AI Nuggetz
Posts
Mastering AI Image & Video Prompting

Mastering AI Image & Video Prompting

SECTION 1: FOUNDATIONS OF PROMPT ENGINEERING

Nardeep Singh
March 13, 2025

Understanding AI Models & How They "See" Text

Different AI models interpret your words in unique ways, much like different artists have their own styles. Let's understand the major players:

Image Generation Models:

DALL·E: OpenAI's model excels at creative concepts but trends toward illustration-style outputs
MidJourney: Discord-based AI known for stunning artistic quality and photorealism with minimal tweaking
Stable Diffusion: Open-source model offering maximum customization for those willing to adjust settings

Video Generation Models:

Runway Gen-2: Creates smooth, cinematic camera movements but sometimes misses specific prompt details
Pika Labs: Closely follows complex instructions and allows image+text inputs for greater control
Sora: OpenAI's upcoming model shows fewer artifacts and more realistic motion physics

When you type a prompt, the AI breaks it down into concepts and searches its training data for matching patterns. A vague prompt like "a cat" gives the AI minimal guidance, while "a Persian cat with blue eyes sitting on a Victorian chair beside a window during golden hour" provides rich details to work with.

Today's Exercise: Create a simple prompt about an animal in a location. Run it through two different AI tools if available to you (or use free options like Bing Image Creator and Craiyon).

Example to start with:

A tiger in a forest

Try expanding it slightly to:

A Bengal tiger walking through a misty jungle

Notice how even small additions dramatically improve the results!

The Five Pillars of Perfect Prompts

Subject: Building the Blueprint for AI Visuals

Every effective prompt addresses these five key elements:

Subject: The main focus (person, object, scene)
- Bad: "A woman"
- Better: "A female astronaut"
- Best: "A mid-30s female astronaut with short brown hair"
Style: The medium or aesthetic approach
- Options: Photography, oil painting, 3D render, anime, pencil sketch, etc.
- Add qualifiers: "Photorealistic," "impressionist," "in the style of Studio Ghibli"
Details: Distinctive attributes that anchor the scene
- Environment: Weather, time of day, era, location specifics
- Props/Clothing: Important objects, distinctive attire, textures
- Example: "wearing a white spacesuit with NASA patches, holding a helmet"
Mood: The emotional tone or atmosphere
- Options: Serene, ominous, joyful, melancholic, energetic
- Create through: Lighting descriptions, color palettes, weather effects
Composition: The framing and arrangement
- Camera angle: Low angle, eye-level, aerial view
- Shot type: Close-up, medium shot, wide landscape
- Perspective: First-person view, side profile, three-quarters view

Today's Exercise: Take the animal prompt from section 1 and systematically enhance it by addressing all five elements.

Example structure:

A majestic Bengal tiger with piercing amber eyes walking through a misty tropical jungle at dawn, photorealistic wildlife photography, with dewdrops on large monstera leaves and mist curling around ancient tree trunks, creating a serene yet mysterious atmosphere. Wide shot with the tiger positioned on the left side of the frame, shot with a telephoto lens.

Wrapping Up: Your AI Prompting Journey Understanding how AI interprets prompts is the key to getting the best results. By refining your descriptions and experimenting with different models, you gain greater control over the images they generate.

Take today’s exercises a step further—compare your results, tweak your wording, and notice how small changes impact the outcome. The more you practice, the better you’ll become at crafting prompts that turn ideas into striking visuals.

Got an interesting result? Share it with others, discuss what worked, and keep refining your AI-generated art. The best prompts come from experimentation and curiosity!

Until next time, keep creating.