AI Video Prompts: How to Direct Runway, Kling & Pika Like a Pro
AI video generation is the newest frontier in generative AI. In 2024, tools like Runway Gen-3, Kling, and Pika produced the first genuinely usable AI video clips. By 2026, they're approaching production-ready quality for specific use cases.
But video is significantly harder than images. Each clip is hundreds of individual frames that must remain coherent. Motion must feel natural. Physics must be convincing. And all of this needs to be controlled through text prompts.
This guide teaches you how to prompt AI video generators like a cinematographer, covering camera movement vocabulary, motion descriptors, and platform-specific techniques for Runway, Kling, and Pika.
How Video AI Works
Understanding the technical basics helps you write better prompts.
Two primary modes:
Text-to-video:
- Write a prompt describing the scene
- AI generates video from scratch
- Starting from noise (like image generation)
- More creative freedom, less control
Image-to-video:
- Provide a starting image (AI-generated or real photo)
- Write a prompt describing the motion
- AI animates the still image
- More control, more predictable results
Why video is harder than images:
Temporal coherence:
- Every frame must be consistent with the previous frame
- Characters/objects can't morph or disappear mid-clip
- Lighting and atmosphere must stay stable
- Current AI struggles with this beyond 3-10 seconds
Physics simulation:
- Motion must obey physics (gravity, momentum, inertia)
- AI learned physics implicitly from training videos
- Still produces impossible movements regularly
Camera work:
- AI must understand camera movement (pan, zoom, dolly, etc.)
- Interpret whether motion is subject moving or camera moving
- Maintain framing and composition throughout
Computing cost:
- Video generation is 100-1000ร more expensive than images
- Limits iteration speed
- Most platforms charge per-second generated
The field is evolving rapidly. What's impossible today may be trivial in six months.
The Video Prompt Formula
Effective video prompts need more specificity than image prompts.
Core formula:
[Scene description] + [Subject action] + [Camera movement] + [Lighting] + [Mood] + [Technical details] + [Duration context]Example:
A woman walking through a forest at golden hour, slow motion, camera follows beside her, warm afternoon light, peaceful atmosphere, cinematic, 4KComponent priority for video:
- Scene (setting and context)
- Action (what moves, how it moves)
- Camera movement (static, pan, zoom, dolly, etc.)
- Subject details (appearance, clothing, expression)
- Lighting (natural, dramatic, soft, etc.)
- Mood/atmosphere
- Technical specs (4K, slow-mo, frame rate)
Key difference from image prompts:
- Action verbs are critical โ video needs movement
- Camera language is essential โ must specify camera behavior
- Duration cues help (slow, fast, time-lapse, slow motion)
Optimal prompt length:
- Runway Gen-3: 20-50 words
- Kling: 25-60 words
- Pika: 15-40 words
Longer than image prompts because motion needs description.
Camera Movement Vocabulary
These words reliably trigger specific camera behaviors. Master them.
Basic Camera Movements
Static shot:
"Static camera, no movement"
"Fixed camera angle"
"Locked camera"Effect: Camera doesn't move. Subject movement only.
Pan (horizontal camera rotation):
"Pan left" โ Camera rotates left, scene moves right across frame
"Pan right" โ Camera rotates right, scene moves left across frame
"Slow pan across the landscape"Effect: Reveals scene horizontally, like turning your head.
Tilt (vertical camera rotation):
"Tilt up" โ Camera rotates upward
"Tilt down" โ Camera rotates downwardEffect: Reveals scene vertically, like looking up/down.
Zoom in:
"Zoom in on subject"
"Slow zoom into face"
"Dramatic zoom"Effect: Magnifies subject, moves closer optically (not physically).
Zoom out:
"Zoom out to reveal"
"Pull back to show full scene"Effect: Widens view, reveals more context.
Advanced Camera Movements
Dolly (camera moves physically forward/backward):
"Dolly in" โ Camera physically moves toward subject
"Dolly out" โ Camera physically moves away
"Slow dolly push"Effect: Similar to zoom but with depth/parallax shift. More cinematic.
Truck (camera moves physically left/right):
"Truck left" โ Camera physically slides left
"Truck right" โ Camera physically slides right
"Lateral tracking shot"Effect: Moves parallel to subject, often following action.
Crane/Boom (camera moves vertically):
"Crane up" โ Camera rises vertically
"Crane down" โ Camera descends
"Ascending crane shot revealing cityscape"Effect: Dramatic vertical movement, god's-eye view transitions.
Orbit/Arc:
"Orbit around subject"
"Camera circles the character"
"360-degree rotation around object"Effect: Camera moves in circular path around subject.
Handheld:
"Handheld camera, shaky"
"Documentary style, unstable"
"Realistic handheld movement"Effect: Subtle shake, organic feeling, increases realism.
Steadicam:
"Smooth steadicam movement"
"Gliding camera following subject"Effect: Smooth, flowing movement without shake.
Drone/Aerial:
"Aerial drone shot"
"Bird's eye view descending"
"Fly-over establishing shot"Effect: High altitude perspective, sweeping movements.
First-person POV:
"POV shot, first-person perspective"
"Camera is the character's eyes"Effect: Viewer sees through character's eyes.
Combination Movements
AI video can handle combined movements:
"Camera orbits around subject while slowly zooming in"
"Dolly in while panning right"
"Crane up and pull back to reveal"Combinations produce more cinematic results but are less predictable.
Action Descriptors โ Making Things Move
Video needs motion. Here's how to describe it.
Subject Movement Verbs
Walking/Running:
"walking slowly toward camera"
"running away from camera"
"strolling through park"
"sprinting across field"Looking/Turning:
"turns head to look at camera"
"glances over shoulder"
"looks up suddenly"Hand/Arm movements:
"waves hand"
"reaches out"
"raises arms triumphantly"Object interaction:
"picks up object"
"places cup on table"
"opens door"Facial expressions:
"smiles gradually"
"eyes widen in surprise"
"expression shifts from sad to hopeful"Environmental Motion
Natural elements:
"leaves rustling in wind"
"water flowing over rocks"
"clouds drifting across sky"
"rain falling heavily"
"fire flickering"Atmospheric effects:
"fog rolling in"
"dust particles floating in light beam"
"smoke rising slowly"Background activity:
"cars passing in background"
"people walking by out of focus"
"birds flying across frame"Speed and Tempo Descriptors
"slow motion" โ Captures action at reduced speed
"time-lapse" โ Accelerated passage of time
"normal speed"
"fast motion, accelerated"
"frame-by-frame, stuttering motion"Duration cues:
"quick pan" vs. "slow pan"
"sudden zoom" vs. "gradual zoom"
"gentle movement" vs. "rapid movement"Runway Gen-3 Prompting
Runway is the industry leader in AI video, known for Gen-3's photorealism and motion quality.
Text-to-Video vs. Image-to-Video Modes
Text-to-video:
- More creative freedom
- Less predictable
- Best for abstract concepts, motion graphics, effects
Image-to-video (recommended):
- Start with AI-generated or uploaded image
- Describe motion to apply
- Much more controllable
- Professional results
Workflow for best results:
1. Generate perfect still image (Midjourney, DALL-E, SD)
2. Upload to Runway Gen-3
3. Write motion prompt
4. Generate 5-10 second clipWhat Runway Handles Well
Strengths:
- Smooth, realistic motion
- Photorealistic quality
- Natural physics (most of the time)
- Camera movements
- Environmental effects (water, smoke, wind)
Weaknesses:
- Fast action (motion blur, artifacts)
- Complex human movements (dancing, sports)
- Facial expressions (still limited)
- Long durations (quality degrades after 10 seconds)
- Text appearing in video
Motion Brush Technique
Runway Gen-3 has "Motion Brush"โlets you paint which parts of the image should move and in what direction.
How to use:
- Upload image
- Select Motion Brush tool
- Paint over areas you want to move
- Draw arrow indicating direction
- Set motion intensity (subtle to strong)
- Generate
Use cases:
- Make specific elements move while others stay static
- Direct hair, clothing, flags to blow in specific direction
- Control water flow direction
- Animate specific objects while keeping background still
Pro tip: Combine Motion Brush with text prompts for maximum control.
10 Example Runway Prompts with Expected Results
1. Portrait to Subtle Animation
Image: Close-up portrait
Prompt: Gentle breeze moving hair, subject looks slightly to the right, soft smile forms, natural eye blinks, golden hour lightingExpected: Subtle, realistic portrait animation with natural micro-movements
2. Nature Scene
Image: Forest landscape
Prompt: Camera slowly dollies forward through forest, leaves gently rustling, dappled sunlight shifts, peaceful atmosphereExpected: Smooth forward movement with environmental details animated
3. Product Reveal
Image: Product on table
Prompt: Camera orbits around product, dramatic lighting, slow rotation revealing all angles, professional commercial styleExpected: 360-degree product showcase, studio-quality lighting
4. Cityscape Time-Lapse
Image: City skyline at dusk
Prompt: Time-lapse transition from dusk to night, lights turn on in buildings, clouds move quickly, traffic streams belowExpected: Accelerated day-to-night transition with urban activity
5. Character Walking
Image: Person standing in street
Prompt: Subject walks toward camera, confident stride, coat billowing slightly, camera slowly dollies back, cinematicExpected: Natural walking motion with camera tracking backward
6. Abstract Motion Graphics
Text-to-video (no image)
Prompt: Abstract colorful liquid shapes morphing and flowing, smooth transitions, vibrant gradients, mesmerizing motionExpected: Fluid abstract animation, background/transition material
7. Wildlife Close-Up
Image: Bird on branch
Prompt: Bird turns head, looks at camera, ruffles feathers, subtle branch sway, natural documentary styleExpected: Realistic wildlife micro-movements
8. Establishing Shot
Image: Mountain landscape
Prompt: Aerial drone shot, camera flies forward and up, revealing vast landscape, epic scale, golden hour lightingExpected: Cinematic establishing shot with altitude and forward movement
9. Food Commercial
Image: Plated dish
Prompt: Slow zoom into food, steam rising, drizzle of sauce in slow motion, professional food photography lightingExpected: Appetizing commercial-style food shot
10. Sci-Fi Scene
Image: Futuristic city
Prompt: Camera pans across neon-lit city, flying cars pass by, holographic advertisements flicker, rain-slicked streets reflect lights, Blade Runner aestheticExpected: Cyberpunk atmosphere with motion and depth
Common Failure Modes and Fixes
Problem: Subject morphs or distorts
Fix: Use image-to-video mode, simpler motion prompts, shorter duration
Problem: Camera movement too aggressive
Fix: Add "slow" or "gentle" to camera movement descriptors
Problem: Motion feels floaty/unrealistic
Fix: Add "natural physics" or "realistic motion" to prompt
Problem: Multiple subjects move wrong
Fix: Use Motion Brush to isolate which parts move
Kling AI Prompting
Kling (Chinese AI video platform) is known for longer coherent sequencesโup to 10 seconds of stable motion.
Kling's Strengths
What Kling does better:
- Longer clips with maintained coherence (10 sec vs. 5 sec typical)
- Better at complex scenes
- Strong with Asian aesthetics and subjects
- Good motion consistency
What Kling struggles with:
- Western photorealism (trained more on Asian datasets)
- Extreme closeups
- Very fast action
When to use Kling:
- Need longer clips
- Asian subjects/settings
- When Runway produces too much distortion
- Budget-conscious (often cheaper per second)
Prompt Structure for Kling
Kling responds well to structured, detailed prompts:
[Detailed scene description] + [Subject and action] + [Camera movement] + [Duration/pace] + [Lighting/mood] + [Style reference]Example:
A serene Japanese garden with koi pond, woman in kimono walks slowly along stone path, camera tracks beside her, cherry blossoms falling gently, soft afternoon light, peaceful atmosphere, cinematic travel video style8 Example Kling Prompts
1. Urban Street Scene
Busy Tokyo street at night, neon signs reflected on wet pavement, people walking with umbrellas, camera slowly pans right, rain falling, vibrant colors, cyberpunk atmosphere2. Nature Documentary
Macro shot of butterfly landing on flower, wings slowly open and close, gentle breeze, bokeh background, natural lighting, wildlife documentary style3. Architectural Flythrough
Modern minimalist house interior, camera glides through rooms, sunlight streaming through windows, clean aesthetic, architectural visualization, smooth steadicam movement4. Fantasy Landscape
Mystical forest with glowing mushrooms, camera moves forward along path, magical particles floating in air, ethereal lighting, fantasy concept art style5. Action Sports
Skateboarder performing trick, slow motion capture, camera follows action, urban setting, golden hour light, youth culture aesthetic6. Food Preparation
Close-up of chef's hands chopping vegetables, smooth knife motion, ingredients on wooden cutting board, natural kitchen lighting, cooking show style7. Fashion/Portrait
Model walking down runway, confident stride, camera tracks forward movement, studio lighting, high fashion aesthetic, professional production8. Atmospheric Landscape
Misty mountain valley at sunrise, fog slowly clearing, camera slowly rises to reveal vista, dramatic lighting breaking through clouds, epic nature cinematographyPika Prompting
Pika focuses on fast, social-media-optimized video generation with unique "Pikaffects."
Pika's Unique Features
Speed:
- Fastest generation times (often under 30 seconds)
- Great for rapid iteration
- Lower cost per clip
Pikaffects:
- Preset effects you can apply: "inflate," "explode," "melt," "crumble," "squish"
- One-click style effects
- Social media-friendly
Style:
- Slightly less photorealistic than Runway
- More stylized, artistic
- Perfect for social content where perfect realism isn't needed
When to use Pika:
- Social media content (TikTok, Instagram Reels, YouTube Shorts)
- When you need many clips fast
- Stylized videos over photorealism
- Budget-conscious projects
- Experimental/creative effects
6 Example Pika Prompts for Social Content
1. Product Demo (Instagram Reel)
Product spinning on turntable, dynamic lighting, text overlay space, colorful background, energetic, short-form content style2. Motivation Quote (TikTok)
Sunrise over ocean, waves crashing, camera slowly zooms out, inspirational mood, space for text overlay, peaceful3. Recipe Clip (YouTube Short)
Overhead shot of ingredients coming together, time-lapse assembly, final dish reveal, bright cooking lighting, appetizing4. Before/After (Social comparison)
Split screen effect showing transformation, smooth transition, contrasting lighting left to right, viral video style5. Dance Clip
Person dancing to beat, energetic movement, colorful lighting, camera slightly orbits, party vibe, social media aesthetic6. Pet Content
Cute dog tilts head, ears perk up, looks at camera, shallow depth of field, warm lighting, viral pet video styleMaintaining Consistency Across Shots
The single biggest challenge in AI video: generating multiple clips that feel like the same scene/character.
Techniques for Consistency
1. Seed locking (when available):
- Some platforms offer seed control
- Same seed = more consistent results
- Limited effectiveness for video (vs. images)
2. Image-to-video workflow:
Generate character image (Midjourney) โ Save
Use same character image for ALL video generations
Add different motion prompts
Result: Same character, different actions3. Style consistency:
- Use identical lighting descriptors
- Same camera lens descriptors (85mm, etc.)
- Same quality/aesthetic tags
- Same color grading references
4. Reference image reuse:
- Generate one great clip
- Extract best frame
- Use that frame as starting point for next clip
5. Deliberate continuity planning:
Clip 1: Character enters from left
Clip 2: Start with character already in frame, continue action
Clip 3: Character exits rightMatch action at edit points.
Pro tip: Plan AI video like stop-motionโeach clip is a separate take you'll stitch together in editing. Don't expect AI to maintain continuity across generations.
A/B Testing Video Prompts
Video generation is expensive. Test systematically.
Iteration strategy:
Round 1: Core concept
Test basic prompt โ does it produce desired subject/scene?
If no: revise scene description
If yes: proceed to Round 2Round 2: Motion
Test camera movement variations:
- Static vs. moving camera
- Different camera movements (pan vs. dolly vs. orbit)
Keep best motion approachRound 3: Technical refinement
Test lighting, mood, pacing descriptors
Add cinematic quality words
Adjust speed descriptors (slow, fast, etc.)Round 4: Platform comparison
Take best prompt, try on different platforms
Runway vs. Kling vs. Pika
Choose platform that nails your aestheticCost management:
- Start with cheapest/fastest platform (Pika)
- Move to higher quality when concept is proven
- Use Runway for final production clips
Using AI Video in a Real Production Pipeline
AI video is rarely the final outputโit's an element in your edit.
Professional workflow:
1. Generate clips:
- Use AI for specific shots (establishing, B-roll, effects)
- Not entire videos (yet)
2. Stitch in editing software:
- Premiere Pro, Final Cut Pro, DaVinci Resolve
- Mix AI clips with real footage, stock footage, graphics
3. Add audio:
- AI-generated music (Suno, Udio)
- Sound effects
- Voiceover
- Proper audio mixing
4. Color grade:
- AI clips may have inconsistent color
- Color grade for continuity
- Match AI footage to real footage
5. Transitions and effects:
- Smooth cuts between AI clips
- Add motion graphics, text overlays
- Professional polish
Current best use cases:
- Establishing shots (exteriors, landscapes)
- B-roll (atmospheric shots, inserts)
- Impossible shots (fantasy, sci-fi, abstract)
- Storyboarding and animatics
- Social media content
Not yet reliable for:
- Dialogue scenes (facial sync issues)
- Extended narrative sequences
- Anything requiring precise physical interaction
- Matching existing footage perfectly
Summary: Video Prompting Best Practices
โ Do:
- Use image-to-video mode for more control
- Specify camera movement explicitly
- Use strong action verbs
- Start simple (short clips, simple motion)
- Describe lighting and mood
- Test on multiple platforms
- Plan for post-production editing
โ Don't:
- Expect long, perfect sequences (stitch shorter clips)
- Rely on AI for dialogue/lip sync (not there yet)
- Over-complicate first prompt (iterate)
- Forget duration cues (slow, fast, time-lapse)
- Expect consistency across generations without image reuse
- Generate entire videos in one prompt (yet)
Platform quick comparison:
| Platform | Best For | Strength | Weakness |
|---|---|---|---|
| Runway Gen-3 | Photorealism, pro work | Quality, motion smoothness | Cost |
| Kling | Longer clips, Asian subjects | Coherence, length | Western photorealism |
| Pika | Social media, speed | Fast, affordable, Pikaffects | Less realistic |
Next steps:
- AI YouTube Video Workflow โ Complete video production pipeline
- Runway Gen-3 Review โ Full platform breakdown
- Prompt Anatomy Guide โ Core prompting principles
Explore:
- 100+ Prompt Templates โ Ready-to-use video prompts
- AI Tools Directory โ Compare all video AI platforms
AI video is early but evolving fast. Master the camera language, understand platform strengths, and you'll produce shots impossible or prohibitively expensive to film traditionally. The future of video production is hybridโAI for impossible shots, real cameras for everything else.
๐ Video production and cinematography books translate directly into better AI video prompts โ shot type, camera movement, and lighting language all carry over from traditional filmmaking vocabulary.
Some links in this article are affiliate links โ we may earn a small commission if you purchase, at no extra cost to you. Full disclosure โ