Sora 2 Prompting Guide
Write prompts that get the most out of Sora 2's world simulation engine.
Sora 2 is OpenAI's world simulation model — it doesn't just generate video, it simulates a physical world and films it. This distinction matters for prompting. The techniques that work for image generators or other video models often produce mediocre Sora 2 output. Here's what actually works.
The Sora 2 prompt structure
Sora 2 responds best to prompts structured as scene descriptions rather than shot directions. Think of it as writing for a cinematographer who will make camera decisions for you.
Optimal structure
1. Setting — where and when 2. Subject — who or what 3. Action — what happens 4. Atmosphere — mood, lighting, weather 5. Style (optional) — visual treatment
Example: "A rain-soaked Tokyo alley at 2 AM. A street cat picks its way between puddles, stopping to shake water from one paw. Neon signs reflect in the wet pavement. The scene feels quiet and contemplative. Shot on 35mm film with shallow depth of field."
Physics hints
Sora 2's world simulation engine is its biggest advantage. You can give it physics hints and it will compute the results:
- "A glass of water tips over on a wooden table" — Sora 2 simulates the splash pattern, water spreading, dripping off the edge
- "A feather and a bowling ball dropped from the same height" — it gets the differential fall rates right
- "Wind catches a stack of papers on a desk" — individual sheets scatter with realistic aerodynamics
The key: describe the initial conditions and let the model simulate what happens. Don't describe the result — describe the cause.
Good: "A gust of wind hits an open umbrella" Bad: "An umbrella flips inside out in the wind"
The first lets Sora 2 simulate; the second forces a specific outcome that may fight the physics engine.
Style control
Sora 2 handles an exceptionally wide range of visual styles. Here are the most reliable style keywords:
Photorealistic
- "Shot on [camera model]" — e.g., "Shot on ARRI Alexa Mini"
- "35mm film," "16mm film," "anamorphic lens"
- "[Director name] style" — e.g., "Wes Anderson style" (framing) or "Roger Deakins lighting"
Stylized
- "Studio Ghibli style," "Pixar style," "stop motion animation"
- "Oil painting," "watercolor," "charcoal sketch"
- "Vaporwave aesthetic," "film noir," "cyberpunk"
Tip: Commit to one style
Mixing style references in a single prompt ("anime style shot on 35mm film") produces muddy results. Pick one visual direction and reinforce it.
Lighting language
Sora 2 is remarkably responsive to lighting descriptions:
- "Golden hour" — warm directional light, long shadows
- "Overcast flat light" — even, shadowless illumination
- "Single practical light source" — dramatic, high-contrast
- "Rim lighting" — backlit subject with glowing edges
- "Dappled light through leaves" — the model generates accurate caustic patterns
Specifying light direction helps too: "Key light from upper left, soft fill from right."
Duration and pacing
Sora 2 generates up to 12 seconds. To control pacing within that window:
- Slow scenes: Use words like "slowly," "gradually," "lingering." Describe small details — "she traces a finger along the spines of old books" — to signal the model to slow down.
- Fast scenes: "Suddenly," "in a flash," "rapid." Describe high-energy actions.
- Build-ups: Start with stillness and introduce motion mid-prompt. "A still lake at dawn. A fish suddenly breaks the surface."
What Sora 2 does poorly
Being honest about limitations saves you credits: 1. Text rendering — Sora 2 still struggles with legible text in video. If you need readable text, add it in post. 2. Precise hand detail — Hands are better than they were but still occasionally produce extra or missing fingers in complex poses. 3. Counting — "Five birds fly across the sky" might give you 4 or 7. If exact count matters, keep numbers small (2-3). 4. Multi-shot sequences — Sora 2 generates single continuous shots. Use Kling 3.0 or PonPon Flow for multi-shot.
Prompt refinement workflow
1. Start minimal: Subject + action + one style word. See what the model's default interpretation is. 2. Add atmosphere: Lighting, weather, time of day. 3. Refine style: Camera/lens references or artistic style. 4. Iterate on specifics: Adjust the details that matter, leave the rest to the model.
Don't write a 200-word prompt on the first try. Build up incrementally — you'll learn what Sora 2 responds to and what it ignores.