Video prompt
Paste into Sora, Kling, Runway, Luma Dream Machine or Pika. These models read plain-English shot descriptions.
Describe your subject and pick the camera, lighting and style above to build a prompt.
A tool that builds AI text-to-video prompts the way a film shot is described. Type your subject / scene (e.g. an astronaut on a red desert) and the action / motion (e.g. walking toward the horizon), then just pick from dropdowns: shot (wide, close-up, low angle…), camera move (dolly in, tracking, aerial drone…), lighting (golden hour, neon, backlit…), style (cinematic, anime, vintage 8mm…), mood (epic, dreamy, suspenseful…), speed (slow motion, time-lapse…), aspect ratio (16:9, 9:16…) and length (5 s, 10 s…). It joins them into one natural-English sentence — "[style] [shot] of [subject], [action], [camera], [lighting], [mood], [speed]. [aspect], [length]." — in the form models like Sora, Kling, Runway, Luma Dream Machine and Pika read best. These models respond strongly to English cinematography terms (dolly in, golden hour, 16:9), so the output is in English. The subject, action and extra-detail fields are free text and only the options you select are added (unselected ones are skipped), so you can keep a prompt minimal or pile on detail. Your camera / lighting / style choices are saved in the browser so you start from the same defaults next time (the text you type is not stored). All assembly is plain string work in your browser — the ideas you type and the prompt it produces are never sent to any API or server.
How to use
- Type the subject / scene and the action / motion you want in the video.
- Pick the shot, camera move, lighting, style, mood, aspect and length from the dropdowns — only the ones you choose are added.
- Copy the assembled English prompt and paste it into Sora / Kling / Runway / Luma / Pika.
FAQ
Is anything I type sent anywhere?
No. The whole prompt is assembled by plain string joining in your browser. The subject, action and extra details you type, and the prompt it produces, are never sent to any API or server — it all stays on your device.
Why is the output in English?
The major video models — Sora, Kling, Runway, Luma, Pika — follow English cinematography terms (dolly in / golden hour / wide shot / 16:9) most accurately. The subject and action are free text, so you can write those however you like.
What order does the prompt use?
The common cinematography order: "[style] [shot] of [subject], [action], [camera move], [lighting], [mood], [speed]. [aspect], [length]." Unselected fields are skipped, so only the parts you set are strung together.
Does it work beyond Sora?
Yes. The output is a plain English shot description, not service-specific syntax. Kling, Runway Gen-3 / Gen-4, Luma Dream Machine, Pika, Haiper and other text-to-video tools all accept it pasted directly.
Length and aspect are set in the model UI too — is it useful in the text?
Most models prioritize the UI setting, but appending 16:9 or a 5-second clip to the prompt can still nudge composition and pacing. Set "Aspect" or "Length" to "—" if you'd rather leave them out of the sentence.