Cost shown before you generate
The credit counter updates as you change duration, resolution, quality mode, and audio settings. You always know the cost before committing.
The model you pick matters more than the prompt you write. Start by choosing the workflow that matches your output — video draft, reference-guided clip, motion transfer, image asset, or 4K final — then generate with the right Gemini Omni model.
Preview your credit cost before every generation. One shared credit pool covers all models — video, image, motion, and 4K.
Use Veo 3.1 for text-to-video or image-to-video drafts with strong motion quality
Use Veo 3.1 when the output must match a reference image for style, identity, or composition
Use Motion Control to apply movement from a reference video to your subject
No surprises. See the exact credit cost before you submit — then decide whether to iterate cheaply or commit to a higher-quality render.
The credit counter updates as you change duration, resolution, quality mode, and audio settings. You always know the cost before committing.
Video, image, motion control, and 4K all draw from the same balance. No separate wallets or confusing tier splits.
If the system fails to produce output, credits are not deducted. You only pay for completed results.
Use Standard or Draft modes for iteration at lower cost. Move to Pro or 4K only when the direction is locked and the output needs to ship.
Most bad AI video drafts fail before generation starts — because the user picked the wrong model for the job. Use this guide to choose the right workflow before you write a single prompt.

For text-to-video, image-to-video, multi-shot sequences, and reference-guided clips. Choose Veo 3.1 for general video or O3 when reference fidelity matters.

For movement transfer from reference video, or creating still image assets before video generation.
Define the output
Video clip, reference-guided video, motion transfer, image asset, or image edit.
Choose the model
Veo 3.1, O3, Motion Control 3.0, Motion Control 2.6, O3 Image, or O3 Bild bearbeiten.
Add inputs
Prompt, reference images, @element tags, motion reference video — whatever the model needs.
Preview credits and generate
Check the cost, choose quality level, then generate. Iterate cheaply before committing to Pro or 4K.
Veo 3.1 handles most text-to-video and image-to-video tasks. Strong motion, good prompt following, Standard/Pro/4K quality tiers.
Veo 3.1
Veo 3.1 locks style, identity, and composition from a reference image. Use when brand consistency or character identity matters.
Veo 3.1
Each model solves a different problem. Choosing wrong wastes credits and produces off-target results. Use this guide.
Your default choice for most video tasks. Handles text-to-video, image-to-video, and multi-shot sequences with strong motion quality and good prompt adherence.
Cinematic Scene Generation
An AI-generated cinematic scene demonstrating Veo 3.1 text-to-video output.
Use when the output must closely match a reference image in style, composition, or character identity. Stronger reference fidelity than Veo 3.1.
Not a general video model. Use specifically when you have a still subject and want it to follow movement from a reference video.
Create reference frames, product visuals, thumbnails, or style frames before moving to video generation.
When an image is close but needs prompt-guided changes before it becomes a reference or final asset.
Not a separate model — a quality tier available on supported workflows. Use only after the direction is locked.
Start with what you need, not which button to press. The right model choice saves credits and produces better results on the first try.
Video clip? Reference-guided video? Motion transfer? Image asset? Image edit? Start with the result.
Task
Veo 3.1 for general video. O3 for reference control. Motion Control for movement. O3 Image for stills.
Model
Write the prompt, upload references, check the credit cost. Adjust quality settings before generating.
Setup
Start with Standard quality for fast iteration. Move to Pro or 4K only when the direction is confirmed.
Generate
The fastest path to a good result is choosing the right model before you write the prompt. Here is when to use each.
Use Veo 3.1. It handles most video generation tasks with strong motion quality. Add a reference image if you want visual guidance, or use text-only for creative exploration.
→ Veo 3.1
Use Veo 3.1. It prioritizes reference fidelity over creative freedom. Best when brand identity, character consistency, or composition must be preserved.
→ Veo 3.1
Use Motion Control. Upload your subject image and a motion reference video. The model applies that movement to your subject.
→ Motion Control
Use O3 Image to generate reference frames, product visuals, or style explorations. Then use those images as references for video generation.
→ O3 Image
Answers for choosing the right Gemini Omni model and workflow.
Pick the model that matches your task. Preview credits before every job. Iterate cheaply, then upgrade to Pro or 4K when the direction is locked.