Create AI video from a prompt or reference images with the Gemini Omni Flash workflow. Choose 4, 6, or 8 seconds, set 16:9 or 9:16, and add optional audio when the clip needs sound.
Sample outputs generated with the Gemini Omni Flash workflow.
Video generated from mixed text and image inputs showing multimodal capabilities.
Gemini Omni Flash is the site's fast Gemini Omni video workflow powered by the PoYo VEO 3.1 official API. It supports text-to-video, image-to-video, first/last-frame generation, up to 3 image references, optional audio, 4/6/8 second duration, and 16:9 or 9:16 output.
Start with a text prompt when you need an original scene, camera move, product clip, or social video draft.
Upload reference images to guide the clip. One image works as image-to-video; two images can guide first and last frames; three images can be used for supported reference mode.
Enable audio when the final clip needs sound. Keep drafts silent to reduce credit use while you iterate.
Use Gemini Omni Flash when you need quick text-to-video or image-to-video drafts before committing to a higher-cost final render.
Guide the output with images instead of relying on a prompt alone. This is useful for products, characters, style frames, and start/end framing.
Sound is optional, so you can keep early tests silent and turn audio on for clips that are ready to present.
The workflow exposes practical controls: duration, aspect ratio, resolution, audio, and reference images.
Write a scene prompt and generate a 4, 6, or 8 second video clip.
Use up to 3 image references for image-guided video, first/last-frame control, or supported reference mode.
Turn on audio for complete clips, or keep output silent for lower-cost drafts.
Choose 720p, 1080p, or 4K where supported by the selected VEO 3.1 official mode.
Generate 16:9 horizontal clips or 9:16 vertical clips for social platforms.
Check estimated credit cost before generation so you can adjust duration, resolution, and audio.
Generate 9:16 portrait clips for Reels, Shorts, and TikTok from short prompts or reference images.
Turn product photos into short video ads with controlled framing and optional audio.
Test scene direction at 4 seconds before spending more credits on longer or higher-resolution output.
Create multiple prompt or reference-image variations while keeping cost visible before every run.
Create AI video from prompts or image references with optional audio, visible credit cost, and 4/6/8 second output settings.