Gemini Omni Video Generator
Google's first anything-from-anything multimodal AI — text, images, video, or audio input → stunning 4K video output
Loading studio…
Google's first anything-from-anything multimodal AI — text, images, video, or audio input → stunning 4K video output
Loading studio…
Aspect Ratio
Duration
Resolution
Seed
Input Quota
0/7 usedImages×1 + Video×2 + Character IDs×1 ≤ 7
Credits per generation
200 credits
Reference Images(optional, up to 7)
Upload images as visual references. Gemini Omni will use them to guide video generation.
Audio IDs
Optional — up to 3 audio IDs to include in the video
Character IDs
0/3Optional — up to 3 character IDs for consistent characters
Reference Video URL
URL, start and end time are all required. Trim range must be ≤ 10 seconds.
Describe what you want in your video. Be detailed — include style, mood, lighting, and camera movement.
Result
Example output — enter a prompt to get started