Google Gemini Omni Flash Opens Developer Public Beta: Text-to-Video with Native Audio

13    2026-07-01

On June 30, Google officially opened public beta access to the Gemini Omni Flash developer API. Through the Gemini API, it generates 3-10 second videos with native audio directly from text, animates static images, and supports conversational iterative editing. Unveiled at Google I/O 2026, it integrates Veo video generation with Gemini multimodal reasoning capabilities, with built-in SynthID watermarking for traceability. Now available on Gemini App, YouTube Shorts, and Google Flow, it democratizes multimodal AI video generation.

#GeminiOmni #GoogleAI #TextToVideo #MultimodalAI #AIGC

10730_rkns_6057.png