On June 30, Google officially opened public beta access to the Gemini Omni Flash developer API. Through the Gemini API, it generates 3-10 second videos with native audio directly from text, animates static images, and supports conversational iterative editing. Unveiled at Google I/O 2026, it integrates Veo video generation with Gemini multimodal reasoning capabilities, with built-in SynthID watermarking for traceability. Now available on Gemini App, YouTube Shorts, and Google Flow, it democratizes multimodal AI video generation.
#GeminiOmni #GoogleAI #TextToVideo #MultimodalAI #AIGC
