Business Stories AI Technology Travel Visa Asia Business Registration Telecommunication Medical Services

Google Gemini Omni Flash Opens Developer Public Beta: Text-to-Video with Native Audio

13 2026-07-01

On June 30, Google officially opened public beta access to the Gemini Omni Flash developer API. Through the Gemini API, it generates 3-10 second videos with native audio directly from text, animates static images, and supports conversational iterative editing. Unveiled at Google I/O 2026, it integrates Veo video generation with Gemini multimodal reasoning capabilities, with built-in SynthID watermarking for traceability. Now available on Gemini App, YouTube Shorts, and Google Flow, it democratizes multimodal AI video generation.

#GeminiOmni #GoogleAI #TextToVideo #MultimodalAI #AIGC

South Korea Launches Trillion-Dollar AI Chip Initiative Led by Samsung and SK

Anthropic Claude Full Family Launches on Microsoft Azure Powered by NVIDIA GB300 Blackwell Ultra

new

Anthropic Claude Full Family Launches on Microsoft Azure Powered by NVIDIA GB300 Blackwell Ultra Google Gemini Omni Flash Opens Developer Public Beta: Text-to-Video with Native Audio South Korea Launches Trillion-Dollar AI Chip Initiative Led by Samsung and SK Meituan Unveils LongCat-2.0: World's First Trillion-Parameter Model Trained Entirely on Domestic Computing Power OpenAI Unveils GPT-5.6 Series, Outperforming Claude Mythos 5 Meta Launches $299 Own-Brand AI Glasses to Accelerate Consumer AI Device Adoption NVIDIA Launches BioNeMo Agent Toolkit: AI Deepens Integration into Life Sciences Anthropic Launches Claude Tag: AI Evolves from Personal Assistant to Team Collaborator Volcano Engine Launches Doubao LLM 2.1 Pro with Enhanced Multimodal Capabilities China's "Lingsheng" Supercomputer Tops Global TOP500 List