Home

Business Stories AI Technology Travel Visa Asia Business Registration Telecommunication Medical Services

About Us

xAI Grok-4.1 Adds Native Video Understanding and Temporal Action Localization

849 2026-02-20

[AI-NEWS-ENTRY]

Date: 2026-02-20

Title: xAI Grok-4.1 Adds Native Video Understanding and Temporal Action Localization

Content: Grok-4.1 preview integrates dense video captioning, temporal action localization, and event grounding capabilities directly into the core model. It can answer detailed questions about video events (“What object was picked up right after the door opened?”) with sub-second precision and generate structured timelines from hour-long footage.

Keywords: video understanding, temporal localization, event grounding, video reasoning, Grok-4.1, multimodal video

Previous article

DeepMind AlphaGeometry 2 Solves 92% of IMO Geometry Problems Autonomously

Next article

Anthropic Claude Full Family Launches on Microsoft Azure Powered by NVIDIA GB300 Blackwell Ultra

new

Anthropic Claude Full Family Launches on Microsoft Azure Powered by NVIDIA GB300 Blackwell Ultra Google Gemini Omni Flash Opens Developer Public Beta: Text-to-Video with Native Audio South Korea Launches Trillion-Dollar AI Chip Initiative Led by Samsung and SK Meituan Unveils LongCat-2.0: World's First Trillion-Parameter Model Trained Entirely on Domestic Computing Power OpenAI Unveils GPT-5.6 Series, Outperforming Claude Mythos 5 Meta Launches $299 Own-Brand AI Glasses to Accelerate Consumer AI Device Adoption NVIDIA Launches BioNeMo Agent Toolkit: AI Deepens Integration into Life Sciences Anthropic Launches Claude Tag: AI Evolves from Personal Assistant to Team Collaborator Volcano Engine Launches Doubao LLM 2.1 Pro with Enhanced Multimodal Capabilities China's "Lingsheng" Supercomputer Tops Global TOP500 List