News
Mistral Pixtral 2.0: Open Multimodal Model with Advanced Image Generation and Editing
​Mistral released Pixtral 2.0 (open-weight 30B model), combining high-fidelity image generation
OpenAI o4-medium: Enhanced Self-Reflective Reasoning for Ethical Decision-Making in Agents
o4-medium introduces self-reflective reasoning loops that allow agents to evaluate ethical implications in real-time
Google DeepMind Gemini 3.1: Frontier Performance in Quantum Circuit Optimization and Simulation
Gemini 3.1 achieves state-of-the-art in quantum computing tasks
xAI Grok-3.6: Native Physics Simulation and Real-Time 3D Environment Reasoning
​Grok-3.6 integrates native physics engines for real-time simulation of 3D environments
Anthropic Claude 4.2 Opus: Autonomous Multi-Day Software Project Completion with Zero Human Input
​Claude 4.2 Opus demonstrated the ability to complete a multi-day software project
Mistral NeMo 2.0: 12B Model with State-of-the-Art On-Device Multimodal Performance
Mistral released NeMo 2.0 (12B parameter model optimized for on-device inference)
Anthropic Claude 4.1 Opus: First Model to Pass Internal 10-Hour Autonomous Software Engineering Test
Claude 4.1 Opus became the first publicly known frontier model to successfully complete a
Google Gemini Robotics 1.0: End-to-End Multimodal Policy Model for Dexterous Manipulation
Google DeepMind unveiled Gemini Robotics 1.0, an end-to-end multimodal policy model that directly maps vision + language
DeepSeek-V3.5: 671B MoE Model Surpasses GPT-5.2 on Chinese & English Long-Context Benchmarks
DeepSeek open-sourced V3.5 (671B MoE), setting new state-of-the-art on 1M+ token long-context Chinese
OpenAI o4-mini Pro Launches with Breakthrough Chain-of-Verification Reasoning
OpenAI released o4-mini Pro, featuring a new chain-of-verification (CoVe) reasoning mechanism that significantly
Loading...
There is no more data available~