Researchers at The University of Hong Kong (HKU) and collaborators have developed OpenCUA, an open-source framework to create robust AI agents for computer operation. OpenCUA…
View More OpenCUA’s Open Source Agents Compete with Proprietary Models from OpenAI and AnthropicTag: reasoning
MCP-Universe Benchmark Reveals GPT-5 Fails Over 50% of Real-World Orchestration Tasks
The adoption of interoperability standards, like the Model Context Protocol (MCP), can offer enterprises deeper insights into how agents and models operate beyond their boundaries.…
View More MCP-Universe Benchmark Reveals GPT-5 Fails Over 50% of Real-World Orchestration TasksDon’t Overlook Cohere: Command A Reasoning, Its First Model for Enterprise Customer Service and More
I was in more meetings than usual today so I just caught up with the news that Cohere, the Canadian startup co-founded by former Transformer…
View More Don’t Overlook Cohere: Command A Reasoning, Its First Model for Enterprise Customer Service and MoreByteDance, TikTok’s parent, unveils new open-source Seed-OSS-36B model with 512K token context.
The company’s Seed Team of AI researchers today released Seed-OSS-36B on AI code sharing website Hugging Face. Seed-OSS-36B is a new line of open-source, large…
View More ByteDance, TikTok’s parent, unveils new open-source Seed-OSS-36B model with 512K token context.LLMs Produce ‘Fluent Nonsense’ When Operating Beyond Their Training Bounds
Researchers at Arizona State University have released a new study challenging the idea that Chain-of-Thought (CoT) reasoning in Large Language Models (LLMs) indicates genuine intelligence.…
View More LLMs Produce ‘Fluent Nonsense’ When Operating Beyond Their Training BoundsIntroducing DeepSeek V3.1: Possibly the Most Powerful Open AI to Date
Chinese AI startup DeepSeek surprised the global AI community with the release of its 685-billion parameter model, challenging American AI giants and reshaping the landscape…
View More Introducing DeepSeek V3.1: Possibly the Most Powerful Open AI to DateNvidia Unveils Nemotron-Nano-9B-v2: A Compact, Open Model with Toggle On/Off Reasoning
Small AI models are gaining traction. Following the release of an AI vision model small enough for smartwatches by MIT spinoff Liquid AI and another…
View More Nvidia Unveils Nemotron-Nano-9B-v2: A Compact, Open Model with Toggle On/Off ReasoningHugging Face: 5 Ways Enterprises Can Reduce AI Costs Without Compromising Performance
Enterprises often believe AI models inherently need substantial computing power, leading them to seek out more resources. Sasha Luccioni from Hugging Face suggests a shift…
View More Hugging Face: 5 Ways Enterprises Can Reduce AI Costs Without Compromising PerformanceDesigning Feedback Loops: Enhancing LLMs to Improve Continuously
Large language models (LLMs) excel at reasoning, generating, and automating, but transforming a demo into a sustainable product requires the system to learn from actual…
View More Designing Feedback Loops: Enhancing LLMs to Improve ContinuouslyTencent Introduces Versatile Open-Source Hunyuan AI Models for Various Industry Uses
Tencent Reveals Advanced Open-Source Hunyuan AI Models for Various Industries Tencent has taken a significant step in the AI race by introducing its open-source Hunyuan…
View More Tencent Introduces Versatile Open-Source Hunyuan AI Models for Various Industry Uses