OpenCUA's Open Source Agents Compete with Proprietary Models from OpenAI and Anthropic

OpenCUA’s Open Source Agents Compete with Proprietary Models from OpenAI and Anthropic

Researchers at The University of Hong Kong (HKU) and collaborators have developed OpenCUA, an open-source framework to create robust AI agents for computer operation. OpenCUA…

View More OpenCUA’s Open Source Agents Compete with Proprietary Models from OpenAI and Anthropic
MCP-Universe Benchmark Reveals GPT-5 Fails Over 50% of Real-World Orchestration Tasks

MCP-Universe Benchmark Reveals GPT-5 Fails Over 50% of Real-World Orchestration Tasks

The adoption of interoperability standards, like the Model Context Protocol (MCP), can offer enterprises deeper insights into how agents and models operate beyond their boundaries.…

View More MCP-Universe Benchmark Reveals GPT-5 Fails Over 50% of Real-World Orchestration Tasks
Don't Overlook Cohere: Command A Reasoning, Its First Model for Enterprise Customer Service and More

Don’t Overlook Cohere: Command A Reasoning, Its First Model for Enterprise Customer Service and More

I was in more meetings than usual today so I just caught up with the news that Cohere, the Canadian startup co-founded by former Transformer…

View More Don’t Overlook Cohere: Command A Reasoning, Its First Model for Enterprise Customer Service and More
ByteDance, TikTok's parent, unveils new open-source Seed-OSS-36B model with 512K token context.

ByteDance, TikTok’s parent, unveils new open-source Seed-OSS-36B model with 512K token context.

The company’s Seed Team of AI researchers today released Seed-OSS-36B on AI code sharing website Hugging Face. Seed-OSS-36B is a new line of open-source, large…

View More ByteDance, TikTok’s parent, unveils new open-source Seed-OSS-36B model with 512K token context.
LLMs Produce 'Fluent Nonsense' When Operating Beyond Their Training Bounds

LLMs Produce ‘Fluent Nonsense’ When Operating Beyond Their Training Bounds

Researchers at Arizona State University have released a new study challenging the idea that Chain-of-Thought (CoT) reasoning in Large Language Models (LLMs) indicates genuine intelligence.…

View More LLMs Produce ‘Fluent Nonsense’ When Operating Beyond Their Training Bounds
Introducing DeepSeek V3.1: Possibly the Most Powerful Open AI to Date

Introducing DeepSeek V3.1: Possibly the Most Powerful Open AI to Date

Chinese AI startup DeepSeek surprised the global AI community with the release of its 685-billion parameter model, challenging American AI giants and reshaping the landscape…

View More Introducing DeepSeek V3.1: Possibly the Most Powerful Open AI to Date
Nvidia Unveils Nemotron-Nano-9B-v2: A Compact, Open Model with Toggle On/Off Reasoning

Nvidia Unveils Nemotron-Nano-9B-v2: A Compact, Open Model with Toggle On/Off Reasoning

Small AI models are gaining traction. Following the release of an AI vision model small enough for smartwatches by MIT spinoff Liquid AI and another…

View More Nvidia Unveils Nemotron-Nano-9B-v2: A Compact, Open Model with Toggle On/Off Reasoning
Hugging Face: 5 Ways Enterprises Can Reduce AI Costs Without Compromising Performance

Hugging Face: 5 Ways Enterprises Can Reduce AI Costs Without Compromising Performance

Enterprises often believe AI models inherently need substantial computing power, leading them to seek out more resources.  Sasha Luccioni from Hugging Face suggests a shift…

View More Hugging Face: 5 Ways Enterprises Can Reduce AI Costs Without Compromising Performance
Designing Feedback Loops: Enhancing LLMs to Improve Continuously

Designing Feedback Loops: Enhancing LLMs to Improve Continuously

Large language models (LLMs) excel at reasoning, generating, and automating, but transforming a demo into a sustainable product requires the system to learn from actual…

View More Designing Feedback Loops: Enhancing LLMs to Improve Continuously
Tencent Introduces Versatile Open-Source Hunyuan AI Models for Various Industry Uses

Tencent Introduces Versatile Open-Source Hunyuan AI Models for Various Industry Uses

Tencent Reveals Advanced Open-Source Hunyuan AI Models for Various Industries Tencent has taken a significant step in the AI race by introducing its open-source Hunyuan…

View More Tencent Introduces Versatile Open-Source Hunyuan AI Models for Various Industry Uses