When OpenAI launched GPT-5 about two weeks ago, CEO Sam Altman promised it would be the company’s “smartest, fastest, most useful model yet.” Instead, the…
View More This website allows you to blind-test GPT-5 vs. GPT-4o—and the results may surprise youTag: benchmarks
Curious About Your Startup’s Value? Use the FREE SaaStr VC Valuation Calculator
Curious about how B2B + AI VCs value your start-up today? We’ve compiled all the data into a straightforward VC Valuation Calculator here. Try it…
View More Curious About Your Startup’s Value? Use the FREE SaaStr VC Valuation CalculatorOpenCUA’s Open Source Agents Compete with Proprietary Models from OpenAI and Anthropic
Researchers at The University of Hong Kong (HKU) and collaborators have developed OpenCUA, an open-source framework to create robust AI agents for computer operation. OpenCUA…
View More OpenCUA’s Open Source Agents Compete with Proprietary Models from OpenAI and AnthropicMCP-Universe Benchmark Reveals GPT-5 Fails Over 50% of Real-World Orchestration Tasks
The adoption of interoperability standards, like the Model Context Protocol (MCP), can offer enterprises deeper insights into how agents and models operate beyond their boundaries.…
View More MCP-Universe Benchmark Reveals GPT-5 Fails Over 50% of Real-World Orchestration TasksDon’t Overlook Cohere: Command A Reasoning, Its First Model for Enterprise Customer Service and More
I was in more meetings than usual today so I just caught up with the news that Cohere, the Canadian startup co-founded by former Transformer…
View More Don’t Overlook Cohere: Command A Reasoning, Its First Model for Enterprise Customer Service and MoreChan Zuckerberg Initiative’s rBio Trains AI Using Virtual Cells Instead of Lab Work
The Chan Zuckerberg Initiative announced the launch of rBio, an artificial intelligence model designed to understand cellular biology through virtual simulations instead of expensive lab…
View More Chan Zuckerberg Initiative’s rBio Trains AI Using Virtual Cells Instead of Lab WorkHow AI Startup Delphi Managed User Data and Scaled with Pinecone
Delphi, a San Francisco-based AI startup named after the Ancient Greek oracle, was grappling with a modern issue: its “Digital Minds,” personalized chatbots that embody…
View More How AI Startup Delphi Managed User Data and Scaled with PineconeHow to Conduct an Effective QBR (Quarterly Business Review)
**Dear SaaStr: How Do I Conduct an Effective Quarterly Business Review (QBR) with Customers?** QBRs have often lost their original value, becoming mere upsell sessions…
View More How to Conduct an Effective QBR (Quarterly Business Review)ByteDance, TikTok’s parent, unveils new open-source Seed-OSS-36B model with 512K token context.
The company’s Seed Team of AI researchers today released Seed-OSS-36B on AI code sharing website Hugging Face. Seed-OSS-36B is a new line of open-source, large…
View More ByteDance, TikTok’s parent, unveils new open-source Seed-OSS-36B model with 512K token context.Fluence Technology in Warsaw Secures €6.6 Million for Femtosecond Laser Systems
Fluence Technology, a company founded by four Polish physicists, has raised €6.6 million (PLN 28.1 million) in a seed funding round to enhance the global…
View More Fluence Technology in Warsaw Secures €6.6 Million for Femtosecond Laser Systems