In January 2025, a Hangzhou-based AI lab called DeepSeek dropped a reasoning model that, by its own benchmarks, went ...
The real headline is what ZAYA1-8B was trained on: a full stack of AMD Instinct MI300 graphics processing units (GPUs), the rival to Nvidia GPUs.
OpenAI announced this week that one of its general-purpose reasoning models made a breakthrough that has grabbed the ...
By putting the weights of a highly capable, 33B-parameter agentic model in the hands of researchers and startups, Poolside is positioning itself as a cornerstone of the open-AI ecosystem.
Cohere, the trusted sovereign AI platform for governments and regulated industries worldwide, today announced the release of Command A+, a powerful open-source Mixture-of-Experts (MoE) model released ...
OpenAI has said that one of its unreleased AI reasoning models has solved a long-standing mathematics problem first proposed ...
DeepSeek says both models are more efficient and performant than DeepSeek V3.2 due to architectural improvements, and have almost "closed the gap" with current leading models, both open and closed, on ...
DeepSeek, the Chinese artificial intelligence startup that shook world markets last year, launched preview versions of its latest major update Friday as the AI rivalry between China and the U.S. heats ...
What if the fragmented world of open AI models could finally speak the same language? Sam Witteveen explores how the newly introduced “Open Responses” is a new and open inference standard. Initiated ...
Open source robotics AI model MolmoAct2 from the Allen Institute for AI runs up to 37 times faster than its predecessor, ...
Chinese open models are spreading fast, from Hugging Face to Silicon Valley. Here’s why that matters. MIT Technology Review’s What’s Next series looks across industries, trends, and technologies to ...