Skip to main content

News Roundup 25

Created on August 14|Last edited on August 14
OpenAI releases SWE-bench Verified
OpenAI refines an existing coding benchmark to improve evaluation accuracy of AI in software engineering
Sakana reveals AI Scientist Agent
This new AI system automates research discovery, offering potential for significant scientific breakthroughs
TII unveils Falcon Mamba
A powerful new state space language model challenges traditional transformer-based architectures
Lumina-mGPT: a new paradigm for image generation?
Lumina-mGPT redefines image generation by avoiding diffusion, using a streamlined decoder-only architecture
xAI enters the chat with Grok 2
Elon Musk’s xAI releases Grok 2, enhancing capabilities in chat, coding, and reasoning