An international research team has developed a new benchmark that reveals the current limitations of LLMs. Even the most advanced models fail at 90 percent of the tasks - for now. The test, called ...
Inside sources told the Financial Times that the Stargate AI infrastructure project will provide computing power exclusively to OpenAI. The project, announced earlier this week by OpenAI, SoftBank, ...
Donald Trump has eliminated his predecessor's AI safety regulations, creating a regulatory gap for artificial intelligence development in the United States. In one of his first moves as president, ...
A new study by OpenAI shows that AI models become more robust against manipulation attempts if they are given more time to "think". The researchers also discovered new methods of attack. A recent ...
OpenAI has just launched Operator, an AI assistant that can navigate the web on its own. The tool, currently only available to US ChatGPT Pro subscribers, represents a step toward AI assistants that ...
Perplexity is stepping into Google's territory with a new AI assistant for Android that can control apps and handle tasks on its own. The move puts the startup in direct competition with Google's ...
Chinese AI startup DeepSeek has released two new AI models that they say match OpenAI's o1 in performance. Along with their main models, DeepSeek-R1 and DeepSeek-R1-Zero, they've also launched six ...
OpenAI's involvement in funding FrontierMath, a leading AI math benchmark, only came to light when the company announced its record-breaking performance on the test. Now, the benchmark's developer ...
A team of researchers from NYU, MIT, and Google has found a way to improve AI-generated images by borrowing ideas from recent AI reasoning models like OpenAI's o1. Their approach enhances image ...
OpenAI's AI reasoning expert Noam Brown says there is "lots of vague AI hype" on social media. While acknowledging there are "good reasons to be optimistic" about AI progress, Brown emphasized that ...
While today's AI systems are typically trained once to handle various tasks like writing text and answering questions, they often struggle with new, unexpected challenges. Transformer² aims to solve ...
OpenAI is stepping into life sciences with a new LLM designed to optimize proteins. Early testing suggests the system might work better than human researchers at certain tasks. Working with startup ...