large language models (LLMs)

Google’s new diffusion AI agent mimics human writing to improve enterprise research

admin August 6, 2025

A diffusion model inspired by the human process of drafting, searching for information, and making iterative revisions.Read...

‘Subliminal learning’: Anthropic uncovers how AI fine-tuning secretly teaches bad habits

admin July 30, 2025

A common AI fine-tuning practice could be unintentionally poisoning your models with hidden biases and risks, a...

New AI architecture delivers 100x faster reasoning than LLMs with just 1,000 training examples

admin July 25, 2025

Hierarchical Reasoning Models (HRM) tackle complex reasoning tasks while being smaller, faster, and more data-efficient than large...

Mixture-of-recursions delivers 2x faster inference—Here’s how to implement it

admin July 23, 2025

Mixture-of-Recursions (MoR) is a new AI architecture that promises to cut LLM inference costs and memory use...

Open-source MCPEval makes protocol-level agent testing plug-and-play

admin July 22, 2025

Researchers from Salesforce unveiled MCPEval, a new method to evaluate AI agent performance and tool use within...

New embedding model leaderboard shakeup: Google takes #1 while Alibaba’s open source alternative closes gap

admin July 19, 2025

Google’s new Gemini Embedding model now leads the MTEB benchmark. But it is facing fierce competition from...

Google study shows LLMs abandon correct answers under pressure, threatening multi-turn AI systems

admin July 16, 2025

A DeepMind study finds LLMs are both stubborn and easily swayed. This confidence paradox has key implications...

A new paradigm for AI: How ‘thinking as optimization’ leads to better general-purpose models

admin July 11, 2025

A new AI model learns to “think” longer on hard problems, achieving more robust reasoning and better...

New 1.5B router model achieves 93% accuracy without costly retraining

admin July 7, 2025

Katanemo Labs’ new LLM routing framework aligns with human preferences and adapts to new models without retraining.Read...

Sakana AI’s TreeQuest: Deploy multi-model teams that outperform individual LLMs by 30%

admin July 3, 2025

Sakana AI’s new inference-time scaling technique uses Monte-Carlo Tree Search to orchestrate multiple LLMs to collaborate on...

Google’s new diffusion AI agent mimics human writing to improve enterprise research

‘Subliminal learning’: Anthropic uncovers how AI fine-tuning secretly teaches bad habits

New AI architecture delivers 100x faster reasoning than LLMs with just 1,000 training examples

Mixture-of-recursions delivers 2x faster inference—Here’s how to implement it

Open-source MCPEval makes protocol-level agent testing plug-and-play

New embedding model leaderboard shakeup: Google takes #1 while Alibaba’s open source alternative closes gap

Google study shows LLMs abandon correct answers under pressure, threatening multi-turn AI systems

A new paradigm for AI: How ‘thinking as optimization’ leads to better general-purpose models

New 1.5B router model achieves 93% accuracy without costly retraining

Sakana AI’s TreeQuest: Deploy multi-model teams that outperform individual LLMs by 30%

You may have missed

‘Stranger Things’ Lets It Rip to Kick Off Its Final Season

A Long-Lost Chapter of Quentin Tarantino’s ‘Kill Bill’ Is Coming to… ‘Fortnite’?

‘Magic: The Gathering’ Is Scrapping Its ‘Monster Hunter’ Crossover and Starting Over

How the ‘Sinners’ Costume Designer Helped Wunmi Mosaku Shape the Movie’s Secret MVP