A diffusion model inspired by the human process of drafting, searching for information, and making iterative revisions.Read...
large language models (LLMs)
Auto Added by WPeMatico
A common AI fine-tuning practice could be unintentionally poisoning your models with hidden biases and risks, a...
Hierarchical Reasoning Models (HRM) tackle complex reasoning tasks while being smaller, faster, and more data-efficient than large...
Mixture-of-Recursions (MoR) is a new AI architecture that promises to cut LLM inference costs and memory use...
Researchers from Salesforce unveiled MCPEval, a new method to evaluate AI agent performance and tool use within...
Google’s new Gemini Embedding model now leads the MTEB benchmark. But it is facing fierce competition from...
A DeepMind study finds LLMs are both stubborn and easily swayed. This confidence paradox has key implications...
A new AI model learns to “think” longer on hard problems, achieving more robust reasoning and better...
Katanemo Labs’ new LLM routing framework aligns with human preferences and adapts to new models without retraining.Read...
Sakana AI’s new inference-time scaling technique uses Monte-Carlo Tree Search to orchestrate multiple LLMs to collaborate on...