research

DeepSeek unveils new technique for smarter, scalable AI reward models

admin April 8, 2025

Reward models holding back AI? DeepSeek’s SPCT creates self-guiding critiques, promising more scalable intelligence for enterprise LLMs.Read...

Meta’s answer to DeepSeek is here: Llama 4 launches with long context Scout and Maverick models, and 2T parameter Behemoth on the way!

admin April 5, 2025

While DeepSeek R1 and OpenAI o1 edge out Behemoth on a couple metrics, Llama 4 Behemoth remains...

The tool integration problem that’s holding back enterprise AI (and how CoTools solves it)

admin April 2, 2025

CoTools uses hidden states and in-context learning to enable LLMs to use more than 1,000 tools very...

New approach to agent reliability, AgentSpec, forces agents to follow rules

admin March 28, 2025

Researchers from Singapore Management University developed a new domain-specific language for agents to remain reliable.Read More

Researchers warn of ‘catastrophic overtraining’ in Large Language Models

admin March 28, 2025

The researchers compared two versions of OLMo-1b: one pre-trained on 2.3 trillion tokens and another on 3...

METASCALE improves LLM reasoning with adaptive strategies

admin March 25, 2025

METASCALE uses a three-stage approach to dynamically choose the right reasoning technique for each promblem.Read More

Less is more: UC Berkeley and Google unlock LLM potential through simple sampling

admin March 21, 2025

With multiple sampling and self-verification, Gemini 1.5 Pro can outperform o1-preview in reasoning tasks.Read More

Beyond RAG: SEARCH-R1 integrates search engines directly into reasoning models

admin March 19, 2025

SEARCH-R1 trains LLMs to gradually think and conduct online search as they generate answers for reasoning problems.Read...

Chain-of-experts (CoE): A lower-cost LLM framework that increases efficiency and accuracy

admin March 10, 2025

Chain-of-experts chains LLM experts in a sequence, outperforming mixture-of-experts (MoE) with lower memory and compute costs.Read More

How the A-MEM framework supports powerful long-context memory so LLMs can take on more complicated tasks

admin March 6, 2025

A-MEM uses embeddings and LLMs to create dynamic memory notes that automatically link to create complex knowledge...

DeepSeek unveils new technique for smarter, scalable AI reward models

Meta’s answer to DeepSeek is here: Llama 4 launches with long context Scout and Maverick models, and 2T parameter Behemoth on the way!

The tool integration problem that’s holding back enterprise AI (and how CoTools solves it)

New approach to agent reliability, AgentSpec, forces agents to follow rules

Researchers warn of ‘catastrophic overtraining’ in Large Language Models

METASCALE improves LLM reasoning with adaptive strategies

Less is more: UC Berkeley and Google unlock LLM potential through simple sampling

Beyond RAG: SEARCH-R1 integrates search engines directly into reasoning models

Chain-of-experts (CoE): A lower-cost LLM framework that increases efficiency and accuracy

How the A-MEM framework supports powerful long-context memory so LLMs can take on more complicated tasks

You may have missed

‘Mortal Kombat’ is Already Warming Up For a Third Movie

Marvel Brought Endgames, X-Men, and Iron Man to New York Comic Con

The New ‘Starfleet Academy’ Trailer Has a Lot More Than Lessons to Be Learned

The First Look at ‘Star Trek: Strange New Worlds’ Season 4 Gets Enterprise Lost In Space