The Hype Flow

<script async="async" data-cfasync="false" src="//pl26153259.effectiveratecpm.com/93ff6afac9d705a7b294e3283d5bce15/invoke.js"></script>
<div id="container-93ff6afac9d705a7b294e3283d5bce15"></div>

Supervised fine-tuning (SFT)

Auto Added by WPeMatico

QwenLong-L1 solves long-context reasoning challenge that stumps current LLMs

QwenLong-L1 solves long-context reasoning challenge that stumps current LLMs

admin May 30, 2025

Alibaba’s QwenLong-L1 helps LLMs deeply understand long documents, unlocking advanced reasoning for practical enterprise applications.Read More

30 seconds vs. 3: The d1 reasoning framework that’s slashing AI response times

30 seconds vs. 3: The d1 reasoning framework that’s slashing AI response times

admin April 28, 2025

d1 framework changes boosts diffusion LLMs with novel reinforcement learning, unlocking efficient, problem-solving AI possibilities.Read More

Less supervision, better results: Study shows AI models generalize more effectively on their own

Less supervision, better results: Study shows AI models generalize more effectively on their own

admin February 12, 2025

Training LLMs and VLMs through reinforcement learning delivers better results than using hand-crafted examples.Read More