LLM Reasoning Model - Search News

22h

Inception Launches Mercury 2, the Fastest Reasoning LLM — 5x Faster Than Leading Speed-Optimized LLMs, with Dramatically Lower Inference Cost

Inception, the company behind the first commercial diffusion large language models (dLLMs), today announced the launch of ...

59m

Mercury 2 : World’s Fastest Reasoning AI Model Built for Production Applications

The new AI model uses diffusion reasoning to generate 1,000 tokens per second; it runs about 5x faster than Haiku, speed limits are ...

12d

Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

Nvidia researchers developed dynamic memory sparsification (DMS), a technique that compresses the KV cache in large language models by up to 8x while maintaining reasoning accuracy — and it can be ...

SiliconANGLE

Google makes its reasoning-optimized Gemini 2.5 Pro model available in public preview

Google LLC today made Gemini 2.5 Pro, an advanced large language model it debuted last month, available in public preview. Until now, the LLM was accessible through a free application programming ...

Researchers baked 3x inference speedups directly into LLM weights — without speculative decoding

Researchers from the University of Maryland, Lawrence Livermore, Columbia and TogetherAI have developed a training technique that triples LLM inference speed without auxiliary models or infrastructure ...

InfoWorld

Multi-token prediction technique triples LLM inference speed without auxiliary draft models

With reported 3x speed gains and limited degradation in output quality, the method targets one of the biggest pain points in production AI systems: latency at scale.

XDA Developers on MSN

You're using your local LLM wrong if you're prompting it like a cloud LLM

Local models work best when you meet them halfway ...

AI Summit 2026: Meet The 3 Sovereign AI LLM Models That Were Unveiled In Delhi To Rival Global Tech Giants

The third entrant is the most unusual. BharatGen is led by IIT Bombay and backed by the IndiaAI Mission to the tune of Rs. 900 crore - making it the largest single beneficiary of government AI funding ...

Nasdaq

Writer Releases New Adaptive Reasoning LLM Palmyra X5 With 1M Context Window to Scale Next Era of Enterprise AI Agents

SAN FRANCISCO--(BUSINESS WIRE)-- Writer, the leader in enterprise generative AI, today released its newest and most advanced foundation model, Palmyra X5. The state-of-the-art adaptive reasoning model ...

Amatrium Launches Multilingual Interface and Advanced LLM Selector for AmatriumGPT

A 9-language interface and LLM Selector expand global accessibility while giving enterprises greater control over AI ...

XDA Developers on MSN

I didn't think a local LLM could work this well for research, but LM Studio proved me wrong

A local LLM makes better sense for serious work ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results