News Nug
all agent api update benchmark dataset deployment eval fine tuning game dev hardware inference library monitoring new model open source optimization probe targeted prompt engineering rag research tool training tutorial workflow
High Dimensional, Dynamic Rotary Positional Embedding [P]
r/MachineLearning · 1d ago · 5
Quoting Tom MacWright
Simon Willison · 1d ago · 5
Introducing computer use in Gemini 3.5 Flash
DeepMind Blog · 1d ago · 5
Find the best open-source OCR models in one place at Papers with Code [P]
r/MachineLearning · 1d ago · 5
I made a superhuman Generals.io agent with self-play RL [P]
r/MachineLearning · 1d ago · 5
Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel
HuggingFace Blog · 1d ago · 5
I did some model hacks, and got GLM5.2 from about 2.5 tok/s to >50 tok/s on my GH200 system.
r/LocalLLaMA · 1d ago · 5
I compiled LLM inference pricing across 7 providers — the caching numbers are surprising(spreadsheet included) [R]
r/MachineLearning · 1d ago · 5
Could it be that there aren’t really any medical LLM APIs available right now? [D]
r/MachineLearning · 1d ago · 5
[AINews] Claude Tag: Multiplayer, Proactive, Persistent Agents in Slack
Latent Space · 1d ago · 5
OpenAI and Broadcom unveil LLM-optimized inference chip
OpenAI Blog · 1d ago · 5
Speaking of those chinese chips... "Chinese supercomputer displaces US machines as world's fastest for first time since 2017"
r/LocalLLaMA · 1d ago · 5
Seems this community might have missed it: Bill that would mandate AI chip location tracking gains industry support | Half a dozen companies have come out in support of the Chip Security Act, which would require location-tracking mechanisms for America’s most advanced computing chips.
r/LocalLLaMA · 1d ago · 5
DeepSWE: new benchmark looking at how well today's frontier models can actually write code [R]
r/MachineLearning · 1d ago · 5
Introducing the FFASR Leaderboard: Benchmarking ASR in the Real World
HuggingFace Blog · 1d ago · 5
Mimo 2.5 is _fast_ at large context (dual RTX Pro 6000)
r/LocalLLaMA · 1d ago · 5
datasette 1.0a35
Simon Willison · 2d ago · 5
Will I be desk rejected for this[R]
r/MachineLearning · 2d ago · 5
MiniMax2.7 @47tg 1200pp
r/LocalLLaMA · 2d ago · 5
SpaceX Colossus deal 🚀, GPT-5.5 Cyber launch 🛡️, Codex as workspace 🤖
TLDR AI · 2d ago · 5
<12345…65>