News Nug
all agent api update benchmark dataset deployment eval fine tuning game dev hardware inference library monitoring new model open source optimization probe targeted prompt engineering rag research tool training tutorial workflow
Which tokens does a hybrid model predict better?
HuggingFace Blog · 1h ago · 5
Looking for arXiv endorsement (eess.AS or cs.SD) [R]
r/MachineLearning · 6h ago · 5
New sampler + verifier *drastically* improves tiny 0.5b model coding performance
r/LocalLLaMA · 6h ago · 5
I stopped trusting model benchmarks and started running my own eval set, here is what changed[D]
r/MachineLearning · 7h ago · 5
NVIDIA has released Nemotron-TwoTower-30B-A3B-Base-BF16, an unusual diffusion-based language model built from the Nemotron 3 Nano 30B-A3B backbone.
r/LocalLLaMA · 8h ago · 5
Any ideas for unconventional ML projects? [D]
r/MachineLearning · 12h ago · 5
Xperience-10M Download Help [D]
r/MachineLearning · 13h ago · 5
[AINews] It's Meta-Harness Summer
Latent Space · 15h ago · 5
How agents are transforming work
OpenAI Blog · 15h ago · 5
simonw/browser-compat-db
Simon Willison · 17h ago · 5
MuJoCo derived Simulator for High Fidelity Vision RL training natively on GPU [D]
r/MachineLearning · 22h ago · 5
Why the Frontier Ecosystem must be Open — Matei Zaharia and Reynold Xin, Databricks
Latent Space · 22h ago · 5
High Dimensional, Dynamic Rotary Positional Embedding [P]
r/MachineLearning · 22h ago · 5
Quoting Tom MacWright
Simon Willison · 23h ago · 5
Introducing computer use in Gemini 3.5 Flash
DeepMind Blog · 1d ago · 5
Find the best open-source OCR models in one place at Papers with Code [P]
r/MachineLearning · 1d ago · 5
I made a superhuman Generals.io agent with self-play RL [P]
r/MachineLearning · 1d ago · 5
Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel
HuggingFace Blog · 1d ago · 5
I did some model hacks, and got GLM5.2 from about 2.5 tok/s to >50 tok/s on my GH200 system.
r/LocalLLaMA · 1d ago · 5
I compiled LLM inference pricing across 7 providers — the caching numbers are surprising(spreadsheet included) [R]
r/MachineLearning · 1d ago · 5
<12345…65>