News Nug
all agent api update benchmark deployment eval fine tuning hardware inference library monitoring new model open source optimization prompt engineering research tool training tutorial workflow
Do we still need to study algorithms now that AI writes most of our code? [D]
r/MachineLearning · 2d ago · 5
Benchmarking Self-Hosted Gemma 2 9B vs. Frontier APIs: The FP8 Quantization Prefill Tax and VRAM Realities on an NVIDIA L4 [P]
r/MachineLearning · 2d ago · 5
MathFormer: Testing whether symbolic math is pattern matching or reasoning [D]
r/MachineLearning · 2d ago · 5
Even Google still believes in small models for coding.
r/LocalLLaMA · 2d ago · 5
Built an LLM training framework that actually runs on older GPUs without crashing [P]
r/MachineLearning · 2d ago · 5
Hiding messages in the least significant mantissa bits of fine-tuned ONNX model weights [P]
r/MachineLearning · 2d ago · 5
We built a calibration-aware Q4_K_M quant of Qwen3.5 0.8B that recovers 96.5% of the BF16 gap vs pure llama.cpp Q4_K_M (SpectralQuant)
r/LocalLLaMA · 3d ago · 5
Using Local Coding Agents
Ahead of AI · 3d ago · 5
Orthrus (diffusion head) trained Qwen 3.5/3.6 and Gemma 4 models are dropping soon
r/LocalLLaMA · 3d ago · 5
Showcase: Building ML models that "watch" MMA fights and label events and positional changes making these moments all searchable on a timeline [P]
r/MachineLearning · 3d ago · 5
Kicking off GPU Mode [D]
r/MachineLearning · 3d ago · 5
I silently break training codes or configs so I made pybench [P]
r/MachineLearning · 3d ago · 5
[AINews] OpenAI GPT-5.6 Sol / Terra / Luna — restricted to trusted partners
Latent Space · 3d ago · 5
Late Submission of NeurIPS Review [R]
r/MachineLearning · 3d ago · 5
Quoting Dean W. Ball
Simon Willison · 3d ago · 5
Quoting Timothy B. Lee
Simon Willison · 3d ago · 5
Nemotron-3-Super-120B-A12B (hybrid Mamba+MoE) holds perfect needle retrieval to 504K tokens on 4×3090
r/LocalLLaMA · 3d ago · 5
vulkan: make TP viable by pwilkin · Pull Request #25051 · ggml-org/llama.cpp
r/LocalLLaMA · 3d ago · 5
"What should I do?" - consider post-training
r/LocalLLaMA · 3d ago · 5
What happened after 2,000 people tried to hack my AI assistant
Simon Willison · 3d ago · 5
<12345…69>