News Nug
all agent api update benchmark deployment eval fine tuning hardware inference library monitoring new model open source optimization prompt engineering rag research tool training tutorial workflow
Hiding messages in the least significant mantissa bits of fine-tuned ONNX model weights [P]
r/MachineLearning · 1d ago · 5
We built a calibration-aware Q4_K_M quant of Qwen3.5 0.8B that recovers 96.5% of the BF16 gap vs pure llama.cpp Q4_K_M (SpectralQuant)
r/LocalLLaMA · 2d ago · 5
Using Local Coding Agents
Ahead of AI · 2d ago · 5
Orthrus (diffusion head) trained Qwen 3.5/3.6 and Gemma 4 models are dropping soon
r/LocalLLaMA · 2d ago · 5
Showcase: Building ML models that "watch" MMA fights and label events and positional changes making these moments all searchable on a timeline [P]
r/MachineLearning · 2d ago · 5
Kicking off GPU Mode [D]
r/MachineLearning · 2d ago · 5
I silently break training codes or configs so I made pybench [P]
r/MachineLearning · 2d ago · 5
[AINews] OpenAI GPT-5.6 Sol / Terra / Luna — restricted to trusted partners
Latent Space · 2d ago · 5
Late Submission of NeurIPS Review [R]
r/MachineLearning · 2d ago · 5
Quoting Dean W. Ball
Simon Willison · 2d ago · 5
Quoting Timothy B. Lee
Simon Willison · 2d ago · 5
Nemotron-3-Super-120B-A12B (hybrid Mamba+MoE) holds perfect needle retrieval to 504K tokens on 4×3090
r/LocalLLaMA · 2d ago · 5
vulkan: make TP viable by pwilkin · Pull Request #25051 · ggml-org/llama.cpp
r/LocalLLaMA · 2d ago · 5
"What should I do?" - consider post-training
r/LocalLLaMA · 2d ago · 5
What happened after 2,000 people tried to hack my AI assistant
Simon Willison · 2d ago · 5
Incident Report: CVE-2026-LGTM
Simon Willison · 2d ago · 5
Quoting OpenAI
Simon Willison · 2d ago · 5
Jun 26, 2026Economic ResearchAnthropic Economic Index report: Cadences
Anthropic Research · 2d ago · 5
Why do people keep investing in Intel for AI?
r/LocalLLaMA · 2d ago · 5
A debugger for RL reward functions that detects reward hacking during training [P]
r/MachineLearning · 2d ago · 5
<12345…68>