News Nug
all agent api update benchmark deployment eval fine tuning hardware inference library monitoring new model open source optimization prompt engineering rag research tool training tutorial workflow
Orthrus (diffusion head) trained Qwen 3.5/3.6 and Gemma 4 models are dropping soon
r/LocalLLaMA · 1d ago · 5
Showcase: Building ML models that "watch" MMA fights and label events and positional changes making these moments all searchable on a timeline [P]
r/MachineLearning · 1d ago · 5
Kicking off GPU Mode [D]
r/MachineLearning · 1d ago · 5
I silently break training codes or configs so I made pybench [P]
r/MachineLearning · 1d ago · 5
[AINews] OpenAI GPT-5.6 Sol / Terra / Luna — restricted to trusted partners
Latent Space · 2d ago · 5
Late Submission of NeurIPS Review [R]
r/MachineLearning · 2d ago · 5
Quoting Dean W. Ball
Simon Willison · 2d ago · 5
Quoting Timothy B. Lee
Simon Willison · 2d ago · 5
Nemotron-3-Super-120B-A12B (hybrid Mamba+MoE) holds perfect needle retrieval to 504K tokens on 4×3090
r/LocalLLaMA · 2d ago · 5
vulkan: make TP viable by pwilkin · Pull Request #25051 · ggml-org/llama.cpp
r/LocalLLaMA · 2d ago · 5
"What should I do?" - consider post-training
r/LocalLLaMA · 2d ago · 5
What happened after 2,000 people tried to hack my AI assistant
Simon Willison · 2d ago · 5
Incident Report: CVE-2026-LGTM
Simon Willison · 2d ago · 5
Quoting OpenAI
Simon Willison · 2d ago · 5
Jun 26, 2026Economic ResearchAnthropic Economic Index report: Cadences
Anthropic Research · 2d ago · 5
Why do people keep investing in Intel for AI?
r/LocalLLaMA · 2d ago · 5
A debugger for RL reward functions that detects reward hacking during training [P]
r/MachineLearning · 2d ago · 5
Live Continual Learning in Machine Learning [D]
r/MachineLearning · 2d ago · 5
What if context compression is a diffusion noise function? Proposal + honest results from untrained-model experiments [R]
r/MachineLearning · 2d ago · 5
Previewing GPT-5.6 Sol: a next-generation model
OpenAI Blog · 2d ago · 5
<12345…68>