News Nug
all agent api update benchmark deployment eval fine tuning hardware inference library monitoring new model open source optimization prompt engineering rag research tool training tutorial workflow
NagaTranslate: Building a translation and voice pipeline for low-resource Nagaland creoles (Whisper, VITS, LLMs) [P]
r/MachineLearning · 14h ago · 5
Koboldcpp v1.116 released
r/LocalLLaMA · 16h ago · 5
Do we still need to study algorithms now that AI writes most of our code? [D]
r/MachineLearning · 20h ago · 5
Benchmarking Self-Hosted Gemma 2 9B vs. Frontier APIs: The FP8 Quantization Prefill Tax and VRAM Realities on an NVIDIA L4 [P]
r/MachineLearning · 20h ago · 5
MathFormer: Testing whether symbolic math is pattern matching or reasoning [D]
r/MachineLearning · 22h ago · 5
Even Google still believes in small models for coding.
r/LocalLLaMA · 23h ago · 5
Built an LLM training framework that actually runs on older GPUs without crashing [P]
r/MachineLearning · 1d ago · 5
Hiding messages in the least significant mantissa bits of fine-tuned ONNX model weights [P]
r/MachineLearning · 1d ago · 5
We built a calibration-aware Q4_K_M quant of Qwen3.5 0.8B that recovers 96.5% of the BF16 gap vs pure llama.cpp Q4_K_M (SpectralQuant)
r/LocalLLaMA · 1d ago · 5
Using Local Coding Agents
Ahead of AI · 1d ago · 5
Orthrus (diffusion head) trained Qwen 3.5/3.6 and Gemma 4 models are dropping soon
r/LocalLLaMA · 1d ago · 5
Showcase: Building ML models that "watch" MMA fights and label events and positional changes making these moments all searchable on a timeline [P]
r/MachineLearning · 1d ago · 5
Kicking off GPU Mode [D]
r/MachineLearning · 1d ago · 5
I silently break training codes or configs so I made pybench [P]
r/MachineLearning · 1d ago · 5
[AINews] OpenAI GPT-5.6 Sol / Terra / Luna — restricted to trusted partners
Latent Space · 1d ago · 5
Late Submission of NeurIPS Review [R]
r/MachineLearning · 1d ago · 5
Quoting Dean W. Ball
Simon Willison · 1d ago · 5
Quoting Timothy B. Lee
Simon Willison · 1d ago · 5
Nemotron-3-Super-120B-A12B (hybrid Mamba+MoE) holds perfect needle retrieval to 504K tokens on 4×3090
r/LocalLLaMA · 1d ago · 5
vulkan: make TP viable by pwilkin · Pull Request #25051 · ggml-org/llama.cpp
r/LocalLLaMA · 1d ago · 5
<12345…68>