News Nug
all agent api update benchmark business cuda dataset deployment eval fine tuning game dev hardware inference library monitoring new model open source optimization probe targeted prompt engineering quantization rag research security tool training tutorial workflow
Local LLM Inference Optimization: The Complete Guide
r/LocalLLaMA · 1d ago · 5
Samsung Electronics brings ChatGPT and Codex to employees
OpenAI Blog · 1d ago · 5
Temporary Cloudflare Accounts for AI agents
Simon Willison · 1d ago · 5
[ECCV 2026] Paper Decision Appeals Discussion [D]
r/MachineLearning · 1d ago · 5
An Update on Matrix Recurrent Units, an Attention Alternative [R]
r/MachineLearning · 1d ago · 5
Data-centric debugging for teams training neural nets [P]
r/MachineLearning · 1d ago · 5
Best current methods for finetuning whisper on domain specific vocabulary? [P]
r/MachineLearning · 1d ago · 5
EMA on LoRA ? [R]
r/MachineLearning · 1d ago · 5
A slightly improved DVD-JEPA demo [P]
r/MachineLearning · 1d ago · 5
[Exclusive] $250 off AI Engineer tix til Monday
Latent Space · 1d ago · 5
Tokenomics
r/LocalLLaMA · 1d ago · 5
When I can start applying for job[D]
r/MachineLearning · 1d ago · 5
8-16 MI50s Minimax M3 @19 tps TG (peak)
r/LocalLLaMA · 1d ago · 5
I released a softmax-free attention model at GPT-2 Medium scale (~354M params, 11.5B tokens): structural sparsity + tile-skipping kernels for long-context VRAM savings. Open weights + custom Triton kernels [R]
r/MachineLearning · 1d ago · 5
Gemma 4 QAT seems to respond significantly better to KV cache quantization
r/LocalLLaMA · 1d ago · 5
Vercel CEO: "Almost shocked" by how good GLM-5.2 is at coding
r/LocalLLaMA · 1d ago · 5
[NEW MODEL] SupraLabs started the Any2Any model family!
r/LocalLLaMA · 2d ago · 5
Python packages for particle swarms, genetic algorithms. Scikit-opt maybe? [D]
r/MachineLearning · 2d ago · 5
Six months ago I turned down $8,165 for an RTX 6000 PRO. Today the same vendor is selling them for $11,575. Oh, hindsight.
r/LocalLLaMA · 2d ago · 5
Studying FLUX in diffusers library was hard, so I built a smaller open-source version [P]
r/MachineLearning · 2d ago · 5
<12345…62>