News Nug
all agent api update benchmark business cuda dataset deployment eval fine tuning game dev hardware inference library monitoring new model open source optimization probe targeted prompt engineering quantization rag research security tool training tutorial workflow
EMA on LoRA ? [R]
r/MachineLearning · 1d ago · 5
A slightly improved DVD-JEPA demo [P]
r/MachineLearning · 1d ago · 5
[Exclusive] $250 off AI Engineer tix til Monday
Latent Space · 1d ago · 5
Tokenomics
r/LocalLLaMA · 1d ago · 5
When I can start applying for job[D]
r/MachineLearning · 1d ago · 5
8-16 MI50s Minimax M3 @19 tps TG (peak)
r/LocalLLaMA · 1d ago · 5
I released a softmax-free attention model at GPT-2 Medium scale (~354M params, 11.5B tokens): structural sparsity + tile-skipping kernels for long-context VRAM savings. Open weights + custom Triton kernels [R]
r/MachineLearning · 1d ago · 5
Gemma 4 QAT seems to respond significantly better to KV cache quantization
r/LocalLLaMA · 1d ago · 5
Vercel CEO: "Almost shocked" by how good GLM-5.2 is at coding
r/LocalLLaMA · 1d ago · 5
[NEW MODEL] SupraLabs started the Any2Any model family!
r/LocalLLaMA · 1d ago · 5
Python packages for particle swarms, genetic algorithms. Scikit-opt maybe? [D]
r/MachineLearning · 2d ago · 5
Six months ago I turned down $8,165 for an RTX 6000 PRO. Today the same vendor is selling them for $11,575. Oh, hindsight.
r/LocalLLaMA · 2d ago · 5
Studying FLUX in diffusers library was hard, so I built a smaller open-source version [P]
r/MachineLearning · 2d ago · 5
TSAuditor: A time-series auditing framework [P]
r/MachineLearning · 2d ago · 5
Hi Reddit, I posted my Build Your Own LLM workshop to Youtube teaching ML, LLM and math intuition [P]
r/MachineLearning · 2d ago · 5
Would you let an ML PhD student graduate without a top-tier paper? [D]
r/MachineLearning · 2d ago · 5
z.AI as the number 2 gives praise to the number 1 open source model
r/LocalLLaMA · 2d ago · 5
[GLM 5.2 UD IQ2_M] That's the best pelican svg image I have ever seen
r/LocalLLaMA · 2d ago · 5
An open handbook on LLM inference at scale (GPU internals, KV cache, batching, vLLM/SGLang/TensorRT-LLM) [P]
r/MachineLearning · 2d ago · 5
DVD-JEPA: an open-source, fully-reproducible JEPA world model [P]
r/MachineLearning · 2d ago · 5
<12345…62>