News Nug
all agent api update benchmark dataset deployment eval fine tuning game dev hardware inference library monitoring new model open source optimization probe targeted prompt engineering rag research tool training tutorial workflow
Tokenomics
r/LocalLLaMA · 4d ago · 5
When I can start applying for job[D]
r/MachineLearning · 4d ago · 5
8-16 MI50s Minimax M3 @19 tps TG (peak)
r/LocalLLaMA · 4d ago · 5
I released a softmax-free attention model at GPT-2 Medium scale (~354M params, 11.5B tokens): structural sparsity + tile-skipping kernels for long-context VRAM savings. Open weights + custom Triton kernels [R]
r/MachineLearning · 4d ago · 5
Gemma 4 QAT seems to respond significantly better to KV cache quantization
r/LocalLLaMA · 4d ago · 5
Vercel CEO: "Almost shocked" by how good GLM-5.2 is at coding
r/LocalLLaMA · 4d ago · 5
[NEW MODEL] SupraLabs started the Any2Any model family!
r/LocalLLaMA · 4d ago · 5
Python packages for particle swarms, genetic algorithms. Scikit-opt maybe? [D]
r/MachineLearning · 4d ago · 5
Six months ago I turned down $8,165 for an RTX 6000 PRO. Today the same vendor is selling them for $11,575. Oh, hindsight.
r/LocalLLaMA · 4d ago · 5
Studying FLUX in diffusers library was hard, so I built a smaller open-source version [P]
r/MachineLearning · 5d ago · 5
TSAuditor: A time-series auditing framework [P]
r/MachineLearning · 5d ago · 5
Hi Reddit, I posted my Build Your Own LLM workshop to Youtube teaching ML, LLM and math intuition [P]
r/MachineLearning · 5d ago · 5
Would you let an ML PhD student graduate without a top-tier paper? [D]
r/MachineLearning · 5d ago · 5
z.AI as the number 2 gives praise to the number 1 open source model
r/LocalLLaMA · 5d ago · 5
[GLM 5.2 UD IQ2_M] That's the best pelican svg image I have ever seen
r/LocalLLaMA · 5d ago · 5
An open handbook on LLM inference at scale (GPU internals, KV cache, batching, vLLM/SGLang/TensorRT-LLM) [P]
r/MachineLearning · 5d ago · 5
DVD-JEPA: an open-source, fully-reproducible JEPA world model [P]
r/MachineLearning · 5d ago · 5
Time Series Modeling Needs a Dynamical Systems Perspective [R]
r/MachineLearning · 5d ago · 5
Built a Global AQ (PM2.5) Forecaster ML Model [P]
r/MachineLearning · 5d ago · 5
[AINews] not much happened today
Latent Space · 5d ago · 5
<1…456…65>