News Nug
all agent api update architecture benchmark business cuda dataset deployment eval fine tuning game dev hardware inference library monitoring new model open source optimization probe targeted prompt engineering quantization rag research security tool training tutorial workflow
[Exclusive] $250 off AI Engineer tix til Monday
Latent Space · 2h ago · 5
When I can start applying for job[D]
r/MachineLearning · 5h ago · 5
I released a softmax-free attention model at GPT-2 Medium scale (~354M params, 11.5B tokens): structural sparsity + tile-skipping kernels for long-context VRAM savings. Open weights + custom Triton kernels [R]
r/MachineLearning · 6h ago · 5
Gemma 4 QAT seems to respond significantly better to KV cache quantization
r/LocalLLaMA · 8h ago · 5
Vercel CEO: "Almost shocked" by how good GLM-5.2 is at coding
r/LocalLLaMA · 9h ago · 5
Python packages for particle swarms, genetic algorithms. Scikit-opt maybe? [D]
r/MachineLearning · 19h ago · 5
Six months ago I turned down $8,165 for an RTX 6000 PRO. Today the same vendor is selling them for $11,575. Oh, hindsight.
r/LocalLLaMA · 21h ago · 5
Studying FLUX in diffusers library was hard, so I built a smaller open-source version [P]
r/MachineLearning · 1d ago · 5
TSAuditor: A time-series auditing framework [P]
r/MachineLearning · 1d ago · 5
Hi Reddit, I posted my Build Your Own LLM workshop to Youtube teaching ML, LLM and math intuition [P]
r/MachineLearning · 1d ago · 5
Would you let an ML PhD student graduate without a top-tier paper? [D]
r/MachineLearning · 1d ago · 5
z.AI as the number 2 gives praise to the number 1 open source model
r/LocalLLaMA · 1d ago · 5
[GLM 5.2 UD IQ2_M] That's the best pelican svg image I have ever seen
r/LocalLLaMA · 1d ago · 5
An open handbook on LLM inference at scale (GPU internals, KV cache, batching, vLLM/SGLang/TensorRT-LLM) [P]
r/MachineLearning · 1d ago · 5
DVD-JEPA: an open-source, fully-reproducible JEPA world model [P]
r/MachineLearning · 1d ago · 5
Time Series Modeling Needs a Dynamical Systems Perspective [R]
r/MachineLearning · 1d ago · 5
Built a Global AQ (PM2.5) Forecaster ML Model [P]
r/MachineLearning · 1d ago · 5
[AINews] not much happened today
Latent Space · 1d ago · 5
how to access books3 dataset for research purposes? [R]
r/MachineLearning · 1d ago · 5
RTX 5090 MSI, only inference or training at 475-500W. Make sure to not bend you cable!
r/LocalLLaMA · 1d ago · 5
<12345…61>