News Nug
all agent api update benchmark deployment eval fine tuning hardware inference library monitoring new model open source optimization prompt engineering research tool training tutorial workflow
China Has Matched Anthropic in Cybersecurity, Resetting AI Race
r/LocalLLaMA · 3d ago · 5
A lot of good M5 Max options available at Apple Refurbished
r/LocalLLaMA · 3d ago · 5
HP Inc. launches Frontier strategic partnership with OpenAI
OpenAI Blog · 3d ago · 5
Evaluating long-term memory limits in stateless LLM chatbots — feedback needed [D]
r/MachineLearning · 3d ago · 5
DeepSpec - a deepseek-ai Collection
r/LocalLLaMA · 3d ago · 5
DFlash support merged into llama.cpp
r/LocalLLaMA · 3d ago · 5
I shrank a transformer until every number fitted on the screen and made the weights editable [R]
r/MachineLearning · 3d ago · 5
NagaTranslate: Building a translation and voice pipeline for low-resource Nagaland creoles (Whisper, VITS, LLMs) [P]
r/MachineLearning · 3d ago · 5
Finally.. my rig is maxed out
r/LocalLLaMA · 3d ago · 5
Koboldcpp v1.116 released
r/LocalLLaMA · 3d ago · 5
Do we still need to study algorithms now that AI writes most of our code? [D]
r/MachineLearning · 4d ago · 5
Benchmarking Self-Hosted Gemma 2 9B vs. Frontier APIs: The FP8 Quantization Prefill Tax and VRAM Realities on an NVIDIA L4 [P]
r/MachineLearning · 4d ago · 5
MathFormer: Testing whether symbolic math is pattern matching or reasoning [D]
r/MachineLearning · 4d ago · 5
Even Google still believes in small models for coding.
r/LocalLLaMA · 4d ago · 5
Built an LLM training framework that actually runs on older GPUs without crashing [P]
r/MachineLearning · 4d ago · 5
Hiding messages in the least significant mantissa bits of fine-tuned ONNX model weights [P]
r/MachineLearning · 4d ago · 5
We built a calibration-aware Q4_K_M quant of Qwen3.5 0.8B that recovers 96.5% of the BF16 gap vs pure llama.cpp Q4_K_M (SpectralQuant)
r/LocalLLaMA · 4d ago · 5
Using Local Coding Agents
Ahead of AI · 4d ago · 5
Orthrus (diffusion head) trained Qwen 3.5/3.6 and Gemma 4 models are dropping soon
r/LocalLLaMA · 4d ago · 5
Showcase: Building ML models that "watch" MMA fights and label events and positional changes making these moments all searchable on a timeline [P]
r/MachineLearning · 4d ago · 5
<1…456…72>