News Nug
all agent api update benchmark business cuda dataset deployment eval fine tuning game dev hardware inference library monitoring new model open source optimization probe targeted prompt engineering quantization rag research security tool training tutorial workflow
Top-N-Sigma: Remove unconditional softmax+sort by TimNN · Pull Request #22645 · ggml-org/llama.cpp
r/LocalLLaMA · 1d ago · 5
Chinese Hackers Latest Masterpiece with NVIDIA
r/LocalLLaMA · 1d ago · 5
Some new updates to Papers with Code [P]
r/MachineLearning · 1d ago · 5
PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters
HuggingFace Blog · 1d ago · 5
Gemma 4 QAT 31B responds better to KV cache quantization too
r/LocalLLaMA · 1d ago · 5
Patch the Planet: a Daybreak initiative to support open source maintainers
OpenAI Blog · 1d ago · 5
Daybreak: Tools for securing every organization in the world
OpenAI Blog · 1d ago · 5
Support Step3.5/3.7 flash mtp3 by forforever73 · Pull Request #24340 · ggml-org/llama.cpp
r/LocalLLaMA · 1d ago · 5
GLM-5.2 is on DeepSWE
r/LocalLLaMA · 1d ago · 5
We got local models to triage the OpenClaw repo for FREE!*
HuggingFace Blog · 2d ago · 5
Codex-maxxing for long-running work
OpenAI Blog · 2d ago · 5
sqlite-utils 4.0rc1 adds migrations and nested transactions
Simon Willison · 2d ago · 5
sqlite-utils 4.0rc1
Simon Willison · 2d ago · 5
Local LLM Inference Optimization: The Complete Guide
r/LocalLLaMA · 2d ago · 5
Samsung Electronics brings ChatGPT and Codex to employees
OpenAI Blog · 2d ago · 5
Temporary Cloudflare Accounts for AI agents
Simon Willison · 2d ago · 5
[ECCV 2026] Paper Decision Appeals Discussion [D]
r/MachineLearning · 2d ago · 5
An Update on Matrix Recurrent Units, an Attention Alternative [R]
r/MachineLearning · 2d ago · 5
Data-centric debugging for teams training neural nets [P]
r/MachineLearning · 2d ago · 5
Best current methods for finetuning whisper on domain specific vocabulary? [P]
r/MachineLearning · 2d ago · 5
<12345…63>