News Nug
all agent api update benchmark dataset deployment eval fine tuning game dev hardware inference library monitoring new model open source optimization probe targeted prompt engineering rag research tool training tutorial workflow
Support Step3.5/3.7 flash mtp3 by forforever73 · Pull Request #24340 · ggml-org/llama.cpp
r/LocalLLaMA · 3d ago · 5
GLM-5.2 is on DeepSWE
r/LocalLLaMA · 4d ago · 5
We got local models to triage the OpenClaw repo for FREE!*
HuggingFace Blog · 4d ago · 5
Codex-maxxing for long-running work
OpenAI Blog · 4d ago · 5
sqlite-utils 4.0rc1 adds migrations and nested transactions
Simon Willison · 4d ago · 5
sqlite-utils 4.0rc1
Simon Willison · 4d ago · 5
Local LLM Inference Optimization: The Complete Guide
r/LocalLLaMA · 4d ago · 5
Samsung Electronics brings ChatGPT and Codex to employees
OpenAI Blog · 4d ago · 5
Temporary Cloudflare Accounts for AI agents
Simon Willison · 4d ago · 5
[ECCV 2026] Paper Decision Appeals Discussion [D]
r/MachineLearning · 4d ago · 5
An Update on Matrix Recurrent Units, an Attention Alternative [R]
r/MachineLearning · 4d ago · 5
Data-centric debugging for teams training neural nets [P]
r/MachineLearning · 4d ago · 5
Best current methods for finetuning whisper on domain specific vocabulary? [P]
r/MachineLearning · 4d ago · 5
EMA on LoRA ? [R]
r/MachineLearning · 4d ago · 5
A slightly improved DVD-JEPA demo [P]
r/MachineLearning · 4d ago · 5
[Exclusive] $250 off AI Engineer tix til Monday
Latent Space · 4d ago · 5
Tokenomics
r/LocalLLaMA · 4d ago · 5
When I can start applying for job[D]
r/MachineLearning · 4d ago · 5
8-16 MI50s Minimax M3 @19 tps TG (peak)
r/LocalLLaMA · 4d ago · 5
I released a softmax-free attention model at GPT-2 Medium scale (~354M params, 11.5B tokens): structural sparsity + tile-skipping kernels for long-context VRAM savings. Open weights + custom Triton kernels [R]
r/MachineLearning · 4d ago · 5
<1…456…66>