News Nug
all agent api update benchmark deployment eval fine tuning hardware inference library monitoring new model open source optimization prompt engineering research tool training tutorial workflow
Documented: Weight-Level Political Conditioning in Large Language Models - A Case Study in AI Bias on the Gaza Genocide Question Conditioning in Large Language Models [R]
r/MachineLearning · 4d ago · 5
For ECCV, Springer Metor. How are we supposed to upload the files? [D]
r/MachineLearning · 4d ago · 5
AI and Liability
Simon Willison · 4d ago · 5
US Govt to individually approve who gets GPT 5.6.
r/LocalLLaMA · 4d ago · 5
Kuma: compiling PyTorch models into self-contained WebGPU executables [P]
r/MachineLearning · 4d ago · 5
Dev Log on Steam Recommender[P]
r/MachineLearning · 4d ago · 5
Report: Apple to skip M6 Pro/Max chips, fast-track M7 for local AI
r/LocalLLaMA · 4d ago · 5
[R] All Routes Lead to Collapse: attention sinks, representation collapse, and norm stratification are what content-based routing does under a norm-blind metric
r/MachineLearning · 4d ago · 5
ECCV 2026 camera-ready deadline: June 27 or June 30? [D]
r/MachineLearning · 4d ago · 5
Would having a dedicated programming language specifically for LLMs be a viable solution? [D]
r/MachineLearning · 4d ago · 5
[R] Compiling Agentic Workflows into LLM Weights: Near-Frontier Quality at Two Orders of Magnitude Less Cost
r/MachineLearning · 4d ago · 5
datasette-export-database 0.3a2
Simon Willison · 5d ago · 5
Does ML background help or hurt when applying for security roles [D]
r/MachineLearning · 5d ago · 5
Which tokens does a hybrid model predict better?
HuggingFace Blog · 5d ago · 5
Optimising LMAPF guidance graphs using Evolutionary algorithms: Advice needed [R]
r/MachineLearning · 5d ago · 5
CALHippo - Mapping neurons and glial cells in the human brain hippocampus in 3D using SOTA segmentation and density estimation models [R]
r/MachineLearning · 5d ago · 5
Looking for arXiv endorsement (eess.AS or cs.SD) [R]
r/MachineLearning · 5d ago · 5
New sampler + verifier *drastically* improves tiny 0.5b model coding performance
r/LocalLLaMA · 5d ago · 5
I stopped trusting model benchmarks and started running my own eval set, here is what changed[D]
r/MachineLearning · 5d ago · 5
NVIDIA has released Nemotron-TwoTower-30B-A3B-Base-BF16, an unusual diffusion-based language model built from the Nemotron 3 Nano 30B-A3B backbone.
r/LocalLLaMA · 5d ago · 5
<1…456…70>