News Nug
all agent api update benchmark deployment eval fine tuning inference new model open source prompt engineering research tool workflow
ProductJun 30, 2026Introducing Claude Sonnet 5Sonnet 5 delivers frontier performance across coding, agents, and professional work at scale.
Anthropic Blog · 3d ago · 5
How to improve a 5-class Diabetic Retinopathy model (APTOS 2019) – Mixed predictions across classes[P]
r/MachineLearning · 3d ago · 5
ScarfBench: Benchmarking AI Agents for Enterprise Java Framework Migration
HuggingFace Blog · 3d ago · 5
The AI Compass
Simon Willison · 3d ago · 5
Are all LLM research papers nowadays 100+ pages beasts?[D]
r/MachineLearning · 3d ago · 5
Have your agent record video demos of its work with shot-scraper video
Simon Willison · 3d ago · 5
Start building with Nano Banana 2 Lite and Gemini Omni Flash
DeepMind Blog · 3d ago · 5
shot-scraper 1.10
Simon Willison · 3d ago · 5
PageStorm: A Model Built for Creative Book Writing
r/LocalLLaMA · 3d ago · 5
Why Specialization Is Inevitable
HuggingFace Blog · 3d ago · 5
Well.. it's a step up from nonstop bot spam I guess
r/LocalLLaMA · 4d ago · 5
NEW on Hugging Face: Filter by hardware compatibility
r/LocalLLaMA · 4d ago · 5
A map of the latest 11 million papers split by semantic similarity and time slices [P]
r/MachineLearning · 4d ago · 5
Update on CVIL: the free CV interview prep checklist after landing my internship... just added Segmentation, OCR, and VLM sections [D]
r/MachineLearning · 4d ago · 5
How ChatGPT adoption has expanded
OpenAI Blog · 4d ago · 5
EACL 2027: Author response and author-reviewer discussion are now two separate stages and allow more time [D]
r/MachineLearning · 4d ago · 5
[AINews] not much happened today
Latent Space · 4d ago · 5
Core dump epidemiology: fixing an 18-year-old bug
OpenAI Blog · 4d ago · 5
Featuring Every Eval Ever Results on Hugging Face Model Pages
HuggingFace Blog · 4d ago · 5
Inside Genebench-Pro
OpenAI Blog · 4d ago · 5
<1…456…75>