News Nug
all agent api update benchmark deployment eval fine tuning hardware inference library monitoring new model open source optimization prompt engineering research tool training tutorial workflow
AnnouncementsJun 30, 2026Claude Science, an AI workbench for scientists, is now availableClaude Science is a customizable app that integrates the tools and packages researchers most often use, produces auditable artifacts, and provides flexible access to computing resources.
Anthropic Blog · 1d ago · 5
ProductJun 30, 2026Introducing Claude Sonnet 5Sonnet 5 delivers frontier performance across coding, agents, and professional work at scale.
Anthropic Blog · 1d ago · 5
How to improve a 5-class Diabetic Retinopathy model (APTOS 2019) – Mixed predictions across classes[P]
r/MachineLearning · 1d ago · 5
ScarfBench: Benchmarking AI Agents for Enterprise Java Framework Migration
HuggingFace Blog · 1d ago · 5
The AI Compass
Simon Willison · 1d ago · 5
Are all LLM research papers nowadays 100+ pages beasts?[D]
r/MachineLearning · 1d ago · 5
Have your agent record video demos of its work with shot-scraper video
Simon Willison · 1d ago · 5
Start building with Nano Banana 2 Lite and Gemini Omni Flash
DeepMind Blog · 1d ago · 5
shot-scraper 1.10
Simon Willison · 1d ago · 5
PageStorm: A Model Built for Creative Book Writing
r/LocalLLaMA · 1d ago · 5
Why Specialization Is Inevitable
HuggingFace Blog · 1d ago · 5
Well.. it's a step up from nonstop bot spam I guess
r/LocalLLaMA · 2d ago · 5
NEW on Hugging Face: Filter by hardware compatibility
r/LocalLLaMA · 2d ago · 5
A map of the latest 11 million papers split by semantic similarity and time slices [P]
r/MachineLearning · 2d ago · 5
Update on CVIL: the free CV interview prep checklist after landing my internship... just added Segmentation, OCR, and VLM sections [D]
r/MachineLearning · 2d ago · 5
How ChatGPT adoption has expanded
OpenAI Blog · 2d ago · 5
EACL 2027: Author response and author-reviewer discussion are now two separate stages and allow more time [D]
r/MachineLearning · 2d ago · 5
[AINews] not much happened today
Latent Space · 2d ago · 5
Core dump epidemiology: fixing an 18-year-old bug
OpenAI Blog · 2d ago · 5
Introducing GeneBench-Pro
OpenAI Blog · 2d ago · 5
<12345…73>