Simon Willison · 22h ago · 7 · agent security prompt engineering

Microsoft Copilot Cowork contained a critical security vulnerability where agentic systems could exfiltrate files through unapproved email messages with external image requests and pre-authenticated OneDrive links. This highlights a major design challenge in building safe autonomous agents: preventing prompt injection attacks from enabling data theft while maintaining agent autonomy.

r/MachineLearning · 22h ago · 7 · research agent inference

A technical essay critiques reasoning models' ability to perform faithful inference, arguing that jointly-generated reasoning traces and final answers lack genuine separation of concerns. The piece engages empirically with recent work (Lanham/Turpin/Mirzadeh) and compares architectural approaches (HRM, TRM, GRAM, AlphaProof, Kona/Aleph), offering conceptual framing around constraints vs. influence that's relevant for engineers building reasoning systems.

r/MachineLearning · 1d ago · 8 · open source agent workflow tool

Spice is an open-source decision layer framework that sits above execution agents to make agent decision-making explicit and interpretable. It captures what was observed, options considered, reasoning for selection, trade-offs rejected, and execution outcomes—addressing a key gap where agents excel at execution but lack transparent decision-making processes. The project is early-stage but functional, installable, and designed to work with existing agents like Claude Code and other tools.

r/MachineLearning · 2d ago · 8 · tool agent deployment open source

Production-tested solution for enforcing tool-call constraints in LangGraph agents using a YAML-based contract layer that validates rules deterministically before execution. Addresses critical failure mode where prompt engineering and post-hoc auditing fail to prevent compliance violations, with the approach open-sourced as Sponsio for community feedback.

HuggingFace Blog · 2d ago · 7 · agent workflow

A practical glossary clarifying commonly confused terminology in AI agent development (model, scaffold, harness, tool definitions) with examples from frameworks like Claude Code and Codex. Provides mental models for understanding agent architecture that's essential when building or deploying agentic systems, though not a technical tutorial.

Simon Willison · 2d ago · 6 · tool api update agent

Datasette 1.0a30 introduced a new makeJumpSections() JavaScript plugin hook that datasette-agent leverages to add agent chat functionality directly into the Jump to menu interface. This represents a practical integration pattern for embedding AI agents into existing tools, though it's specific to the Datasette ecosystem rather than broadly applicable.

r/MachineLearning · 3d ago · 8 · tool open source agent deployment

AgentLantern is an open-source devtool that provides visibility into AI agent project structure and execution, addressing the debugging and observability challenges in multi-agent systems. It offers three components: static documentation generation, linting for design issues, and a runtime viewer for observing agent behavior—currently supporting CrewAI with plans for broader framework support.

r/MachineLearning · 4d ago · 7 · open source agent tool workflow

Spice is an open-source decision layer framework that sits above execution agents, providing context-aware task routing and decision-making through a perception → simulation → decision → execution → reflection loop. Rather than replacing agents like Claude or Codex, it adds orchestration capabilities including state modeling, option simulation, and outcome reflection to coordinate multi-agent workflows.

Latent Space · 4d ago · 6 · agent workflow api update

Industry shift from models as primary product to agents as integrated systems combining models, harnesses, UI, and workflows. Major players (OpenAI, AI21, DeepSeek) are building dedicated agent teams and reducing standalone model focus, with concrete shipping examples like OpenAI's Codex updates and Claude's auto-mode expansion showing product differentiation moving beyond model quality alone.

r/MachineLearning · 5d ago · 7 · benchmark workflow agent

Community discussion identifying gaps between standard benchmarks and real-world AI system robustness, particularly around ambiguous intent, context handling, and multi-turn sessions. Highlights the disconnect between optimizing for clean evaluation metrics versus building production-resilient systems.

Latent Space · 5d ago · 7 · tool deployment agent inference

Daytona provides cloud-based sandboxed compute infrastructure optimized for AI agents, enabling stateful, instantly-spinnable environments that handle massive scale (850k+ sandboxes/day). The infrastructure supports agentic workflows requiring composable computers with dynamic resource scaling, bare-metal architecture, and instant startup times (~60ms), addressing the emerging market gap between traditional code execution and agent-specific compute needs.

Simon Willison · 5d ago · 8 · tool agent open source library plugin

Datasette Agent is a new conversational AI assistant that lets users query data stored in Datasette using natural language, with LLM-powered SQL generation and an extensible plugin architecture. The tool integrates with modern LLMs (Gemini, Claude, local models) for reliable tool calling and SQL generation, and includes plugins for charts and other functionality. This represents a practical fusion of data querying and LLM agents with immediate applicability for engineers working with databases and AI.

Simon Willison · 5d ago · 7 · tool agent open source

A new Datasette Agent plugin enables running commands in a Fly Sprites sandbox environment, extending Datasette's capabilities for AI agents to execute code safely. This is a practical tool for developers building agentic systems that need sandboxed command execution alongside database operations.

Simon Willison · 5d ago · 7 · tool agent open source

Datasette Agent is a new extensible AI assistant built for Datasette, enabling users to query and interact with databases through an agentic interface. This tool bridges LLMs with database systems, useful for engineers building AI applications that need structured data access patterns.

r/MachineLearning · 6d ago · 8 · research agent open source benchmark

Research on masked diffusion language models (MDLMs) for world modeling in RL environments, addressing mode collapse and diversity limitations of autoregressive models. Introduces GRPO training framework with zero-shot transfer across multiple open-source environments and agent backbones, with open-sourced code and dataset of 239K trajectories.

r/MachineLearning · 6d ago · 8 · agent inference deployment benchmark

Practical cost-optimization study comparing five LLMs (Opus, GPT-5, Sonnet, DeepSeek V4, Hunyuan) on an MCP-based file management agent across 500+ tool calls, revealing surprisingly small quality gaps (96-99% success) despite 10x price differences. Author deployed Hunyuan locally via MLX on M2 Ultra for $5.5k, reducing daily inference costs from $40 to $9 through intelligent routing (local/cheap API for routine tasks, expensive models for complex failures).

Simon Willison · 6d ago · 6 · new model agent deployment

Google I/O 2026 introduced Gemini 3.5 Flash and Gemini Spark, a new AI agent product integrating with Google Workspace apps, running on Gemini 3.5 Flash and a closed-source Go binary called Antigravity. Key technical consideration: Spark uses isolated ephemeral VMs with DLP policies for enterprise security, though the author notes this is a critical area given prompt injection risks with sensitive data flows.

r/MachineLearning · 7d ago · 8 · open source research library agent

Engineer open-sourced NOML, a custom RL algorithm for continuous control that addresses instability in flight simulation by combining anchor policy (safe action fallback), hierarchical actor architecture (independent MLP heads per control axis), and mirror learning for data efficiency. The approach diverges from standard TD3 by eliminating exploration noise while maintaining stability through structural constraints rather than reward shaping.

r/MachineLearning · 7d ago · 8 · agent prompt engineering research open source

CANTANTE is a novel framework that automates multi-agent LLM system configuration by solving the credit assignment problem, allowing per-agent prompt optimization from global task rewards rather than manual tuning. The approach outperforms DSPy baselines (GEPA, MIPROv2) by 12-19 points on standard benchmarks while maintaining inference costs, with open-source code available.

Latent Space · 7d ago · 9 · new model api update agent workflow

Google released Gemini 3.5 Flash (GA immediately) with 1M context window, 65k max output, and agentic/coding capabilities, plus the new Gemini Omni multimodal family for video/audio generation and editing. The stack includes expanded Antigravity agents across desktop/CLI/SDK/API, with Google reporting 3.2 quadrillion tokens/month processed and 900M+ monthly users.