GitHub Trending AI · 34d ago · 7 · agent architecture tutorial workflow

A comprehensive Chinese technical guide ("御舆") that deconstructs AI Agent architecture, specifically analyzing Claude Code's design patterns including conversation loops, tool permission pipelines, context compression, and the Agent Harness runtime framework. Provides a transferable mental model for building production-grade agent systems across different frameworks without relying on prompt engineering tutorials.

HuggingFace Blog · 34d ago · 8 · new model open source fine tuning research

IBM releases Granite 4.0 3B Vision, a modular vision-language model optimized for chart and document understanding, delivered as a LoRA adapter on Granite 4.0 Micro with a novel DeepStack architecture for multi-layer visual feature injection. The release includes ChartNet, a 1.7M-sample synthetic dataset for chart interpretation with code-guided augmentation, addressing a key VLM weakness in structured data reasoning.

GitHub Trending AI · 34d ago · 6 · research open source

A comprehensive architectural analysis of Claude Code v2.1.88's TypeScript codebase (1,902 files, 477K lines), written by Claude itself, covering the query engine, 42 tools, multi-agent coordination, and 7-layer security model. While meta and entertaining, this is primarily documentation/breakdown of Anthropic's internal tooling rather than actionable technical content for building with AI.

GitHub Trending AI · 34d ago · 9 · agent architecture tutorial open source workflow

In-depth technical analysis of Claude Code's source architecture, covering the agent loop, context engineering, tool system, and production-grade error recovery strategies. Includes a companion project (Claude Code From Scratch) with ~4000 lines of TypeScript/Python and 11-chapter tutorial for building your own AI programming agent from scratch.

GitHub Trending AI · 34d ago · 7 · rag research workflow

M-flow introduces a novel RAG architecture where the knowledge graph becomes the scoring engine rather than a supporting structure, using path-cost reasoning and granularity-matched retrieval to find relevant evidence chains instead of relying on vector similarity alone. The system organizes knowledge in a four-layer cone graph (Episode→Facet→FacetPoint→Entity) and propagates evidence through typed edges to score relevance by coherent reasoning paths rather than embedding proximity.

HuggingFace Blog · 34d ago · 8 · open source tool research benchmark fine tuning

OpenMed built an end-to-end open-source protein engineering pipeline combining structure prediction, sequence design, and codon optimization, with novel contributions in codon-level language modeling. They benchmarked transformer architectures (CodonRoBERTa-large-v2 vs ModernBERT) for codon optimization, scaled to 25 species in 55 GPU-hours, and released runnable code with full experimental transparency—directly applicable for engineers building biological AI systems.

HuggingFace Blog · 35d ago · 8 · library fine tuning workflow research

TRL v1.0 introduces architectural lessons for building stable post-training libraries that can adapt as methods evolve from PPO to DPO to RLVR approaches. The library design prioritizes flexibility over fixed abstractions, recognizing that core concepts like reward models shift between being fundamental, optional, or reimagined as verifiers across different training paradigms.

DeepMind Blog · 39d ago · 8 · new model api update agent

Google released Gemini 3.1 Flash Live, an improved real-time audio model with better precision, lower latency, and enhanced tonal understanding for voice-first applications. Available via Gemini Live API, it achieves 90.8% on ComplexFuncBench Audio and 36.1% on Scale AI's Audio MultiChallenge, enabling developers to build voice agents that handle complex tasks with natural dialogue in noisy environments.

GitHub Trending AI · 39d ago · 7 · agent deployment tool open source

Phantom is an AI agent platform that provides persistent, autonomous workspace capabilities—agents get their own dedicated machine to install software, manage databases, and maintain context across sessions rather than disposable chat interactions. The key technical differentiator is autonomous infrastructure building: agents can independently deploy databases (ClickHouse), create REST APIs, integrate open-source tools, build observability dashboards, and even extend their own capabilities (e.g., adding Discord support via Bot API integration).

GitHub Trending AI · 40d ago · 8 · tool open source agent api update

An open-source MCP (Model Context Protocol) server that connects AI agents (Claude, GPT, Copilot) to 41 Brazilian government APIs covering economics, legislation, transparency, judiciary, elections, and more—38 APIs require no authentication. This is a practical tool for engineers building AI applications that need access to structured public sector data with ready-made integrations and natural language query capabilities.

DeepMind Blog · 40d ago · 6 · research benchmark safety

Research release on empirically validated toolkit for measuring AI manipulation capabilities, tested across 10,000+ participants in finance and health domains. Provides open-source methodology and materials for evaluating how AI systems can be misused to deceptively influence human behavior and beliefs in high-stakes scenarios.

DeepMind Blog · 40d ago · 7 · new model api update tool

Google released Lyria 3 Pro, an advanced music generation model supporting 3-minute tracks with structural awareness (verses, choruses, bridges). The model is available across multiple platforms including Vertex AI, Gemini API, Google AI Studio, and consumer apps, enabling developers to integrate custom music generation at scale.

OpenAI Research · 40d ago · 6 · workflow api update

OpenAI published a Model Spec that documents expected behavior, safety constraints, and design principles for their AI models. This provides engineers with official guidance on model capabilities and limitations, useful for understanding how to work within OpenAI's systems and for designing similar frameworks in their own applications.

GitHub Trending AI · 41d ago · 8 · tool open source api update inference deployment

apfel is an open-source tool that exposes Apple's on-device foundation model through a CLI, OpenAI-compatible API server, and shell integration—enabling local LLM inference on Apple Silicon Macs with no cloud dependency, API keys, or per-token billing. It supports tool calling via Model Context Protocol (MCP), includes demo shell scripts for practical workflows, and manages a 4096-token context window automatically.

GitHub Trending AI · 41d ago · 7 · tool open source library agent rag deployment

A curated directory of production-ready open-source AI tools and libraries organized by category (core frameworks, models, inference, agents, RAG, training, deployment, benchmarks, safety). Highlights practical CLI tools like PR-Agent, Gemini CLI, LLM, and Repomix that directly integrate AI into developer workflows.

Ahead of AI · 43d ago · 8 · research tutorial open source

Comprehensive reference guide organizing 45+ LLM architectures with visual model cards and detailed explanations of attention variants (MHA, GQA, sliding window, etc.) used in modern models. Includes both a web gallery and printable poster, serving as a practical learning resource for understanding contemporary transformer architectures.

GitHub Trending AI · 43d ago · 7 · tool open source agent deployment

holaOS is an agent operating system framework that provides infrastructure for long-running AI agents with persistent memory, durable state, and continuity across executions rather than one-off tasks. The project includes a local desktop environment (Holaboss) with quick-start installation and integration points for coding agents like Claude, Cursor, and Windsurf.

GitHub Trending AI · 44d ago · 7 · api update tool inference

A curated resource listing LLM APIs with permanent free tiers for text inference, including first-party APIs from model trainers and third-party platforms hosting open-weight models. Covers rate limits, available regions, and notable models—useful reference for engineers exploring cost-free inference options during development and experimentation.

GitHub Trending AI · 47d ago · 7 · tutorial workflow agent open source

A comprehensive AI engineering curriculum spanning 260+ lessons across 20 phases (~290 hours) covering fundamentals from linear algebra to autonomous agent swarms in Python, TypeScript, Rust, and Julia. Each lesson produces reusable artifacts (prompts, skills, agents, MCP servers) that can be immediately integrated into AI coding workflows, with personalized learning paths based on existing ML/DL knowledge.

DeepMind Blog · 48d ago · 7 · benchmark research tool

Google DeepMind released a cognitive taxonomy framework for measuring AGI progress, grounded in psychology and neuroscience, identifying 10 key cognitive abilities. They're launching a $200K Kaggle hackathon where engineers can design evaluations for five priority abilities (learning, metacognition, attention, executive functions, social cognition) using their new Community Benchmarks platform to test against frontier models.