agents-course
AgentFreeThis repository contains the Hugging Face Agents Course.
Capabilities12 decomposed
progressive agent architecture curriculum with thought-action-observation cycle teaching
Medium confidenceTeaches the foundational TAO (Thought-Action-Observation) cycle through structured lessons that decompose agent decision-making into discrete steps: LLM reasoning (Thought), tool invocation (Action), and result integration (Observation). The course uses a four-unit progression model that builds from basic LLM concepts to complex multi-framework implementations, with each unit scaffolding knowledge through conceptual explanations, code walkthroughs, and interactive quizzes that validate understanding of agent loop mechanics.
Structures agent learning around the explicit TAO cycle rather than framework-specific APIs, allowing learners to understand agent mechanics independently before choosing implementation frameworks. Uses a hierarchical table-of-contents system that maps conceptual progression to concrete code patterns across multiple frameworks.
More comprehensive than framework-specific tutorials because it teaches agent theory first, then shows how different frameworks (smolagents, LlamaIndex, LangGraph) implement the same TAO concepts differently.
multi-framework agent implementation comparison and pattern mapping
Medium confidenceProvides side-by-side architectural comparisons of three distinct agent frameworks (smolagents, LlamaIndex, LangGraph) by mapping their core classes, execution models, and use cases to the same underlying agent concepts. Each framework section explains how it implements the TAO cycle differently: smolagents uses code generation, LlamaIndex uses RAG-focused workflows with QueryEngine abstractions, and LangGraph uses explicit StateGraph nodes with conditional routing. The course teaches when to choose each framework based on problem characteristics (general-purpose vs. document-heavy vs. complex state management).
Maps frameworks to the same TAO abstraction layer rather than teaching them as isolated tools, enabling learners to understand framework selection as a design decision rather than a preference. Includes explicit comparison table showing core classes (CodeAgent vs. AgentWorkflow vs. StateGraph) and execution models side-by-side.
Broader than framework-specific documentation because it contextualizes each framework within the agent architecture landscape, helping developers understand trade-offs rather than just API usage.
gaia benchmark evaluation framework for standardized agent assessment
Medium confidenceTeaches how to use the GAIA (General AI Assistant) benchmark to evaluate agent reasoning quality across diverse tasks. GAIA provides a standardized set of multi-step reasoning tasks with ground truth answers, enabling consistent comparison of agent implementations, frameworks, and model choices. The course covers benchmark task structure (questions requiring multi-step reasoning, tool use, and information synthesis), evaluation metrics (exact match, partial credit), and how to interpret benchmark results to identify agent weaknesses. Includes patterns for running agents against benchmarks, collecting failure cases, and using benchmark results to guide agent improvements.
Provides integration with a published, standardized benchmark (GAIA) rather than custom evaluation metrics, enabling reproducible agent comparison across teams and implementations. Benchmark tasks require multi-step reasoning and tool use, testing agent capabilities beyond simple text generation.
More rigorous than custom evaluation because GAIA is published and reproducible; enables cross-team comparison unlike proprietary benchmarks; more comprehensive than single-task evaluation.
interactive course platform with multilingual content and community engagement
Medium confidenceProvides a structured learning platform built on Hugging Face's infrastructure with progressive units, quizzes, and community features (Discord integration). The course uses a hierarchical table-of-contents system that guides learners through four units plus bonus content, with each unit containing conceptual lessons, code walkthroughs, and knowledge checks. The platform supports multilingual content (English primary, partial Chinese translations), enabling global accessibility. Community features (Discord channel) enable peer learning and instructor support, creating a cohort-based learning experience.
Combines structured curriculum with community engagement through Discord, creating a cohort-based learning experience rather than isolated self-study. Hierarchical table-of-contents system maps conceptual progression to concrete code patterns, enabling learners to understand both theory and implementation.
More comprehensive than framework documentation because it teaches agent theory first, then shows implementation; more engaging than video courses because it includes interactive code examples and community support.
code-first agent development with smolagents codeagent and toolcallingagent patterns
Medium confidenceTeaches smolagents' dual-agent approach where CodeAgent generates executable Python code as its reasoning output (allowing complex logic, loops, and conditionals) while ToolCallingAgent uses structured JSON schemas for tool invocation. The course explains how smolagents integrates with Hugging Face Hub for model access, how to define custom tools with type hints and docstrings, and how the framework handles code execution sandboxing. Includes patterns for error recovery, tool chaining, and leveraging code generation for multi-step reasoning that would require explicit prompting in other frameworks.
Uses code generation as the primary reasoning mechanism rather than natural language planning, allowing agents to express complex logic (loops, conditionals, variable assignment) directly. Automatically extracts tool schemas from Python function signatures and docstrings, reducing boilerplate compared to manual schema definition in other frameworks.
More expressive than JSON-based tool calling for multi-step reasoning because generated code can contain loops and conditionals; more integrated with Hugging Face ecosystem than LangChain/LlamaIndex alternatives.
rag-integrated agent workflows with llamaindex queryengine and agentworkflow abstractions
Medium confidenceTeaches LlamaIndex's agent architecture which couples retrieval-augmented generation (RAG) with agent reasoning through QueryEngine abstractions that encapsulate document indexing, retrieval, and synthesis. The course explains how LlamaIndex agents differ from general-purpose agents by optimizing for document-heavy workflows: agents use QueryEngine to retrieve relevant context before reasoning, reducing hallucination and grounding responses in source documents. Includes patterns for multi-document reasoning, hierarchical indexing, and combining multiple QueryEngines (e.g., vector search + keyword search) within a single agent.
Integrates RAG as a first-class agent capability rather than a post-hoc retrieval step, allowing agents to reason about which documents to retrieve and how to synthesize information across multiple sources. QueryEngine abstraction encapsulates the full retrieval pipeline (indexing, embedding, retrieval, synthesis) behind a single interface, reducing boilerplate for document-heavy agents.
More optimized for document-centric workflows than general-purpose frameworks because retrieval is built into the agent loop rather than added as a tool; better source attribution and explainability than pure LLM agents.
stateful agent orchestration with langgraph stategraph and conditional routing
Medium confidenceTeaches LangGraph's explicit state management approach where agents are modeled as directed graphs with nodes representing processing steps and edges representing conditional transitions. The course explains how StateGraph maintains typed state across agent steps, enabling complex workflows with branching logic, loops, and human-in-the-loop interventions. Unlike implicit state in other frameworks, LangGraph requires explicit state schema definition and transition rules, making agent flow transparent and debuggable. Includes patterns for error recovery, state persistence, and multi-agent coordination through shared state graphs.
Models agents as explicit directed graphs with typed state schemas, making agent flow and state transitions transparent and debuggable. Supports conditional routing, loops, and human-in-the-loop interventions as first-class graph constructs rather than workarounds, enabling complex workflows that would require custom code in other frameworks.
More suitable for complex, stateful workflows than CodeAgent or QueryEngine approaches because explicit state management prevents hidden state bugs and enables transparent debugging; better for multi-agent coordination than single-agent frameworks.
function calling schema definition and multi-provider llm binding
Medium confidenceTeaches how to define tool schemas using JSON Schema or Python type hints that enable LLMs to invoke functions reliably. The course covers how different LLM providers (OpenAI, Anthropic, Hugging Face) implement function calling differently (OpenAI uses tool_choice, Anthropic uses tool_use blocks, open-source models require prompt engineering), and how agent frameworks abstract these differences. Includes patterns for schema validation, error handling when LLMs generate invalid function calls, and optimizing schemas to reduce hallucination (e.g., using enums instead of free-text fields).
Abstracts provider-specific function calling implementations (OpenAI tool_choice vs. Anthropic tool_use vs. open-source prompt engineering) behind a unified schema interface, allowing agents to work across multiple LLM providers without code changes. Teaches schema optimization patterns (enums, descriptions, required fields) that reduce LLM hallucination.
More portable than provider-specific function calling because it abstracts differences; more reliable than free-text tool invocation because schemas enforce structure and enable validation.
fine-tuning llms for improved function calling and agent reasoning
Medium confidenceTeaches techniques for fine-tuning LLMs to improve their ability to invoke functions correctly and reason through multi-step agent tasks. The course covers dataset preparation (collecting agent trajectories with correct function calls), training approaches (supervised fine-tuning on function calling examples), and evaluation metrics (function call accuracy, reasoning quality). Includes patterns for using synthetic data generation to create fine-tuning datasets when real agent logs are unavailable, and how to measure improvements in agent performance post-fine-tuning.
Focuses on fine-tuning for agent-specific tasks (function calling, multi-step reasoning) rather than general language understanding, using agent trajectories as training data. Includes synthetic data generation patterns for creating fine-tuning datasets without manual agent log collection.
More cost-effective than using expensive proprietary APIs for high-volume agent deployments; enables use of open-source models for specialized agent tasks where base models underperform.
agent observability, tracing, and evaluation against benchmarks
Medium confidenceTeaches techniques for monitoring agent behavior, tracing execution paths, and evaluating agent quality against standardized benchmarks. The course covers logging agent steps (thought, action, observation), visualizing agent decision trees, and using benchmarks like GAIA (General AI Assistant) to measure agent reasoning quality. Includes patterns for identifying failure modes (e.g., tool hallucination, reasoning loops), debugging agent behavior through execution traces, and comparing agent performance across frameworks and model choices.
Provides end-to-end observability patterns from execution tracing to benchmark evaluation, enabling teams to measure and improve agent quality systematically. Includes GAIA benchmark integration for standardized agent evaluation across different implementations.
More comprehensive than framework-specific logging because it covers the full observability pipeline from tracing to evaluation; enables cross-framework comparison unlike single-framework tools.
agentic rag with alfred: document-aware agent reasoning and synthesis
Medium confidenceTeaches a specific agent application pattern called 'Agentic RAG' where agents actively decide which documents to retrieve and how to synthesize information across multiple sources, rather than passively using retrieved context. The course uses Alfred (a document-aware agent) as a concrete example, showing how agents can reason about document relevance, ask follow-up questions to refine retrieval, and synthesize contradictory information from multiple sources. Includes patterns for handling document uncertainty, managing context windows when dealing with large retrieved sets, and optimizing retrieval strategies based on agent reasoning.
Treats document retrieval as an active agent decision rather than a passive preprocessing step, allowing agents to reason about which documents to retrieve and how to synthesize information. Alfred example demonstrates how agents can ask follow-up questions to refine retrieval and handle contradictory information.
More flexible than passive RAG for complex information synthesis because agents can reason about retrieval decisions; more accurate than pure LLM reasoning because agents actively manage document context.
multi-agent pokémon battle simulation with competitive agent reasoning
Medium confidenceTeaches agent design through a concrete game-playing application where agents control Pokémon in battles, requiring real-time decision-making, opponent modeling, and strategic reasoning. The course walks through building agents that evaluate game state, predict opponent moves, and select optimal actions (attack, switch, item use) under uncertainty. This application demonstrates agent capabilities beyond text generation: state management (game board), multi-step planning (battle strategy), and competitive reasoning (opponent modeling). Includes patterns for handling imperfect information, managing agent state across multiple turns, and evaluating agent performance through win rates.
Uses game-playing as a concrete domain for teaching agent design, demonstrating state management, multi-step planning, and competitive reasoning in a tangible, evaluable context. Pokémon battles provide clear win/loss metrics for agent evaluation, unlike open-ended text generation tasks.
More engaging and concrete than abstract agent tutorials because game outcomes are immediately visible; better for teaching state management and strategic reasoning than text-only examples.
Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.
Related Artifactssharing capabilities
Artifacts that share capabilities with agents-course, ranked by overlap. Discovered automatically through the match graph.
awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
AgentGuide
https://adongwanai.github.io/AgentGuide | AI Agent开发指南 | LangGraph实战 | 高级RAG | 转行大模型 | 大模型面试 | 算法工程师 | 面试题库 | 强化学习|数据合成
ai-agents-for-beginners
12 Lessons to Get Started Building AI Agents
hello-agents
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
500-AI-Agents-Projects
The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation, illustrating how AI agents are transforming sectors such as healthcare, finance, education, retail, a
GenAI_Agents
50+ tutorials and implementations for Generative AI Agent techniques, from basic conversational bots to complex multi-agent systems.
Best For
- ✓ML engineers transitioning from traditional NLP to agentic systems
- ✓Developers building their first autonomous agents
- ✓Teams evaluating agent frameworks for production deployment
- ✓Architects selecting agent frameworks for production systems
- ✓Teams with existing LangChain/LlamaIndex investments evaluating smolagents
- ✓Developers building complex agents requiring conditional logic or state persistence
- ✓Researchers comparing agent architectures and approaches
- ✓Projects with strict quality requirements requiring standardized evaluation
Known Limitations
- ⚠Course material is static and does not adapt to learner pace or background
- ⚠No built-in sandbox environment for hands-on experimentation during lessons
- ⚠Multilingual content coverage is incomplete (primarily English with partial Chinese translations)
- ⚠Comparison is educational rather than performance-benchmarked; no latency or throughput metrics provided
- ⚠Does not cover framework integration patterns (e.g., using LangGraph with LlamaIndex RAG)
- ⚠Framework versions taught may lag behind latest releases; requires manual updates
Requirements
Input / Output
UnfragileRank
UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.
Repository Details
Last commit: Apr 17, 2026
About
This repository contains the Hugging Face Agents Course.
Categories
Alternatives to agents-course
Are you the builder of agents-course?
Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.
Get the weekly brief
New tools, rising stars, and what's actually worth your time. No spam.
Data Sources
Looking for something else?
Search →