Production Debugging Session Replay

1

AgentOpsAgent62/100

via “session-replay-with-point-in-time-debugging”

Observability platform for AI agent debugging.

Unique: Implements event-based replay architecture that captures granular LLM calls, tool invocations, and multi-agent interactions as discrete events, enabling point-in-time inspection without requiring agent re-execution. This differs from log-based debugging by providing structured, queryable event sequences with visual timeline rendering.

vs others: Provides richer visibility than traditional logging (structured events vs text logs) and faster debugging than re-running agents, though requires upfront SDK integration unlike post-hoc log analysis tools.

2

Playwright Test for VS CodeExtension61/100

via “trace viewing and playback for test execution analysis”

Official Playwright E2E testing with codegen.

Unique: Integrates Playwright's native trace recording and viewer into VS Code, providing frame-by-frame execution replay without leaving the IDE.

vs others: More detailed than test logs or screenshots alone; allows temporal analysis of execution flow and state changes.

3

Live ShareExtension59/100

via “shared debugging session with breakpoint and variable inspection synchronization”

Real-time collaborative editing for pair programming.

Unique: Hooks into VS Code's Debug Adapter Protocol (DAP) to intercept debugger state changes and broadcast them to remote participants, enabling shared debugging without requiring separate debugger instances on guest machines. Synchronizes debugger state at the protocol level rather than screen-sharing, preserving interactive debugging capabilities for all participants.

vs others: More interactive than screen-sharing tools (Zoom, TeamViewer) because guests can independently inspect variables and navigate the call stack without the host controlling their view; more lightweight than running separate debugger instances because it reuses the host's debugging session.

4

BrowserbasePlatform57/100

via “session-recording-and-playback”

Headless browser infrastructure for AI agents — stealth mode, CAPTCHA solving, session recording.

Unique: Provides built-in session recording without requiring separate video capture or event logging infrastructure, with tiered data retention aligned to plan level; however, recording format and export mechanisms are proprietary and undocumented

vs others: More integrated than external logging services (no separate instrumentation) but less transparent than open-source alternatives (Playwright traces) regarding what is recorded and how to export it

5

AutoGen StarterTemplate57/100

via “conversation state persistence and replay for debugging and audit”

Microsoft AutoGen multi-agent conversation samples.

Unique: AgentRuntime event subscription system enables agents to emit structured events without modifying agent code; persistence is decoupled from agent execution via event handlers

vs others: More flexible than built-in logging because events are structured and can be routed to multiple backends (database, file, observability platform) simultaneously

6

Agent framework that generates its own topology and evolves at runtimeFramework50/100

via “agent debugging and execution tracing with replay”

Hi HN,I’m Vincent from Aden. We spent 4 years building ERP automation for construction (PO/invoice reconciliation). We had real enterprise customers but hit a technical wall: Chatbots aren't for real work. Accountants don't want to chat; they want the ledger reconciled while they slee

Unique: Records detailed execution traces with replay capability, enabling deterministic debugging and analysis of agent behavior without modifying agent code

vs others: More integrated than generic logging, but requires careful handling of external dependencies for accurate replay

7

AutoGenAgent49/100

via “conversation replay and debugging with message history analysis”

Multi-agent framework with diversity of agents

Unique: Implements a conversation replay system that can reconstruct agent interactions from message history, enabling step-by-step debugging and analysis without re-running agents. Supports filtering and searching by agent, message type, or content, and can generate conversation graphs showing agent interactions.

vs others: More practical than re-running agents for debugging because it uses saved history and doesn't require LLM calls, and more comprehensive than simple log analysis because it understands agent roles and message types

8

Agent-of-empires: OpenCode and Claude Code session managerCLI Tool46/100

via “execution history tracking and replay”

Hi! I’m Nathan: an ML Engineer at Mozilla.ai: I built agent-of-empires (aoe): a CLI application to help you manage all of your running Claude Code/Opencode sessions and know when they are waiting for you.- Written in rust and relies on tmux for security and reliability - Monitors state of cli s

Unique: Implements provider-aware execution logging that captures not just code and output but provider-specific metadata (model version, execution time, token usage, provider-specific errors), enabling forensic analysis of provider behavior differences

vs others: Jupyter notebooks have cell history but no provider tracking; cloud IDEs log execution but not provider-specific metrics; this is designed for multi-provider comparison and audit compliance

9

Claude-replay – A video-like player for Claude Code sessionsCLI Tool45/100

via “terminal output capture and replay”

I got tired of sharing AI demos with terminal screenshots or screen recordings.Claude Code already stores full session transcripts locally as JSONL files. Those logs contain everything: prompts, tool calls, thinking blocks, and timestamps.I built a small CLI tool that converts those logs into an int

Unique: Preserves and replays ANSI-formatted terminal output as a first-class part of the session, not just code changes, enabling viewers to see build results, test output, and runtime behavior in context

vs others: More complete than code-only replay because it shows the full development workflow including compilation, testing, and execution, providing evidence that AI-assisted code actually works

10

Loopsy, a way for terminals and AI agents on different machines to talkRepository40/100

via “terminal session state serialization and replay”

I've always had the urge to have my two macbooks communicate. Having one idle while working on the other felt like underutilization of resources. So I built Loopsy. Initially the goal was to do file transfer via local network, and then came running commands. I then tried running coding agents f

Unique: Implements session capture at the terminal I/O level with timestamp preservation, enabling deterministic replay with original timing rather than just storing command history

vs others: More detailed than shell history files because it captures output and timing, but less comprehensive than full system call tracing and requires more storage

11

Meta-agent: self-improving agent harnesses from live tracesAgent38/100

via “trace replay and validation”

We built meta-agent: an open-source library that automatically and continuously improves agent harnesses from production traces.Point it at an existing agent, a stream of unlabeled production traces, and a small labeled holdout set.An LLM judge scores unlabeled production traces as they stream.A pro

Unique: Validates agent behavior by replaying traces rather than relying on unit tests or manual testing, ensuring that generated harnesses preserve the behavior observed in successful runs

vs others: More comprehensive than traditional unit tests because it validates entire agent execution flows including tool interactions and LLM behavior, not just individual functions

12

openclaw-superpowersSkill37/100

via “skill execution tracing and debugging”

44 plug-and-play skills for OpenClaw — self-modifying AI agent with cron scheduling, security guardrails, persistent memory, knowledge graphs, and MCP health monitoring. Your agent teaches itself new behaviors during conversation.

Unique: Provides skill-level execution tracing with replay capability, enabling developers to understand and reproduce agent behavior at a granular level

vs others: More comprehensive than basic logging because it captures full execution context (inputs, outputs, intermediate states) and enables interactive debugging and replay

13

footprintjsMCP Server36/100

via “time-travel debugging with state snapshots”

Explainable backend flows — automatic causal traces, decision evidence, and MCP tool generation for AI agents

Unique: Combines immutable state snapshots with structural sharing to enable efficient time-travel debugging without requiring external debugger attachment or process restart, making it practical for production incident investigation

vs others: More practical than traditional debuggers for production systems because it captures complete state history without requiring live process attachment, and more efficient than full execution replay because it uses snapshots rather than re-running code

14

InstruktAgent30/100

via “session recording and replay”

Terminal env for interacting with with AI agents

Unique: Integrates recording and replay directly into the terminal UI, allowing developers to step through recorded sessions with the same controls as live execution rather than requiring separate replay tools

vs others: More integrated debugging than external logging tools, with native replay capability that doesn't require post-processing or external analysis tools

15

teamcopilotAgent30/100

via “agent-execution-history-and-replay”

A shared AI Agent for Teams

Unique: Provides immutable, team-accessible execution history with replay capability, enabling collaborative debugging and forensic analysis of agent behavior across the entire team

vs others: More comprehensive than typical LLM logging (which often only captures final outputs) and more accessible than vendor-specific debugging tools by storing history in team-controlled infrastructure

16

agentopsAgent30/100

via “web dashboard for session visualization and replay”

Observability and DevTool Platform for AI Agents

Unique: Provides interactive timeline-based visualization with integrated cost breakdown and tool call details, specifically designed for agent execution patterns rather than generic log viewing

vs others: More intuitive than raw JSON logs and faster to navigate than terminal-based tools, while being more specialized than general observability platforms like Grafana

17

Test DriverAgent29/100

via “test-execution-video-replay-and-debugging”

AI Agent for QA in GitHub

Unique: Provides synchronized video replay with integrated logs and metrics, enabling developers to see exactly what happened during test execution without examining raw logs or re-running tests. This visual debugging approach is more intuitive than log analysis.

vs others: More effective for debugging than log-only analysis because visual evidence shows actual UI state and interactions; more efficient than re-running tests because videos provide immediate evidence without waiting for test completion

18

playwrightFramework29/100

via “video and trace recording for debugging”

A high-level API to automate web browsers

Unique: Captures both video and detailed trace files (with screenshots, network logs, and DOM snapshots) automatically during test execution, enabling post-test debugging without re-running or external recording tools

vs others: More comprehensive than video-only recording because traces include network logs and DOM snapshots, and more integrated than external recording tools because it's built into the context lifecycle

19

BlinkyRepository25/100

via “persistent debugging session state and conversation history”

An open-source AI debugging agent for VSCode

Unique: Implements a stateful session model that persists both conversation history and applied fixes, allowing users to resume debugging and review past analyses. Includes optional cloud sync for cross-device session continuity, though local-first storage is the default for privacy.

vs others: More persistent than stateless debugging tools because it maintains conversation context and fix history across editor sessions, enabling long-term debugging workflows and institutional learning.

20

HyperbrowserPlatform23/100

via “session replay and debugging”

Browser infrastructure and automation for AI Agents and Apps with advanced features like proxies, captcha solving, and session recording.

Unique: Combines event logging with state management for accurate session recreation, enhancing debugging capabilities.

vs others: More precise than traditional logging methods, allowing for detailed analysis of automation failures.

Top Matches

Also Known As

Company