haft
AgentFreeEngineering decisions engine that know when they're stale. Frame, compare, decide — with evidence decay and parity enforcement. For Claude Code, Cursor, Gemini CLI, Codex and more.
Capabilities11 decomposed
first principles framework (fpf) structured reasoning enforcement
Medium confidenceEnforces a disciplined 5-mode engineering cycle (Understand → Explore → Choose → Execute → Verify) by requiring AI agents to frame problems before solving them, generate genuinely different solution variants, and compare options under parity constraints. Implements this via MCP tools (haft_problem, haft_solution, haft_decision) that validate reasoning artifacts against a formal specification before allowing progression to implementation.
Implements a formal specification-driven reasoning cycle with maturity (Unassessed → Shipped) and freshness (Healthy → Stale → At Risk) tracking, enforcing parity in comparisons via a knowledge graph that links decisions to codebase artifacts — unlike generic prompt engineering, this creates falsifiable contracts with evidence decay mechanics
Differs from Cursor/Claude Code's native reasoning by adding governance layer that prevents decision drift and enforces structured comparison, whereas standard agents optimize for speed-to-code
mcp-based tool serving for ai agent integration
Medium confidenceExposes Haft's reasoning capabilities as a Model Context Protocol (MCP) server via JSON-RPC transport, providing six specialized tools (haft_problem, haft_solution, haft_decision, haft_evidence, haft_check, haft_search) that AI agents can invoke natively within their execution environment. The server runs as a subprocess managed by the agent's MCP client, maintaining a persistent SQLite state store and knowledge graph indexed to the codebase.
Implements MCP as the primary delivery surface (not a secondary plugin), with six domain-specific tools designed for the FPF cycle rather than generic function calling — includes codebase-aware search and evidence decay scoring built into the protocol layer
More specialized than generic MCP servers (e.g., Anthropic's file-system MCP) because tools are designed for reasoning governance, not file I/O; tighter integration with decision lifecycle than REST APIs
parity enforcement in solution comparison
Medium confidenceEnforces equal rigor in comparing competing solutions by requiring that all variants be evaluated against the same criteria, preventing bias toward preferred solutions. Implements parity checks via the haft_solution and haft_decision tools that validate solution descriptions follow the same structure and depth. Tracks comparison fairness metrics to ensure decisions are based on equivalent evidence.
Implements structural parity checks that validate all solutions follow the same evaluation template and depth — unlike generic decision frameworks, this prevents strawman alternatives and ensures fair comparison
More rigorous than informal decision-making because it enforces structural equivalence; differs from decision matrices by focusing on comparison process rather than scoring
evidence decay and decision freshness tracking
Medium confidenceMonitors the health of engineering decisions across two axes: maturity (progress from Unassessed to Shipped) and freshness (Healthy → Stale → At Risk based on evidence age and drift detection). Implements R_eff (effective reasoning score) that decays over time as supporting evidence ages, triggering alerts when decisions drift from their original context. Uses SQLite schema with timestamp-based queries to identify stale decisions and prompt re-evaluation.
Implements a two-axis decision lifecycle model (maturity + freshness) with time-decay scoring (R_eff) that automatically degrades decision confidence — unlike static decision logs, this creates a living system where old decisions are flagged for re-evaluation without manual intervention
More sophisticated than ADR (Architecture Decision Records) because it tracks decision health over time and flags staleness; differs from code review tools by focusing on decision validity rather than code quality
codebase-aware knowledge graph indexing and search
Medium confidenceBuilds a knowledge graph that links engineering decisions to codebase artifacts (modules, functions, files) using FPF Spec Search & Indexer. Enables semantic search over past decisions filtered by codebase context, allowing agents to query 'decisions affecting this module' or 'solutions tried for this problem pattern'. Stores graph in SQLite with projections that map decisions to code locations and vice versa.
Implements a bidirectional knowledge graph (decisions ↔ code artifacts) with FPF Spec Search that understands decision semantics and codebase structure simultaneously — unlike generic code search, this links reasoning to implementation and enables decision-centric queries
More targeted than full-text search because it understands decision structure and codebase topology; differs from RAG systems by maintaining explicit decision-to-code mappings rather than embedding-based retrieval
autonomous tui agent with react-style coordinator
Medium confidenceProvides a terminal-based autonomous agent (haft agent command) that executes the engineering cycle without human intervention, using a ReAct-style coordinator to move through Understand → Explore → Choose → Execute → Verify phases. The coordinator maintains state in SQLite and can pause at checkpoints for human review. Implements a lemniscate cycle pattern that allows looping back to earlier phases if verification fails.
Implements a lemniscate cycle (figure-8 loop) that allows backtracking from Verify to earlier phases if verification fails, rather than linear progression — enables iterative refinement without restarting the entire cycle
More structured than generic ReAct agents because it enforces FPF phases; differs from Devin/Claude Code by running autonomously in terminal without IDE, making it suitable for headless environments
multi-provider llm abstraction with provider-agnostic reasoning
Medium confidenceAbstracts LLM provider differences (OpenAI Codex, Anthropic Claude, Google Gemini) behind a unified interface, allowing the same FPF reasoning cycle to work across different models. Routes tool calls and reasoning prompts to the configured provider via a provider adapter pattern, with fallback support for multiple models. Stores provider configuration in project policy files.
Implements provider abstraction at the reasoning level (not just API calls), allowing the same FPF cycle to work across Claude, Codex, and Gemini with different tool-calling conventions — uses adapter pattern to normalize provider differences
More flexible than single-provider agents (Claude Code, Cursor) because it supports provider switching; differs from LangChain by focusing on reasoning governance rather than generic LLM chaining
project policy enforcement and governance configuration
Medium confidenceEnforces project-level governance policies via .haft/ directory containing formal specifications (FPF Spec), provider configurations, and decision templates. Policies are versioned and can be checked via haft check command to ensure decisions comply with project standards. Implements a policy-as-code approach where governance rules are stored alongside the project and enforced by the Haft runtime.
Implements governance as versioned policy files in .haft/ directory (similar to .github/ workflows), making policies auditable and version-controlled alongside code — unlike external governance systems, policies live in the repository
More integrated than external compliance tools because policies are co-located with code; differs from linters by enforcing reasoning discipline rather than code style
desktop visual cockpit for decision portfolio management
Medium confidenceProvides a Wails v2-based desktop application (Go backend + React frontend) that visualizes the project's reasoning state across multiple dimensions: decision maturity, freshness, evidence decay, and Pareto fronts of competing solutions. Allows human oversight of autonomous agent decisions, multi-project portfolio views, and interactive decision exploration. Status: pre-alpha.
Implements a visual decision governance cockpit with Pareto front visualization for trade-off analysis — unlike CLI tools, provides interactive exploration of competing solutions and their trade-offs
More specialized than generic project management dashboards because it focuses on decision health and evidence decay; differs from IDE integrations by providing portfolio-level oversight
sqlite-backed artifact store with schema migrations
Medium confidenceImplements a persistent data layer using SQLite with versioned schema migrations, storing decision artifacts, evidence records, codebase mappings, and reasoning state. Provides projection views that map decisions to code modules and track artifact relationships. Migrations are managed via a migration system that ensures schema consistency across versions.
Implements versioned schema migrations that allow Haft to evolve its data model without breaking existing projects — uses a migration system similar to database frameworks (Flyway, Alembic) but integrated into the Go binary
More durable than in-memory stores because it persists across sessions; differs from document databases by using relational schema for decision relationships
cli command suite for governance and initialization
Medium confidenceProvides a comprehensive CLI interface with commands for project initialization (haft init), governance checks (haft check), MCP server startup (haft serve), and autonomous agent execution (haft agent). Commands are composable and support piping for integration with shell scripts and CI/CD pipelines. Implements a task-based CLI pattern with structured output (JSON/YAML).
Implements a task-based CLI (similar to Taskfile.yaml) with structured output and composable commands, designed for CI/CD integration rather than interactive use — supports piping and JSON output for automation
More automation-friendly than interactive CLIs because it provides structured output and exit codes; differs from generic CLI tools by focusing on governance-specific commands
Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.
Related Artifactssharing capabilities
Artifacts that share capabilities with haft, ranked by overlap. Discovered automatically through the match graph.
Codeium
Free AI code completion — 70+ languages, 40+ IDEs, inline suggestions, chat, free for individuals.
cherry-studio
AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs
Search1API
** - One API for Search, Crawling, and Sitemaps
Mistral: Devstral Medium
Devstral Medium is a high-performance code generation and agentic reasoning model developed jointly by Mistral AI and All Hands AI. Positioned as a step up from Devstral Small, it achieves...
servers
Model Context Protocol Servers
mcp-demo-example
MCP demo — ReAct agent using @modelcontextprotocol/server-filesystem via @flomatai/mcp-client
Best For
- ✓Teams using Claude Code, Cursor, or Gemini CLI who want governance over agentic reasoning
- ✓Organizations requiring decision traceability and falsifiable engineering contracts
- ✓Solo developers building complex systems who need to avoid decision drift
- ✓IDE-integrated AI agents (Cursor, Claude Code) that support MCP
- ✓Teams wanting to enforce governance without breaking agent workflows
- ✓Organizations with existing MCP infrastructure
- ✓Teams making high-stakes architectural decisions
- ✓Organizations wanting to prevent decision bias
Known Limitations
- ⚠Requires explicit adoption of FPF methodology — agents must be prompted to use MCP tools
- ⚠No automatic enforcement if agent chooses to bypass the reasoning cycle
- ⚠Pre-alpha TUI agent mode may have stability issues in production workflows
- ⚠MCP transport adds ~50-100ms latency per tool invocation
- ⚠Requires agent to explicitly call tools — no automatic decision capture
- ⚠JSON-RPC serialization overhead for large decision artifacts (>10KB)
Requirements
Input / Output
UnfragileRank
UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.
Repository Details
Last commit: Apr 20, 2026
About
Engineering decisions engine that know when they're stale. Frame, compare, decide — with evidence decay and parity enforcement. For Claude Code, Cursor, Gemini CLI, Codex and more.
Categories
Alternatives to haft
Are you the builder of haft?
Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.
Get the weekly brief
New tools, rising stars, and what's actually worth your time. No spam.
Data Sources
Looking for something else?
Search →