visual agent orchestration with rag and workflow integration
Provides a React 18-based visual canvas IDE for composing conversational AI agents by connecting LLM models, RAG knowledge bases, plugins, and workflow nodes without code. Uses a FlowGram engine to render and manage directed acyclic graphs of agent logic, with Zustand state management for real-time canvas synchronization and a Thrift IDL layer enforcing strict type contracts between frontend and Go backend services that execute the composed workflows.
Unique: Combines FlowGram visual canvas with Thrift-based type-safe RPC contracts and Go-based DDD backend, enabling visual agent composition with strict schema validation and multi-provider LLM support (OpenAI, Volcengine) in a single monorepo
vs alternatives: Offers tighter type safety and visual debugging than Langchain's Python-based DAG approach, and lower operational complexity than Kubernetes-native orchestration platforms by bundling UI, backend, and deployment in a single Docker Compose stack
multi-provider llm model service management and routing
Abstracts LLM provider APIs (OpenAI, Volcengine, and others) through a unified model service layer that manages model lists, credentials, and request routing. The backend uses Go's Hertz HTTP framework with domain-driven service handlers that normalize provider-specific request/response formats into a common interface, allowing agents to switch models or providers without workflow changes.
Unique: Implements provider abstraction via Go domain services with Hertz HTTP handlers that normalize OpenAI, Volcengine, and custom provider APIs into a single Thrift-defined interface, enabling zero-code provider switching at runtime
vs alternatives: More tightly integrated than LiteLLM (Python library) because it's built into the backend service layer with native Go performance; simpler than Anthropic's batch API or OpenAI's fine-tuning workflows because it focuses purely on request routing and credential management
openapi and chat sdk for agent integration and deployment
Exposes agent functionality through OpenAPI endpoints for chat session management and a Chat SDK (TypeScript/Python) for application integration. The OpenAPI spec is auto-generated from Thrift IDL, providing standard REST endpoints for creating sessions, sending messages, and retrieving traces. The Chat SDK wraps these endpoints with convenience methods, error handling, and streaming support for real-time agent responses.
Unique: Auto-generates OpenAPI spec from Thrift IDL and provides Chat SDK wrappers for TypeScript/Python with streaming support, enabling zero-code agent integration into external applications
vs alternatives: More standardized than custom REST APIs because OpenAPI spec is auto-generated; more convenient than raw HTTP because Chat SDK handles authentication, error handling, and streaming automatically
infrastructure-as-code deployment with docker, kubernetes, and helm
Provides Docker Compose configurations for local development and Kubernetes Helm charts for production deployment. The Docker Compose setup includes all services (frontend, backend, MySQL, Redis, Elasticsearch, vector databases) with environment variable configuration. Helm charts abstract Kubernetes resources (Deployments, Services, ConfigMaps, Secrets) and enable parameterized multi-environment deployments (staging, production) with different resource limits and replica counts.
Unique: Provides both Docker Compose for local development and Kubernetes Helm charts for production, with parameterized multi-environment support and infrastructure abstraction
vs alternatives: More flexible than managed Coze Cloud because it enables on-premises deployment; simpler than writing raw Kubernetes YAML because Helm charts provide templating and parameterization
rag knowledge base indexing, retrieval, and semantic search
Provides a resource management system for uploading, indexing, and retrieving documents through a RAG pipeline built on the Eino framework. Documents are embedded using configurable vector models, stored in vector databases (Milvus, OceanBase, or similar), and retrieved via semantic search with BM25 hybrid ranking. The backend Go services handle chunking, embedding, and retrieval orchestration, while the frontend provides UI for knowledge base CRUD and search testing.
Unique: Integrates Eino framework for RAG orchestration with hybrid BM25+semantic search, supports multiple vector databases (Milvus, OceanBase) via pluggable adapters, and provides visual knowledge base management UI with retrieval testing in the same monorepo
vs alternatives: More integrated than Langchain's RAG chains because vector DB and embedding management are built into the backend service layer; simpler than Vespa or Elasticsearch-only solutions because it combines semantic and keyword search without separate infrastructure
plugin and tool integration with schema-based function calling
Enables agents to invoke external tools and APIs through a plugin registry system where each plugin defines a Thrift-based schema specifying inputs, outputs, and execution logic. The backend maintains a plugin service that validates requests against schemas, handles authentication/credentials, and orchestrates execution via HTTP or gRPC. Plugins can be built as standalone services or embedded Go modules, and the frontend provides UI for plugin discovery, configuration, and testing.
Unique: Uses Thrift-based schema definitions for strict plugin contracts, supports both HTTP and gRPC plugin execution, and provides centralized credential management with visual plugin testing UI in the frontend
vs alternatives: More type-safe than OpenAI's function calling because schemas are enforced at the IDL layer; more flexible than Langchain's tool decorators because plugins can be external services or embedded modules
agent lifecycle management with versioning, publishing, and deployment
Manages the complete agent lifecycle from creation through deployment, including version control, publishing to registries, and deployment to production environments. The backend stores agent definitions (prompts, workflows, RAG bindings, plugins) in MySQL, tracks version history, and provides APIs for publishing agents as immutable releases. The frontend IDE includes publish workflows, deployment configuration UI, and agent marketplace browsing for discovering and importing published agents.
Unique: Provides end-to-end agent lifecycle management with MySQL-backed version history, immutable published releases, and a visual agent marketplace UI, integrated into the same monorepo as the IDE
vs alternatives: More comprehensive than Hugging Face Model Hub because it versions entire agent configurations (not just models), and simpler than Kubernetes Helm because deployment is abstracted through a UI rather than requiring YAML templating
real-time chat session management with execution tracing
Manages chat sessions between users and deployed agents, capturing full execution traces including LLM calls, tool invocations, RAG retrievals, and workflow steps. Sessions are stored in MySQL with Redis caching for active sessions, and the backend exposes OpenAPI endpoints for session creation, message sending, and trace retrieval. The frontend provides a chat UI with side-by-side execution trace visualization, allowing developers to inspect intermediate states and debug agent behavior.
Unique: Captures full execution traces with nested LLM calls, tool invocations, and RAG retrievals in a single session record, provides visual trace inspection UI in the frontend, and exposes both OpenAPI and Chat SDK for integration
vs alternatives: More detailed than LangSmith's tracing because traces are captured at the backend service layer with full context; simpler than Datadog APM because it's purpose-built for agent debugging rather than general observability
+4 more capabilities