Capability
3 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →via “tree-sitter-based code compression and comment stripping”
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.
Unique: Uses Tree-sitter AST parsing for language-aware comment removal instead of regex patterns, enabling structural understanding of code syntax. Supports 40+ languages natively with automatic fallback to regex-based stripping for unsupported languages, providing consistent compression across heterogeneous codebases.
vs others: More accurate than regex-based comment stripping because it understands language syntax and can distinguish between comments and string literals containing comment-like text. Reduces token consumption by 20-40% compared to naive concatenation while preserving code semantics.
via “comment and docstring filtering with preservation options”
Condense source code for LLM analysis by extracting essential highlights, utilizing a simplified version of Paul Gauthier's repomap technique from Aider Chat.
Unique: Provides configurable comment and docstring filtering with language-aware detection of multiple comment styles, enabling fine-grained control over documentation preservation in condensed code
vs others: More sophisticated than naive regex-based comment removal because it understands language-specific comment syntax and docstring formats, while remaining simpler than full AST-based approaches
via “tree-sitter based code parsing and semantic chunking”
** - MCP for semantic code search & navigation that reduces token waste
Unique: Uses Tree-sitter AST parsing instead of regex or simple text splitting, enabling structurally-aware chunking that respects language syntax boundaries and extracts semantic units (functions, classes) with full context preservation
vs others: More accurate than line-based or regex-based chunking because it understands actual code structure; more maintainable than custom parsers because Tree-sitter grammars are community-maintained and battle-tested
Building an AI tool with “Tree Sitter Based Code Compression And Comment Stripping”?
Submit your artifact →curl unfragile.ai/agents.md | sh© 2026 Unfragile. The platform for software for agents.