AI Repositories
The open-source AI ecosystem — frameworks like LangChain and CrewAI, libraries, research implementations, awesome-lists, and the building blocks developers use to create AI applications.
100-dimensional English word embeddings for wink-nlp
Digital brain as skills for AI coding CLIs — no vector DB, no embeddings, no infrastructure
A Vitest reporter optimized for LLM parsing with structured, concise output
A lightweight, file-backed vector database for Node.js and browsers with Pinecone-compatible filtering and hybrid BM25 search.
VectoriaDB - A lightweight, production-ready in-memory vector database for semantic search
TalaDB React Native module — document and vector database via JSI HostObject
Local-first document and vector database for React, React Native, and Node.js
AI embeddings and semantic search plugin for Strapi v5 with pgvector support
Lightweight vector database with SQL, SPARQL, and Cypher - runs everywhere (Node.js, Browser, Edge)
Portable WASM embedding generation with SIMD and parallel workers - run text embeddings in browsers, Cloudflare Workers, Deno, and Node.js
The AWS (Bedrock) backend module for the @roadiehq/rag-ai plugin.
Semantic embeddings and vector search - find concepts that resonate
TypeScript bridge for recursive-llm: Recursive Language Models for unbounded context processing with structured outputs
Internal shared utilities for RAG-Forge packages
Real Geeks UI Kit.
LLM eval & testing toolkit
Parse partial JSON generated by LLM
OpenCode plugin that gives coding agents persistent memory using local vector database
QNN LLM binding for Node.js
n8n community node for LM Studio Embeddings API with encoding format selection
Azure OpenAI Chat Model and Embeddings with MS OAuth2 for n8n
Library to query multiple LLM providers in a consistent way
MemberJunction: AI Vector Database Module
Popular AI / LLM Model Brand SVG Logo and Icon Collection
100+ LLM models. Pricing, capabilities, context windows. Always current.
[llm-ui](https://llm-ui.com) markdown block.
Efficient, configurable text chunking utility for LLM vectorization. Returns rich chunk metadata.
Information on LLM models, context window token limit, output token limit, pricing and more
A TypeScript library for validating and securing LLM prompts
[](https://github.com/rogeriochaves/llm-cost/actions/workflows/node.js.yml) [](https://www.npmjs.com/package/ll
Condense source code for LLM analysis by extracting essential highlights, utilizing a simplified version of Paul Gauthier's repomap technique from Aider Chat.
A super simple text splitter for LLM
Mind engine adapter for KB Labs Mind (RAG, embeddings, vector store integration).
Semantic code search for coding agents. Local embeddings, LLM summaries, call graph tracing.
Generate LLM-friendly llms.txt files from markdown and MDX content files
Genkit AI framework plugin for Pinecone vector database.
OpenAI intelligence adapter for Engram — embeddings, summarization, entity extraction, cross-encoder reranking
TypeScript client for encrypted vector database with maximum security and speed
Effect modules for working with AI apis
EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js
AI support bot framework with RAG and ticket management
CloseVector is fundamentally a vector database. We have made dedicated libraries available for both browsers and node.js, aiming for easy integration no matter your platform. One feature we've been working on is its potential for scalability. Instead of b
Azure AI Projects client library.
autogen types for proxy gql
autogen for directory srv
autogen for calendar srv
autogen for adopus srv
Force Remove Copilot, Recall and More in Windows 11
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
OpenUI let's you describe UI using your imagination, then see it rendered live.
SD.Next: All-in-one WebUI for AI generative image and video creation, captioning and processing
AI + Data, online. https://vespa.ai
fast-stable-diffusion + DreamBooth
Community interface for generative AI
Entitas is a super fast Entity Component System (ECS) Framework specifically made for C# and Unity
🚀 Beautiful highly customizable statusline for Claude Code CLI with powerline support, themes, and more.
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Private & local AI personal knowledge management app for high entropy people.
A query and indexing engine for Redis, providing secondary indexing, full-text search, vector similarity search and aggregations.
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
The Fastest Distributed Database for Transactional, Analytical, and AI Workloads.
An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms
An open source, privacy focused alternative to NotebookLM for teams with no data limits. Join our Discord: https://discord.gg/ejRNvftDp9
Run Stable Diffusion on Mac natively
The best agent harness.
A multi-module course teaching everything you need to know about using GitHub Copilot as an AI Peer Programming resource.
Python tool for converting files and office documents to Markdown.
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
MariaDB server is a community developed fork of MySQL server. Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry.
Multi-Platform Package Manager for Stable Diffusion
THE Copilot in Obsidian
Official repository for LTX-Video
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
Meta-programming for Swift, stop writing boilerplate code.
Kilo is the all-in-one agentic engineering platform. Build, ship, and iterate faster with the most popular open source coding agent. #1 coding agent on OpenRouter. 1.5M+ Kilo Coders. 25T+ tokens processed
Java 1-25 Parser and Abstract Syntax Tree for Java with advanced analysis functionalities.
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial product
The first GitHub Copilot, Codeium and ChatGPT Xcode Source Editor Extension
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Design-first Go framework that generates API code, documentation, and clients. Define once in an elegant DSL, deploy as HTTP and gRPC services with zero drift between code and docs.
💫 Toolkit to help you get started with Spec-Driven Development
TypeScript Compiler API wrapper for static analysis and programmatic code changes.
Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.
Desktop application of new Bing's AI-powered chat (Windows, macOS and Linux)
Free universal database tool and SQL client
本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/
Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
The ultimate LLM/AI application development framework in Go.
A repository of models, textual inversions, and more
Powerful AI Client
🌻 一键拥有你自己的 ChatGPT+众多AI 网页服务 | One click access to your own ChatGPT+Many AI web services
Stable Diffusion built-in to Blender
stable diffusion webui colab
What are AI Repositories?
Open-source AI repositories are the building blocks of the AI ecosystem. They include frameworks (LangChain, Transformers), tools (Ollama, vLLM), research implementations, and community projects. GitHub is the primary host, with repositories ranging from production-ready libraries to cutting-edge research code.
How to Choose
Beyond star count, evaluate: maintenance activity (last commit date, PR response time), documentation quality, test coverage, and community health (Discord/issues responsiveness). For production use, check the release cadence and breaking change history. Star count indicates popularity, not quality.
Key Capabilities to Evaluate
Common Patterns
Install as a dependency (npm, pip). The most common pattern — import and use in your code.
Provides the application structure — you write code within its patterns. More opinionated, more features.
Clone and run. Complete application with its own UI, API, and storage.
Paper companion code. Often requires adaptation for production use.
What to Watch Out For
Top Capabilities
Browse all →Analyzes selected code or entire files and generates natural language explanations of what the code does, how it works, and why certain patterns were chosen. The feature can produce documentation in multiple formats (docstrings, comments, markdown) and supports various documentation styles (JSDoc, Sphinx, etc.). Developers can request explanations at different levels of detail (high-level overview, line-by-line breakdown, architectural context) through the chat interface, with responses appearing as formatted text or code comments.
Translates non-English speech directly to English text using the same Transformer encoder-decoder architecture by prepending a 'translate' task token during decoding, bypassing explicit transcription. The AudioEncoder processes mel spectrograms identically to transcription, but the TextDecoder generates English tokens directly from audio embeddings. This end-to-end approach avoids cascading errors from intermediate transcription-then-translation pipelines and enables language-agnostic audio understanding.
Detects the spoken language in audio by analyzing the AudioEncoder embeddings and using the TextDecoder to predict a language token before generating transcription text. Language detection is implicit in the multitask training; the model learns to identify language from acoustic features without a separate classification head. Supports 99 languages with varying confidence based on training data representation (English: 65% of training data, others: 0.1-2%).
Maintains conversation history within a single chat session, allowing developers to ask follow-up questions, request refinements, and build on previous responses without re-providing context. The extension manages conversation state (messages, responses, context) and sends the full conversation history to ChatGPT's API with each request, enabling contextual understanding of refinement requests like 'make it faster' or 'add error handling'.
Generates new code snippets based on natural language descriptions by sending the user's intent and current editor selection context to OpenAI's API, then inserting the generated code at the cursor position or displaying it in the sidebar. The extension reads the active editor's selected text to provide code context, enabling the model to generate syntactically appropriate code for the detected language. Generation is triggered via keyboard shortcut (Ctrl+Alt+G), command palette, or toolbar button.
Generates docstrings, comments, and API documentation for functions, classes, and modules by analyzing code structure and semantics using GPT-4o. The extension detects function signatures, parameter types, and return types, then generates documentation in multiple formats (JSDoc, Python docstrings, Javadoc, etc.) matching the language and project conventions. Generated docs are inserted inline with proper indentation and formatting.
Analyzes staged or modified code changes in the current Git repository and generates descriptive commit messages using the configured AI provider. The feature integrates with VS Code's Git context to identify changed files and diffs, then sends this information to the AI model to produce commit messages following conventional commit formats or project-specific conventions. This automation reduces the cognitive load of writing commit messages while maintaining code quality and repository history clarity.
Offers a freemium pricing structure where basic problem detection and explanations are available for free, with premium features (likely advanced fix generation, priority support, or higher API quotas) available through paid subscription. The free tier includes GNN-based problem detection and LLM-powered explanations using Metabob's default backend, while premium tiers likely unlock OpenAI ChatGPT integration, higher analysis quotas, or team features. Pricing details are not publicly documented in the marketplace listing.
Browse Other Types
Autonomous AI systems that act on your behalf
ModelsFoundation models, fine-tunes, and specialized AI models
MCP ServersModel Context Protocol tools and integrations
APIsProgrammatic endpoints for AI capabilities
ExtensionsBrowser and IDE extensions powered by AI
WorkflowsAutomation sequences and AI pipelines
View all 14 types →Frequently Asked Questions
How do I evaluate an open-source AI project?
Look beyond stars: check last commit date, open issue count vs. closed ratio, release frequency, documentation quality, test coverage, and license terms. A repo with 500 stars and weekly commits is often more reliable than one with 5000 stars and no commits in 6 months.