Semantic Search For Long Term Memories

1

sentence-transformersRepository56/100

via “semantic-search-with-query-document-retrieval”

Framework for sentence embeddings and semantic search.

Unique: Provides unified API for semantic search combining embedding generation, similarity computation, and result ranking; differentiates by supporting both in-memory search and external vector database integration without requiring separate libraries for each approach

vs others: More semantically accurate than keyword-based search (BM25, Elasticsearch) because it understands meaning rather than string matching, and simpler than building custom retrieval systems with separate embedding and ranking components

2

agents-towards-productionRepository55/100

via “dual-memory-system-with-semantic-search”

End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.

Unique: Explicitly separates short-term (Redis) and long-term (vector DB) memory with configurable retrieval strategies, using RedisConfig and VectorStore abstractions — most frameworks conflate these into a single context window, losing the ability to scale memory independently

vs others: Outperforms naive RAG approaches (e.g., LangChain's memory classes) by decoupling recency from relevance; agents can access week-old memories if semantically similar while keeping recent context in fast Redis, reducing both latency and token waste

3

Pieces for DevelopersProduct55/100

via “natural language search across 9-month memory with time-based filtering”

AI code snippet manager with context capture.

Unique: Combines vector-based semantic search with time-based filtering and implicit relationship graphs linking snippets to related activity (chats, tabs, documents), enabling 'bigger picture' context retrieval rather than isolated snippet matching. Local-first processing avoids cloud transmission of search queries.

vs others: Searches personal context (not generic knowledge), supports time-based filtering, and associates results with related activity — unlike GitHub Gist search or IDE snippet managers which lack temporal filtering and activity correlation.

4

mem0Agent54/100

via “semantic memory search with vector and graph-based retrieval”

Universal memory layer for AI Agents

Unique: Supports both vector-based semantic search (24+ vector store providers) and graph-based entity/relationship search (multiple graph store providers) with a unified API, allowing developers to choose or combine retrieval strategies. Includes configurable similarity thresholds and reranking to optimize result quality without requiring manual prompt engineering.

vs others: More flexible than pure vector search (Pinecone, Weaviate) because it adds graph-based relationship traversal, and more practical than pure graph search because it combines semantic similarity scoring with structural queries, enabling both fuzzy and precise memory retrieval.

5

MemOSMCP Server54/100

via “hybrid vector-graph search with multi-modal embedding support”

AI memory OS for LLM and Agent systems(moltbot,clawdbot,openclaw), enabling persistent Skill memory for cross-task skill reuse and evolution.

Unique: Fuses vector similarity and graph pattern matching in a single query pipeline with pluggable embedding models for multi-modal inputs, rather than treating vector search and structured queries as separate concerns — enables relationship-aware semantic search.

vs others: Outperforms pure vector databases on relationship-filtered queries and provides explainability via graph paths; slower than vector-only search due to dual-path execution, but more semantically structured than keyword search.

6

all-MiniLM-L6-v2Model51/100

via “semantic-text-search-with-ranking”

feature-extraction model by undefined. 32,39,437 downloads.

Unique: Combines embedding-based retrieval with similarity ranking to enable semantic search without keyword matching — the distilled BERT model is optimized for semantic similarity, making search results more relevant than BM25 for intent-based queries

vs others: More accurate than BM25 keyword search for semantic relevance; faster than cross-encoder reranking because it uses pre-computed embeddings; simpler than learning-to-rank approaches because it requires no training data

7

mcp-memory-serviceMCP Server50/100

via “semantic-memory-retrieval-with-local-embeddings”

Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.

Unique: Uses ONNX-based local embeddings instead of cloud APIs (OpenAI, Cohere), eliminating per-query costs and latency; combines sqlite-vec for dense search with optional ONNX re-ranker for quality without external dependencies. Supports both local SQLite and remote Cloudflare Vectorize backends with transparent fallback.

vs others: Faster and cheaper than Pinecone/Weaviate for single-agent deployments due to local ONNX inference; more flexible than Anthropic's native memory because it supports arbitrary knowledge graphs and multi-provider agent frameworks.

8

AI memory with biological decayRepository40/100

via “embedding-based semantic memory retrieval”

Most RAG setups fail because they treat memory like a static filing cabinet. When every transient bug fix or abandoned rule is stored forever, the context window eventually chokes on noise, spiking token costs and degrading the agent's reasoning.This implementation experiments with a biological

Unique: Integrates semantic embedding-based retrieval with decay probability scoring, ranking memories by both semantic relevance and temporal confidence. Decay filtering is applied post-retrieval, not pre-computed, allowing dynamic threshold adjustment.

vs others: More flexible than keyword-based search (handles paraphrasing and semantic drift) but more expensive and slower than simple BM25; enables natural language queries without requiring structured memory schemas.

9

Neo4j Knowledge Graph MemoryMCP Server38/100

via “hybrid semantic and exact search”

Store and retrieve user-specific memories across sessions using Neo4j graph database. This MCP memory infrastructure enables AI assistants to maintain context, recall past interactions, and manage memories with semantic search capabilities. Transform your agent's conversations into a searchable memo

Unique: Combines semantic search with exact search capabilities, providing a more comprehensive retrieval system than typical memory solutions.

vs others: Offers a dual approach to search that outperforms single-method systems in accuracy and relevance.

10

Mem0 Memory ServerMCP Server35/100

via “semantic search for memory retrieval”

Enable AI agents to store, search, and delete persistent memories across sessions to enhance context retention and recall. Integrate seamlessly with Mem0.ai's cloud or self-hosted Supabase storage for scalable and reliable memory management. Optimize your LLM applications with advanced filtering, se

Unique: Incorporates advanced NLP techniques for semantic understanding, allowing for more intuitive and context-aware memory retrieval compared to traditional keyword-based systems.

vs others: Offers superior context awareness over standard search systems, making it easier for AI agents to find relevant memories.

11

agent-recall-coreAgent35/100

via “semantic-memory-retrieval-with-ranking”

Core memory palace engine for AgentRecall

Unique: Combines three independent ranking signals (semantic similarity, temporal decay, access frequency) into a unified score rather than relying solely on embedding similarity like standard RAG. Uses spatial memory palace structure to pre-filter candidates before ranking, reducing computation vs. flat vector search.

vs others: More sophisticated than simple vector similarity search because it weights recency and usage patterns, preventing old but semantically similar memories from drowning out recent relevant ones. Spatial pre-filtering reduces ranking computation vs. exhaustive similarity search.

12

Collabmem – a memory system for long-term collaboration with AIRepository34/100

via “persistent conversation memory with semantic indexing”

Hello HN! I built collabmem, a simple memory system for long-term collaboration between humans and AI assistants. And it's easy to install, just ask Claude Code: Install the long-term collaboration memory system by cloning https://github.com/visionscaper/collabmem to a te

Unique: Implements collaborative memory specifically designed for multi-turn AI interactions, using semantic embeddings to surface relevant past context automatically rather than relying on manual memory management or fixed context windows

vs others: Enables true long-term collaboration memory where context persists across sessions and is retrieved semantically, unlike stateless LLM APIs or simple conversation logs that require manual context injection

13

mem0_mcp_privateMCP Server33/100

via “semantic search for long-term memories”

Save, search, and manage long-term memories across users and apps. Quickly recall facts, preferences, and past conversations with semantic search and structured filters. Update or delete specific entries, or bulk-clear a scope to keep context accurate and tidy.

Unique: Integrates a custom-built vector embedding model tailored for user memory contexts, enhancing retrieval accuracy over generic models.

vs others: More efficient than traditional keyword-based searches as it understands context, reducing irrelevant results.

14

Memory Box MCP ServerMCP Server33/100

via “semantic-memory-search-with-intent-matching”

Save, search, and format memories with semantic understanding. Enhance your memory management by leveraging advanced semantic search capabilities directly from Cline. Organize and retrieve your memories efficiently with structured formatting and detailed context.

Unique: Operates as an MCP tool within Cline's context, enabling semantic search directly in the code editor workflow without context-switching to a separate search interface or database tool

vs others: More integrated than standalone vector databases for developer workflows, with direct MCP bindings that reduce latency and context loss compared to REST API calls

15

MementoMCP Server33/100

via “semantic search with temporal awareness”

Enhance your LLM applications with a scalable knowledge graph memory system. Utilize semantic search and temporal awareness to manage and retrieve information effectively, ensuring your agents have persistent and contextual memory capabilities.

Unique: Memento's semantic search integrates temporal awareness directly into the knowledge graph, enabling contextually relevant results based on the timing of information.

vs others: More effective than traditional keyword-based search engines by incorporating temporal context into the retrieval process.

16

Mem0 MemoriesMCP Server33/100

via “contextual memory retrieval”

Store and retrieve user-specific memories to maintain reliable long-term context. Search past memories to surface the most relevant details instantly. Organize preferences and facts per user for consistent, personalized interactions across sessions.

Unique: Incorporates both keyword indexing and semantic search to enhance the relevance of retrieved memories, unlike simpler keyword-only systems.

vs others: Provides faster and more relevant memory retrieval than systems relying solely on keyword matching.

17

Memory-PlusRepository31/100

via “semantic-memory-retrieval-with-similarity-search”

** a lightweight, local RAG memory store to record, retrieve, update, delete, and visualize persistent "memories" across sessions—perfect for developers working with multiple AI coders (like Windsurf, Cursor, or Copilot) or anyone who wants their AI to actually remember them.

Unique: Implements category-aware filtering and recent-memory shortcuts alongside semantic search, allowing agents to choose between expensive semantic queries and fast recency-based lookups depending on context needs

vs others: More lightweight than LangChain's memory modules by focusing purely on vector similarity without additional re-ranking or fusion strategies, trading some ranking sophistication for lower latency and simpler integration

18

AI-Augmented Memory for GroupsProduct30/100

via “semantic search for group memory”

We’re building Largemem, (https://largemem.com) a shared knowledge base where groups upload and maintain a common set of documents (PDFs, scans, audio) and query them conversationally.Each group has its own persistent knowledge base. We parse content into chunks, extract entities, and comb

Unique: Incorporates semantic understanding to enhance search relevance, unlike traditional keyword-based search engines.

vs others: Delivers more relevant results than standard search tools by understanding the context of queries.

19

lettaFramework30/100

via “semantic memory retrieval with context-aware recall”

Create LLM agents with long-term memory and custom tools

Unique: Integrates semantic memory retrieval directly into agent decision-making, allowing agents to actively search their memory rather than relying on fixed context windows or external RAG systems

vs others: More tightly integrated with agent state than external RAG systems, enabling agents to reason about what memories to retrieve and how to use them

20

mem0aiMCP Server29/100

via “semantic memory retrieval with hybrid search”

Long-term memory for AI Agents

Unique: Combines configurable embedding models with provider-agnostic vector search, supporting both semantic and keyword retrieval in a unified query interface, with automatic re-ranking based on metadata filters and relevance scores

vs others: More integrated than using raw vector DB SDKs (handles embedding generation and ranking) while remaining more flexible than LangChain's memory (supports multiple embedding models and hybrid search strategies)

Top Matches

Also Known As

Company