RAG & Vector Search

211 repos
Production-ready platform for agentic workflow development.
★ 139,176TypeScriptupdated 2026-04-20agentagentic-aiagentic-frameworkagentic-workflowai
The agent engineering platform
★ 134,936Pythonupdated 2026-04-19agentsaiai-agentsanthropicchatgpt
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
★ 134,144Pythonupdated 2026-04-20aillmllm-uillm-webuillms
21 Lessons, Get Started Building with Generative AI
★ 109,836Jupyter Notebookupdated 2026-04-16aiazurechatgptdall-egenerative-ai
100+ AI Agent & RAG apps you can actually run — clone, customize, ship.
★ 107,537Pythonupdated 2026-04-19agentsllmspythonrag
The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
★ 101,440TypeScriptupdated 2026-04-20aialternativeauthdatabasedeno
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
★ 79,009Pythonupdated 2026-04-20llm-app
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
★ 76,556Pythonupdated 2026-04-20ai4sciencechineseocrdocument-parsingdocument-translationkie
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
★ 73,813MDXupdated 2026-03-11agentagentsai-agentschatgptdeep-learning
A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future sessions.
★ 67,599TypeScriptupdated 2026-04-20aiai-agentsai-memoryanthropicartificial-intelligence
12 Lessons to Get Started Building AI Agents
★ 59,438Jupyter Notebookupdated 2026-04-20agentic-aiagentic-frameworkagentic-ragai-agentsai-agents-framework
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
★ 58,999JavaScriptupdated 2026-04-17ai-agentscustom-ai-agentsdeepseekkimillama3
Universal memory layer for AI Agents
★ 54,076Pythonupdated 2026-04-20agentsaiai-agentsapplicationchatbots
The best-benchmarked open-source AI memory system. And it's free.
★ 49,688Pythonupdated 2026-04-19aichromadbllmmcpmemory
LlamaIndex is the leading document agent and OCR platform
★ 48,939Pythonupdated 2026-04-16agentsapplicationdatafine-tuningframework
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
★ 43,986Goupdated 2026-04-20annscloud-nativediskanndistributedembedding-database
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
★ 39,126Pythonupdated 2025-07-09aiapichatbotchatgptdatabase
AI Data Vault - A query engine for AI Agents to securely query data from any datasource
★ 39,054Pythonupdated 2026-04-20agentsaianalyticsartificial-inteligencebigquery
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
★ 34,251Pythonupdated 2026-03-26agentaiassistantchatchatgpt
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
★ 34,110Jupyter Notebookupdated 2026-03-23agentsaillmsmachine-learningmcp
Vane is an AI-powered answering engine.
★ 33,989TypeScriptupdated 2026-04-11ai-agentsai-search-engineanswering-engineartificial-intelligencellm
🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architecture, distributed swarm intelligence, RAG integration, and native Claude Code / Codex Integration
★ 33,399TypeScriptupdated 2026-04-11agentic-aiagentic-engineeringagentic-frameworkagentic-ragagentic-workflow
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
★ 30,698Rustupdated 2026-04-20ai-searchai-search-engineembeddings-similarityhnswhybrid-search
Build resilient language agents as graphs.
★ 30,406Pythonupdated 2026-04-19agentsaiai-agentschatgptdeepagents
Open Source AI Platform - AI Chat with advanced features that works with every LLM
★ 28,514Pythonupdated 2026-04-20aiai-chatchatgptchatuienterprise-search
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
★ 28,130Pythonupdated 2025-09-30agentic-ragdeep-researchemnlp2024knowledge-curationlarge-language-models
Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.
★ 27,875TypeScriptupdated 2026-04-20agent-workflowagentic-workflowagentsaiaiagents
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
★ 27,833TypeScriptupdated 2026-04-20agentclaudedeepseekllmmcp
Data infrastructure for AI
★ 27,626Rustupdated 2026-04-19agentsaiai-agentsdatabaserust
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed notebook tutorial.
★ 26,982Jupyter Notebookupdated 2026-04-15aiembeddingslangchainllama-indexllm
Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.
★ 26,818TypeScriptupdated 2026-04-17applicant-tracking-systematshacktoberfestmachine-learningnatural-language-processing
Build Real-Time Knowledge Graphs for AI Agents
★ 25,389Pythonupdated 2026-04-18agentsgraphllmsrag
An open-source RAG-based tool for chatting with your documents.
★ 25,310Pythonupdated 2026-04-03chatbotllmsopen-sourcerag
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
★ 24,992MDXupdated 2026-04-20agentagentsaigeminigenerative-ai
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
★ 23,485Pythonupdated 2025-10-28aicsvdatadata-analysisdata-science
Python scraper based on AI
★ 23,388Pythonupdated 2026-04-19ai-crawlerai-scrapingai-searchcrawlerdata-extraction
OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need through a file system paradigm, enabling hierarchical context delivery and self-evolving.
★ 23,068Pythonupdated 2026-04-20agentagentic-ragai-agentsclawbotcontext-database
280+ free n8n automation templates — ready-to-use workflows for Gmail, Telegram, Slack, Discord, WhatsApp, Google Drive, Notion, OpenAI, and more. AI agents, RAG chatbots, email automation, social media, DevOps, and document processing. The largest open-source n8n template collection.
★ 21,599updated 2026-04-09ai-agentsai-automationautomationautomation-templatesawesome
50+ tutorials and implementations for Generative AI Agent techniques, from basic conversational bots to complex multi-agent systems.
★ 21,566Jupyter Notebookupdated 2026-04-15agentsaiai-agentsgenailangchain
🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。
★ 20,809Pythonupdated 2026-04-20agentagentic-aichatbotdeepseek-r1knowledgebase
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
★ 20,627TypeScriptupdated 2026-04-20agentagent-platformai-pluginschatbotchatbot-framework
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.
★ 20,570TypeScriptupdated 2026-04-20cici-cdcicdevaluationevaluation-framework
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
★ 19,489TypeScriptupdated 2025-09-2112-factor12-factor-agentsagentsaicontext-window
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
★ 19,052Pythonupdated 2026-04-20evaluationhacktoberfesthacktoberfest2025langchainllama-index
open-source agentic AI data assistant for the next generation of AI + Data products.
★ 18,604Pythonupdated 2026-04-03agentsbgidatabasedeepseekgpt
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
★ 18,303Jupyter Notebookupdated 2026-04-17aifinetuninglangchainllamallama2
Autonomous agents for everyone
★ 18,242TypeScriptupdated 2026-04-20agentagenticaiautonomouschatbot
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
★ 17,849Pythonupdated 2026-04-20agent-builderagentsaichatgptdocsgpt
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
★ 17,479Pythonupdated 2025-01-22agentsagiaiartificial-general-intelligenceartificial-intelligence
Knowledge Engine for AI Agent Memory in 6 lines of code
★ 16,781Pythonupdated 2026-04-20aiai-agentsai-memorycognitive-architecturecognitive-memory
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
★ 16,178Pythonupdated 2026-03-04
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.
★ 16,080Goupdated 2026-04-20approximate-nearest-neighbor-searchgenerative-searchgrpchnswhybrid-search
Open-source text-to-SQL and text-to-chart GenBI agent with a semantic layer. Ask your database questions in natural language — get accurate SQL, charts, and BI insights. Supports 12+ data sources (PostgreSQL, BigQuery, Snowflake, etc.) and any LLM (OpenAI, Claude, Gemini, Ollama).
★ 15,013TypeScriptupdated 2026-04-16agentanthropicbedrockbigquerybusiness-intelligence
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
★ 14,068JavaScriptupdated 2026-04-10datasetfine-tuningjavascriptllmrag
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
★ 14,005Goupdated 2026-04-20agentagenticaichatbotchatbots
An open source, privacy focused alternative to NotebookLM for teams with no data limits. Join our Discord: https://discord.gg/ejRNvftDp9
★ 13,996Pythonupdated 2026-04-18agentagentsaichrome-extensionextension
Memori is agent-native memory infrastructure. A LLM-agnostic layer that turns agent execution and conversation into structured, persistent state for production systems.
★ 13,868Pythonupdated 2026-04-17agentagent-memoryaiai-memoryaiagent
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
★ 12,424Pythonupdated 2026-04-14agentsaiai-agentsembeddingsinformation-retrieval
A cross-platform Markdown AI note-taking software.
★ 11,313TypeScriptupdated 2026-04-16agentchatbotknowledge-basellmmarkdown
Chrome MCP Server is a Chrome extension-based Model Context Protocol (MCP) server that exposes your Chrome browser functionality to AI assistants like Claude, enabling complex browser automation, content analysis, and semantic search.
★ 11,299TypeScriptupdated 2026-01-06
Agent S: an open agentic framework that uses computers like a human
★ 10,915Pythonupdated 2026-02-21agent-computer-interfaceai-agentscomputer-automationcomputer-usecomputer-use-agent
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
★ 10,466Pythonupdated 2026-03-27agentic-aiagentic-frameworkagentic-workflowagentsai-framework
A collection of projects showcasing RAG, agents, workflows, and other AI use cases
★ 10,258Pythonupdated 2026-04-20agentsaihacktoberfestllmmcp
All-in-one LLM CLI tool featuring Shell Assistant, Chat-REPL, RAG, AI Tools & Agents, with access to OpenAI, Claude, Gemini, Ollama, Groq, and more.
★ 9,887Rustupdated 2026-02-23aiai-agentschatbotclaudecli
Code search MCP for Claude Code. Make entire codebase the context for any coding agent.
★ 9,443TypeScriptupdated 2026-04-20agentagentic-ragai-codingclaude-codecode-generation
Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.
★ 9,100C++updated 2026-02-16agentagentic-ragaiclawbotcomputer-vision
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
★ 8,822Pythonupdated 2026-04-23apifyautomationbeautifulsoupcrawlercrawling
AI memory OS for LLM and Agent systems(moltbot,clawdbot,openclaw), enabling persistent Skill memory for cross-task skill reuse and evolution.
★ 8,685TypeScriptupdated 2026-04-20agentagent-memoryclawdbotllmllm-memory
Private & local AI personal knowledge management app for high entropy people.
★ 8,558JavaScriptupdated 2025-05-13ailancedbllamallamacpplocal-first
AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework
★ 8,483TypeScriptupdated 2026-04-15agentsaiai-agentsai-agents-frameworkaiagentframework
A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.
★ 8,133Rustupdated 2026-04-29buncsharpdocument-intelligenceelixirffi
Build autonomous AI agents in Python.
★ 7,836Pythonupdated 2026-04-17agentagent-frameworkautonomous-agentautonomous-agentsclaude
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
★ 7,784Pythonupdated 2025-11-19agentagentic-ragclaudedeep-researchdeepseek
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
★ 7,775Pythonupdated 2025-11-07artificial-intelligencelarge-language-modelspythonquestion-answeringrag
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
★ 7,663Pythonupdated 2026-04-20
PraisonAI 🦞 — Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, RAG, and support for 100+ LLMs.
★ 6,986Pythonupdated 2026-04-20agentsaiai-agent-frameworkai-agent-sdkai-agents
AI + Data, online. https://vespa.ai
★ 6,896Javaupdated 2026-04-20aibig-datajavamachine-learningrag
Large Action Model framework to develop AI Web Agents
★ 6,326Pythonupdated 2025-01-21aibrowserlarge-action-modelllmoss
Open-source context retrieval layer for AI agents
★ 6,260Pythonupdated 2026-04-20agent-infrastructureaiai-agentsai-infrastructureapi
Context-aware AI assistant for your desktop. Ready to respond intelligently, seamlessly integrating multiple LLMs and MCP tools.
★ 5,852C#updated 2026-04-19agentaiavaloniachatclaude
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
★ 5,802Pythonupdated 2025-09-12article-extractorcorpus-buildercorpus-toolscrawlerhtml-to-markdown
A visual playground for agentic workflows: Iterate over your agents 10x faster
★ 5,713TypeScriptupdated 2025-07-20agentagentsaibuilderdeepseek
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
★ 5,437Rustupdated 2026-04-20ai-engineeringai-pipelinearrowartificial-intelligencebig-data
The open source platform for AI-native application development.
★ 5,381Pythonupdated 2024-12-02agentaiai-nativefunction-callgenerative-ai
Spec-driven development for large codebases
★ 5,351Pythonupdated 2026-04-20agentsai-agentsai-agents-frameworkartificial-intelligencedeveloper-tools
🐢 Open-Source Evaluation & Testing library for LLM Agents
★ 5,307Pythonupdated 2026-04-25agent-evaluationai-red-teamai-securityai-testingfairness-ai
MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)
★ 5,288Pythonupdated 2026-03-12agentcontext-engineeringelectronembedding-modelsjavascript
Superduper: End-to-end framework for building custom AI applications and agents.
★ 5,271Pythonupdated 2025-09-01aichatbotdatadatabasedistributed-ml
Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3
★ 4,901JavaScriptupdated 2026-04-18chatgptclaudeembeddingsgeminillama3
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
★ 4,727Pythonupdated 2026-04-18analysisautomlbenchmarkingdocument-parserembeddings
Low-latency AI engine for mobile devices & wearables
★ 4,689Cupdated 2026-04-20aiandroidarmedgeedge-ai
⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com
★ 4,503Goupdated 2026-04-26a2aagentagenticagentic-aiagi
The easiest way to use Agentic RAG in any enterprise
★ 4,431TypeScriptupdated 2025-01-22agenticagentsaidockerllamaindex
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
★ 4,409Pythonupdated 2026-03-13agentaiapplicationdatadeep-learning
Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini). Supports local and cloud LLMs (Ollama, Google, Anthropic, ...). Searches 10+ sources - arXiv, PubMed, web, and your private documents. Everything Local & Encrypted.
★ 4,381Pythonupdated 2026-04-20academiaanthropicarxivbravedeep-research
Nexent is a zero-code platform for auto-generating production-grade AI agents using Harness Engineering principles — unified tools, skills, memory, and orchestration with built-in constraints, feedback loops, and control planes.
★ 4,360Pythonupdated 2026-04-20agentagentic-aiagentic-frameworkagentic-ragagentic-workflow
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!
★ 4,280Pythonupdated 2026-04-20agentagentsai-agentsai-assistantanthropic
Build, evaluate, and integrate long-term memory for self-evolving agents.
★ 4,212Pythonupdated 2026-04-20agent-memoryagentic-aiaichatsclawdbot
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
★ 4,060TypeScriptupdated 2026-04-20agentsevaluationllm-as-a-judgellm-evaluationllm-framework
Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc.
★ 4,020TypeScriptupdated 2026-04-19aiai-agentsai-infrastructureai-memoryartificial-intelligence
Harness LLMs with Multi-Agent Programming
★ 3,985Pythonupdated 2026-04-07agentsaichatgptfunction-callinggpt
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
★ 3,180Pythonupdated 2026-04-20agent-llmagiagixtaiartificial
🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
★ 3,085updated 2026-03-10agentagentic-aiagiawesome-listcognitive-science
AI agent microservice
★ 3,034Pythonupdated 2026-03-14ag-ui-protocolagentaiassistantchatbot
The no-code platform for building custom LLM Agents
★ 2,940updated 2024-06-17aiaichatbotchatbotchatbotschatgpt
🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents
★ 2,897Pythonupdated 2026-04-17agentaiai-agentsllmsmemory
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
★ 2,723Pythonupdated 2026-04-20evaluationllmperformanceragvlm
RAG on Paul Graham's essays.
★ 2,670TypeScriptupdated 2023-07-28
Learn to build your Second Brain AI assistant with LLMs, agents, RAG, fine-tuning, LLMOps and AI systems techniques.
★ 2,663Jupyter Notebookupdated 2026-04-06agentsai-systemsdata-engineeringfine-tuninghuggingface
All-in-one platform for search, recommendations, RAG, and analytics offered via API
★ 2,641Rustupdated 2026-01-25actixactix-webaiartificial-intelligencediesel
[EMNLP-2024] Build multimodal language agents for fast prototype and production
★ 2,640Pythonupdated 2025-03-19agentchatbotgeminigptgpt4
The Open Source Memory Layer For Autonomous Agents
★ 2,578Jupyter Notebookupdated 2024-10-22agentsknowledge-graphmemorymultiagent-systemsrag
Comprehensive resources on Generative AI, including a detailed roadmap, projects, use cases, interview preparation, and coding preparation.
★ 2,239Jupyter Notebookupdated 2026-04-18agentic-aiagentic-frameworkclaudegeminigenai
The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.
★ 2,213TypeScriptupdated 2025-04-15aiai-agentsaitoolschromadatabase-management
The context development platform. Store, enrich, and retrieve structured knowledge with graph-native infrastructure, semantic retrieval, and portable context cores.
★ 2,029Pythonupdated 2026-04-24agentagent-memoryai-infraai-toolscontext
🤖 A Python library for learning and evaluating knowledge graph embeddings
★ 1,983Pythonupdated 2026-04-21cudadeep-learningknowledge-base-completionknowledge-graph-embeddingsknowledge-graphs
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
★ 1,974updated 2026-04-15agentawsome-listbenchmarkblogscompress
Nomic Developer API SDK
★ 1,877Pythonupdated 2025-11-11clusteringduplicate-detectionembeddingspythontext
The PHP Agentic Framework to build production-ready AI driven applications. Connect components (LLMs, vector DBs, memory) to agents that can interact with your data.
★ 1,870PHPupdated 2026-04-20agentagentic-aiagentic-frameworkagentsai
Semantic Intelligence for Large-Scale Engineering. Context+ is an MCP server designed for developers who demand 99% accuracy. By combining RAG, Tree-sitter AST, Spectral Clustering, and Obsidian-style linking, Context+ turns a massive codebase into a searchable, hierarchical feature graph.
★ 1,776TypeScriptupdated 2026-04-06mcp-server
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, speech synthesis and recognition, web search, memory, presets, assistants,and more. Linux, Windows, Mac
★ 1,749Pythonupdated 2026-02-06aiai-assistantartificial-intelligenceautonomous-agentchatbot
Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.
★ 1,730Pythonupdated 2026-04-19agent-memoryagentic-aiai-agentsautogenclaude
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
★ 1,715updated 2026-04-17aiartificial-intelligencelarge-language-modelsllmmachine-learning
The Context Optimization Layer for LLM Applications
★ 1,593Pythonupdated 2026-04-26agentaianthropiccompressioncontext-engineering
This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated graph based algorithm to handle the tasks.
★ 1,586Jupyter Notebookupdated 2025-06-17advanced-ragagentgenailangchainlanggraph
Agentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents.
★ 1,582updated 2025-10-20agenticagentic-aiagentic-frameworkagentic-patternagentic-rag
PageLM is a community driven version of NotebookLM & a education platform that transforms study materials into interactive resources like quizzes, flashcards, notes, and podcasts.
★ 1,574TypeScriptupdated 2026-03-27aidockeredtecheducationflashcards
A playground of highly experimental prompts, Jinja2 templates & scripts for machine intelligence models from OpenAI, Anthropic, DeepSeek, Meta, Mistral, Google, xAI & others. Alex Bilzerian (2022-2025).
★ 1,574Jinjaupdated 2025-07-12ai-agentsjinjajinja2-templatesmeta-promptingmultimodal
MemFree - Hybrid AI Search Engine & AI Page Generator
★ 1,495TypeScriptupdated 2025-08-08aiai-searchai-search-enginedevfastgenerate-ui
Build, run and scale AI agents like API and microservices - observable,auditable and identity-aware from day one.
★ 1,476Goupdated 2026-04-18agentagent-authagent-authenticationagent-indentityagent-scaling
AI video agents framework for next-gen video interactions and workflows.
★ 1,367Pythonupdated 2026-01-23agentagent-frameworkai-agentsframeworkllm
An official Qdrant Model Context Protocol (MCP) server implementation
★ 1,366Pythonupdated 2026-03-31claudecursorllmmcpmcp-server
[EMNLP 2025 Oral] MemoryOS is designed to provide a memory operating system for personalized AI agents.
★ 1,343Pythonupdated 2026-03-24agentlanguage-modelllmlong-term-memoryoperating-system
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
★ 1,324updated 2026-04-20embeddingslarge-language-modelsllmragrag-embeddings
A full-featured image/video management app with AI-powered organization and semantic search. Supports metadata from SD-webui, ComfyUI, Fooocus, NovelAI, StableSwarmUI, and more. Available as standalone app, SD-webui extension, or library.
★ 1,291Vueupdated 2026-04-08audiocomfyuiextensionfile-explorerfile-server
List of software that allows searching the web with the assistance of AI: https://hf.co/spaces/felladrin/awesome-ai-web-search
★ 1,286HTMLupdated 2026-03-04aiai-search-engineartificial-intelligenceartificial-intelligence-projectsawesome
SuperEasy 100% Local RAG with Ollama + Email RAG
★ 1,223Pythonupdated 2024-06-04
😎 Awesome list of Retrieval-Augmented Generation (RAG) applications in Generative AI.
★ 1,170updated 2026-04-11artificial-intelligencegenerative-ailarge-language-modelsmachine-learningretrieval-augmented-generation
🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL
★ 1,152Pythonupdated 2026-04-16chainlitcolbertduckdbevalslate-chunking
Open Source Semantic Search for your AI Agent
★ 1,132TypeScriptupdated 2026-01-17colbertembeddingsgrepgrep-search
Search + Chat = SearChat(AI Chat with Search), Support OpenAI/Anthropic/VertexAI/Gemini, DeepResearch, SearXNG, Docker. AI对话式搜索引擎,支持DeepResearch, 支持OpenAI/Anthropic/VertexAI/Gemini接口、聚合搜索引擎SearXNG,支持Docker一键部署。
★ 1,043TypeScriptupdated 2026-04-16aianthropicdeepresearchgeminillm
Dynamiq is an orchestration framework for agentic AI and LLM applications
★ 1,039Pythonupdated 2026-04-20agentsaigenerative-aigptllm
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
★ 1,032Pythonupdated 2024-11-13generative-aillmragvector-database
TypeScript AI AI Function Calling Framework enhanced by compiler skills.
★ 1,021TypeScriptupdated 2026-04-14agentagenticagentic-aiagentic-frameworkai
ID-based RAG FastAPI: Integration with Langchain and PostgreSQL/pgvector
★ 794Pythonupdated 2026-04-20apiapi-restembeddingsfastapilangchain
Awesome AI Memory | LLM Memory | A curated knowledge base on AI memory for LLMs and agents, covering long-term memory, reasoning, retrieval, and memory-native system design. Awesome-AI-Memory 是一个 集中式、持续更新的 AI 记忆知识库,系统性整理了与 大模型记忆(LLM Memory)与智能体记忆(Agent Memory) 相关的前沿研究、工程框架、系统设计、评测基准与真实应用实践。
★ 784Pythonupdated 2026-04-18agent-memoryai-memoryai-memory-systemawesome-ai-memorycontinual-learning
A list of AI memory projects
★ 741Pythonupdated 2025-01-10aiai-agentsai-engineeringai-memoryai-ml
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
★ 701Pythonupdated 2024-05-16aidata-engineeringembeddingsmachine-learningnlp
Agent Cloud is like having your own GPT builder with a bunch extra goodies. The GUI features 1) RAG pipeline which can natively embed 260+ datasources 2) Create Conversational apps (like GPTs) 3) Create Multi Agent process automation apps (crewai) 4) Tools 5) Teams+user permissions. Get started fast with Docker and our install.sh
★ 683TypeScriptupdated 2025-07-21aiautogenchatgptcrewaidockerized-application
ARGO is an open-source AI Agent platform that brings Local Manus to your desktop. With one-click model downloads, seamless closed LLM integration, and offline-first RAG knowledge bases, ARGO becomes a DeepResearch powerhouse for autonomous thinking, task planning, and 100% of your data stays locally. Support Win/Mac/Docker.
★ 676Pythonupdated 2026-01-06agentagentic-aiaiaigcanthropic
A curated collection of AI agent research papers released in 2026, covering agent engineering, memory, evaluation, workflows, and autonomous systems.
★ 672updated 2026-04-16ai-agentsawesomeawesome-listllmllm-agents
An open-source AI content search engine designed specifically for content creators. Supports extraction of text, images, and short videos. Allows full local deployment (web app, RAG server, LLM server). Supports multi-modal RAG content Q&A.
★ 619TypeScriptupdated 2026-04-09contentcontent-searchragsearchsearch-engine
Fast, local-first web content extraction for LLMs. Scrape, crawl, extract structured data — all from Rust. CLI, REST API, and MCP server.
★ 597Rustupdated 2026-04-17aiai-agentsai-scrapingclicrawler
GraphRAG4OpenWebUI integrates Microsoft's GraphRAG technology into Open WebUI, providing a versatile information retrieval API. It combines local, global, and web searches for advanced Q&A systems and search engines. This tool simplifies graph-based retrieval integration in open web environments.
★ 595Pythonupdated 2025-01-10aiagentsgraphragllmsollamaopenai
Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space
★ 559TypeScriptupdated 2026-04-20aiai-search-engineartificial-intelligencegenerative-aigpu-accelerated
A Model Context Protocol (MCP) server implementation that provides database capabilities for Chroma
★ 541Pythonupdated 2025-09-17
Giselle: AI App Builder. Open Source.
★ 519TypeScriptupdated 2026-04-17agentagent-builderagentic-aiaiai-agent
Manifold is an experimental platform for enabling long horizon workflow automation using teams of AI assistants.
★ 489Goupdated 2026-04-19agentsai-agentsai-assistantai-toolsartificial-intelligence
Local semantic search. Stupidly simple.
★ 453Pythonupdated 2024-07-07
The Pinecone Python client
★ 434Pythonupdated 2026-04-08
★ 409Pythonupdated 2026-04-01
Self-hosted web UI for Qdrant
★ 392JavaScriptupdated 2026-04-20
Context-Engine MCP - Agentic Context Compression Suite
★ 387Svelteupdated 2026-04-25aiai-agentscodexcompressioncontext
On-device AI for Android — LLM chat (GGUF/llama.cpp), vision models (VLM), image generation (Stable Diffusion), tool calling, AI personas, RAG knowledge packs, TTS/STT. Fully offline, zero subscriptions, open-source.
★ 379Kotlinupdated 2026-03-15ai-personasandroidgguf-modelsjetpack-composekotlin
A curated list of retrieval-augmented generation (RAG) in large language models
★ 379updated 2025-12-01awesome-listawesome-resourcesembeddingslarge-language-modelsllm
★ 376updated 2025-09-07ragrag-evaluationrag-implementation
The Open Assistant API is a ready-to-use, open-source, self-hosted agent/gpts orchestration creation framework, supporting customized extensions for LLM, RAG, function call, and tools capabilities. It also supports seamless integration with the openai/langchain sdk.
★ 362Pythonupdated 2025-06-24agentaiassistantsassistants-apichatgpt
Browser based tool to convert PDFs to Markdown
★ 335TypeScriptupdated 2025-12-25convertermarkdownnextjspdfrag
The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.
★ 303Pythonupdated 2026-04-20acsazurechunkingdenseembedding
Home of the AI workforce - Multi-agent system, AI agents & tools
★ 279Pythonupdated 2026-01-15clusteringcomputer-visionembeddingsnatural-language-processingnlp
DeepContext is an MCP server that adds symbol-aware semantic search to Claude Code, Codex CLI, and other agents for faster, smarter context on large codebases.
★ 275TypeScriptupdated 2025-09-22aiai-agentsclaudeclaude-codecode
An MCP server implementation that provides tools for retrieving and processing documentation through vector search, enabling AI assistants to augment their responses with relevant documentation context.
★ 259TypeScriptupdated 2025-07-18llmmcpmcp-serversragvector-database
Unified benchmark for evaluating conversational memory and RAG across multiple datasets
★ 248TypeScriptupdated 2026-03-27aiai-memorybenchmark-framework
Your personal free-to-use AI assistant, built with gemini & flutter.
★ 231Dartupdated 2025-02-24embeddingsfluttergeminihive
Code search MCP for Claude Code. Make entire codebase the context for any coding agent. Embeddings are created and stored locally. No API cost.
★ 219Pythonupdated 2025-11-13agentai-codingclaudeclaude-codecode-generation
BondAI is an open-source tool for developing AI Agent Systems. BondAI handles the implementation complexities including memory/context management, error handling, vector/semantic search and includes a powerful set of out of the box tools and integrations.
★ 219Pythonupdated 2024-01-14
MongoDB Knowledge Service. Powered by MongoDB and Atlas Vector Search.
★ 196TypeScriptupdated 2025-09-29chatbotmongodbmongodb-atlasragretrieval-augmented-generation
See how to augment LLMs with real-time data for dynamic, context-aware apps - Rag + Agents + GraphRAG.
★ 165Jupyter Notebookupdated 2026-02-17agentsgenerative-aigraphraglanggraphllms
Model Context Protocol server to allow for reading and writing from Pinecone. Rudimentary RAG
★ 148Pythonupdated 2025-01-31claudemcpmcp-servermodel-context-protocolpinecone
All-in-one local low-code AI agent development platform. Installs and runs n8n, Flowise, Browser-Use, Qdrant, Ollama, and more. Proxies LLM requests through LiteLLM with Langfuse for observability.
★ 142TypeScriptupdated 2026-02-03
Local RAG researcher agent built using Langgraph, DeepSeek R1 and Ollama
★ 140Pythonupdated 2025-02-13ai-agentsdeep-researchdeepseekdeepseek-r1langchain
Lightweight, simple embedded Open WebUI widget, allowing you to easily implement chatbot capabilities and RAG workflows into your existing tools, apps and webpages!
★ 114Svelteupdated 2025-09-03aiartifical-intelligencellmopen-webuiopenwebui
Simple Graph Memory for AI applications
★ 104Jupyter Notebookupdated 2026-02-23databasedspygraphsqlite3vector-database
Local-GenAI-Search is a generative search engine based on Llama 3, langchain and qdrant that answers questions based on your local files
★ 96Pythonupdated 2024-08-19generative-ailangchainlarge-language-modelsllama3local
🌟DataTonic : A Data-Capable AGI-style Agent Builder of Agents , that creates swarms , runs commands and securely processes and creates datasets, databases, visualisations, and analyses.
★ 95Jupyter Notebookupdated 2025-07-19agent-builderagiautogenazurechroma
An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.
★ 95Pythonupdated 2026-04-08chunkingdependency-inversion-principledockerembeddingsfactory-pattern
The SQLite for vector embeddings — A simple, embedded vector database that stores everything in a single file.
★ 81Pythonupdated 2026-03-02
Weaviate Web UI
★ 79TypeScriptupdated 2023-09-14weaviate
The most accurate and comprehensive Context Engine as a service, optimized for large codebases, powered by advanced GraphRAG and accessible via MCP. It enriches the context for AI agents like Codex, Claude Code, Cursor, etc., making them 35% more efficient and up to 84% faster.
★ 78Pythonupdated 2026-04-17claude-codecursorgraphraglarge-codebasemcp
🤖🔎 STREAM: Search with Top Result Extraction & Answer Model 🔤📊 SEEKTOPIC 🚜📜 Tractor the Text Extractor 📈📝 REASON Docs Writing Agent
★ 72HTMLupdated 2025-12-29ai-searchautocompletehacktoberfestkeywordsknowledge-graph
Shinkai allows you to create AI agents without touching code. Define tasks, schedule actions, and let Shinkai write custom code for you. Native crypto support included.
★ 67Rustupdated 2026-04-19aichatgptguillama3llm
The official Pinecone marketplace for Claude Code Plugins
★ 57Pythonupdated 2026-03-06anthropic-claudeclaude-codeclaude-code-pluginclaude-code-plugin-marketplacehybrid-search
Local RAG server for code editors. Scans your codebase, builds a local context index, and connects to any external LLM for context-aware completions and assistance.
★ 44Pythonupdated 2025-09-11ai-assistantclaudecodegendeepseekgemini
Awesome-RAG: a curated list of Retrieval-Augmented Generation
★ 43TypeScriptupdated 2024-12-31
MovieGPT: A RAG, Gen AI application for Movie Recommendations
★ 38Jupyter Notebookupdated 2024-08-19chatgptlarge-language-modelsllama-indexllmrecommendation-systems
A study assistant powered by Claude Opus. It provides various tools to assist with different tasks, such as researching,coding,note-taking and more.
★ 26Pythonupdated 2024-07-18aillmpythonrag
An application that enable the users to upload PDF files and ask questions regarding their content using Retrieval Augmented Generation (RAG)
★ 23Pythonupdated 2024-10-18aicosine-similarityembeddingsgoogle-drive-apigpt-4
Python command-line tool for interacting with AI models through the OpenRouter API/Cloudflare AI Gateway, or local self-hosted Ollama. Optionally support Microsoft LLMLingua prompt token compression
★ 21updated 2025-12-28ai-ragcloudflare-aicloudflare-ai-gatewaylinkupllm-inference
Agent skills for working with Chroma
★ 15TypeScriptupdated 2026-04-14
AI-powered file launcher and semantic search assistant. Like Spotlight/Alfred but with advanced AI capabilities for understanding context and meaning. Features local processing, privacy-first design, and seamless integration with your workflow.
★ 15TypeScriptupdated 2025-09-25aialfreddesktopdocument-searchelectron
`VectorMD` transforms markdown files into a semantically searchable database, leveraging vector embeddings to efficiently retrieve relevant code snippets or information based on query meanings.
★ 10Pythonupdated 2023-08-19
a multi-modal MCP layer for real life — built on continuous video, semantic search and natural language video understanding.
★ 9Pythonupdated 2025-09-03
Embedded single-file knowledge graph database with vector search and full-text search for AI/RAG apps
★ 5Zigupdated 2026-04-19
Finetuning and evaluating LLMs to extract GHG emissions from PDF reports using RAG and grammar-based decoding.
★ 5TeXupdated 2024-03-22data-extractionevaluationinformation-extractionllmlong-context
A reproducible evaluation of how frontier LLMs handle grounded vs. ungrounded questions in RAG systems — measuring correctness, grounding, and calibrated refusal.
★ 2Pythonupdated 2026-04-24ai-safetyanthropicbenchmarkevaluationgemini
Advanced GenAI Legal Ecosystem for Israeli Law. Powered by Autonomous Agents, Multi-Step Tool Calling (Skills), and Gemini Pro. Architected for Precision with RAG, Vector Embeddings, and Agentic Workflows to deliver
★ 1Pythonupdated 2026-04-21
🏡 Transform real estate searches with natural language queries; find contextually relevant listings effortlessly using ML embeddings and vector search.
★ 1Pythonupdated 2026-04-20bootstrapdialogdotfile-managerdotfilesdotfiles-linux
Product deduplication pipeline for Israeli price-comparison — Hebrew/English normalization, FAISS embeddings, LLM cluster refinement. Pair F1: 0.955
★ 1Pythonupdated 2026-04-07deduplicationembeddingsfaissnlpopenai
Agentic research assistant for Israeli Knesset data — LangGraph + RAG + MCP
★ 1Pythonupdated 2026-04-08
Agentic RAG system for architecture firms to query Israeli planning regulations and local projects
★ 1Pythonupdated 2026-03-24
A concise guide for using Google's Agent Development Kit (ADK) to build various AI applications.
★ 1Pythonupdated 2025-08-09adkadk-pythonagentic-aiagentsgoogle