RAG & Vector Search
211 repos
Production-ready platform for agentic workflow development.
The agent engineering platform
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
21 Lessons, Get Started Building with Generative AI
100+ AI Agent & RAG apps you can actually run — clone, customize, ship.
The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future sessions.
12 Lessons to Get Started Building AI Agents
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
Universal memory layer for AI Agents
The best-benchmarked open-source AI memory system. And it's free.
LlamaIndex is the leading document agent and OCR platform
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
AI Data Vault - A query engine for AI Agents to securely query data from any datasource
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
Vane is an AI-powered answering engine.
🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architecture, distributed swarm intelligence, RAG integration, and native Claude Code / Codex Integration
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Build resilient language agents as graphs.
Open Source AI Platform - AI Chat with advanced features that works with every LLM
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
Data infrastructure for AI
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed notebook tutorial.
Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.
Build Real-Time Knowledge Graphs for AI Agents
An open-source RAG-based tool for chatting with your documents.
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Python scraper based on AI
OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need through a file system paradigm, enabling hierarchical context delivery and self-evolving.
280+ free n8n automation templates — ready-to-use workflows for Gmail, Telegram, Slack, Discord, WhatsApp, Google Drive, Notion, OpenAI, and more. AI agents, RAG chatbots, email automation, social media, DevOps, and document processing. The largest open-source n8n template collection.
50+ tutorials and implementations for Generative AI Agent techniques, from basic conversational bots to complex multi-agent systems.
🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
open-source agentic AI data assistant for the next generation of AI + Data products.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
Autonomous agents for everyone
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
Knowledge Engine for AI Agent Memory in 6 lines of code
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
Open-source text-to-SQL and text-to-chart GenBI agent with a semantic layer. Ask your database questions in natural language — get accurate SQL, charts, and BI insights. Supports 12+ data sources (PostgreSQL, BigQuery, Snowflake, etc.) and any LLM (OpenAI, Claude, Gemini, Ollama).
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
An open source, privacy focused alternative to NotebookLM for teams with no data limits. Join our Discord: https://discord.gg/ejRNvftDp9
Memori is agent-native memory infrastructure. A LLM-agnostic layer that turns agent execution and conversation into structured, persistent state for production systems.
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
A cross-platform Markdown AI note-taking software.
Chrome MCP Server is a Chrome extension-based Model Context Protocol (MCP) server that exposes your Chrome browser functionality to AI assistants like Claude, enabling complex browser automation, content analysis, and semantic search.
Agent S: an open agentic framework that uses computers like a human
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
A collection of projects showcasing RAG, agents, workflows, and other AI use cases
All-in-one LLM CLI tool featuring Shell Assistant, Chat-REPL, RAG, AI Tools & Agents, with access to OpenAI, Claude, Gemini, Ollama, Groq, and more.
Code search MCP for Claude Code. Make entire codebase the context for any coding agent.
Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
AI memory OS for LLM and Agent systems(moltbot,clawdbot,openclaw), enabling persistent Skill memory for cross-task skill reuse and evolution.
Private & local AI personal knowledge management app for high entropy people.
AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework
A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.
Build autonomous AI agents in Python.
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
PraisonAI 🦞 — Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, RAG, and support for 100+ LLMs.
AI + Data, online. https://vespa.ai
Large Action Model framework to develop AI Web Agents
Open-source context retrieval layer for AI agents
Context-aware AI assistant for your desktop. Ready to respond intelligently, seamlessly integrating multiple LLMs and MCP tools.
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
A visual playground for agentic workflows: Iterate over your agents 10x faster
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
The open source platform for AI-native application development.
Spec-driven development for large codebases
🐢 Open-Source Evaluation & Testing library for LLM Agents
MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)
Superduper: End-to-end framework for building custom AI applications and agents.
Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Low-latency AI engine for mobile devices & wearables
⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com
The easiest way to use Agentic RAG in any enterprise
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini). Supports local and cloud LLMs (Ollama, Google, Anthropic, ...). Searches 10+ sources - arXiv, PubMed, web, and your private documents. Everything Local & Encrypted.
Nexent is a zero-code platform for auto-generating production-grade AI agents using Harness Engineering principles — unified tools, skills, memory, and orchestration with built-in constraints, feedback loops, and control planes.
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!
Build, evaluate, and integrate long-term memory for self-evolving agents.
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc.
Harness LLMs with Multi-Agent Programming
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
AI agent microservice
The no-code platform for building custom LLM Agents
🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
RAG on Paul Graham's essays.
Learn to build your Second Brain AI assistant with LLMs, agents, RAG, fine-tuning, LLMOps and AI systems techniques.
All-in-one platform for search, recommendations, RAG, and analytics offered via API
[EMNLP-2024] Build multimodal language agents for fast prototype and production
The Open Source Memory Layer For Autonomous Agents
Comprehensive resources on Generative AI, including a detailed roadmap, projects, use cases, interview preparation, and coding preparation.
The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.
The context development platform. Store, enrich, and retrieve structured knowledge with graph-native infrastructure, semantic retrieval, and portable context cores.
🤖 A Python library for learning and evaluating knowledge graph embeddings
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
Nomic Developer API SDK
The PHP Agentic Framework to build production-ready AI driven applications. Connect components (LLMs, vector DBs, memory) to agents that can interact with your data.
Semantic Intelligence for Large-Scale Engineering. Context+ is an MCP server designed for developers who demand 99% accuracy. By combining RAG, Tree-sitter AST, Spectral Clustering, and Obsidian-style linking, Context+ turns a massive codebase into a searchable, hierarchical feature graph.
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, speech synthesis and recognition, web search, memory, presets, assistants,and more. Linux, Windows, Mac
Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
The Context Optimization Layer for LLM Applications
This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated graph based algorithm to handle the tasks.
Agentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents.
PageLM is a community driven version of NotebookLM & a education platform that transforms study materials into interactive resources like quizzes, flashcards, notes, and podcasts.
A playground of highly experimental prompts, Jinja2 templates & scripts for machine intelligence models from OpenAI, Anthropic, DeepSeek, Meta, Mistral, Google, xAI & others. Alex Bilzerian (2022-2025).
MemFree - Hybrid AI Search Engine & AI Page Generator
Build, run and scale AI agents like API and microservices - observable,auditable and identity-aware from day one.
AI video agents framework for next-gen video interactions and workflows.
An official Qdrant Model Context Protocol (MCP) server implementation
[EMNLP 2025 Oral] MemoryOS is designed to provide a memory operating system for personalized AI agents.
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
A full-featured image/video management app with AI-powered organization and semantic search. Supports metadata from SD-webui, ComfyUI, Fooocus, NovelAI, StableSwarmUI, and more. Available as standalone app, SD-webui extension, or library.
List of software that allows searching the web with the assistance of AI: https://hf.co/spaces/felladrin/awesome-ai-web-search
SuperEasy 100% Local RAG with Ollama + Email RAG
😎 Awesome list of Retrieval-Augmented Generation (RAG) applications in Generative AI.
🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL
Open Source Semantic Search for your AI Agent
Search + Chat = SearChat(AI Chat with Search), Support OpenAI/Anthropic/VertexAI/Gemini, DeepResearch, SearXNG, Docker. AI对话式搜索引擎,支持DeepResearch, 支持OpenAI/Anthropic/VertexAI/Gemini接口、聚合搜索引擎SearXNG,支持Docker一键部署。
Dynamiq is an orchestration framework for agentic AI and LLM applications
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
TypeScript AI AI Function Calling Framework enhanced by compiler skills.
ID-based RAG FastAPI: Integration with Langchain and PostgreSQL/pgvector
Awesome AI Memory | LLM Memory | A curated knowledge base on AI memory for LLMs and agents, covering long-term memory, reasoning, retrieval, and memory-native system design. Awesome-AI-Memory 是一个 集中式、持续更新的 AI 记忆知识库,系统性整理了与 大模型记忆(LLM Memory)与智能体记忆(Agent Memory) 相关的前沿研究、工程框架、系统设计、评测基准与真实应用实践。
A list of AI memory projects
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
Agent Cloud is like having your own GPT builder with a bunch extra goodies. The GUI features 1) RAG pipeline which can natively embed 260+ datasources 2) Create Conversational apps (like GPTs) 3) Create Multi Agent process automation apps (crewai) 4) Tools 5) Teams+user permissions. Get started fast with Docker and our install.sh
ARGO is an open-source AI Agent platform that brings Local Manus to your desktop. With one-click model downloads, seamless closed LLM integration, and offline-first RAG knowledge bases, ARGO becomes a DeepResearch powerhouse for autonomous thinking, task planning, and 100% of your data stays locally. Support Win/Mac/Docker.
A curated collection of AI agent research papers released in 2026, covering agent engineering, memory, evaluation, workflows, and autonomous systems.
An open-source AI content search engine designed specifically for content creators. Supports extraction of text, images, and short videos. Allows full local deployment (web app, RAG server, LLM server). Supports multi-modal RAG content Q&A.
Fast, local-first web content extraction for LLMs. Scrape, crawl, extract structured data — all from Rust. CLI, REST API, and MCP server.
GraphRAG4OpenWebUI integrates Microsoft's GraphRAG technology into Open WebUI, providing a versatile information retrieval API. It combines local, global, and web searches for advanced Q&A systems and search engines. This tool simplifies graph-based retrieval integration in open web environments.
Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space
A Model Context Protocol (MCP) server implementation that provides database capabilities for Chroma
Giselle: AI App Builder. Open Source.
Manifold is an experimental platform for enabling long horizon workflow automation using teams of AI assistants.
Local semantic search. Stupidly simple.
The Pinecone Python client
Self-hosted web UI for Qdrant
Context-Engine MCP - Agentic Context Compression Suite
On-device AI for Android — LLM chat (GGUF/llama.cpp), vision models (VLM), image generation (Stable Diffusion), tool calling, AI personas, RAG knowledge packs, TTS/STT. Fully offline, zero subscriptions, open-source.
A curated list of retrieval-augmented generation (RAG) in large language models
The Open Assistant API is a ready-to-use, open-source, self-hosted agent/gpts orchestration creation framework, supporting customized extensions for LLM, RAG, function call, and tools capabilities. It also supports seamless integration with the openai/langchain sdk.
Browser based tool to convert PDFs to Markdown
The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.
Home of the AI workforce - Multi-agent system, AI agents & tools
DeepContext is an MCP server that adds symbol-aware semantic search to Claude Code, Codex CLI, and other agents for faster, smarter context on large codebases.
An MCP server implementation that provides tools for retrieving and processing documentation through vector search, enabling AI assistants to augment their responses with relevant documentation context.
Unified benchmark for evaluating conversational memory and RAG across multiple datasets
Your personal free-to-use AI assistant, built with gemini & flutter.
Code search MCP for Claude Code. Make entire codebase the context for any coding agent. Embeddings are created and stored locally. No API cost.
BondAI is an open-source tool for developing AI Agent Systems. BondAI handles the implementation complexities including memory/context management, error handling, vector/semantic search and includes a powerful set of out of the box tools and integrations.
MongoDB Knowledge Service. Powered by MongoDB and Atlas Vector Search.
See how to augment LLMs with real-time data for dynamic, context-aware apps - Rag + Agents + GraphRAG.
Model Context Protocol server to allow for reading and writing from Pinecone. Rudimentary RAG
All-in-one local low-code AI agent development platform. Installs and runs n8n, Flowise, Browser-Use, Qdrant, Ollama, and more. Proxies LLM requests through LiteLLM with Langfuse for observability.
Local RAG researcher agent built using Langgraph, DeepSeek R1 and Ollama
Lightweight, simple embedded Open WebUI widget, allowing you to easily implement chatbot capabilities and RAG workflows into your existing tools, apps and webpages!
Simple Graph Memory for AI applications
Local-GenAI-Search is a generative search engine based on Llama 3, langchain and qdrant that answers questions based on your local files
🌟DataTonic : A Data-Capable AGI-style Agent Builder of Agents , that creates swarms , runs commands and securely processes and creates datasets, databases, visualisations, and analyses.
An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.
The SQLite for vector embeddings — A simple, embedded vector database that stores everything in a single file.
Weaviate Web UI
The most accurate and comprehensive Context Engine as a service, optimized for large codebases, powered by advanced GraphRAG and accessible via MCP. It enriches the context for AI agents like Codex, Claude Code, Cursor, etc., making them 35% more efficient and up to 84% faster.
🤖🔎 STREAM: Search with Top Result Extraction & Answer Model 🔤📊 SEEKTOPIC 🚜📜 Tractor the Text Extractor 📈📝 REASON Docs Writing Agent
Shinkai allows you to create AI agents without touching code. Define tasks, schedule actions, and let Shinkai write custom code for you. Native crypto support included.
The official Pinecone marketplace for Claude Code Plugins
Local RAG server for code editors. Scans your codebase, builds a local context index, and connects to any external LLM for context-aware completions and assistance.
Awesome-RAG: a curated list of Retrieval-Augmented Generation
MovieGPT: A RAG, Gen AI application for Movie Recommendations
A study assistant powered by Claude Opus. It provides various tools to assist with different tasks, such as researching,coding,note-taking and more.
An application that enable the users to upload PDF files and ask questions regarding their content using Retrieval Augmented Generation (RAG)
Python command-line tool for interacting with AI models through the OpenRouter API/Cloudflare AI Gateway, or local self-hosted Ollama. Optionally support Microsoft LLMLingua prompt token compression
Agent skills for working with Chroma
AI-powered file launcher and semantic search assistant. Like Spotlight/Alfred but with advanced AI capabilities for understanding context and meaning. Features local processing, privacy-first design, and seamless integration with your workflow.
`VectorMD` transforms markdown files into a semantically searchable database, leveraging vector embeddings to efficiently retrieve relevant code snippets or information based on query meanings.
a multi-modal MCP layer for real life — built on continuous video, semantic search and natural language video understanding.
Embedded single-file knowledge graph database with vector search and full-text search for AI/RAG apps
Finetuning and evaluating LLMs to extract GHG emissions from PDF reports using RAG and grammar-based decoding.
A reproducible evaluation of how frontier LLMs handle grounded vs. ungrounded questions in RAG systems — measuring correctness, grounding, and calibrated refusal.
Advanced GenAI Legal Ecosystem for Israeli Law. Powered by Autonomous Agents, Multi-Step Tool Calling (Skills), and Gemini Pro. Architected for Precision with RAG, Vector Embeddings, and Agentic Workflows to deliver
🏡 Transform real estate searches with natural language queries; find contextually relevant listings effortlessly using ML embeddings and vector search.
Product deduplication pipeline for Israeli price-comparison — Hebrew/English normalization, FAISS embeddings, LLM cluster refinement. Pair F1: 0.955
Agentic research assistant for Israeli Knesset data — LangGraph + RAG + MCP
Agentic RAG system for architecture firms to query Israeli planning regulations and local projects
A concise guide for using Google's Agent Development Kit (ADK) to build various AI applications.