LLMs & Generative AI
1,269 repos
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
Production-ready platform for agentic workflow development.
The agent engineering platform
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Python tool for converting files and office documents to Markdown.
The agent that grows with you
🔥 The API to search, scrape, and interact with the web for AI
21 Lessons, Get Started Building with Generative AI
100+ AI Agent & RAG apps you can actually run — clone, customize, ship.
LLM inference in C/C++
An open-source AI agent that brings the power of Gemini directly into your terminal.
Robust Speech Recognition via Large-Scale Weak Supervision
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
A high-throughput and memory-efficient inference and serving engine for LLMs
Lightweight coding agent that runs in your terminal
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
🙌 OpenHands: AI-Driven Development
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours.
A natural language interface for computers
Web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.
12 Lessons to Get Started Building AI Agents
Inference code for Llama models
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
A programming framework for agentic AI
A Simple and Universal Swarm Intelligence Engine, Predicting Anything. 简洁通用的群体智能引擎,预测万物
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构,赋能 AI 自然语言对话分析、情感洞察与趋势预测等。支持 Docker ,数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
🔮 ChatGPT Desktop Application (Mac, Windows and Linux)
omo; the best agent harness - previously oh-my-opencode
Universal memory layer for AI Agents
Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors
TradingAgents: Multi-Agents LLM Financial Trading Framework
The best ChatGPT that $100 can buy.
A cross-platform desktop All-in-One assistant tool for Claude Code, Codex, OpenCode, openclaw & Gemini CLI.
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
The best-benchmarked open-source AI memory system. And it's free.
Port of OpenAI's Whisper model in C/C++
LlamaIndex is the leading document agent and OCR platform
The original local LLM interface. Text, vision, tool-calling, training. UI + API, 100% offline and private.
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]
AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
aider is AI pair programming in your terminal
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
🦍 The API and AI Gateway
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A curated list of awesome skills, hooks, slash-commands, agent orchestrators, applications, and plugins for Claude Code by Anthropic
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
Powerful AI Client
A generative speech model for daily dialogue.
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
AI Data Vault - A query engine for AI Agents to securely query data from any datasource
Extracted system prompts from ChatGPT (GPT-5.4, GPT-5.3, Codex), Claude (Opus 4.6, Sonnet 4.6, Claude Code), Gemini (3.1 Pro, 3 Flash, CLI), Grok (4.2, 4), Perplexity, and more. Updated regularly.
Integrate the DeepSeek API into popular software
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active.
🔥 1Panel is a modern, open-source VPS control panel — and the only one with native AI agent support. Run Ollama models, deploy OpenClaw agents, and manage your entire server stack from one clean web interface.
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
Vane is an AI-powered answering engine.
The first real AI developer
Self-hosted AI coding assistant
🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architecture, distributed swarm intelligence, RAG integration, and native Claude Code / Codex Integration
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration
LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & key redistribution system, unifying multiple providers under a single API. Single binary, Docker-ready, with an English UI.
Community-contributed instructions, agents, skills, and configurations to help you make the most of GitHub Copilot.
The official Python library for the OpenAI API
The Frontend Stack for Agents & Generative UI. React + Angular. Makers of the AG-UI Protocol
Build resilient language agents as graphs.
SOTA Open Source TTS
AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for personal and enterprise model management. 🍥
Wrap Gemini CLI, Antigravity, ChatGPT Codex, Claude Code as an OpenAI/Gemini/Claude/Codex compatible API service, allowing you to enjoy the free Gemini 2.5 Pro, GPT 5, Claude model through API
Open Source AI Platform - AI Chat with advanced features that works with every LLM
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Composio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.
Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
Integrate cutting-edge LLM technology quickly and easily into your apps
A list of AI autonomous agents
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed notebook tutorial.
An autonomous agent that conducts deep research on any data using any LLM providers
Awesome-LLM: a curated list of Large Language Model
Agent skills for Obsidian. Teach your agent to use Markdown, Bases, JSON Canvas, and use the CLI.
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
Find secrets with Gitleaks 🔑
LLM Frontend for Power Users.
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin993886460 (Beware of fake account)
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.
Google Workspace CLI — one command-line tool for Drive, Gmail, Calendar, Sheets, Docs, Chat, Admin, and more. Dynamically built from Google Discovery Service. Includes AI agent skills.
Build Real-Time Knowledge Graphs for AI Agents
An open-source RAG-based tool for chatting with your documents.
A lightweight, powerful framework for multi-agent workflows
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
🚀 The fast, Pythonic way to build MCP servers and clients.
Marketing skills for Claude Code and AI agents. CRO, copywriting, SEO, analytics, and growth engineering.
Open-source AI hackers to find and fix your app’s vulnerabilities.
Build and run agents you can see, understand and trust.
Distribute and run LLMs with a single file.
🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.
The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents
Glamourous agentic coding for all 💘
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Agent2Agent (A2A) is an open protocol enabling communication and interoperability between opaque agentic applications.
A powerful MCP toolkit for coding, providing semantic retrieval and editing capabilities - the IDE for your agent
Python scraper based on AI
From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.
DeepSeek Coder: Let the Code Write Itself
OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need through a file system paradigm, enabling hierarchical context delivery and self-evolving.
Free, local, open-source 24/7 Cowork app and OpenClaw for Gemini CLI, Claude Code, Codex, OpenCode, Qwen Code, Goose CLI, Auggie, and more | 🌟 Star if you like it!
Universal LLM Deployment Engine with ML Compilation
Faster Whisper transcription with CTranslate2
A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's development of image generation and unified models(click to website to see our blog)
The SDK For Browser Agents
Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.
OpenUI let's you describe UI using your imagination, then see it rendered live.
Crawl a site to generate knowledge files to create your own custom GPT from a URL
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
A powerful GUI app and Toolkit for Claude Code - Create custom agents, manage interactive Claude Code sessions, run secure background agents, and more.
280+ free n8n automation templates — ready-to-use workflows for Gmail, Telegram, Slack, Discord, WhatsApp, Google Drive, Notion, OpenAI, and more. AI agents, RAG chatbots, email automation, social media, DevOps, and document processing. The largest open-source n8n template collection.
50+ tutorials and implementations for Generative AI Agent techniques, from basic conversational bots to complex multi-agent systems.
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Automate browser based workflows with AI
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.
A full-featured, hackable Next.js AI chatbot built by Vercel
Local, open-source AI app builder for power users ✨ v0 / Lovable / Replit / Bolt alternative 🌟 Star if you like it!
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
The Autonomous Company Operating System
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
A list of free LLM inference resources accessible via API.
Prompt, run, edit, and deploy full-stack web applications using any LLM you want!
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
Mobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Instruct-tune LLaMA on consumer hardware
End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the simplest implementation of a deep research agent - e.g. an agent that can refine its research direction overtime and deep dive into a topic.
Tongyi Deep Research, the Leading Open-source Deep Research Agent
open-source agentic AI data assistant for the next generation of AI + Data products.
Kilo is the all-in-one agentic engineering platform. Build, ship, and iterate faster with the most popular open source coding agent.
Run agents that work for you based on what you do. AI finally knows what you are doing
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Autonomous agents for everyone
Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.
Your API ⇒ Paid MCP. Instantly.
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
The interaction control harness for customer-facing AI agents - optimized for building controlled, consistent, and predictable customer interactions with LLMs.
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
LLM Council works together to answer your hardest questions
Janus-Series: Unified Multimodal Understanding and Generation Models
🚀 One-stop solution for creating your AI twin from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. 从聊天记录创造数字分身的一站式解决方案
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
🧠 Leon is your open-source personal assistant.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Examples and guides for using the Gemini API
The absolute trainer to light up AI agents.
Open-source Agent Operating System
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
Knowledge Engine for AI Agent Memory in 6 lines of code
Sample code and notebooks for Generative AI on Google Cloud, with Gemini Enterprise Agent Platform
AI Agent Framework, the Pydantic way
Prompt, run, edit, and deploy full-stack web applications. -- bolt.new -- Help Center: https://support.bolt.new/ -- Community Support: https://discord.com/invite/stackblitz
Inference code for CodeLlama models
Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view
Fully autonomous AI Agents system capable of performing complex penetration testing tasks
🏛️ 三省六部制 · OpenClaw Multi-Agent Orchestration System — 9 specialized AI agents with real-time dashboard, model config, and full audit trails
Open source AI coding agent. Designed for large projects and real world tasks.
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
Open-source text-to-SQL and text-to-chart GenBI agent with a semantic layer. Ask your database questions in natural language — get accurate SQL, charts, and BI insights. Supports 12+ data sources (PostgreSQL, BigQuery, Snowflake, etc.) and any LLM (OpenAI, Claude, Gemini, Ollama).
The LLM Evaluation Framework
FinRL®: Financial Reinforcement Learning. 🔥
MCP Toolbox for Databases is an open source MCP server for databases.
The open-source hub to build & deploy GPT/LLM Agents ⚡️
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
Collection of leaked system prompts
A 100% free modern JS SaaS boilerplate (React, NodeJS, Prisma). Full-featured: Auth (email, google, github, slack, MS), Email sending, Background jobs, Landing page, Payments (Stripe, Polar.sh), Shadcn UI, S3 file upload. AI-ready with tailored AGENTS.md, skills, and Claude Code plugin. One cmd deploy. Powered by Wasp full-stack framework.
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
An open source, privacy focused alternative to NotebookLM for teams with no data limits. Join our Discord: https://discord.gg/ejRNvftDp9
Memori is agent-native memory infrastructure. A LLM-agnostic layer that turns agent execution and conversation into structured, persistent state for production systems.
An open-source Agent-first Identity and Access Management (IAM) /LLM MCP & agent gateway and auth server with web UI supporting OpenClaw, MCP, OAuth, OIDC, SAML, CAS, LDAP, SCIM, WebAuthn, TOTP, MFA, Face ID, Google Workspace, Azure AD
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Open-source AI coworker, with memory
The simplest way to run LLaMA on your local machine
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
AI-powered, vision-driven UI automation for every platform.
Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.
📋 A list of open LLMs available for commercial use.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
A command-line productivity tool powered by AI large language models like GPT-5, will help you accomplish your tasks faster and more efficiently.
Build Conversational AI in minutes ⚡️
LangGPT: Empowering everyone to become a prompt expert! 🚀 📌 结构化提示词(Structured Prompt)提出者 📌 元提示词(Meta-Prompt)发起者 📌 最流行的提示词落地范式 | Language of GPT The pioneering framework for structured & meta-prompt design 10,000+ ⭐ | Battle-tested by thousands of users worldwide Created by 云中江树
Open-source, secure environment with real-world tools for enterprise-grade agents.
A curated list of modern Generative Artificial Intelligence projects and services
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!
Go ahead and axolotl questions
Access large language models from the command-line
Low-code framework for building custom LLMs, neural networks, and other AI models
Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞
🍌 World's largest Nano Banana Pro prompt library — 10,000+ curated prompts with preview images, 16 languages. Google Gemini AI image generation. Free & open source.
The power of Claude Code / GeminiCLI / CodexCLI + [Gemini / OpenAI / OpenRouter / Azure / Grok / Ollama / Custom Model / All Of The Above] working as one.
A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization built on Rust. 100% local processing. no cloud required. Meetily (Meetly Ai - https://meetily.ai) is the #1 Self-hosted, Open-source Ai meeting note taker for macOS & Windows.
A cross-platform Markdown AI note-taking software.
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.
Large Language Model Text Generation Inference
The open source codebase powering HuggingChat
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
🌐 The open-source Agentic browser; alternative to ChatGPT Atlas, Perplexity Comet, Dia.
The best way to get AI coding agents to solve hard problems in complex codebases.
A collection of GPT system prompts and various prompt injection/leaking knowledge.
The world's best AI personal assistant for email. Open source app to help you reach inbox zero fast.
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
A collection of projects showcasing RAG, agents, workflows, and other AI use cases
A framework for building realtime voice AI agents 🤖🎙️📹
Context window optimization for AI coding agents. Sandboxes tool output, 98% reduction. 14 platforms
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Multi-Agent Harness for Production AI
An open-source, self-hosted personal AI note tool prioritizing privacy, built using TypeScript .
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Build, Manage and Deploy AI/ML Systems
All-in-one LLM CLI tool featuring Shell Assistant, Chat-REPL, RAG, AI Tools & Agents, with access to OpenAI, Claude, Gemini, Ollama, Groq, and more.
💬 Typebot is a powerful chatbot builder that you can self-host.
🚀 An awesome list of curated Nano Banana pro prompts and examples. Your go-to resource for mastering prompt engineering and exploring the creative potential of the Nano banana pro(Nano banana 2) AI image model.
The fullstack MCP framework to develop MCP Apps for ChatGPT / Claude & MCP Servers for AI Agents.
Typescript/React Library for AI Chat💬🚀
An Open-Source Asynchronous Coding Agent
AI powered open source recommender system engine supports classical/LLM rankers and multimodal content via embedding
Tips and tricks for working with Large Language Models like OpenAI's GPT-4.
AI Observability & Evaluation
Code search MCP for Claude Code. Make entire codebase the context for any coding agent.
😎 Awesome list of tools and projects with the awesome LangChain framework
Anomaly detection related books, papers, videos, and toolboxes. Last update late 2025 for LLM and VLM works!
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!
"AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!
Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.
🚀 Less chaos. More flow.
An AI-powered search engine with a generative UI
AI memory OS for LLM and Agent systems(moltbot,clawdbot,openclaw), enabling persistent Skill memory for cross-task skill reuse and evolution.
A Go implementation of the Model Context Protocol (MCP), enabling seamless integration between LLM applications and external data sources and tools.
Private & local AI personal knowledge management app for high entropy people.
An Autonomous LLM Agent for Complex Task Solving
AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework
Uncensored, open-source alternative to Higgsfield AI, Freepik AI, Krea AI, Openart AI — Free, unrestricted AI image & video generation studio with 200+ models (Flux, Midjourney, Kling, Sora, Veo). No content filters. Self-hosted, MIT licensed.
HexStrike AI MCP Agents is an advanced MCP server that lets AI agents (Claude, GPT, Copilot, etc.) autonomously run 150+ cybersecurity tools for automated pentesting, vulnerability discovery, bug bounty automation, and security research. Seamlessly bridge LLMs with real-world offensive security capabilities.
Build effective agents using Model Context Protocol and simple workflow patterns
Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖
Multilingual Voice Understanding Model
Put an end to code hallucinations! GitMCP is a free, open-source, remote MCP server for any GitHub project
Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Supporting OpenAI GPT-OSS, IBM Granite-4, Qwen-3-VL, Gemma-3n, Ministral-3, and more.
Build autonomous AI agents in Python.
Use your locally running AI models to assist you in your web browsing
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
OpenAI + LINE + Vercel = GPT AI Assistant
Giving Kubernetes Superpowers to everyone
Sweep: AI coding assistant for JetBrains
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
✨ Build AI agents and web apps — with a single binary.
22 prompt engineering techniques with hands-on Jupyter Notebook tutorials, from fundamental concepts to advanced strategies for leveraging LLMs.
Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepower. Maintained by Orchestra Research.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.
Manage multiple AI terminal agents like Claude Code, Codex, OpenCode, and Amp.
An open source library for deep learning end-to-end dialog systems and chatbots.
AI enabled pair programmer for Claude, GPT, O Series, Grok, Deepseek, Gemini and 300+ models
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. Includes AI personas, AGI functions, world-class Beam multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.
🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web without worrying about infrastructure.
An Open-Ended Embodied Agent with Large Language Models
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
THE Copilot in Obsidian
FinRobot: An Open-Source AI Agent Platform for Financial Analysis using LLMs 🚀 🚀 🚀
free online AI resume editor,the only official website is https://magicv.art
Adding guardrails to large language models.
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Deploy serverless AI workflows at scale. Firebase for AI agents
Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app and prove compliance to your customers.
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
A curated list of GPT agents for cybersecurity
WebChatGPT: A browser extension that augments your ChatGPT prompts with web results.
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
An LLM playground you can run on your laptop
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Turn any webpage into structured data using LLMs
Ship AI Agents to Google Cloud in minutes, not months. Production-ready templates with built-in CI/CD, evaluation, and observability.
Large Action Model framework to develop AI Web Agents
The agent-native LLM router for OpenClaw. 41+ models, <1ms routing, USDC payments on Base & Solana via x402.
The best agent harness.
Open-source context retrieval layer for AI agents
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
🤖 Awesome list for ChatGPT — an artificial intelligence chatbot developed by OpenAI
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
The first "code-first" agent framework for seamlessly planning and executing data analytics tasks.
an ambient intelligence library
🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.
Towards Human-Sounding Speech
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
Collection of Open Source Projects Related to GPT,GPT相关开源项目合集🚀、精选🔥🔥
This is MCP server for Claude that gives it terminal control, file system search and diff file editing capabilities
Zero-Config Code Flow for Claude code & Codex
TypeScript multi-agent orchestration engine — one runTeam() call from goal to result. Multi-model teams, auto task decomposition, parallel execution. 3 runtime dependencies.
Collection of AI-related utilities. Welcome to submit pull requests /收藏AI相关的实用工具,欢迎提交pull requests
Context-aware AI assistant for your desktop. Ready to respond intelligently, seamlessly integrating multiple LLMs and MCP tools.
Transcribe on your own!
Building AI agents, atomically
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
🧠 Curated collection of system prompts for top AI tools. Perfect for AI agent builders and prompt engineers. Incuding: ChatGPT, Claude, Perplexity, Manus, Claude-Code, Loveable, v0, Grok, same new, windsurf, notion, and MetaAI.
An awesome & curated list of best LLMOps tools for developers
🐬DeepChat - A smart assistant that connects powerful AI to your personal world
A self-organizing file system with llama 3
Klavis AI: MCP integration platforms that let AI agents use tools reliably at any scale
A visual playground for agentic workflows: Iterate over your agents 10x faster
53AI Hub is an open-source AI portal, which enables you to quickly build a operational-level AI portal to launch and operate AI agents, prompts, and AI tools. It supports seamless integration with development platforms like Coze, Dify, FastGPT, RAGFlow.
Rules and Knowledge to work better with agents such as Claude Code or Cursor
Real-time webcam demo with SmolVLM and llama.cpp server
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from development, debugging, and evaluation to monitoring.
The open source platform for AI-native application development.
ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.
A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.
Spec-driven development for large codebases
Free prompt engineering online course. ChatGPT and Midjourney tutorials are now included!
🐢 Open-Source Evaluation & Testing library for LLM Agents
MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)
Langchain + Docker + Neo4j + Ollama
Kodu is an autonomous coding agent that lives in your IDE. It is a VSCode extension that can help you build your dream project step by step by leveraging the latest technologies in automated coding agents
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Superduper: End-to-end framework for building custom AI applications and agents.
5ire is a cross-platform desktop AI assistant, MCP client. It compatible with major service providers, supports local knowledge base and tools via model context protocol servers .
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
Adversary simulation and Red teaming platform with AI
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
Perplexity Inspired Answer Engine
A collection of prompts, system prompts and LLM instructions
Kodezi Chronos is a debugging-first language model that achieves state-of-the-art results on SWE-bench Lite (80.33%) and 67% real-world fix accuracy, over six times better than GPT-4. Built with Adaptive Graph-Guided Retrieval and Persistent Debug Memory. Model available Q1 2026 via Kodezi OS.
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
A VSCode extension that allows you to use ChatGPT
The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.
This repo includes Claude prompt curation to use Claude better.
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.
Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3
AI-Powered Dark Web OSINT Tool
A fast, helpful, and open-source document parser
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
Magicrew. The first open-source all-in-one AI productivity platform (Generalist AI Agent + Workflow Engine + IM + Online collaborative office system)
Lord of Large Language and Multi modal Systems Web User Interface
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
A lightweight next-gen data explorer - Postgres, MySQL, SQLite, MongoDB, Redis, MariaDB, Elastic Search, and Clickhouse with Chat interface
ACI.dev is the open source tool-calling platform that hooks up 600+ tools into any agentic IDE or custom AI agent through direct function calling or a unified MCP server. The birthplace of VibeOps.
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Low-latency AI engine for mobile devices & wearables
Prompt Engineering, Generative AI, and LLM Guide by Learn Prompting | Join our discord for the largest Prompt Engineering learning community
UI over MCP. Create next-gen UI experiences with the protocol and SDK!
ByteRover CLI (brv) - The portable memory layer for autonomous coding agents (formerly Cipher)
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
The first AI agent that builds permissionless integrations through reverse engineering platforms' internal APIs.
Use any LLMs (Large Language Models) for Deep Research. Support SSE API and MCP server.
The open-source visual AI programming environment and TypeScript library
A simple yet powerful agent framework that delivers with open-source models
⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents.
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
A fast multimodal LLM for real-time voice
Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini). Supports local and cloud LLMs (Ollama, Google, Anthropic, ...). Searches 10+ sources - arXiv, PubMed, web, and your private documents. Everything Local & Encrypted.
GitHub Agentic Workflows
Nexent is a zero-code platform for auto-generating production-grade AI agents using Harness Engineering principles — unified tools, skills, memory, and orchestration with built-in constraints, feedback loops, and control planes.
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document.
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!
Mac app for crushing tech interviews with AI
The most powerful AI agent and AI chat software on Android/Operit是一款Android上能力最为强大的AI Agent
Build, evaluate, and integrate long-term memory for self-evolving agents.
Learn Agentic AI using Dapr Agentic Cloud Ascent (DACA) Design Pattern and Agent-Native Cloud Technologies: OpenAI Agents SDK, Memory, MCP, A2A, Knowledge Graphs, Dapr, Rancher Desktop, and Kubernetes.
DeepSeek-VL: Towards Real-World Vision-Language Understanding
DeepAnalyze is the first agentic LLM for autonomous data science. 🎈你的AI数据分析师,自动分析大量数据,一键生成专业分析报告!
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc.
🤖 A visualization mcp & skills contains 25+ visual charts using @antvis. Using for chart generation and data analysis.
Harness LLMs with Multi-Agent Programming
A nearly-live implementation of OpenAI's Whisper.
Every front-end GUI client for ChatGPT, Claude, and other LLMs
Examples and tutorials to help developers build AI systems
Latitude is the open-source agent engineering platform
Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves
Claudable is an open-source web builder that leverages local CLI agents, such as Claude Code, Codex, Gemini CLI, Qwen Code, and Cursor Agent, to build and deploy products effortlessly.
ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.
TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It's capable of accurately predicting various domains such as retail, electricity, finance, and IoT with just a few lines of code 🚀.
Claraverse is a opesource privacy focused ecosystem to replace ChatGPT, Claude, N8N, ImageGen with your own hosted llm, keys and compute. With desktop, IOS, Android Apps.
🤖 Build voice-based LLM agents. Modular + open source.
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk
The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more!
An AI Gateway, registry, and proxy that sits in front of any MCP, A2A, or REST/gRPC APIs, exposing a unified endpoint with centralized discovery, guardrails and management. Optimizes Agent & Tool calling, and supports plugins.
Enterprise AI Platform with guardrails, MCP registry, gateway & orchestrator
Fully customizable AI chatbot component for your website
An open source implementation of OpenAI's ChatGPT Code interpreter
Universal memory layer for AI Agents. It provides scalable, extensible, and interoperable memory storage and retrieval to streamline AI agent state management for next-generation autonomous systems.
🔍 AI search engine - self-host with local or cloud LLMs
⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more
Run Claude Code on OpenAI models
🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python
Semi-automated research assistant for academic research and software development. Supports Claude Code, OpenCode, and Codex CLI across ideation, coding, experiments, writing, and publication.
OpenClaw for Teams
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Agent Skills as a Memory Layer
II-Agent: a new open-source framework to build and deploy intelligent agents
Skills for the Gemini API, SDK and model/agent interactions
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
OpenAgents - AI Agent Networks for Open Collaboration
🤖 Telegram Bot API PHP SDK. Lets you build Telegram Bots easily! Supports Laravel out of the box.
Open-source Claude Design alternative. One-click import your Claude Code / Codex API key. Prompt → prototype / slides / PDF. Multi-model (Claude, GPT, Gemini, Kimi, GLM, Ollama). BYOK, local-first, MIT.
Evaluation and Tracking for LLM Experiments and AI Agents
Instantly generate AI-powered subtitles on your device. Works standalone or connects to DaVinci Resolve.
🌊 AChat - An open-source/self-hosted/local-first AI platform, designed for enterprises and teams, perfectly combining powerful local processing capabilities with seamless remote synchronization.
Automatic Generation of Visualizations and Infographics using Large Language Models
World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.
🕹️ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboration, troubleshooting, observability and more.
The platform for LLM evaluations and AI agent testing
Build production-ready AI agents in both Python and Typescript.
Code for the paper "Evaluating Large Language Models Trained on Code"
Turn approved specs into long-running autonomous implementation. A minimal, adaptable SDD harness with Agent Skills for Claude Code, Codex, Cursor, Copilot, Windsurf, OpenCode, Gemini CLI, and Antigravity.
The missing DevTools for Claude Code — inspect session logs, tool calls, token usage, subagents, and context window in a visual UI. Free, open source.
All Cursor AI's official download links for both the latest and older versions, making it easy for you to update, downgrade, and choose any version. 🚀
An AI-powered file management tool that ensures privacy by organizing local texts, images. Using Llama3.2 3B and Llava v1.6 models with the Nexa SDK, it intuitively scans, restructures, and organizes files for quick, seamless access and easy retrieval.
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.
AI Chatbots in terminal for free
162 production-ready AI agent templates for OpenClaw. SOUL.md configs across 19 categories. Submit yours!
Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.
A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!
🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
(Crystal is now Nimbalyst) Run multiple Codex and Claude Code AI sessions in parallel git worktrees. Test, compare approaches & manage AI-assisted development workflows in one desktop app.
AI agent microservice
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.一键免费部署你的私人AutoGPT 网页应用
Must-read Papers on LLM Agents.
An open-source visual programming environment for battle-testing prompts to LLMs.
A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from various relevant websites and do research for you all on its own! And more, not limited to but including saving the findings for you!
The no-code platform for building custom LLM Agents
🏆 Top-1 on 5+ benchmarks | Web UI | Supports MiroThinker, Claude, Kimi, OpenAI
Real time transcription with OpenAI Whisper.
AI conversations that actually remember. Never re-explain your project to your AI again. Join our Discord: https://discord.gg/tyvKNccgqN
🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents
One command brings a complete pre-wired LLM stack with hundreds of services to explore.
o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging the power of OpenAI's API, this tool provides functionalities such as code generation, file editing, and project planning to streamline your development workflow.
Put up to 8 AI models on every coding task — blind spots surface before you ship. Claude Code plugin.
Laminar - open-source observability platform purpose-built for AI agents. YC S24.
Unleash Next-Level AI! 🚀 💻 Code Generation: DeepSeek r1 + Claude 3.7 Sonnet - Unparalleled Performance! 📝 Content Creation: DeepSeek r1 + Gemini 2.5 Pro - Superior Quality! 🔌 OpenAI-Compatible. 🌊 Streaming & Non-Streaming Support. ✨ Experience the Future of AI – Today! Click to Try Now! ✨
🦜💬 Web app for interacting with any LangGraph agent (PY & TS) via a chat interface.
ALLWEONE® Open source AI presentation generator Gamma Alternative. Create professional slides with customizable themes and AI-generated content in minutes.
An improved implementation of the Ralph Wiggum technique for autonomous AI agent orchestration
Data context layer for unstructured data - images, video, sensor data, text and PDFs
A portable open-source operating system for agents. ~6 ms coldstarts, 32x cheaper than sandboxes. Powered by WebAssembly and V8 isolates.
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
RAG on Paul Graham's essays.
Ruler — apply the same rules to all coding agents
Learn to build your Second Brain AI assistant with LLMs, agents, RAG, fine-tuning, LLMOps and AI systems techniques.
Control Any Computer Using LLMs.
My own Prompts for Custom instructions ChatGPT
All-in-one platform for search, recommendations, RAG, and analytics offered via API
[EMNLP-2024] Build multimodal language agents for fast prototype and production
Multi-agent orchestration workflow (Claude Code Codex Gemini OpenCode)
A react-based starter app for using the Live API over websockets with Gemini
Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
CodeMachine is an open-source tool that orchestrates AI coding agents into repeatable, long-running workflows. ⚡️
✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.
Brings MCP to ChatGPT, DeepSeek, Perplexity, Grok, Gemini, Google AI Studio, OpenRouter, DeepSeek, T3 Chat and more...
Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers, VectorDBs, Agent Frameworks and GPUs.
Export and Share your ChatGPT conversation history
OpenAPI specification for the OpenAI API
the terminal client for Ollama
A Tool to Visualize Claude Code's LLM Interactions
Pipelines: Versatile, UI-Agnostic OpenAI-Compatible Plugin Framework
Real-time multi-AI collaboration: Claude, Codex & Gemini with persistent context, minimal token overhead
extendable code review and QA agent 🚢
Research into how agentic AI coding assistants work — reconstructed prompt patterns, agent coordination, and security classification
Bridge Claude Code / Codex to IM platforms — chat with AI coding agents from Telegram, Discord, or Feishu/Lark.
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.
No-code multi-agent framework to build LLM Agents, workflows and applications with your data
This SDK is now deprecated, use the new unified Google GenAI SDK.
A repo lists papers related to LLM based agent
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
A Lightweight LLM Post-Training Library
A lightweight framework for building LLM-based agents
Comprehensive resources on Generative AI, including a detailed roadmap, projects, use cases, interview preparation, and coding preparation.
Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.
agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.
Control Gmail, Google Calendar, Docs, Sheets, Slides, Chat, Forms, Tasks, Search & Drive with AI - Comprehensive Google Workspace / G Suite MCP Server & CLI Tool
The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works on Windows, Linux, & macOS, with the free Gemini API, local LLMs, & more.
Automatically generate and overlay subtitles for any video.
Terminal session manager for AI coding agents. One TUI for Claude, Gemini, OpenCode, Codex, and more.
🧠 Make your agents learn from experience. Now available as a hosted solution at kayba.ai
(Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models.
PentestAgent is an AI agent framework for black-box security testing, supporting bug bounty, red-team, and penetration testing workflows.
Open-source AI-driven quantitative trading platform for crypto, stocks, and forex with backtesting, live trading, market data, and multi-agent research.
Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.
Your Automatic Prompt Engineering Assistant for GenAI Applications
Replace Copilot local AI
AI coding workstation: Claude Code + web UI + 7 AI CLIs + headless browser + 50+ tools
Whisper as a Service (GUI and API with queuing for OpenAI Whisper)
Perplexity style AI Search engine clone built with Gemini 2.0 Flash and Grounding
An Obsidian vault that gives AI coding agents persistent memory. Claude Code, Codex CLI, Gemini CLI.
Completely free, private, UI based Tech Documentation MCP server. Designed for coders and software developers in mind. Easily integrate into Cursor, Windsurf, Cline, Roo Code, Claude Desktop App
423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.
Offline multi-agent simulation & prediction engine. English fork of MiroFish with Neo4j + Ollama local stack.
OpenDAN is an open source Personal AI OS , which consolidates various AI modules in one place for your personal use.
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
ChatGPT web interface using the OpenAI API
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
AI computer use powered by open source LLMs and E2B Desktop Sandbox
A Unified MCP Server Management App (MCP Manager).
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
OpenAI Assistants API quickstart with Next.js.
Witsy: desktop AI assistant / universal MCP client
🌸 Best framework to build web agents, and deploy serverless web automation functions on reliable browser infra.
Text Generator is a versatile plugin for Obsidian that allows you to generate text content using various AI providers, including OpenAI, Anthropic, Google and local models.
JSON-driven multi-agent cadence-team development framework with intelligent CLI orchestration (Gemini/Qwen/Codex), context-first architecture, and automated workflow execution
Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use/LLM. Selenium IDE import/export.
Development platform to debug, chat, inspect, and evaluate MCP servers, MCP apps, and ChatGPT apps.
The leading open-source AI copilot for JetBrains. Connect to any model in any environment, and customize your coding experience in any way you like.
GPT-5 coding examples
AutoChain: Build lightweight, extensible, and testable LLM Agents
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
The PHP Agentic Framework to build production-ready AI driven applications. Connect components (LLMs, vector DBs, memory) to agents that can interact with your data.
[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
ContextGem: Effortless LLM extraction from documents
AI as Workspace - An elegant AI chat client. Full-featured, lightweight. Support multiple workspaces, plugin system, cross-platform, local first + real-time cloud sync, Artifacts, MCP | 更好的 AI 客户端
收集GPTS的prompt / Collect the prompt of GPTS
Dive is an open-source MCP Host Desktop Application that seamlessly integrates with any LLMs supporting function calling capabilities. ✨
Run Claude Code, Gemini, Codex — or any coding agent — in a clean, isolated sandbox with sensitive data redaction and observability baked in.
A beautiful local-first coding agent running in your terminal - built by the community for the community ⚒
LocalAGI is a powerful, self-hostable AI Agent platform designed for maximum privacy and flexibility. A complete drop-in replacement for OpenAI's Responses APIs with advanced agentic capabilities. No clouds. Local AI that works on consumer-grade hardware (CPU and GPU).
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, speech synthesis and recognition, web search, memory, presets, assistants,and more. Linux, Windows, Mac
Practical productivity tools for Claude Code, Codex-CLI, and similar CLI coding agents.
An AutoGPT agent that controls Chrome on your desktop
The AI Agent Workforce Platform — where teams scale beyond headcount. Give every team member an AI agent squad.
An index of the LangChain + LangGraph ecosystem: concepts, projects, tools, templates, and guides for LLM & multi-agent apps.
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
GPTeam: An open-source multi-agent simulation
An agentic company research tool powered by LangGraph and Tavily that conducts deep diligence on companies using a multi-agent framework. It leverages Google's Gemini 2.5 Flash and OpenAI's GPT-5.1 on the backend for inference.
DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing.
Aria is Your AI Research Assistant Powered by GPT Large Language Models
Curated GPT-Image-2 prompts for the OpenAI API — portraits, posters, UI mockups, game screenshots, character sheets, and more. Ready-to-use prompts for gpt-image-2.
ChatGPT with superpowers! Search chat history, create folders, export all chats, pin messages, access thousands of community prompts, incognito mode, language and tone selection, and many more features
Use Kimi latest model(kimi-k2-0711-preview) to drive your Claude Code.
Create Custom GPT and add/embed on your site using Assistants api
Build Anything with AI Agents
The Context Optimization Layer for LLM Applications
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
A fast and lightweight framework for creating decentralized agents with ease.
This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated graph based algorithm to handle the tasks.
Agentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents.
A playground of highly experimental prompts, Jinja2 templates & scripts for machine intelligence models from OpenAI, Anthropic, DeepSeek, Meta, Mistral, Google, xAI & others. Alex Bilzerian (2022-2025).
A high-performance inference engine for AI models
A curated list of awesome resources, tools, and other shiny things for LLM prompt engineering.
Chat with your data - AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake, SQLite...
[GenAI Application Development Framework] 🚀 Build GenAI application quick and easy 💬 Easy to interact with GenAI agent in code using structure data and chained-calls syntax 🧩 Use Event-Driven Flow *TriggerFlow* to manage complex GenAI working logic 🔀 Switch to any model without rewrite application code
🦙 Local and online AI hub
An open-source, code-first Java toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
Lemon AI is the first Full-stack Open-source Self-Evolving General AI Agent, offering a fully local alternative to Agentic platforms like Manus & Genspark AI.🔔 Official updates X(twitter) @LemonAI_cc
"Your Fully-Automated Personal AI Assistant"
Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.
Evaluate your LLM-powered apps with TypeScript
Not just another ChatGPT user-interface!
In the evolving world of Large Language Models (LLMs), crafting effective prompts has become an essential skill. That's why I've created this collection, showcasing the most impactful prompts of the year across various intriguing domains. 🌐
Build, run and scale AI agents like API and microservices - observable,auditable and identity-aware from day one.
Claude Code settings, commands and agents for vibe coding
Common interface for interacting with AI agents. The protocol is tech stack agnostic - you can use it with any framework for building agents.
OBS plugin for local speech recognition and captioning using AI
A curated list of awesome LLM agents frameworks.
An LLM-powered autonomous agent platform
Make your own story. User-friendly software for LLM roleplaying
SALMONN family: A suite of advanced multi-modal LLMs
Full computer-use for AI agents. Self-learning workflows. Native macOS. No screenshots required.
🔥 A list of tools, frameworks, and resources for building AI web agents
AIlice is a fully autonomous, general-purpose AI agent.
Run local LLMs like llama, deepseek-distill, kokoro and more inside your browser
A (nearly) seamless integration of ChatGPT into Obsidian.
Home Assistant custom component of conversation agent. It uses OpenAI to control your devices.
HTTP API for Claude Code, Goose, Aider, Gemini, Amp, and Codex
AI video agents framework for next-gen video interactions and workflows.
An official Qdrant Model Context Protocol (MCP) server implementation
Universal Claude Code workflow plugin with agents, skills, hooks, and commands
⚡️ 10x - Up to 20x faster AI coding with multi-step Superpowers. Open-source agent with smart model routing, BYOK, fully self-hosted.
[EMNLP 2025 Oral] MemoryOS is designed to provide a memory operating system for personalized AI agents.
Custom AI agent platform to speed up your work.
Easily select and manage your preferred AI digital assistants on Android.
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.
The Simple Agent Development Kit.
AI Vibe Coding Agent of TS backend server, enhanced by compiler skills, generating 100% working code
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
Google AI Studio Starter Apps
The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
Visual intelligence for your home.
AI Browser Automation
Bub it. Build it. A hook-first runtime for agents that live alongside people.
List of software that allows searching the web with the assistance of AI: https://hf.co/spaces/felladrin/awesome-ai-web-search
👨💻 An awesome and curated list of best code-LLM for research.
Curate a custom library of AI Prompts
Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere
Fine-tune LLM agents with online reinforcement learning
BrowserWing turns your browser actions into MCP commands Or Claude Skill, allowing AI agents to control browsers efficiently and reliably. Say goodbye to slow, token-heavy LLM interactions — let agents call commands directly for faster automation. Perfect for AI-driven tasks, browser automation, and boosting productivity.
Autonomous Agents (LLMs) research papers. Updated Daily.
Large Language Model based Multi-Agents: A Survey of Progress and Challenges (In IJCAI 2024)
Simple shell script to use OpenAI's ChatGPT and DALL-E from the terminal. No Python or JS required. Formerly https://gptshell.cc
Samurai-inspired multi-agent system for Claude Code. Orchestrate parallel AI tasks via tmux with shogun → karo → ashigaru hierarchy.
BaseAI — The Web AI Framework. The easiest way to build serverless autonomous AI agents with memory. Start building local-first, agentic pipes, tools, and memory. Deploy serverless with one command.
Skill that audits and rewrites content to remove AI writing patterns. Use it with your favorite agents including Claude Code, OpenClaw, and Hermes.
SuperEasy 100% Local RAG with Ollama + Email RAG
A curated list of awesome AI assistants. Example Telegram bot with all these assistants can be tested on the link below.
OWASP Top 10 for Large Language Model Apps (Part of the GenAI Security Project)
🌎💪 BrowserGym, a Gym environment for web task automation
open-source coding LLM for software engineering tasks
Prompty makes it easy to create, manage, debug, and evaluate LLM prompts for your AI applications. Prompty is an asset class and format for LLM prompts designed to enhance observability, understandability, and portability for developers.
This repository contains a collection of the best system prompts for ChatGPT, a conversational AI model developed by OpenAI. Star this repository to help us reach 5,000 stars!
A generalized information-seeking agent system with Large Language Models (LLMs).
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.
Connect AI models like Claude & GPT with robots using MCP and ROS.
Install our local first extensions for your favorite AI IDE or Terminal Agent. Sync your conversations to the cloud. File issues and requests.
Xtreme1 is an all-in-one data labeling and annotation platform for multimodal data training and supports 3D LiDAR point cloud, image, and LLM.
TypeScript AI platform with AI chat, Autonomous agents, Software developer agents, chatbots and more
🎙️ AI Dictation App - Open Source and Local-first ⚡ Type 3x faster, no keyboard needed. 🆓 Powered by open source models, works offline, fast and accurate.
😎 Awesome list of Retrieval-Augmented Generation (RAG) applications in Generative AI.
A database of SDKs, frameworks, libraries, and tools for creating, monitoring, debugging and deploying autonomous AI agents
A curated list of awesome LLM and AI Agent Skills, resources and tools for customising AI Agent workflows - that works with Claude Code, Codex, Gemini CLI and your custom AI Agents
Orchestrate Claude Code, Codex, and Gemini sessions on a multiplayer canvas. Manage git worktrees, track AI conversations, and visualize your team's agentic work in real-time.
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim and Neovim.
AIPex: AI browser automation assistant, no migration and privacy first. Alternative to Manus Browser Operator、 Claude Chrome and Agent Browser
Context manager for all agents
🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL
Elixir implementation of a LangChain style framework that lets Elixir projects integrate with and leverage LLMs.
A modular voice assistant application for experimenting with state-of-the-art transcription, response generation, and text-to-speech models. Supports OpenAI, Groq, Elevanlabs, CartesiaAI, and Deepgram APIs, plus local models via Ollama. Ideal for research and development in voice technology.
Command Your AI Agent Empire from the CEO Desk — A local-first AI agent office simulator that orchestrates CLI, OAuth, and API-connected agents (Claude Code, Codex CLI, Gemini CLI, OpenCode, and more) as a virtual autonomous company.
ChatDBG - AI-assisted debugging. Uses AI to answer 'why'
A collection of autonomous agents 🤖️ powered by LLM.
Low code tool to rapidly build and coordinate multi-agent teams
Just a Better Chatbot. Powered by Agent & MCP & Workflows.
PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai
AI agent expert in PostgreSQL
🤖 / 🏪 Agent Index - This is the agent index for LobeChat. It accesses index.json from this repository to display a list of available agents for LobeChat to the agent market.
Search + Chat = SearChat(AI Chat with Search), Support OpenAI/Anthropic/VertexAI/Gemini, DeepResearch, SearXNG, Docker. AI对话式搜索引擎,支持DeepResearch, 支持OpenAI/Anthropic/VertexAI/Gemini接口、聚合搜索引擎SearXNG,支持Docker一键部署。
Autonomous GPT-4 agent platform
Build agents which are controlled by LLMs
Dynamiq is an orchestration framework for agentic AI and LLM applications
Readymade evaluators for your LLM apps
GeoIntel using Google's Gemini API to uncover the location where photos were taken through AI-powered geo-location analysis.
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
Coding Agent Session Manager for Claude Code / Gemini CLI / Codex CLI / Cursor Agent / Copilot CLI / Cline CLI / OpenCode / Kimi CLI
AI Agent Evaluator & Red Team Platform
🍌 The official starter kit for the Nano Banana Hackathon. Clone this repo to get building fast!
TypeScript AI AI Function Calling Framework enhanced by compiler skills.
📊 llm.report is an open-source logging and analytics platform for OpenAI: Log your ChatGPT API requests, analyze costs, and improve your prompts.
Web UI for AutoGen (A Framework Multi-Agent LLM Applications)
A pattern for an always on AI Assistant powered by Deepseek-V3, RealtimeSTT, and Typer for engineering
🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring safety & security. 🛡️ Features include text quality, relevance metrics, & sentiment analysis. 📊 A comprehensive tool for LLM observability. 👀
Simple go utility to download HuggingFace Models and Datasets
A.S.E (AICGSecEval) is a repository-level AI-generated code security evaluation benchmark developed by Tencent Wukong Code Security Team.
CodeProject.AI Server is a self contained service that software developers can include in, and distribute with, their applications in order to augment their apps with the power of AI.
Use your Claude Max subscription with OpenCode, Pi, Droid, Aider, Crush, Cline. Proxy that bridges Anthropic's official SDK to enable Claude Max in third-party tools.
Claude Code for Finance
AgentSociety: Large-scale Social Simulation to Understand Human Behaviors and Society through LLM-driven Agents
Hexabot is an open-source AI chatbot / agent builder. It allows you to create and manage multi-channel and multilingual chatbots / agents with ease.
An editing tool that uses AI to transcribe, understand content and search for anything in your footage, integrated with ChatGPT and other AI models
CLI MCP package manager & registry for all platforms and all clients. Search & configure MCP servers. Advanced Router & Profile features.
Build and run AI agents using Docker Compose. A collection of ready-to-use examples for orchestrating open-source LLMs, tools, and agent runtimes.
A ChatGPT bot for Kubernetes issues.
ChatGPT CLI is a powerful, multi-provider command-line interface for working with modern LLMs. It supports OpenAI, Azure, Perplexity, LLaMA, and more, with features like streaming, interactive chat, prompt files, image/audio I/O, MCP tool calls, and an experimental agent mode for safe, multi-step automation.
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech
Audio Large Language Models
The Best AI Agent Framework for Agent Collaboration.
Labs to explore AI Models, MCP servers, and Agents with the AI Gateway powered by Azure API Management and Microsoft Foundry 🚀
A curated list of OpenClaw resources, tools, skills, tutorials & articles. OpenClaw (formerly Moltbot / Clawdbot) — open-source self-hosted AI agent for WhatsApp, Telegram, Discord & 50+ integrations.
The most advanced Web UI for AI chat
ChatGPT and Bing AI prompt curation
Supercharge AI coding agents with portable skills. Install, translate & share skills across Claude Code, Cursor, Codex, Copilot & 40 more
Hand-crafted Claude Code Skills focused on improving agent results quality. Compatible with OpenCode, Cursor, Antigravity, Gemini CLI, and others.
An open-source alternative to OpenAI and Gemini's deep research.
The best way to create, deploy, and share MCP Servers
Deep research agent to help you find the best GitHub repositories 🕵️!
Browser automation system that uses AI-driven planning to navigate web pages and perform goals.
Your local AI Desktop Agent for Windows, macOS & Linux. Agent Skills (SKILL.md), autonomous coding (Codework), multi-agent teams, desktop automation, 15+ AI providers, Desktop Buddy. No Docker, no terminal. Free.
AgentKit: Build multi-agent networks in TypeScript with deterministic routing and rich tooling via MCP.
An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.
Using Tree-of-Thought Prompting to boost ChatGPT's reasoning
ChattyUI - your private AI chat for running LLMs in the browser
WilmerAI is one of the oldest LLM semantic routers. It uses multi-layer prompt routing and complex workflows to allow you to not only create practical chatbots, but to extend any kind of application that connects to an LLM via REST API. Wilmer sits between your app and your many LLM APIs, so that you can manipulate prompts as needed.
GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters
CodeGate: Security, Workspaces and Multiplexing for AI Agentic Frameworks
Awesome AI Memory | LLM Memory | A curated knowledge base on AI memory for LLMs and agents, covering long-term memory, reasoning, retrieval, and memory-native system design. Awesome-AI-Memory 是一个 集中式、持续更新的 AI 记忆知识库,系统性整理了与 大模型记忆(LLM Memory)与智能体记忆(Agent Memory) 相关的前沿研究、工程框架、系统设计、评测基准与真实应用实践。
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
AlwaysReddy is a LLM voice assistant that is always just a hotkey away.
A collection of standardized Agent Skills to teach GitHub Copilot, Claude, Gemini and Cursor about modern Android development (Kotlin, Jetpack Compose, etc.).
🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.
A curated collection of awesome AI Agents and LLM Apps built with multiple tech stacks, showcasing real-world implementations using OpenAI, Gemini, local models, and various AI frameworks.
An implementation of iterative deep research using the OpenAI Agents SDK
Compare open-source local LLM inference projects by their metrics to assess popularity and activeness.
Fully local, private and cross platform Speech-to-Text with LLM Post-processing
Sharing early versions of Ada, a personal AI Assistant built on OpenAIs Realtime API
MCP Server for SearXNG
[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.
🥥 Coco AI App - Search, Connect, Collaborate, Personal AI Search and Assistant, all in one space.
An agent benchmark with tasks in a simulated software company.
AI coding tools that give free Claude Opus/Sonnet, GPT-5, Gemini Pro, and other pro-grade models
Cross-platform desktop application for content-aware file organization and renaming. Supports local and remote LLMs, preview-based workflows, and fully user-controlled changes.
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
Agent Cloud is like having your own GPT builder with a bunch extra goodies. The GUI features 1) RAG pipeline which can natively embed 260+ datasources 2) Create Conversational apps (like GPTs) 3) Create Multi Agent process automation apps (crewai) 4) Tools 5) Teams+user permissions. Get started fast with Docker and our install.sh
Open‑WebUI Tools is a modular toolkit designed to extend and enrich your Open WebUI instance, turning it into a powerful AI workstation. With a suite of over 15 specialized tools, function pipelines, and filters, this project supports academic research, agentic autonomy, multimodal creativity, workflows, and more
Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.
ARGO is an open-source AI Agent platform that brings Local Manus to your desktop. With one-click model downloads, seamless closed LLM integration, and offline-first RAG knowledge bases, ARGO becomes a DeepResearch powerhouse for autonomous thinking, task planning, and 100% of your data stays locally. Support Win/Mac/Docker.
The most advanced Nano Banana image generator and editor application. Your central hub for AI image generation and revisions. Intuitive UI features reference images, editing with image masks, version history, and more. Powered by Gemini 2.5 Flash images API.
A curated collection of AI agent research papers released in 2026, covering agent engineering, memory, evaluation, workflows, and autonomous systems.
A simple CLI to run LLM prompt and implement MCP client.
A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features include agent mode, multi-server, model switching, streaming responses, tool management, human-in-the-loop, thinking mode, model params config, MCP prompts, custom system prompt and saved preferences. Built for developers working with local LLMs.
OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.
Easily switch between alternative low-cost AI models in Claude Code/Agent SDK. For those comfortable using Claude agents and commands, it lets you take what you've created and deploy fully hosted agents for real business purposes. Use Claude Code to get the agent working, then deploy it in your favorite cloud.
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
AI-powered video podcast creation skill for coding agents. Supports Bilibili & YouTube, multi-language (zh-CN/en-US), 6 TTS engines (Edge/Azure/ElevenLabs/OpenAI/Doubao/CosyVoice), 4K Remotion rendering.
My personal Claude Code and OpenAI Codex setup with battle-tested skills, plugins, hooks and agents that I use daily.
Local Ollama and OpenAI-like GPT's assistance for maximum privacy and offline access
Conversational voice AI agents
Make contextual data visualization with Chat Interface from tabular datasets. AI data visualization.
An open-source AI content search engine designed specifically for content creators. Supports extraction of text, images, and short videos. Allows full local deployment (web app, RAG server, LLM server). Supports multi-modal RAG content Q&A.
Real-time transcription using faster-whisper
A coding agent and general agent harness for building and orchestrating agentic applications.
a magical LLM desktop client that makes it easy for *anyone* to use LLMs and MCP
Tock, the open source conversational AI toolkit.
"VideoAgent: All-in-One Agentic Framework for Video Understanding, Editing, and Remaking"
Daydreams is a set of tools for building agents for commerce
MCP Gateway is a reverse proxy and management layer for MCP servers, enabling scalable, session-aware stateful routing and lifecycle management of MCP servers in Kubernetes environments.
Fast, local-first web content extraction for LLMs. Scrape, crawl, extract structured data — all from Rust. CLI, REST API, and MCP server.
GraphRAG4OpenWebUI integrates Microsoft's GraphRAG technology into Open WebUI, providing a versatile information retrieval API. It combines local, global, and web searches for advanced Q&A systems and search engines. This tool simplifies graph-based retrieval integration in open web environments.
Elevate vibe coding to vibe engineering: Get consistent Github Copilot custom instructions, Cursor, Roo Code, Cline, Windsurf, Claude Code, Gemini Cli, Codex CLI, kilo code, warp custom rules via a universal, managed template. Features vibe coding memory bank & best practices for large codebases.
AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.
multilspy is a lsp client library in Python intended to be used to build applications around language servers.
Structured deep research skill for Claude Code/Open Code/Codex with human-in-the-loop control
The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.
ChatGPT PROMPTs Splitter. Tool for safely process chunks of up to 15,000 characters per request
Build agents that scale with a zero-cost abstraction.
Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space
AI-based stock analysis and trading system
MAD: The first work to explore Multi-Agent Debate with Large Language Models :D
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.
A versatile workflow automation platform to create, organize, and execute AI workflows, from a single LLM to complex AI-driven workflows.
Self-hosted version of OpenAI’s new stateful Assistants API
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
🤖 AI browser extensions & userscripts to augment your web experience
AI-agents that automatically generate and use Langchain Tools and ChatGPT plugins
Instruction-based prompts for generating and classifying text.
AI-powered assistant to help you with your daily tasks, powered by Llama 3, DeepSeek R1, and many more models on HuggingFace.
JobSync is a self-hosted, open-source job application tracker and AI-powered career assistant. Built with Next.js and Shadcn UI, it helps job seekers manage their search journey with AI resume review, job matching, task logging, and application analytics—all while keeping your data private.
State of the Art 82% OSWorld Verified Computer Using Agent, fully open-source, safe, auditable, and production-ready.
Generate and brainstorm ideas while creating your notes using Large Language Models (LLMs) from Ollama, LM Studio, Anthropic, Google Gemini, Mistral AI, OpenAI, and more for Obsidian.
A personal context store for AI agents and assistants—reuse your existing coding agent CLI (Codex/Claude/OpenCode) with built‑in Skills/tools and a desktop GUI to capture, search, and reuse project knowledge across agents and repos.
🤖 Awesome list of AGI Agents. Agents 精选资源合集.
Universal MCP-Server for your Databases optimized for LLMs and AI-Agents.
AI-powered CMS core for personal blogs and creator websites, with AI summaries, translation, moderation, and writing workflows.
Giselle: AI App Builder. Open Source.
Use Claude Code / CodeX CLI to perform multiple tasks in parallel with a Codex-style UI. Your personal codex/cursor-background agent. Claude Code UI.
18 AI personas deliberate your hardest decisions across multiple LLM providers. Aristotle, Feynman, Kahneman, Torvalds & more — structured multi-round deliberation with genuine model diversity. One command: /council
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
Enable any LLM (e.g. Claude) to interactively debug any language for you via MCP and a VS Code Extension
OpenAI API-compatible wrapper for Claude Code
Local Video-LLM powered AI Baby Monitor
[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI
Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.
A next-generation AI-powered career platform transforming how employers and job seekers connect
Manifold is an experimental platform for enabling long horizon workflow automation using teams of AI assistants.
Huge AI models catalog. A curated list of AI tools, platforms, and resources across various domains.
Simple yet effective command line client for chatting with ChatGPT using the official API
A centralized manager for Model Context Protocol (MCP) servers with dynamic server management and monitoring
Quantalogic ReAct Agent - Coding Agent Framework - Gives a ⭐️ if you like the project
The ultimate no-code platform to build and share AI apps with beautiful UI.
No code AI agents
Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation
An LLM-first SEO analysis skill for Antigravity, Codex, Claude with 16 specialized sub-skills, 10 specialist agents, and 33 optional utility scripts used as evidence collectors.
Build reliable AI Workflows and Agents with humans in the loop, structured outputs and durable execution.
End-to-end platform for building voice first multimodal agents
A tool for evaluating LLMs
This project is a digital human that can talk and listen to you. It uses OpenAI's GPT to generate responses, OpenAI's Whisper to transcript the audio, Eleven Labs to generate voice and Rhubarb Lip Sync to generate the lip sync.
🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses SparkTTS, OpenAI, ElevenLabs, Kokoro, Typecast or xAI
The best way to use AI is on your own computer. Use local or paid API models, and ctrl+k to show/hide the chat UI. Experience the future of AI, and help build it too!
A Community Open-Source Saas for Crafting/Building/Creating Chatbots with OpenAI's Assistant API that you can add to your website.
Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic research tool to reason , create multi step tasks , synthesize data from multiple online resources and create neat reports
An agent that uses OpenAI's Agents SDK to generate new agents
Integrate AI Assistants with Django to build intelligent applications
Cannoli allows you to build and run no-code LLM scripts using the Obsidian Canvas editor.
Real-time behavioral enforcement for Claude Code. Monitors AI actions, detects violations, and interrupts misbehavior. Also has a cute pet.
🤖 Intelligent integration between Claude Code and Google Gemini for large-scale code analysis
Jupyter code notebooks of "ChatGPT Prompt Engineering for Developers" by DeepLearning.AI and OpenAI.
A Model Context Protocol (MCP) server that provides tools for fetching and analyzing Reddit content.
Fresh builds of llama.cpp with AMD ROCm™ 7 acceleration
AI-powered video editor that turns raw footage and a creative brief into a polished ad using an ensemble of AI agents (Google Gemini + FFmpeg)
Context-Engine MCP - Agentic Context Compression Suite
Brower extension to convert web pages to clean Markdown and copy to clipboard so you can feed it to your favorite LLM model as context with just 1 click!
Use Claude Code with any LLM provider - GLM-4.5, Kimi-K2, Qwen3-Coder, DeepSeek, etc.
🏔️ The AI-native RSS reader
Bring Agent Skills to Any AI Agent and Coding Agent — via CLI or MCP. Manage once, serve anywhere.
On-device AI for Android — LLM chat (GGUF/llama.cpp), vision models (VLM), image generation (Stable Diffusion), tool calling, AI personas, RAG knowledge packs, TTS/STT. Fully offline, zero subscriptions, open-source.
This is a repository to be used when creating a repository template.
A curated list of retrieval-augmented generation (RAG) in large language models
Framework to bring LLM applications to production
The Claude Code alternative
An autonomous LLM-agent for large-scale, repository-level code auditing
Create a plan from a description in minutes
A collection of Model Context Protocol (MCP) servers, clients and developer tools by IBM.
A plugin-based gateway that orchestrates other MCPs and allows developers to build upon it enterprise-grade agents.
Reproducible, flexible LLM evaluations
Local coding agent with neat UI
The Open Assistant API is a ready-to-use, open-source, self-hosted agent/gpts orchestration creation framework, supporting customized extensions for LLM, RAG, function call, and tools capabilities. It also supports seamless integration with the openai/langchain sdk.
VoxNovel: generate audiobooks giving each character a different voice actor.
Open-source CLI coding agent, a free alternative to Claude Code. Generate, debug, and manage code seamlessly.
Autocomplete your obsidian notes with AI, including ChatGPT, through a copilot-like interface.
🧭 Open source tools for air quality data analysis
Create state-machine-powered LLM agents using XState
This python program allows you to use Claude Code with Google's Gemini models.
VoiceGPT is a voice assistant that leverages the powerful ChatGPT chatbot to answer your questions.
🚀 Powerful Local AI Chat Application - Mcp, Secure, Efficient, Personalized 本地化部署的大模型客户端
A terminal utility for intelligent shell command generation
A curated list of awesome ChatGPT software.
Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
A CLI tool for logging and analyzing Claude Code and Cursor ai-driven coding session.
A lightweight Python framework for Parallel AI Reasoning.
⚡ Stream browser logs to terminal, zero setup, perfect for Ai Agents
Tiledesk is the open source AI agent builder, written in Node.js and Angular. This repository is dedicated to the WebApp dashboard to manage Tiledesk: open-source alternative to Voiceflow, enabling easy creation of advanced LLM-powered Agents with seamless human-in-the-loop (HITL).
Integrating AI into every workflow with our open-source, no-code platform, powered by the actor model for dynamic, graph-based solutions.
TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle
Prompts for playable games in ChatGPT
An MCP server for Massive.com Financial Market Data
mcp store manager, add & syncs MCP server configurations across clients like Claude code, Cursor💡mcphub
A Model Context Protocol (MCP) server that helps read GitHub repository structure and important files.
some prompt about cyber security
The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.
Multi-Agent Collaboration Platform✨
🧠 Second Brain AI agent
🔍 DiscovAI-Search: An AI-powered search engine for AI tools and custom data. Built with Next.js, OpenAI, Supabase, and more. Features vector-based search, Redis caching, and LLM-powered responses.
simple web ui to manage mcp (model context protocol) servers in the claude app
AI video generation SDK — JSX for videos. One API for Kling, Flux, ElevenLabs, Sora. Built on Vercel AI SDK.
Seamlessly integrate state-of-the-art transformer models into robotics stacks
Calculate prices for calling LLM inference APIs.
DeepContext is an MCP server that adds symbol-aware semantic search to Claude Code, Codex CLI, and other agents for faster, smarter context on large codebases.
Claude Code with any LLM
Agentic AI platform that harnesses Visual LLM Chaining to build proactive digital assistants
A universal git-native AI agent framework. Your agent lives inside a git repo — identity, rules, memory, tools, and skills are all version-controlled files.
A collection of agents that use Large Language Models (LLMs) to perform tasks common on our day to day jobs in cyber security.
An introduction to the world of AI Agents
TerminalGPT - Terminal-based ChatGPT personal assistant app. Provides optimized, tailored answers for your machine's terminal.
A Multi-Agent framework that enables AI agents to collaborate effectively, helping you build powerful agent teams for solving complex tasks.
Social and customizable AI writing assistant! ✍️
A manager for AI coding agents that works with Claude Code, Cursor, Gemini, Codex, and Qwen.
Here are the prompts I’ve created and want to share.
An MCP server implementation that provides tools for retrieving and processing documentation through vector search, enabling AI assistants to augment their responses with relevant documentation context.
Continuous Integration for LLM powered applications
Web-Use is a CDP powered Browser Agent
Microsoft Finance Time Series Forecasting Framework (FinnTS) is a forecasting package that utilizes cutting-edge time series forecasting and parallelization on the cloud to produce accurate forecasts for financial data.
Vibe code with Claude in parallel git worktrees
Sample application to add voice capabilities to the Agents SDK
OmniAI standardizes the APIs for multiple AI providers like OpenAI's Chat GPT, Mistral's LeChat, Claude's Anthropic, Google's Gemini and DeepSeek's Chat..
AI chat client
A library of shared system prompts for creating customized educational GPT agents.
** THIS REPO HAS MOVED TO https://github.com/langchain-ai/langchainjs/tree/main/libs/langchain-mcp-adapters ** Adapters for integrating Model Context Protocol (MCP) tools with LangChain.js applications, supporting both stdio and SSE transports.
Simplified Gemini for Claude Code.
Welcome to the ChatGPT Prompts Library! This repository contains a diverse collection of over 100,000 prompts tailored for ChatGPT. Our prompts cover a wide range of topics, including marketing, business, fun, and much more.
Your personal free-to-use AI assistant, built with gemini & flutter.
OpenBrowser is a framework for intelligent browser automation. It combines direct CDP communication with a CodeAgent architecture, where the LLM writes Python code executed in a persistent namespace, to navigate, interact with, and extract information from web pages autonomously.
Lightweight MCP integration bringing Google's Gemini AI capabilities to Claude Code with 1M+ token context window, smart model selection, and powerful code analysis tools
Run Anthropic's Claude Code CLI with OpenAI models such as GPT-5-Codex, GPT-5.1, and others via a local LiteLLM proxy.
MedEvalKit: A Unified Medical Evaluation Framework
A Desktop Application for Managing Multiple OpenAI Codex CLI Accounts
🧬 Generate visual charts using ECharts with AI MCP dynamically, used for chart generation and data analysis.
🤖 AI-powered code generation tool for scratch development of web applications with a team collaboration of autonomous AI agents.
The AI Podcast Studio: generate podcasts scripts and their audio version with a team of AI workers in a Podcast Studio 🎙️📜
An AI-powered storytelling video generator that takes user input as a story prompt, generates a story using OpenAI's GPT-3, creates images using OpenAI's DALL-E, adds voiceover using ElevenLabs API, and combines the elements into a video.
Bria RMBG 2.0 - image background removee
A curated list of useful Generative AI APIs for developers
AI-native SaaS framework that builds full-stack apps using autonomous AI agents
[DEPRECATED] Superseded by systempromptio/systemprompt-template and systempromptio/systemprompt-core. Multi-modal MCP client for voice-powered agentic workflows.
Add AI capabilities to your file system using Ollama, Groq, OpenAi and other's api
A playful script to get two AI assistants to converse using OpenAI Assistants API
A fully autonomous, AI-powered DevOps platform for managing cloud infrastructure across multiple providers, with AWS and GitHub integration, powered by OpenAI's Agents SDK.
AI agents platform that gives you a workspace with an integrated team of personal assistants that can work behind the scenes to handle daily monotonous tasks.
🍃🔎 MongoDB Lens: Full Featured MCP Server for MongoDB Databases
A modern, real-time speech recognition application built with OpenAI's Whisper and PySide6. This application provides a beautiful, native-looking interface for transcribing audio in real-time with support for multiple languages.
Simplify interactions with Large Language Models
A powerful Whisper AI keyboard for reliable speech transcription
Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts!
Get up and running with the Gemini API using Node.js and Python
MongoDB Knowledge Service. Powered by MongoDB and Atlas Vector Search.
A collection of prompts for use with GPT-4 via ChatGPT, OpenAI API w/ Gradio frontend & notebook
A toolkit for building computer use AI agents
workbench for learning and practicing on-device AI technology in real scenario with online-TV on Android phone, powered by ggml(llama.cpp,whisper.cpp...) and FFmpeg and opencv-mobile
DeepSeek CLI, a command-line AI coding assistant that leverages the powerful DeepSeek Coder models
Tool to work with arXiv, provide LLM with ability to search and read papers from there
Natural language → ComfyUI workflow JSON. 34 built-in templates, 360+ node definitions, auto model download. Supports txt2img, img2img, txt2vid, img2vid, audio, 3D generation across SD1.5/SDXL/SD3/FLUX/Wan2.2/HunyuanVideo/LTXV/Mochi/Cosmos + LLM integration. Works as a skill for Claude Code, Cursor, and other AI coding agents.
Open-source alternative to Claude Cowork - Browser automation & AI assistant powered by DeepSeek
Samantha OS1 is a conversational AI assistant powered by the Realtime API from OpenAI
A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.
The Multi-Agent Reasoning framework creates an interactive chatbot where AI agents collaborate via structured reasoning and Swarm Integration for optimal answers. Simulating a team that discusses, debates, and refines responses, it enables complex problem-solving and precise results. Now with Prompt Caching to reduce latency and costs.
Spongecake is the easiest way to launch computer use agents.
AI-api text generation
Generate interactive flashcards from your notes using models from OpenAI (ChatGPT), Google (Gemini), Ollama (local LLMs), and more. Or manually create your own to use with the quiz UI.
Automate Creation of Story-Based Videos.
Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp OFFLINE. Speak with local LLMs via llama.cpp.
Convert a Git repo into a ChatGPT prompt!
AI Agents are missing the UI! We're here to change it. Build Business AI Agents for your company: business workflows, API's, bookings, e-commerce, social commerce, b2b, CPQ, intake forms, NPS tests, made-to-order use cases
Awesome ChatGPT prompts for engineers😇
MCP SSH Server: 37 tools for remote SSH management | Claude Code & OpenAI Codex | DevOps automation, backups, database operations, health monitoring
Modern GUI application that transcribes and translate audio files using OpenAI Whisper.
See how to augment LLMs with real-time data for dynamic, context-aware apps - Rag + Agents + GraphRAG.
A curated collection of system prompts and tool definitions from production AI coding agents
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents
Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 and Whisper
Model Context Protocol server that integrates AgentQL's data extraction capabilities.
Model Context Protocol (MCP) Server for Langfuse Prompt Management. This server allows you to access and manage your Langfuse prompts through the Model Context Protocol.
Build a powerful Deep Research AI agent like Gemini or ChatGPT. Using Next.js, Vercel AI SDK, and Exa Search API, An intelligent system that generates follow-up questions, crafts optimal search queries, and compiles comprehensive research reports.
Python CLI for AI Chat with MCP support
This is my collection of helpful priming prompts for ChatGPT when discussing various angles of software development and architecture.
Terminal-based AI Coding Agent, similar to Claude Code, OpenAI Codex etc. but works with many more LLMs e.g. Gemini, Groq, Deepseek
Dynamically expose tools from proxied servers based on an Agent Persona
Ollama conversation integration for Home Assistant
Multi-Agent Conversation Framework in TypeScript
A Model Control Protocol (MCP) server that allows Claude to communicate with locally running LLM models via LM Studio.
VividNode: Multi-purpose Text & Image Generation Desktop Chatbot (supporting various models including GPT).
Connect any Open Data to any LLM with Model Context Protocol.
Build, test and manage your AI Agents in the central place.
Automatically generate engaging AI podcasts from nothing but an episode title.
Typst MCP Server is an MCP (Model Context Protocol) implementation that helps AI models interact with Typst, a markup-based typesetting system. The server provides tools for converting between LaTeX and Typst, validating Typst syntax, and generating images from Typst code.
This MCP or multiple AI setup let claude code use Grok, Gemini and DeepSeek for reviewing and fixes
Command line artificial intelligence - Your local LLM context-feeder
All-in-one local low-code AI agent development platform. Installs and runs n8n, Flowise, Browser-Use, Qdrant, Ollama, and more. Proxies LLM requests through LiteLLM with Langfuse for observability.
Job search Agent (AI) searches and applies to jobs on your behalf, creating tailored applications for positions that match your skills, making your entire job search hassle-free.
This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.
Obsidian canvas plugin for using AI completion with threads of canvas nodes
Desktop AI assistant for Windows, Mac and Linux
Local RAG researcher agent built using Langgraph, DeepSeek R1 and Ollama
[DEPRECATED] Superseded by systempromptio/systemprompt-template and systempromptio/systemprompt-core. MCP server for orchestrating AI coding agents (Claude Code CLI & Gemini CLI).
ChatGPT page with API instand of offical pages. You can modify params, save and download the result as txt with prompt.
Neo AI integrates into the Linux terminal, capable of executing system commands and providing helpful information.
An Agentic Deep Research Assistant similar to Gemini and OpenAI Deep Research
Multi-agent collaboration plugin for Claude Code - orchestrate multiple AI agents (Codex CLI, Gemini CLI, etc.) for diverse perspectives
A command-line personal assistant that integrates with Google Calendar, Gmail, and Tasks to help manage your digital life.
Auto classification plugin for Obsidian using ChatGPT.
Prompt Engineering Research Tool for AI APIs
HaluMem is the first operation level hallucination evaluation benchmark tailored to agent memory systems.
AI tools for OSINT
This repo includes ChatGPT content creation prompt curation to use ChatGPT for content creation better.
A ChatGPT bot trained on your vault notes. Ask your AI questions about your own thoughts and ideas!
Harness GPT's expertise with curated prompts for consistent, high-quality professional consultations.
💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and subtitle generation using OpenAI’s Whisper on CPU, Nvidia GPU and Apple MLX.
An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI Whisper and input the recognized text; Supports English, Chinese, Japanese, etc. and even mixed languages.
Welcome to AgentGenesis, your source for customizable Gen AI code snippets that you can easily copy and paste into your applications.
This project is a Streamlit-based web application that leverages OpenAI's Assistants API to provide a ChatGPT-like experience. Users can have real-time conversations with the AI, upload documents to be used as context, and even scrape and convert website content to PDFs to enrich the AI's knowledge base.
Automatically generate high-quality markdown context files for AI coding agents like Claude, Cursor, and Gemini, Powered By DeepWiki
A Home Assistant custom component that provides an AI-powered agent capable of generating automations based on natural language queries. The agent connects to all entities in your Home Assistant instance and uses OpenAI's or Llama API to translate user requests into valid Home operations including creating automations for you!
simplifies the process of creating and managing LLM workflows.
Lightweight, simple embedded Open WebUI widget, allowing you to easily implement chatbot capabilities and RAG workflows into your existing tools, apps and webpages!
Here is over 200 AI prompts that covers Blog Writing, Email Marketing , YouTube Ad Scripts, Facebook Ad,YouTube Video Ideas,Twitter Thread ,Cold DM Ideas,Influencer Marketing and Copywriting and Instagram Story.
A simple to use python library for creating podcasts with support for many LLM and TTS providers
A minimal, end‑to‑end deep‑research agent implemented with AI SDK and Next.js
Researcher Agent to write blog posts/ articles using Amazon Bedrock and websearch.
Automated Miulti AI Agent for company research with LangGraph— scrapes web data, extracts business insights, and generates structured reports using LLM analysis.
A comprehensive evaluation framework for AI agents and LLM applications.
One-stop shop for AI skills and agents. Search 110K+ community skills, install and track them declaratively, and deploy across all major AI coding tools (Claude Code, Codex, Cursor, Antigravity and more)
SourceGPT - prompt manager and source code analyzer built on top of ChatGPT as the oracle
The coding agent for professionals
demo app: how to use LLM as a general purpose classifier
This shows the results from using a second, filter LLM that analyses prompts before sending them to GPT-Chat
A Dall-E 3 localhost web UI for using advanced settings like style (vivid vs natural) or quality (standard vs hd). Can also be used when ChatGPT's Dall-E throttles you for the day, if you are ready to pay the API call costs. Comes integrated with Prompt Inspirer!
A serverless, AI-powered deep research agent built with Cloudflare Workers and Google Gemini 2.5
A plugin for Obsidian that allows you to create a canvas conversation using ChatGPT.
This repo collects research papers that use AI tools and are in the field of scientific research (including computer science, agronomy, chemistry, physics, etc.). We call this method as Deep-Research.
ChatGPT Desktop Application Supercharged with prompts
Use AI chatbots like Claude or ChatGPT, particularly their projects function, to help track, diagnose, and recovery from chronic illness.
Automated system for LLM evaluation via agents.
High-performance LLM evaluation framework with parallel API calls — up to 17× faster than sequential tools. Supports box, math, and logit-based evaluation.
A visual node-based editor for building, sharing, and executing complex AI workflows with Fal.ai and Replicate.
Local-GenAI-Search is a generative search engine based on Llama 3, langchain and qdrant that answers questions based on your local files
Home Assistant Card to display the LLM Vision Timeline
An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.
Promptdesk is a tool designed for effectively creating, organizing, and evaluating prompts and large language models (LLMs).
End-to-end workflow to automatically generate show notes from audio/video transcripts
MCP OAuth Proxy incl. dynamic client registration (DCR), MCP prompt analytics and MCP firewall to build enterprise grade MCP servers.
Precis is an extensible self-hosted AI-enabled RSS reader with a focus on notifications and support for theming
Deep Research through Multi-Agents, using GraphRAG
Open-source FRED MCP Server (Federal Reserve Economic Data)
☕ GPT-2 chatbot for daily conversation
吴恩达《ChatGPT Prompt Engineering for Developers》课程中文版
AI-powered tool for automatic podcast script and audio generation.
Personal Context Manager for Claude Code. Your life in walnuts.
A reference containing useful prompts for ChatGPT. Many prompts have parameters that allow you to customize them to your liking.
An MCP server that provides LLMs with efficient access to package documentation across multiple programming languages
One-Click-Deploy Your Own Advanced ChatGPT-prompts-database by Vercel🙌一键部署个人Prompts的数据库,除了Prompts,还有Prompting Engineer的相关资料!
A ComfyUI custom node for Google's Gemini 2.5 Flash Image (aka "Nano Banana") model - the state-of-the-art image generation and editing AI that went viral for its incredible quality and capabilities.
Skills to augment LLM thinking process, integrated with InfraNodus insight generation tool
The first and largest GPTs database
😎 Sagentic.ai Agent Framework - Sagentic.ai is a unified platform for building, running and scaling autonomous agents.
Build Awesome MCPs with Awesome Best Practices for MCP Servers and MCP Clients
Real-time speech recognition & AI-powered note-taking app for macOS with offline/online modes, multilingual transcription, and Japanese translation support.
A list of LLM benchmark frameworks.
Multi-Agent Blog Generator based on Agno framework. Supports leading LLM providers like OpenAI, Gemini, Claude, and Grok.
A very simple whsper Python FastAPI for OpenAI API, Android voice-typing (konele), Home Assistant (wyoming), and a voice-typing script on Linux and MacOS!
Record, transcribe, and transform voice notes into structured insights. Leverage Whisper or AssemblyAI and ChatGPT to fill in gaps, generate summaries, and visualize ideas — all seamlessly integrated within Obsidian.
A curated list of awesome Gemini Nano Banana model cases, prompts, and sources
Stable Diffusion Desktop client for Windows, macOS, and Linux built in Embarcadero Delphi.
ChatGPT Prompts for Devops Mastery
A decision operating system for high-stakes choices — business, strategy, career. Simulates disagreement, stress-tests assumptions, and converges on what actually holds up. Claude Code skill inspired by Karpathy's autoresearch + LLM council.
MCP Deep Research Server using Gemini creating a Research AI Agent
Web Graphical User Interface (GUI) for Gas Town multi-agent orchestrator - A companion interface for steveyegge/gastown
This repository provides resources and guidelines to facilitate the integration of Open-WebUI and Langfuse, enabling seamless monitoring and management of AI model usage statistics.
Simple GUI around whisper.cpp for voice-to-text on Linux
Shinkai allows you to create AI agents without touching code. Define tasks, schedule actions, and let Shinkai write custom code for you. Native crypto support included.
Cerno is a local-first research platform that leverages agentic AI to break down complex queries into verifiable, multi-step workflows. Switch seamlessly between cloud LLMs and self-hosted models, track every reasoning step, and optimize cost and tokens—all while keeping your data on your machine.
Claude Code with any LLM provider (OpenRouter, Gemini, Kimi K2)
Token-efficient data serialization for LLM/AI. 50% fewer tokens than JSON, 93% better value/token. Rust, schema validation, LSP.
A powerful command-line interface for interacting with DeepSeek's AI models.
Lets to use local llms in your Obsidian Vaults, extend your stories or create entirely new texts based on your previous input
ChatGPT Desktop Application with prompts hint and voice control
Universal MCP Server Configuration Manager
Zettelkasten note taking powered by Large Language Models
Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash
Advanced ChatGPT Prompts
Open AI Chat Bot in the Menu Bar: ChatGPT desktop app for Windows, Mac, and Linux
Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
Every Eval Ever is a shared schema and crowdsourced eval database. It defines a standardized metadata format for storing AI evaluation results — from leaderboard scrapes and research papers to local evaluation runs — so that results from different frameworks can be compared, reproduced, and reused.
Mark web pages for use with vision-language models
An Obsidian plugin to process text, chat with AI, and semantically search your notes — works with any OpenAI-compatible LLM server (Ollama, LM Studio, vLLM, and more).
🤖 Community fork of Google's Gemini CLI for Qwen AI models. A powerful command-line tool that uses Alibaba Cloud's Qwen models to understand your code, automate workflows, and accelerate development. Features multilingual support (EN/CN), model switching, and web search integration.
A tool to OCR PDFs using gen-AI models
open-source, local-first AI CLI for developers. It connects to your local Ollama models or remote providers like OpenAI, Anthropic, Gemini, and Hugging Face, letting you chat with your code, review changes, and craft commits — all from the terminal.
This is a Streamlit application that allows two local Ollama models to chat with each other.
AI-Powered Podcast Generator: A Python-based tool that converts text scripts into realistic audio podcasts using Google's Generative AI API. This project leverages advanced text-to-speech technology to create dynamic, multi-speaker conversations with customizable voices.
Professional AI assistant configurations for Claude Code, Codex CLI, OpenCode and Gemini CLI with enterprise-grade defaults for Solidity and TypeScript development.
🎯 AI-powered voice assistant for TickTick, enabling natural language task management through speech. Built with OpenAI's speech recognition and TickTick's API integration, this assistant helps you manage your todos hands-free - create tasks, set reminders, and organize your schedule using just your voice.
A curated list of all things awesome about OpenAI
Automatically generate subtitles from an input audio or video file using OpenAI Whisper
An MCP server to communicated with, use, and wrap the API for the OpenAI Codex CLI tool.
Open Source multi-modal LLM environment. Host your own web and mobile chat interface, powered by real-time bots and voice AI functionality.
A curated list of materials on AI guardrails
An MCP Server for audio transcription using OpenAI
Rules and instructions for agentic coding tools like Cursor, Claude CLI, Gemini CLI, Qodo, Cline and more
Open-ended wargames with large language models
AI-powered CLI tool to automatically organize your GitHub Stars into Lists.
A skill to claude code that enables brainstorming with other LLMs (ChatGPT, Gemini) before presenting the implementation plan to the user
Use Large Language Models (such as ChatGPT) to automatically generate flashcards from obsidian notes
OpenAI-compatible TTS API that unifies multiple backends with smart chunking for unlimited-length generation
Ask GPT from your notes and get personalized answers based on your knowledge base.
Curated list of real‑world AGENTS.md files, templates, guides & tools for OpenAI Codex‑based agents.
An index of prompting libraries for GPTs, including ChatGPT. Some of these are on Github and others are hosted externally.
A Claude Code framework for multi-llm planning and development agents
Local RAG server for code editors. Scans your codebase, builds a local context index, and connects to any external LLM for context-aware completions and assistance.
MCP server for Fal.ai - Generate images, videos, music and audio with Claude
Do literature review Fast, Simple and Reliable
List of prompts for chatGPT for various assistances with tasks.
A LLM project as a part of CS-5660
MovieGPT: A RAG, Gen AI application for Movie Recommendations
Let the OpenAI LLM add nodes to your Obsidian canvas
Self-correcting image generation for Gemini's Nano Banana model
A real-time, offline voice assistant for Linux and Raspberry Pi. Uses local LLMs (via Ollama), speech-to-text (Vosk), and text-to-speech (Piper) for fast, wake-free voice interaction. No cloud. No APIs. Just Python, a mic, and your voice.
MCP server that gives Claude ability to use OpenAI's GPTs assistants
A fully local, open-source voice-to-text tool that acts as a system-wide AI dictation layer, converting speech into clean, formatted text.
A modern project scaffold for AI-assisted development workflows—providing reusable Claude Code commands (.claude/commands/), structured AI documentation (ai_docs/), and standardized feature specifications (specs/) to streamline collaboration with Claude Code and OpenAI Codex.
Multi-Agent Debugger: An AI-powered debugging system using CrewAI to orchestrate specialized agents that analyze logs, trace code, and uncover root causes across your stack — powered by LLM providers.
AI-powered image generation using Google Gemini, integrated with Claude Code via Skills or Claude.ai via MCP (Model Context Protocol).
Discover the ultimate collection of top AI prompts for ChatGPT, Bard, and beyond. Elevate your prompt skills with this open-source project. Unleash the full potential of AI-driven interactions. 🔥
Easy to use, scalable admin portal for deploying and managing AI Assistants
A VSCode extension for running LLM prompts. It turns VSCode into a powerful prompt IDE.
Automate ChatGPT prompt input from Airtable data
Prompt Engineering for Everybody with ChatGPT and GPT4, by Packt Publishing
LLM Gateway to call 100+ LLM APIs in OpenAI format with MCP supported.
An AI-powered content generator app using NextJs, React, TailwindCss, Drizzle, Typescript, Gemini, and Clerk. Implement authentication, built a modern UI, set up a Postgres database, and utilised Google Gemini API to generate content with.
Compare and generate AI videos across Sora, Veo, Kling, Seedance & more.
Learn how multimodal AI merges text, image, and audio for smarter models
Universal AI CLI & Python SDK for 8+ providers (OpenAI, Claude, Gemini, Cohere, Perplexity, IBM watsonx, Groq, Together AI). Multi-provider chat, code generation, cost optimization, age-appropriate AI. Claude Code/Gemini CLI alternative with zero vendor lock-in.
A free and open-source library of AI prompts.
Claude Multi-Agent Project Management Framework - AI-driven orchestration with LangGraph and OpenAI integration
The Yahoo Finance Agent is an application that combines OpenAI's LLMs, the Yahoo Finance Python library, and LangChain's tools to provide real-time financial data. It features stock information, financial statements, and an interactive chat interface, all while maintaining conversation context and integrating with Langsmith for debugging
Open DeepResearch is an application for assisting in research by conducting comprehensive research on any topic.
A cross-platform desktop application that records audio and transcribes it to text using OpenAI's Whisper API or compatible services. Perfect for dictation, note-taking, and accessibility.
A study assistant powered by Claude Opus. It provides various tools to assist with different tasks, such as researching,coding,note-taking and more.
awesome for your to collections
💹 StockSim: Multi-Agent LLM Financial Market Simulator — A realistic trading simulation platform for evaluating large language models in dynamic financial environments.
A simple API that provides information about various Large Language Models (LLMs) and their pricing
💬 Experience OpenAI ChatGPT assistance directly within Obsidian, drafting content without interrupting your creative flow.
Text-to-speech plugin for Claude Code — multi-provider support (ElevenLabs, OpenAI, Google, Amazon Polly, Azure, Kitten, local system TTS) on macOS, Linux, and Windows
A stand-alone application with GUI for OpenAI's Whisper
Professional Wargaming LLM Toolbox
Multi-AI consensus MCP server that queries multiple AI models (OpenAI, Claude, Gemini, custom APIs) in parallel and synthesizes responses to reduce bias and improve accuracy. A Python implementation of the wisdom-of-crowds approach for AI decision making.
🚀 Claude Code API | CLI to API Platform | Transform Claude CLI, Gemini CLI, Cursor CLI into REST APIs | One-Click Deploy | Web Terminal
Prompt Management System for Interaction with the ChatGPT API
An application that enable the users to upload PDF files and ask questions regarding their content using Retrieval Augmented Generation (RAG)
AI-powered deep research tool leveraging web scraping for cost-effective, comprehensive analysis. Open-source and API-cost free!
makes the jewish library accessible to LLMs through the MCP protocol
InsightA is an Obsidian plugin, can transform long articles into concise, atomic notes, and create well-organized Map of Content (MOC) for notes using LLM. This tool is ideal for anyone aiming to distill complex information into structured, interconnected notes, drawing inspiration from the Zettelkasten method.
Automatically categorize your GitHub starred repos into Star Lists using LLM.
Use Claude Code CLI with any LLM provider - OpenAI, local models, or any OpenAI-compatible API
Python command-line tool for interacting with AI models through the OpenRouter API/Cloudflare AI Gateway, or local self-hosted Ollama. Optionally support Microsoft LLMLingua prompt token compression
The only tool that replays Claude, Codex, Cursor, AND Gemini AI coding sessions in one unified UI. Vibe coding companion for reviewing, searching, and sharing your AI pair programming transcripts.
MCP server that enables AI agents to analyze projects, initialize vibe unified rules for AI coding across Claude Code, Cursor, GitHub Copilot, and Gemini CLI
An interactive AI voice agent that can capture and transcribe speech in real-time, generate intelligent responses using the DeepSeek R1 (7B model) AI, and convert the responses back to natural speech for immediate playback. The agent maintains conversation context and supports cross-platform usage on macOS, Linux, and Windows.
Train and use generative text models in a few lines of code.
The best Android keyboard for offline speech recognition, using OpenAI's whisper model through whisper.cpp for fast and accurate output.
A Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.
UI for the OpenAI API
The LLM Council works together to answer your hardest questions
🐕 Universal MCP Server Manager - Configure once, manage multiple MCP servers through a single interface. Perfect for Claude Desktop, Claude Code, Cursor, Gemini CLI & AI assistants. Web dashboard, auto-detection, unified proxy layer.
A Model Context Protocol (MCP) server that enables Claude Desktop to interact with Google's Gemini AI - featuring 7 tools with Smart Tool Intelligence that learns from your usage patterns
🤖 A Telegram bot that integrates with OpenWeb UI's OpenAI compatible APIs to provide chat functionality written in Go
A small script that types what you say using whisper while holding a hotkey
AI-powered file launcher and semantic search assistant. Like Spotlight/Alfred but with advanced AI capabilities for understanding context and meaning. Features local processing, privacy-first design, and seamless integration with your workflow.
Create your assistant in the OpenAI dashboard, generate an API key, and use this code to integrate it into Open WebUI. Remember that it is a function, not a pipeline.
🔭 My open-source implementation of Deep Research using Google’s new Gemini 2.0 Flash model.
Running Ollama and Open WebUI in a Kubernetes Cluster
Simple Prompt Plugin is a plugin for Obsidian that allows you generate content in your notes using LLMs.
🤖 AI-powered ADR generation - Automatically capture and document architectural decisions as you code
A gateway service designed to manage and orchestrate OpenAI-compatible API servers with MCP support.
Privacy-first meeting transcription and voice-to-text tool for Linux. 100% local AI processing with faster-whisper and Ollama.
Node.js app that transcribes WhatsApp voice notes to text using OpenAI's Whisper API. The text can also be translated to the user's preferred language and sent back to their WhatsApp account.
An open source deep research clone. AI Agent (Local LLM or Gemini) that reasons large amounts of web data extracted with SwiftSoup.
Chrome extension that allows dictating anywhere using OpenAI Whisper
OpenRouter skill for AI agents: model discovery, multimodal chat, tool calling, routing, and starter templates.
Star Manager is a local-first web app for organizing GitHub starred repositories with Star Lists and optional LLM-assisted classification.
Open-source voice-first AI assistant for Android that actually controls your phone.
Turn any Excel spreadsheet into LLM consumable JSON
Skill management tool for AI agents - discover and install pre-built workflows from Anthropic, OpenAI, and community sources
Open-source clone of OpenAI's Deep Research. Works with any transformer, gpt4free, & runs in browser. No Firecrawl needed.
A collection of custom tools and extensions for Open WebUI that enhance its capabilities
🤖 A CLI tool that transforms web content into polished blog posts using AI. Scrape websites, pull Reddit discussions, and let OpenAI/Claude craft engaging articles - all from your terminal. Features a sleek TUI interface and smart image handling with AI-generated alt text.
This is my custom scripts to use Whisper / OpenAI by keyboard shortcuts and voice input.
A Python-based AI agentic assistant that uses Google's Gemini AI to provide natural language computer control through voice commands.
SDK, skills, and examples for Inkbox: give your AI agents identities, inboxes, phone numbers, and encrypted vaults.
AI-powered web app that automatically generates resumes based on GitHub or LinkedIn profiles using LLM.
A MCP server that provides audio transcription capabilities using OpenAI's Whisper API
A dictation application on linux using openai's whisper. Currently only used on KDE wayland.
Local AI CLI alternative to Gemini CLI/Claude Code - works with LM Studio and local models
Convert audio files (flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, and webm) to SRT subtitles with OpenAI Whisper. Easy script for fast, accurate transcription.
DeepSeekDeepResearch is a Python-based AI assistant that leverages a local LLM (DeepSeek-R1:7b via Ollama), the Google Custom Search API, and asynchronous web scraping to automatically gather and refine information on any research topic.
The AI Assistant uses OpenAI's GPT models and Langchain for agent management and memory handling. With a Streamlit interface, it offers interactive responses and supports efficient document search with FAISS. Users can upload and search pdf, docx, and txt files, making it a versatile tool for answering questions and retrieving content.
Kivywhisper is a cross platform Python GUI for OpenAI's Whisper.
A repository of prompts I've used when working with LLMs as well as some example outputs and notes
Agent-native image-editing SDK for Claude Code. 21 MCP tools + /decompose skill — semantic layer splits, L1–L5 cultural scoring, region inpaint. Powered by ComfyUI, Gemini, or mock.
Autonomous CLI Agent that uses Local LLM with Ollama to execute tasks in CLI.
Add voice-to-text capabilities to Claude Code using OpenAI Whisper for speech recognition.
One-key voice-to-transcription tool: record speech, transcribe locally with Whisper, then paste. Never lose your audio files anymore!
AI-powered Design Authority — 5 specialist agents evaluate architecture decisions and deliver structured rulings in under 60 seconds
The ultimate PyQt6 application that integrates the power of OpenAI, Google Gemini, Claude, and other open-source AI models
The TeamAI application allows users to create a team of AI powered assistants with individual capabilities, personas. The AI assistants will solve the task requested by the user as a team effort, each bot contributing with its respective capabilities. Supported providers are Ollama and OpenAI.
V1Claw is a self-hosted AI assistant that runs on your Mac, Linux, Windows, or Android via Termux. Connect any LLM (Claude, GPT, Gemini, or local), talk to it through voice or text, and let it control your device - read files, run commands, browse the web, send messages, & more. One binary. No cloud dependency. Your data stays on your machine.
Deepseek-R1 on modal.com using openwebui and ollama - Quick one command run
LLM supported tools for Homebox storage
TalkType is a cross-platform application built with Electron, supporting Windows, macOS, and Linux. By combining Automatic Speech Recognition (ASR) with Large Language Models (LLM), it goes beyond simple dictation to offer "Understanding", "Polishing", and "Q&A" capabilities — your all-in-one voice writing assistant.
Linux alternative to Wispr Flow for vibe coding with Cursor but works with any app. Uses OpenAI optionally for dictation clean up.
AudioWrite: Effortless voice dictation powered by Google's Gemini API. Record, transcribe, and transform rambling audio into polished, multi-language notes. PWA ready.
A chat-based discovery tool leveraging AI function calling to explore diverse content via search APIs. Supports Gemini AI & OpenAI models.
arXiv.org ChatGPT Plugin
Finetuning and evaluating LLMs to extract GHG emissions from PDF reports using RAG and grammar-based decoding.
Fast CLI-first LLM council + bundler + multimodal gateway
Twidi Speech To Text (openai, push to talk, linux, wayland, deepgram)
Voice dictation for Linux/Wayland (like wisprflow). 100% offline, GPU-accelerated, and actually works with Wayland compositors.
An offline-first desktop app to automatically transcribe and edit video subtitles using OpenAI Whisper. Full control over text, timing, and advanced styling in a powerful, intuitive editor.
🛠️ Build and customize Claude Code agents with tools and Docker isolation for efficient production workflows and advanced reasoning capabilities.
WhisperVoice: Covert voice notes. Encrypts text and hides it via LLM-generated acrostic sentences. Murf.ai creates natural audio. Browser extension decrypts with passcode, revealing hidden message or playing decoy for unauthorized listeners. Uses LLM, Murf.ai, STT APIs
Speech-to-Text/Code using a fast local LLM, for Linux, uses Whisper
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Researching use cases for LLMs using Sefaria's data
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
Generate lyric-aligned SRT subtitles with Gemini and review them in a local viewer.
An Agent SKILL for Israel Rail API
GPU-accelerated speech-to-text service that types what you say, powered by OpenAI's Whisper AI
Your voice - VocalFlow dictation, harnessing Whisper and faster-whisper for real-time transcription, adaptive learning, and NLP. Built with Python, it spans Linux, Windows, and macOS, boosting productivity through voice-assisted workflows.
Real-time desktop audio transcription using OpenAI Whisper for Arch Linux with CUDA acceleration
A powerful audio transcription server that seamlessly transcribes meeting recordings, generates notes, and intelligently splits audio files for efficient management. Open-source and built with FastMCP and Groq/OpenAI Whisper
MCP server for real-time audio transcription using OpenAI Whisper
Next.js app for Claude code best practices built with Fumadocs. Document and explore content sources, routes, and MDX docs for fast, scalable docs sites. 🐙
Simple Python Tkinter GUI App for linux that uses whisper from openai for transcription.
An admin application I'm developing for managing all aspects of working with LLMs professionally and at scale, with functions for prompt logging, prompt library, and custom LLM agent inventorising. Aspiration: add code to notes gradually as project matures!
Great online communities discussing use of LLMs including (but not limited to) ChatGPT
A reproducible evaluation of how frontier LLMs handle grounded vs. ungrounded questions in RAG systems — measuring correctness, grounding, and calibrated refusal.
Meeting Minutes of the AI Council — Ten Leading AI Models Deliberate on the Human-Machine Trust Crisis Triggered by the Claude Mythos.A transparent experiment for human-AI symbiosis governance.(本项目和会议都是在中文环境下完成,若想看到英文版请自行翻译。)
LLM-powered agentic system for Israeli Knesset election analysis — comparing 4 tool routing strategies (DS-UA 301)
Open-Source Speech-to-Text Evaluation Framework
An LLM agent, to talk to your csvs.
Awesome-LLM: a curated list of Large Language Model
Utility for identifying newly released custom ChatGPTs shared via the ChatGPT App Store
is a professional-grade translation engine designed to bridge the gap between English and Hebrew with surgical precision. Unlike standard translators, Aegis employs a multi-layered "Shield" architecture featuring automated auditing and AI-driven quality assurance to ensure every line is natural, accurate, and perfectly formatted.
Advanced GenAI Legal Ecosystem for Israeli Law. Powered by Autonomous Agents, Multi-Step Tool Calling (Skills), and Gemini Pro. Architected for Precision with RAG, Vector Embeddings, and Agentic Workflows to deliver
AI Presidential Briefing: Daily knowledge synthesis system with memory layer, LLM council, and LinkedIn post generation
Product deduplication pipeline for Israeli price-comparison — Hebrew/English normalization, FAISS embeddings, LLM cluster refinement. Pair F1: 0.955
A Cloudflare webapp (Nuxt + Hono worker) uses Gemini API to analyze Israeli Payslips
AI Recommendation Platform for Israeli Small Businesses - Get recommended by ChatGPT and Perplexity
Israeli career forecast for hightech industry using LLM forecasting via claude code.
Terminal UI for chatting with MiMo models through Xiaomi's OpenAI-compatible API
An index of prompting libraries for GPTs, including ChatGPT. Some of these are on Github and others are hosted externally.
A fast, lightweight Linux tool that converts speech to text and types it into any window using OpenAI's Whisper API.
Voice-to-text input daemon for Linux using OpenAI Whisper
Offline Voice Dictation & Text Enhancement A lightweight, 100% local Linux tool for real-time voice‑to‑text transcription and LLM‑powered writing improvements.
from microphone directly to your app
Linux voice transcription with hotkey using faster-whisper (local) with optional GPT-4o mini polishing
Tests for using multiple different frameworks to replicate "Deep Agents" aka multi-agent systems that run on long tasks (such as Claude Code, OpenAI Deep Research, Manus, etc.)
Langflow-based LLM agent that keeps track of my personal projects. Based on integration with WhatsApp voice messages, Whisper, OpenAI/Mistral models and local MCP.
App for transcribing audio/video to editable SRT subtitles using Whisper. Supports mp3/mp4/wav inputs, audio extraction, and local download.
A clean, stateless AI-powered text transformation tool designed for local AI models and OpenAI-compatible APIs. Each transformation is treated as an independent request with no chat history, optimizing performance for local AI inference.
A free and open-source tool designed to automate typing tasks, with built in AI for advanced users.
Talk to your documents with Ollama and Langchain - supports CSV, PDF, DOCX, PPTX, TXT, XLSX.
ChatGPT-generated Python scripts for various purposes
LLM-generated documents describing potential use-cases for GPTs & other LLMs across both professional and business contexts with sections describing potential/predicted evolutions as underlying technology advances