Machine Learning & Deep Learning

326 repos
An Open Source Machine Learning Framework for Everyone
★ 194,883C++updated 2026-04-20deep-learningdeep-neural-networksdistributedmachine-learningml
f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
★ 160,697HTMLupdated 2026-04-20aiartificial-intelligenceawesome-listchatgptchatgpt-prompts
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
★ 159,926Pythonupdated 2026-04-20audiodeep-learningdeepseekgemmaglm
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
★ 110,102Pythonupdated 2026-04-20aicomfycomfyuipythonpytorch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
★ 91,441Jupyter Notebookupdated 2026-04-16aiartificial-intelligencechatbotchatgptdeep-learning
A high-throughput and memory-efficient inference and serving engine for LLMs
★ 78,154Pythonupdated 2026-04-20amdblackwellcudadeepseekdeepseek-v3
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
★ 73,813MDXupdated 2026-03-11agentagentsai-agentschatgptdeep-learning
Tesseract Open Source OCR Engine (main repository)
★ 73,725C++updated 2026-04-19hacktoberfestlstmmachine-learningocrocr-engine
A curated list of awesome Machine Learning frameworks, libraries and software.
★ 72,295Pythonupdated 2026-04-12
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
★ 70,611Pythonupdated 2026-04-20agentaideepseekfine-tuninggemma
Financial data platform for analysts, quants and AI agents.
★ 66,523Pythonupdated 2026-04-19aicryptoderivativeseconomicsequity
Clone a voice in 5 seconds to generate arbitrary speech in real-time
★ 59,641Pythonupdated 2026-03-09deep-learningpythonpytorchtensorflowtts
Ultralytics YOLO 🚀
★ 56,435Pythonupdated 2026-04-20clicomputer-visiondeep-learninghubimage-classification
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
★ 45,200Pythonupdated 2026-04-20airflowapacheapache-airflowautomationdag
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
★ 45,178Pythonupdated 2024-08-16deep-learningglow-ttshifiganmelganmulti-speaker-tts
Streamlit — A faster way to build and share data apps.
★ 44,356Pythonupdated 2026-04-20data-analysisdata-sciencedata-visualizationdeep-learningdeveloper-tools
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
★ 42,311Pythonupdated 2026-04-20data-sciencedeep-learningdeploymentdistributedhyperparameter-optimization
Making large AI models cheaper, faster and more accessible
★ 41,379Pythonupdated 2026-04-13aibig-modeldata-parallelismdeep-learningdistributed-computing
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.
★ 41,260Pythonupdated 2026-04-17algorithmic-tradingauto-quantdeep-learningfinancefintech
AI-Powered Photos App for the Decentralized Web 🌈💎✨
★ 39,573Goupdated 2026-04-20aigolanggoogle-photosmachine-learningphotography
Google Research
★ 37,792Jupyter Notebookupdated 2026-04-19aimachine-learningresearch
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
★ 36,710Pythonupdated 2026-04-17augmixconvnextdistributed-trainingefficientnetimage-classification
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
★ 35,193Pythonupdated 2024-08-06aminedenoiseesrganimage-restorationjpeg-compression
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
★ 34,110Jupyter Notebookupdated 2026-03-23agentsaillmsmachine-learningmcp
Vane is an AI-powered answering engine.
★ 33,989TypeScriptupdated 2026-04-11ai-agentsai-search-engineanswering-engineartificial-intelligencellm
💫 Industrial-strength Natural Language Processing (NLP) in Python
★ 33,513Pythonupdated 2026-03-28aiartificial-intelligencecythondata-sciencedeep-learning
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
★ 33,452Pythonupdated 2026-04-18deep-learningdiffusionfluximage-generationimage2image
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
★ 33,327Jupyter Notebookupdated 2026-03-25deep-learningmachine-learning
Visualizer for neural network, deep learning and machine learning models
★ 32,805JavaScriptupdated 2026-04-20aicoremldeep-learningdeeplearningkeras
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
★ 32,207Pythonupdated 2025-09-30artificial-intelligencepythonpytorch
NVR with realtime local object detection for IP cameras
★ 31,587TypeScriptupdated 2026-04-20aicameragoogle-coralhome-assistanthome-automation
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
★ 31,080Pythonupdated 2026-04-20aiartificial-intelligencedata-sciencedeep-learningmachine-learning
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
★ 30,698Rustupdated 2026-04-20ai-searchai-search-engineembeddings-similarityhnswhybrid-search
:memo: An awesome Data Science repository to learn and apply for real world problems.
★ 28,891updated 2026-04-18analyticsawesome-listdata-miningdata-sciencedata-scientists
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
★ 28,130Pythonupdated 2025-09-30agentic-ragdeep-researchemnlp2024knowledge-curationlarge-language-models
Label Studio is a multi-type data labeling and annotation tool with standardized output format
★ 27,135TypeScriptupdated 2026-04-20annotationannotation-toolannotationsboundingboxcomputer-vision
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed notebook tutorial.
★ 26,982Jupyter Notebookupdated 2026-04-15aiembeddingslangchainllama-indexllm
Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.
★ 26,818TypeScriptupdated 2026-04-17applicant-tracking-systematshacktoberfestmachine-learningnatural-language-processing
The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.
★ 25,566Pythonupdated 2026-04-20agentopsagentsaiai-governanceapache-spark
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
★ 24,992MDXupdated 2026-04-20agentagentsaigeminigenerative-ai
GUI for a Vocal Remover that uses Deep Neural Networks.
★ 24,423Pythonupdated 2025-03-13audioinstrumentalkaraokekareokeemusic
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
★ 22,972Pythonupdated 2024-07-28dialoguemachine-learningmachine-translationnamed-entity-recognitionnatural-language-processing
Learn OpenCV : C++ and Python Examples
★ 22,892Jupyter Notebookupdated 2026-04-16aicomputer-visioncomputervisiondeep-learningdeep-neural-networks
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
★ 22,628Pythonupdated 2026-03-01age-predictionarcfacedeep-learningdeepfacedeepid
Universal LLM Deployment Engine with ML Compilation
★ 22,526Pythonupdated 2026-04-20language-modelllmmachine-learning-compilationtvm
Faster Whisper transcription with CTranslate2
★ 22,404Pythonupdated 2025-11-19deep-learninginferenceopenaiquantizationspeech-recognition
Best Practices on Recommendation Systems
★ 21,660Pythonupdated 2026-04-18aiartificial-intelligencedata-sciencedeep-learningjupyter-notebook
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
★ 21,453Pythonupdated 2026-04-17aiartificial-intelligencecomputer-visiondataset-hubdatasets
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
★ 19,816Jupyter Notebookupdated 2026-04-19chatgptfinancefingptfintechlarge-language-models
A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.
★ 19,670C++updated 2026-03-07anime4kframe-interpolationmachine-learningneural-networksrealcugan
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
★ 19,355C#updated 2026-04-17deep-learningdeep-reinforcement-learningmachine-learningneural-networksreinforcement-learning
Tongyi Deep Research, the Leading Open-source Deep Research Agent
★ 18,746Pythonupdated 2026-02-27agentalibabaartificial-intelligencedeep-researchdeepresearch
Run agents that work for you based on what you do. AI finally knows what you are doing
★ 18,404Rustupdated 2026-04-20agentsagiaicomputer-visionllm
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
★ 18,303Jupyter Notebookupdated 2026-04-17aifinetuninglangchainllamallama2
The free and privacy-friendly screen recorder with no limits 🎥
★ 18,138JavaScriptupdated 2026-04-08annotationannotation-toolaudiocamerachrome-extension
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
★ 17,849Pythonupdated 2026-04-20agent-builderagentsaichatgptdocsgpt
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
★ 17,126Pythonupdated 2026-04-20asrdeeplearninggenerative-aimachine-translationneural-networks
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
★ 16,793Pythonupdated 2026-04-19agentai-societiesartificial-intelligencecommunicative-aicooperative-ai
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.
★ 16,080Goupdated 2026-04-20approximate-nearest-neighbor-searchgenerative-searchgrpchnswhybrid-search
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
★ 15,852Pythonupdated 2026-03-17audio-visual-speech-recognitionconformerdfsmnparaformerpretrained-model
🦉 Data Versioning and ML Experiments
★ 15,565Pythonupdated 2026-04-14aidata-sciencedata-version-controldeveloper-toolsmachine-learning
FinceptTerminal is a modern finance application offering advanced market analytics, investment research, and economic data tools, designed for interactive exploration and data-driven decision-making in a user-friendly environment.
★ 15,037Pythonupdated 2026-04-20ai-agentsalgorithmic-tradingbloomberg-terminalcppfinance
This repository contains the source code for the paper First Order Motion Model for Image Animation
★ 15,008Jupyter Notebookupdated 2024-11-14deep-learninggenerative-modelimage-animationmotion-retargeting
FinRL®: Financial Reinforcement Learning. 🔥
★ 14,920Jupyter Notebookupdated 2026-04-05algorithmic-tradingdeep-reinforcement-learningdrl-algorithmsdrl-frameworkdrl-trading-agents
The open-source hub to build & deploy GPT/LLM Agents ⚡️
★ 14,652TypeScriptupdated 2026-04-17agentaibotpresschatbotchatgpt
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
★ 14,559HTMLupdated 2026-04-14data-pipelinesdeep-learningdocument-image-analysisdocument-image-processingdocument-parser
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
★ 14,102Jupyter Notebookupdated 2024-08-19darknetpytorchscaled-yolov4yoloryolov3
A curated list of references for MLOps
★ 13,874updated 2024-11-21aidata-sciencedevopsengineeringfederated-learning
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
★ 13,518updated 2025-08-12aiartificial-intelligencedeep-learningintelligent-machinesintelligent-systems
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
★ 13,326Pythonupdated 2026-04-16aiartificial-intelligencedeep-learninglarge-language-modelsllm
Node-based Visual Programming Toolbox
★ 12,688QMLupdated 2026-04-173d-reconstructionalicevisioncamera-trackingcomputer-visionhdr-imaging
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
★ 12,424Pythonupdated 2026-04-14agentsaiai-agentsembeddingsinformation-retrieval
Low-code framework for building custom LLMs, neural networks, and other AI models
★ 11,677Pythonupdated 2026-04-19computer-visiondata-centricdata-sciencedeepdeep-learning
COLMAP - Structure-from-Motion and Multi-View Stereo
★ 11,539C++updated 2026-04-19computer-visiongeometrymulti-view-stereoreconstructionstructure-from-motion
A PyTorch-based Speech Toolkit
★ 11,475Pythonupdated 2026-04-03asraudioaudio-processingdeep-learninghuggingface
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
★ 11,278Rustupdated 2026-04-19aiai-engineeringanthropicartificial-intelligencedeep-learning
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
★ 11,031Pythonupdated 2026-04-20aicollaborationdata-sciencedata-versioningdeep-learning
A fast, local neural text to speech system
★ 10,858C++updated 2025-08-26speech-synthesistext-to-speechtts
Large Language Model Text Generation Inference
★ 10,844Pythonupdated 2026-03-21bloomdeep-learningfalcongptinference
Open source annotation tool for machine learning practitioners.
★ 10,635Pythonupdated 2026-04-14annotation-tooldata-labelingdatasetdatasetsmachine-learning
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
★ 10,605Pythonupdated 2026-04-20clouddatacenterdeep-learningedgegpu
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
★ 10,142C++updated 2026-04-20aicomputer-visiondeep-learningdeploy-aidiffusion-models
Techniques for deep learning with satellite & aerial imagery
★ 10,119updated 2026-04-15convolutional-neural-networksdatasetdatasetsdeep-learningdeep-neural-networks
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
★ 10,090Pythonupdated 2024-09-07bloomchatbotdeep-learningdistributed-systemsfalcon
Build, Manage and Deploy AI/ML Systems
★ 10,056Pythonupdated 2026-04-20agentsaiawsazurecost-optimization
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
★ 9,830Jupyter Notebookupdated 2026-04-16overlapped-speech-detectionpretrained-modelspytorchspeaker-change-detectionspeaker-diarization
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
★ 9,805Pythonupdated 2026-04-09aicookiecuttercookiecutter-data-sciencecookiecutter-templatedata-science
AI powered open source recommender system engine supports classical/LLM rankers and multimodal content via embedding
★ 9,619Goupdated 2026-04-22collaborative-filteringgoknnmachine-learningrecommender-system
A collection of pre-trained, state-of-the-art models in the ONNX format
★ 9,562Jupyter Notebookupdated 2026-03-09deep-learningdownloadmodelsonnxpretrained
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
★ 9,516Pythonupdated 2026-04-21natural-language-processingnlpnltkpatternpython
Containers for machine learning
★ 9,402Goupdated 2026-04-17aicontainerscudadockermachine-learning
Anomaly detection related books, papers, videos, and toolboxes. Last update late 2025 for LLM and VLM works!
★ 9,270Pythonupdated 2026-03-02anomaly-detectionawesomeawesome-listdata-miningfraud
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
★ 9,261Pythonupdated 2026-04-20artificial-intelligencechatglmdeploymentflan-t5gemma
Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.
★ 9,100C++updated 2026-02-16agentagentic-ragaiclawbotcomputer-vision
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
★ 8,899Pythonupdated 2026-03-26onnxonnx-runtimeonnxruntimepytorchspeech
ModelScope: bring the notion of Model-as-a-Service to life.
★ 8,878Pythonupdated 2026-04-20cvdeep-learningmachine-learningmulti-modalnlp
🧙 Build, run, and manage data pipelines for integrating and transforming data.
★ 8,710Pythonupdated 2026-04-02artificial-intelligencedatadata-engineeringdata-integrationdata-pipelines
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
★ 8,695Pythonupdated 2026-04-09deep-learningextracthardsubocrripper
A curated list of practical financial machine learning tools and applications.
★ 8,530Pythonupdated 2025-01-03algorithmic-tradingcryptocurrencyfinanceinvestmentquant
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
★ 8,470Pythonupdated 2024-08-13aideep-learningemotionemotivoicemulti-speaker
A curated list of awesome embedded programming.
★ 8,466updated 2026-03-18aiautosarawesomebeaglebonebootloader
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
★ 8,066Jupyter Notebookupdated 2025-11-28agenticaiai-agentsai-engineeringdeep-learning
Multilingual Voice Understanding Model
★ 8,041Pythonupdated 2025-12-30aiaigcasraudio-event-classificationcross-lingual
A self-hosted open source photo management service.
★ 7,980Pythonupdated 2026-04-17djangoexifhacktoberfestmachine-learningphoto
Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.
★ 7,852Pythonupdated 2026-03-21aibackground-removalbackground-removerbackgroundremoverphoto-editing
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
★ 7,806Pythonupdated 2026-04-20anonymizationdata-anonymizationdata-maskingdata-obfuscationdata-privacy
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
★ 7,587Pythonupdated 2026-04-17agentic-aiagentsai-agentsai-agents-frameworkanthropic
Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html
★ 7,501updated 2022-12-17computer-visiondeep-learningdetectionobject-detectionobject-localisation
A customisable 3D platform for agent-based AI research
★ 7,353Cupdated 2023-01-04artificial-intelligencedeep-learningmachine-learningneural-networks
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
★ 7,146Pythonupdated 2025-06-09aiartificial-intelligenceautomationcrawlermachine-learning
SD.Next: All-in-one WebUI for AI generative image and video creation, captioning and processing
★ 7,075Pythonupdated 2026-04-20ai-artcaptiondiffusersgenerative-artpython
An open source library for deep learning end-to-end dialog systems and chatbots.
★ 6,975Pythonupdated 2025-08-06aiartificial-intelligencebotchatbotchitchat
AI + Data, online. https://vespa.ai
★ 6,896Javaupdated 2026-04-20aibig-datajavamachine-learningrag
Flower: A Friendly Federated AI Framework
★ 6,854Pythonupdated 2026-04-19aiandroidartificial-intelligencecppdeep-learning
FinRobot: An Open-Source AI Agent Platform for Financial Analysis using LLMs 🚀 🚀 🚀
★ 6,777Jupyter Notebookupdated 2026-04-03aiagentchatgptfinancefingptlarge-language-models
Mycroft Core, the Mycroft Artificial Intelligence platform.
★ 6,623Pythonupdated 2024-09-08aiarchartificial-intelligencefedorahacktoberfest
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
★ 6,543Pythonupdated 2026-04-20agentic-aiagentic-workflowagentsaiartificial-intelligence
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
★ 6,243Pythonupdated 2024-08-10adversarial-trainingdeep-learningdiffusion-modelsganlatent-diffusion
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
★ 6,167Pythonupdated 2025-06-04aiaudio-generationdeep-learningfoundation-modelsgpt
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
★ 6,100Pythonupdated 2026-04-19aidata-sciencedata-visualizationexperiment-trackingmachine-learning
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
★ 5,947Pythonupdated 2025-12-12adversarial-attacksadversarial-examplesadversarial-machine-learningaiartificial-intelligence
Silero Models: pre-trained text-to-speech models made embarrassingly simple
★ 5,888Jupyter Notebookupdated 2026-04-16armenianazerbaijanibelaruscolabgeorgian
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
★ 5,820TypeScriptupdated 2026-04-20chatgptchatgpt-apideep-learningfew-shot-learninggpt
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
★ 5,802Pythonupdated 2025-09-12article-extractorcorpus-buildercorpus-toolscrawlerhtml-to-markdown
🔥 Awesome list of resources on Web Development.
★ 5,723updated 2025-07-16aiawesome-listcssdeveloper-storiesjavascript
🍊 :bar_chart: :bulb: Orange: Interactive data analysis
★ 5,605Pythonupdated 2026-04-17classificationclusteringdata-miningdata-sciencedata-visualization
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
★ 5,437Rustupdated 2026-04-20ai-engineeringai-pipelinearrowartificial-intelligencebig-data
ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.
★ 5,362Pythonupdated 2026-04-20agentopsagentsaiautomldata-science
Superduper: End-to-end framework for building custom AI applications and agents.
★ 5,271Pythonupdated 2025-09-01aichatbotdatadatabasedistributed-ml
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
★ 5,178Pythonupdated 2026-01-28agentsaiai-engineerai-engineeringcopilot
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
★ 5,144Pythonupdated 2025-09-16face-detectionface-identificationface-recognitionface-trackingfacial-recognition
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
★ 5,025Jupyter Notebookupdated 2026-02-24dependency-graph
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
★ 5,002Pythonupdated 2026-03-18action-recognitionavabenchmarkdeep-learningi3d
A curated list of Artificial Intelligence Top Tools
★ 4,992updated 2025-12-31aiai-agentai-agentsai-assistantai-tools
Long list of geospatial tools and resources
★ 4,977updated 2026-04-22awesomeawesome-listdata-analysisdeep-learningearth-observation
Kodezi Chronos is a debugging-first language model that achieves state-of-the-art results on SWE-bench Lite (80.33%) and 67% real-world fix accuracy, over six times better than GPT-4. Built with Adaptive Graph-Guided Retrieval and Persistent Debug Memory. Model available Q1 2026 via Kodezi OS.
★ 4,945Javaupdated 2025-11-12artificial-intelligenceautonomous-debuggingbenchmarkbenchmark-reportbug-fixing
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
★ 4,945Pythonupdated 2026-04-13active-learningaiannotation-tooldeveloper-toolsgpt-4
An Open-Source Framework for Prompt-Learning.
★ 4,857Pythonupdated 2024-07-16aideep-learningnatural-language-processingnatural-language-understandingnlp
On-device wake word detection powered by deep learning
★ 4,798Pythonupdated 2026-04-17handsfreehotwordhotword-detectionhotword-detectorkeyword-spotter
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
★ 4,764Pythonupdated 2026-01-04fastapihuggingface-spaceskokorokokoro-ttsonnx
Prompt Engineering, Generative AI, and LLM Guide by Learn Prompting | Join our discord for the largest Prompt Engineering learning community
★ 4,688MDXupdated 2025-01-14chatgptchatgpt-apideep-learninggpt-3gpt-4
Build local voice agents with open-source models
★ 4,686Pythonupdated 2026-04-20aiassistantlanguage-modelmachine-learningpython
Next generation of automated data exploratory analysis and visualization platform.
★ 4,634TypeScriptupdated 2026-03-10augmented-analyticsautomated-data-analysisautomated-visualizationautoviscausal-discovery
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.
★ 4,610Jupyter Notebookupdated 2026-04-13causal-inferencecausalityeconometricseconomicsmachine-learning
An Open Source text-to-speech system built by inverting Whisper.
★ 4,595Jupyter Notebookupdated 2025-12-14pytorchspeech-synthesistts
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
★ 4,588Pythonupdated 2026-03-27chatgptchatgpt-apichatgpt-pythongpt-3gpt-3-prompts
🤗 AutoTrain Advanced
★ 4,571Pythonupdated 2026-04-17autotraindeep-learninghuggingfacemachine-learningnatural-language-processing
Fast inference engine for Transformer models
★ 4,450C++updated 2026-02-04avxavx2cppcudadeep-learning
Visual Object Tagging Tool: An electron app for building end to end Object Detection Models from Images and Videos.
★ 4,430TypeScriptupdated 2021-12-06annotation-toolcntkdeep-learningdetectiondetection-model
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents.
★ 4,412Pythonupdated 2026-04-17agent-based-frameworkagent-based-simulationai-societiesdeep-learninglarge-language-models
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
★ 4,409Pythonupdated 2026-03-13agentaiapplicationdatadeep-learning
A curated list of awesome data labeling tools
★ 4,314updated 2024-06-173d-annotationannotationannotation-toolaudio-annotationaudio-annotation-tool
modular quant framework.
★ 4,112Pythonupdated 2026-04-13algorithmic-tradingbacktestingcryptocurrencyfintechfundamental-analysis
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
★ 4,098Pythonupdated 2025-08-14audiobandwidth-extensiondeep-learningnoise-suppressionpytorch
Noise supression using deep filtering
★ 4,095Pythonupdated 2024-10-17audiodeep-learningnoise-suppressionpytorchrust
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
★ 4,079Pythonupdated 2026-04-10chatgptclaudeclipcomputer-visionevaluation
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
★ 4,009C#updated 2026-04-18aicomfyuicsharpimage-generationjavascript
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
★ 4,004Pythonupdated 2026-04-22computer-visiondatasetsdeep-learningearth-observationgeospatial
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
★ 3,974Pythonupdated 2026-04-20artificial-intelligencedeep-learninggeopythongeospatialmachine-learning
🛰️ List of satellite image training datasets with annotations for computer vision and deep learning
★ 3,884updated 2022-07-14computer-visiondeep-learningearth-observationinstance-segmentationmachine-learning
ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.
★ 3,864HTMLupdated 2026-03-06adversarial-machine-learningagentaiassistantchatgpt
TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It's capable of accurately predicting various domains such as retail, electricity, finance, and IoT with just a few lines of code 🚀.
★ 3,861Jupyter Notebookupdated 2026-04-13agentagentic-aianomaly-detectionartificial-intelligencedeep-learning
Java dataframe and visualization library
★ 3,748Javaupdated 2026-03-02chartdata-analysisdata-framedata-sciencedata-visualization
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)
★ 3,720Pythonupdated 2025-12-29comfycomfyuimachine-learning
A deep learning library for video understanding research.
★ 3,554Pythonupdated 2026-01-12
🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
★ 3,530Jupyter Notebookupdated 2026-02-20artificial-intelligencehacktoberfestmachine-learningprojectspython
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
★ 3,312updated 2026-04-10aiawesomedatasetsllmnlp
A library for audio and music analysis, feature extraction.
★ 3,302Cupdated 2026-03-06audioaudio-analysisaudio-featuresaudio-processingdeep-learning
Algorithmic Trading in Python with Machine Learning
★ 3,274Pythonupdated 2026-04-20aialgorithmic-tradingalgotradingartificial-intelligencebacktesting
Evaluation and Tracking for LLM Experiments and AI Agents
★ 3,269Pythonupdated 2026-04-20agent-evaluationagentopsai-agentsai-monitoringai-observability
QualityScaler - image/video AI upscaler app
★ 3,036Pythonupdated 2026-04-05amdanimecompression-artifact-reductiondeep-learningdirectx-12
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
★ 3,017Cupdated 2026-02-13alexadeep-learningechoesp-adfesp-idf
Self-hosted, local only NVR and AI Computer Vision software. With features such as object detection, motion detection, face recognition and more, it gives you the power to keep an eye on your home, office or any other place you want to monitor.
★ 3,011Pythonupdated 2026-04-17coralcudadarknetedgetpuface-recognition
Toolkit for creating, sharing and using natural language prompts.
★ 3,008Pythonupdated 2023-10-23machine-learningnatural-language-processingnlp
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
★ 3,007Pythonupdated 2026-01-16banditscontextual-banditsdqnmulti-armed-banditsreinforcement-learning
Must-read Papers on LLM Agents.
★ 2,984updated 2026-04-17agentagentsawsome-listenvironmentin-context-learning
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
★ 2,957Pythonupdated 2026-02-21aicomputer-visiondeep-learningdeep-reinforcement-learningpython
GeoAI: Artificial Intelligence for Geospatial Data
★ 2,913Pythonupdated 2026-04-15aidata-sciencedeep-learningearth-observationgeoai
Data manipulation and transformation for audio signal processing, powered by PyTorch
★ 2,869Pythonupdated 2026-04-20audioaudio-processingiomachine-learningpython
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
★ 2,803Pythonupdated 2025-09-09asrattention-is-all-you-needattention-mechanismattention-modelattention-network
A Web UI for easy subtitle using whisper model.
★ 2,764Pythonupdated 2025-12-29aigradioopen-sourcepythonpytorch
Control Any Computer Using LLMs.
★ 2,659Pythonupdated 2026-02-25assistantassistant-computer-controlautomationgptgpt4
✨ Build a machine learning model from a prompt
★ 2,564Pythonupdated 2026-03-06agentic-aiagentsaimachine-learningml
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
★ 2,520Pythonupdated 2024-08-13chatbotdeep-learningflaxjaxlanguage-model
A toolkit to run Ray applications on Kubernetes
★ 2,468Goupdated 2026-04-19apachedeep-learningkubernetesmachine-learningray
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
★ 2,444Pythonupdated 2026-04-17evaluationmachine-learning
news-please - an integrated web crawler and information extractor for news that just works
★ 2,441Pythonupdated 2026-04-14cc-newsccnewscommoncrawlcrawlerdata-gathering
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
★ 2,436updated 2026-02-07awseome-listgenerative-adversarial-networkimage-generationimage-manipulationimage-synthesis
the terminal client for Ollama
★ 2,360Pythonupdated 2025-12-19llmllmsmachine-learningollamapython
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.
★ 2,344Pythonupdated 2024-08-18autoevaluationevaluationexperimentationhallucination-detectionjailbreak-detection
Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
★ 2,327Jupyter Notebookupdated 2026-03-063d3d-reconstruction3d-visionaiaugmented-reality
A powerful tool that translates ComfyUI workflows into executable Python code.
★ 2,310Pythonupdated 2026-04-19ai-artcomfyuigenerative-artimage-generationpytorch
An open source library and framework for deep learning on satellite and aerial imagery.
★ 2,215Pythonupdated 2025-09-29classificationcomputer-visiondeep-learninggeospatialmachine-learning
A comprehensive and up-to-date compilation of datasets, tools, methods, review papers, and competitions for remote sensing change detection.
★ 2,199updated 2026-04-16awesomechange-detectiondatasetdeep-learningremote-sensing
🧠 Make your agents learn from experience. Now available as a hosted solution at kayba.ai
★ 2,176Pythonupdated 2026-04-25agent-learningagent-memoryagentsaiai-agents
Open-source AI-driven quantitative trading platform for crypto, stocks, and forex with backtesting, live trading, market data, and multi-agent research.
★ 2,135Pythonupdated 2026-04-20ai-traderalgorithmic-trading-portfolioalgotradebacktesting-frameworkscryptocurrencies
Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
★ 2,060JavaScriptupdated 2025-03-15annotate-imagesannotation-toolclassificationcomputer-visioncsv
:cloud: :rocket: :bar_chart: :chart_with_upwards_trend: Evaluating state of the art in AI
★ 2,016Pythonupdated 2026-04-20aiai-challengesangularjsartificial-intelligencechallenge
🤖 A Python library for learning and evaluating knowledge graph embeddings
★ 1,983Pythonupdated 2026-04-21cudadeep-learningknowledge-base-completionknowledge-graph-embeddingsknowledge-graphs
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
★ 1,977Jupyter Notebookupdated 2025-08-09deep-learningevaluationfoundation-modelsinstruction-followinglarge-language-models
Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use/LLM. Selenium IDE import/export.
★ 1,891JavaScriptupdated 2026-03-20anthropicanthropic-claudebrowser-automationbrowser-extensioncomputer-use
ContextGem: Effortless LLM extraction from documents
★ 1,823Pythonupdated 2026-03-16aicontract-analysisdata-extractiondocument-intelligencedocx
Cross-Platform, GPU Accelerated Whisper 🏎️
★ 1,800TypeScriptupdated 2024-02-27audiomachine-learningrustspeech-recognitionwebgpu
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.
★ 1,728updated 2025-05-31cudadatasetsdeepseekfew-shot-object-detectiongui
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
★ 1,715updated 2026-04-17aiartificial-intelligencelarge-language-modelsllmmachine-learning
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
★ 1,591Pythonupdated 2025-01-01deep-learninglanguage-modelmachine-learningmulti-modal-learningnatural-language-processing
AI for GNU Image Manipulation Program
★ 1,542Pythonupdated 2024-10-12coloring-imagecomputer-visioncomputervisiondeblurringdeep-learning
Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.
★ 1,492TypeScriptupdated 2025-07-23aiassistant-chat-botscomputer-visionllmspeech-recognition
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
★ 1,443Pythonupdated 2025-11-26agentnlp
A Python/Pytorch app for easily synthesising human voices
★ 1,440Pythonupdated 2024-12-02deep-learningpythonpytorchtacotron2text-to-speech
This repository aims to map the ecosystem of artificial intelligence guidelines, principles, codes of ethics, standards, regulation and beyond.
★ 1,428updated 2026-04-18aiai-ethicsai-guidelinesai-policydata-ethics
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
★ 1,314Pythonupdated 2026-02-10aiai-artartasset-generatorchatbot
The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
★ 1,311Pythonupdated 2026-04-20agentsaillmmcpmcp-client
Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere
★ 1,259Pythonupdated 2026-04-12blackwellchatbotdecentralized-inferencedeepseekdistributed-systems
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
★ 1,211Cupdated 2025-12-17androidasrchinesectranslate2huggingface
Unofficial PyTorch implementation of Google AI's VoiceFilter system
★ 1,201Pythonupdated 2024-07-25audio-separationpytorchsource-separationspeech-separationvoicefilter
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.
★ 1,187Pythonupdated 2026-04-02aiapi-serveraudio-generationchatterboxchatterbox-tts
Xtreme1 is an all-in-one data labeling and annotation platform for multimodal data training and supports 3D LiDAR point cloud, image, and LLM.
★ 1,185TypeScriptupdated 2025-07-153d-annotationannotationannotation-toolcomputer-visionimage-annotation
Examples of programs built using Modal
★ 1,172Pythonupdated 2026-04-20clouddistributedgpumachine-learningmodal
😎 Awesome list of Retrieval-Augmented Generation (RAG) applications in Generative AI.
★ 1,170updated 2026-04-11artificial-intelligencegenerative-ailarge-language-modelsmachine-learningretrieval-augmented-generation
Open source audio annotation tool for humans
★ 1,133TypeScriptupdated 2026-02-03annotation-toolaudio-annotationaudio-processingdatasetsmachine-learning
Datasets for deep learning with satellite & aerial imagery
★ 1,131updated 2026-04-15datasetsearth-observationremote-sensingsatellite-datasatellite-imagery
A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning models.
★ 1,113Pythonupdated 2026-04-16
PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models
★ 1,103updated 2025-12-15audio-captioningaudio-language-modelsaudio-question-answeringaudio-reasoningmultimodal-large-language-models
GRASS - free and open-source geospatial processing engine
★ 1,102Cupdated 2026-04-22arraysdata-scienceearth-observationgeospatialgeospatial-analysis
Sentiment Analysis, Text Classification, Text Augmentation, Text Adversarial defense, etc.;
★ 1,096Jupyter Notebookupdated 2026-04-24adversarialaspect-based-sentiment-analysisaspect-sentiment-triplet-extractionaspect-term-extractionlcf-bert
Build agents which are controlled by LLMs
★ 1,040Pythonupdated 2025-06-23deep-learninglangchainllmsmachine-learning
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
★ 1,030Pythonupdated 2026-04-17benchmarkpytorch
Web UI for AutoGen (A Framework Multi-Agent LLM Applications)
★ 997TypeScriptupdated 2024-11-14agent-based-frameworkaiai-agentsautogenautogen-sample
🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring safety & security. 🛡️ Features include text quality, relevance metrics, & sentiment analysis. 📊 A comprehensive tool for LLM observability. 👀
★ 984Jupyter Notebookupdated 2024-11-22large-language-modelsmachine-learningnlgnlpobservability
Rasa UI is a frontend for the Rasa Framework
★ 966JavaScriptupdated 2025-11-12angularmanage-botsnlpnlp-apisnlp-machine-learning
CodeProject.AI Server is a self contained service that software developers can include in, and distribute with, their applications in order to augment their apps with the power of AI.
★ 957C#updated 2025-07-14artificial-intelligencegenerative-aimlopsobject-detectiononnx
🔧 A curated list of awesome dataset tools
★ 935updated 2023-06-09annotation-toolannotationsawsomeawsome-listdatasets
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
★ 930updated 2025-07-08aigcartificial-intelligenceaudioaudio-effectaudio-generation
Enhance Images with Javascript and AI. Increase resolution, retouch, denoise, and more. Open Source, Browser & Node Compatible, MIT License.
★ 885TypeScriptupdated 2026-04-17aideblurringdehazingdenoisingderaining
Deep research agent to help you find the best GitHub repositories 🕵️!
★ 867Pythonupdated 2026-04-08agentdeep-researchgithub-searchlangchainlanggraph
Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.
★ 857Pythonupdated 2026-04-17api-restdatadatasetshuggingfacemachine-learning
A TensorFlow based wake word detection training framework using synthetic sample generation suitable for certain microcontrollers.
★ 819Pythonupdated 2025-12-21
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
★ 809JavaScriptupdated 2023-03-16expressjsgpulibretranslatemachine-learningnodejs
🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.
★ 759Pythonupdated 2026-04-17gptmachine-learningopenaipromptprompt-engineering
Visualise your Kedro data and machine-learning pipelines and track your experiments.
★ 748JavaScriptupdated 2026-04-22data-visualizationexperiment-trackinghacktoberfestkedrokedro-plugin
Orchestra is a human-in-the-loop AI system for orchestrating project teams of experts and machines.
★ 705Pythonupdated 2026-04-15experthuman-computer-interactionhuman-in-the-loop-machine-learningorchestraworkflow
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
★ 701Pythonupdated 2024-05-16aidata-engineeringembeddingsmachine-learningnlp
Papers, code and datasets about deep learning for 3D Object Detection.
★ 686updated 2025-09-173d-representationautonomous-drivingcomputer-visioncvprcvpr2020
OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.
★ 658Pythonupdated 2026-02-26audio-language-modeldeep-learninglarge-language-modelsmultimodal-large-language-modelsvision-language-model
A Python-based toolbox of various methods in decision making, uncertainty quantification and statistical emulation: multi-fidelity, experimental design, Bayesian optimisation, Bayesian quadrature, etc.
★ 655Pythonupdated 2026-02-22bayesian-optimizationbayesian-quadraturedecision-makingemulationexperimental-design
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
★ 650C++updated 2026-03-18androidasrautomatic-speech-recognitionembeddedmobile
🐝 Multi-agent swarm coordination for OpenCode with learning capabilities, agent issue tracking, and management
★ 645TypeScriptupdated 2026-02-23ai-agentsmachine-learningmulti-agentopencodeswarm
The Hailo Model Zoo includes pre-trained models and a full building and evaluation environment
★ 634Pythonupdated 2026-04-05ai-acceleratorscomputer-visiondeep-learningedge-aihailo
Well tested & Multi-language evaluation framework for text summarization.
★ 626Pythonupdated 2026-04-13bleumachine-learningrougetext-summarization
Computer vision based ML training data generation tool :rocket:
★ 608JavaScriptupdated 2025-02-15aiannotationannotation-toolcococomputer-vision
Tock, the open source conversational AI toolkit.
★ 605Kotlinupdated 2026-04-17aialexaapple-business-chatassistantbot
ChatGPT PROMPTs Splitter. Tool for safely process chunks of up to 15,000 characters per request
★ 565Pythonupdated 2024-02-28ai-conversationscharacter-limitchatgptlanguage-modelnlp-tool
MAD: The first work to explore Multi-Agent Debate with Large Language Models :D
★ 556Pythonupdated 2025-12-16chatgptgpt-4large-language-modelsllmsnlp
Extract hardcoded subtitles from videos using machine learning
★ 547Pythonupdated 2024-01-30
🤖 AI browser extensions & userscripts to augment your web experience
★ 541JavaScriptupdated 2026-04-23aiamazonartificialintelligencebravechat
🤖 Awesome list of AGI Agents. Agents 精选资源合集.
★ 524updated 2023-10-31agentsagents-artagiagi-agentsai
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
★ 511Pythonupdated 2024-08-26artificial-intelligencefine-tuninggenerative-ailarge-language-modelsllm
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
★ 487Cupdated 2025-07-15androidaudio-processingdeep-neural-networksdnngmm
Python scripts performing object detection using the YOLOv8 model in ONNX.
★ 483Pythonupdated 2024-08-22computer-visiondeep-learningobject-detectiononnxonnxruntime
SDK libraries for Modal
★ 466Pythonupdated 2026-04-20aiclouddata-sciencedistributedgenai
No code AI agents
★ 448C++updated 2025-01-16aiai-agentsartificial-intelligencecligpt
A Community Open-Source Saas for Crafting/Building/Creating Chatbots with OpenAI's Assistant API that you can add to your website.
★ 422TypeScriptupdated 2024-10-24aiartificial-intelligenceartificial-neural-networksassistantassistant-app
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
★ 361Pythonupdated 2023-05-23asrjaxpytorchspeech-recognitiontransformers
Autocomplete your obsidian notes with AI, including ChatGPT, through a copilot-like interface.
★ 355TypeScriptupdated 2024-05-28aiai21labschatgptgroqgroq-ai
A curated list of awesome open source healthcare tools, algorithms, datasets and research papers.
★ 320updated 2025-01-20awesome-listawesome-listshealthcarehealthcare-applicationhealthcare-datasets
Home of the AI workforce - Multi-agent system, AI agents & tools
★ 279Pythonupdated 2026-01-15clusteringcomputer-visionembeddingsnatural-language-processingnlp
Agentic AI platform that harnesses Visual LLM Chaining to build proactive digital assistants
★ 275Pythonupdated 2026-04-16botbot-frameworkbotkitbotschatbot
Microsoft Finance Time Series Forecasting Framework (FinnTS) is a forecasting package that utilizes cutting-edge time series forecasting and parallelization on the cloud to produce accurate forecasts for financial data.
★ 254Rupdated 2026-04-20agentaibusinessfeature-selectionfinance
This is a list of awesome articles about object detection from video.
★ 247updated 2019-07-01awesome-listcomputer-visiondeep-learningdeep-neural-networksobject-detection
🐸 - A general purpose model trainer, as flexible as it gets
★ 234Pythonupdated 2024-03-07aidata-sciencedeep-learningmachine-learningpytorch
The AI Podcast Studio: generate podcasts scripts and their audio version with a team of AI workers in a Podcast Studio 🎙️📜
★ 224Pythonupdated 2025-03-05ag2aiaudio-generationautogenelevenlabs
AlphaSuite is an open-source quantitative analysis platform that gives you the power to build, test, and deploy professional-grade trading strategies. It's designed for traders and analysts who want to move beyond simple backtests and develop a genuine, data-driven edge in the financial markets.
★ 213Pythonupdated 2026-03-03algorithmic-tradingbacktestingdata-analysisfintechlangchain
Verify the authenticity of handwritten signatures through digital image processing and neural networks. ✍️
★ 209Pythonupdated 2022-11-12handwritten-signaturesneural-networkopencvsignature-recognitionsignature-verification
A toolkit for building computer use AI agents
★ 194Pythonupdated 2025-06-26agentsllmmachine-learningmultimodal
DeepSeek CLI, a command-line AI coding assistant that leverages the powerful DeepSeek Coder models
★ 191TypeScriptupdated 2025-08-25artificial-intelligenceclaudeclaude-aiclaude-codecursor
A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.
★ 183Pythonupdated 2026-04-15aiartificial-intelligenceevaluation-frameworkllmmachine-learning
A modular framework built to simplify Computer Vision inference workloads.
★ 177Pythonupdated 2025-09-04computer-visiondeep-learningnodesobject-detectionobject-tracking
Deep neural network (DNN) for noise reduction, removal of background music, and speech separation
★ 172Pythonupdated 2022-11-21noise-reductionspeech-separation
Open models for Coqui STT
★ 155updated 2023-05-09deep-learningmodelsspeech-to-text
The BEST music separation model with help of A.I. ... to my ears ! 👂👂
★ 149Pythonupdated 2024-06-10artificial-intelligenceaudioinstrumentalinstrumentalskaraoke
This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.
★ 141updated 2024-09-23awesome-listllmnlprole-playing
simplifies the process of creating and managing LLM workflows.
★ 115Pythonupdated 2024-10-21aillmnlp-librarynlp-machine-learningprompt-engineering
A comprehensive evaluation framework for AI agents and LLM applications.
★ 110Pythonupdated 2026-04-17agenticagentic-aiaievaluationmachine-learning
Gnome shell extension for accurate OFFLINE speech to text input in Linux using whisper.cpp. Input text from speech anywhere.
★ 105JavaScriptupdated 2025-04-09aiasrbloat-freedictatedictation
List of PostgreSQL® AI projects and resources
★ 92updated 2025-08-19aimachine-learningpostgresql
A wargaming platform compatible with reinforcement learning agents
★ 87TypeScriptupdated 2025-09-09agent-based-simulationaiartificial-intelligencegame-enginemachine-learning
☕ GPT-2 chatbot for daily conversation
★ 84Pythonupdated 2022-02-14chatbotgpt-2nlp
The project uses a computer vision model to extract structured features from floor plan images for a fire risk assessment.
★ 80Pythonupdated 2024-07-08deep-learningdeep-neural-networksdjangodjango-applicationfloorplan
Multi-Agent Blog Generator based on Agno framework. Supports leading LLM providers like OpenAI, Gemini, Claude, and Grok.
★ 74Pythonupdated 2026-01-06agentic-aiagentsagnoai-agentsclaude
🤖🔎 STREAM: Search with Top Result Extraction & Answer Model 🔤📊 SEEKTOPIC 🚜📜 Tractor the Text Extractor 📈📝 REASON Docs Writing Agent
★ 72HTMLupdated 2025-12-29ai-searchautocompletehacktoberfestkeywordsknowledge-graph
A utility to inspect, validate, sign and verify machine learning model files.
★ 67Rustupdated 2025-02-05ggufonnxpytorchsafetensors
A curated list of all things awesome about OpenAI
★ 54Jupyter Notebookupdated 2024-07-19artificial-intelligencejavascriptjsonnlp-machine-learningopenai
Comprehensive guide to AI applications in OSINT workflows and intelligence analysis
★ 53updated 2025-07-14ai-toolsartificial-intelligenceautomated-analysiscomputer-visiondata-analysis
GitHub repository of the Introduction to Machine Learning course in the Hebrew University of Jerusalem. Includes code examples, labs, and exercise templates
★ 51Jupyter Notebookupdated 2025-01-24algorithmsmachine-learningpython3
Hebrew Diacritizer
★ 50Pythonupdated 2026-04-14diacritizationhebrewhebrew-niqqudmachine-learning
Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet
★ 40Pythonupdated 2025-01-05hebrewisraelpytorchtts
A dynamic NewsAI dashboard that uses NLP to analyze news articles, visualize sentiment trends, and extract insights through interactive data visualizations.
★ 38Pythonupdated 2026-04-03data-visualizationhacktoberfesthacktoberfest-acceptednewsaggnewsapi
A real-time, offline voice assistant for Linux and Raspberry Pi. Uses local LLMs (via Ollama), speech-to-text (Vosk), and text-to-speech (Piper) for fast, wake-free voice interaction. No cloud. No APIs. Just Python, a mic, and your voice.
★ 37Pythonupdated 2026-04-20androidchatbotdeep-learningechoesp-idf
Speech-to-text, text-to-speech with ElevenLabs
★ 35Pythonupdated 2023-12-21elevenlabspyside6pytorchspeech-to-texttext-to-speeh
AgenticSeek is a fully local, voice-enabled AI assistant designed to autonomously browse the web, write code, and plan tasks while ensuring complete privacy by keeping all data on your device. Tailored for local reasoning models, it runs entirely on your hardware, eliminating any cloud dependency.
★ 30Pythonupdated 2025-08-27ai-agentsai-assistantautonomous-web-browsingchromedrivercoding-assistance
Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.
★ 28C++updated 2026-02-28apiaudio-transcriptiondockermachine-learningspeech-to-text
A dataset of global salaries in AI/ML and Big Data.
★ 27updated 2026-03-01aidata-sciencejobsmachine-learningml
WhisperX-powered voice transcription tool that types text directly at your cursor position. Hold F9 to record, release to transcribe.
★ 26Pythonupdated 2026-03-04pytorchtranscriptionvoice-commandswhisperwhisperx
🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error Rate (WER). Built for the scalable evaluation of speech and transcription accuracy.
★ 26Pythonupdated 2026-03-30asrasr-evaluationautomatic-speech-recognitionlevenshtein-distancemetrics
💹 StockSim: Multi-Agent LLM Financial Market Simulator — A realistic trading simulation platform for evaluating large language models in dynamic financial environments.
★ 25Pythonupdated 2025-07-15algorithmic-tradinganthropicasyncbacktestingdocker
This repository contains code for fine-tuning the Whisper speech-to-text model.
★ 23Jupyter Notebookupdated 2026-04-16fine-tuningnlpspeech-to-textwhisper
(NVIDIA) FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively.
★ 21Pythonupdated 2025-12-18pinokio
[AAAI 2024] Official pytorch implementation of “Learning Real-World Image De-Weathering with Imperfect Supervision”
★ 17Pythonupdated 2024-08-22
MultimodalHugs is an extension of Hugging Face that offers a generalized framework for training, evaluating, and using multimodal AI models with minimal code differences, ensuring seamless compatibility with Hugging Face pipelines.
★ 17Pythonupdated 2026-04-17huggingfacehuggingface-transformersmultimodalmultimodal-deep-learningmultimodal-large-language-models
🚀 Unleash AMD GPU Performance: Fix PyTorch ROCm detection for 4x AI/ML speedup on RX 6000/7000 series for Pinokio and developers / custom setups
★ 17Pythonupdated 2025-08-24
PyTorch docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.
★ 14Shellupdated 2024-06-14aicudadockerjupytermachine-learning
A modern FastAPI-based web app for real-time object detection using YOLO models, supporting image and video uploads, model selection, live streaming, and interactive UI.
★ 9Pythonupdated 2025-06-28ai-projectback-endcomputer-visionfastapifront-end
The AI Assistant uses OpenAI's GPT models and Langchain for agent management and memory handling. With a Streamlit interface, it offers interactive responses and supports efficient document search with FAISS. Users can upload and search pdf, docx, and txt files, making it a versatile tool for answering questions and retrieving content.
★ 9Pythonupdated 2024-04-22artificial-intelligencedata-sciencefaisslangchainllms
This app allows users to take notes by recording and analyze the content using machine learning technology.
★ 7JavaScriptupdated 2019-05-13
About AI-Powered Medical Assistant 🏥🤖 The AI-Powered Medical Assistant is an intelligent healthcare platform that utilizes AI to assist users in symptom analysis, treatment recommendations, medical research, and patient management. By integrating advanced AI models and multiple innovative features, this project enhances healthcare accessibility,
★ 5HTMLupdated 2026-04-20bloggingdruidgraphhopehopenet
Modern NVR with object/motion/audio detection, push notifications, multi-location, and encrypted local and cloud-based storage support built in.
★ 4updated 2024-10-06aicamerahome-assistanthome-automationip-camera
GPU-accelerated speech-to-text service that types what you say, powered by OpenAI's Whisper AI
★ 3Pythonupdated 2025-10-09accessibilitycudadictationgpu-accelerationlinux
Your voice - VocalFlow dictation, harnessing Whisper and faster-whisper for real-time transcription, adaptive learning, and NLP. Built with Python, it spans Linux, Windows, and macOS, boosting productivity through voice-assisted workflows.
★ 3Pythonupdated 2025-09-08cross-platformdesktop-appdictationfaster-whisperlinux
A deep learning application that classifies the reason for a baby's cry (hunger, pain, etc.) from live or uploaded audio. Built with a TensorFlow/Keras CNN, Librosa for audio processing, and a responsive Flask web UI with real-time recording and visualization. Helps caregivers understand an infant's needs instantly.
★ 2updated 2025-08-01
Self-hosted, local only NVR and AI Computer Vision software. With features such as object detection, motion detection, face recognition and more, it gives you the power to keep an eye on your home,…
★ 2updated 2025-05-24
🏡 Transform real estate searches with natural language queries; find contextually relevant listings effortlessly using ML embeddings and vector search.
★ 1Pythonupdated 2026-04-20bootstrapdialogdotfile-managerdotfilesdotfiles-linux
Product deduplication pipeline for Israeli price-comparison — Hebrew/English normalization, FAISS embeddings, LLM cluster refinement. Pair F1: 0.955
★ 1Pythonupdated 2026-04-07deduplicationembeddingsfaissnlpopenai
deep-learning-research-sub-agents for claude code
★ 1updated 2025-08-30
Automation of Whisper fine tuning using ClearML
★ 1Pythonupdated 2025-09-09clearmlclearml-servermlopsnlps3-storage
A repository to support the Leeds Data Science presentation
★ 1Pythonupdated 2023-07-03