Machine Learning & Deep Learning

326 repos

Sort by

An Open Source Machine Learning Framework for Everyone

★ 194,883C++updated 2026-04-20deep-learningdeep-neural-networksdistributedmachine-learningml

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

★ 160,697HTMLupdated 2026-04-20aiartificial-intelligenceawesome-listchatgptchatgpt-prompts

huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

★ 159,926Pythonupdated 2026-04-20audiodeep-learningdeepseekgemmaglm

Comfy-Org/ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

★ 110,102Pythonupdated 2026-04-20aicomfycomfyuipythonpytorch

rasbt/LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

★ 91,441Jupyter Notebookupdated 2026-04-16aiartificial-intelligencechatbotchatgptdeep-learning

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

★ 78,154Pythonupdated 2026-04-20amdblackwellcudadeepseekdeepseek-v3

dair-ai/Prompt-Engineering-Guide

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

★ 73,813MDXupdated 2026-03-11agentagentsai-agentschatgptdeep-learning

tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

★ 73,725C++updated 2026-04-19hacktoberfestlstmmachine-learningocrocr-engine

josephmisiti/awesome-machine-learning

A curated list of awesome Machine Learning frameworks, libraries and software.

★ 72,295Pythonupdated 2026-04-12

hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

★ 70,611Pythonupdated 2026-04-20agentaideepseekfine-tuninggemma

OpenBB-finance/OpenBB

Financial data platform for analysts, quants and AI agents.

★ 66,523Pythonupdated 2026-04-19aicryptoderivativeseconomicsequity

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

★ 59,641Pythonupdated 2026-03-09deep-learningpythonpytorchtensorflowtts

ultralytics/ultralytics

Ultralytics YOLO 🚀

★ 56,435Pythonupdated 2026-04-20clicomputer-visiondeep-learninghubimage-classification

apache/airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

★ 45,200Pythonupdated 2026-04-20airflowapacheapache-airflowautomationdag

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

★ 45,178Pythonupdated 2024-08-16deep-learningglow-ttshifiganmelganmulti-speaker-tts

streamlit/streamlit

Streamlit — A faster way to build and share data apps.

★ 44,356Pythonupdated 2026-04-20data-analysisdata-sciencedata-visualizationdeep-learningdeveloper-tools

ray-project/ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

★ 42,311Pythonupdated 2026-04-20data-sciencedeep-learningdeploymentdistributedhyperparameter-optimization

hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

★ 41,379Pythonupdated 2026-04-13aibig-modeldata-parallelismdeep-learningdistributed-computing

microsoft/qlib

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.

★ 41,260Pythonupdated 2026-04-17algorithmic-tradingauto-quantdeep-learningfinancefintech

photoprism/photoprism

AI-Powered Photos App for the Decentralized Web 🌈💎✨

★ 39,573Goupdated 2026-04-20aigolanggoogle-photosmachine-learningphotography

google-research/google-research

Google Research

★ 37,792Jupyter Notebookupdated 2026-04-19aimachine-learningresearch

huggingface/pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

★ 36,710Pythonupdated 2026-04-17augmixconvnextdistributed-trainingefficientnetimage-classification

xinntao/Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

★ 35,193Pythonupdated 2024-08-06aminedenoiseesrganimage-restorationjpeg-compression

patchy631/ai-engineering-hub

In-depth tutorials on LLMs, RAGs and real-world AI agent applications.

★ 34,110Jupyter Notebookupdated 2026-03-23agentsaillmsmachine-learningmcp

ItzCrazyKns/Vane

Vane is an AI-powered answering engine.

★ 33,989TypeScriptupdated 2026-04-11ai-agentsai-search-engineanswering-engineartificial-intelligencellm

explosion/spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

★ 33,513Pythonupdated 2026-03-28aiartificial-intelligencecythondata-sciencedeep-learning

huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

★ 33,452Pythonupdated 2026-04-18deep-learningdiffusionfluximage-generationimage2image

openai/CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

★ 33,327Jupyter Notebookupdated 2026-03-25deep-learningmachine-learning

lutzroeder/netron

Visualizer for neural network, deep learning and machine learning models

★ 32,805JavaScriptupdated 2026-04-20aicoremldeep-learningdeeplearningkeras

facebookresearch/fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

★ 32,207Pythonupdated 2025-09-30artificial-intelligencepythonpytorch

blakeblackshear/frigate

NVR with realtime local object detection for IP cameras

★ 31,587TypeScriptupdated 2026-04-20aicameragoogle-coralhome-assistanthome-automation

Lightning-AI/pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

★ 31,080Pythonupdated 2026-04-20aiartificial-intelligencedata-sciencedeep-learningmachine-learning

qdrant/qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

★ 30,698Rustupdated 2026-04-20ai-searchai-search-engineembeddings-similarityhnswhybrid-search

academic/awesome-datascience

:memo: An awesome Data Science repository to learn and apply for real world problems.

★ 28,891updated 2026-04-18analyticsawesome-listdata-miningdata-sciencedata-scientists

stanford-oval/storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

★ 28,130Pythonupdated 2025-09-30agentic-ragdeep-researchemnlp2024knowledge-curationlarge-language-models

HumanSignal/label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

★ 27,135TypeScriptupdated 2026-04-20annotationannotation-toolannotationsboundingboxcomputer-vision

NirDiamant/RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed notebook tutorial.

★ 26,982Jupyter Notebookupdated 2026-04-15aiembeddingslangchainllama-indexllm

srbhr/Resume-Matcher

Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.

★ 26,818TypeScriptupdated 2026-04-17applicant-tracking-systematshacktoberfestmachine-learningnatural-language-processing

mlflow/mlflow

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

★ 25,566Pythonupdated 2026-04-20agentopsagentsaiai-governanceapache-spark

deepset-ai/haystack

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

★ 24,992MDXupdated 2026-04-20agentagentsaigeminigenerative-ai

Anjok07/ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

★ 24,423Pythonupdated 2025-03-13audioinstrumentalkaraokekareokeemusic

sebastianruder/NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

★ 22,972Pythonupdated 2024-07-28dialoguemachine-learningmachine-translationnamed-entity-recognitionnatural-language-processing

spmallick/learnopencv

Learn OpenCV : C++ and Python Examples

★ 22,892Jupyter Notebookupdated 2026-04-16aicomputer-visioncomputervisiondeep-learningdeep-neural-networks

serengil/deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

★ 22,628Pythonupdated 2026-03-01age-predictionarcfacedeep-learningdeepfacedeepid

mlc-ai/mlc-llm

Universal LLM Deployment Engine with ML Compilation

★ 22,526Pythonupdated 2026-04-20language-modelllmmachine-learning-compilationtvm

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

★ 22,404Pythonupdated 2025-11-19deep-learninginferenceopenaiquantizationspeech-recognition

recommenders-team/recommenders

Best Practices on Recommendation Systems

★ 21,660Pythonupdated 2026-04-18aiartificial-intelligencedata-sciencedeep-learningjupyter-notebook

huggingface/datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

★ 21,453Pythonupdated 2026-04-17aiartificial-intelligencecomputer-visiondataset-hubdatasets

AI4Finance-Foundation/FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

★ 19,816Jupyter Notebookupdated 2026-04-19chatgptfinancefingptfintechlarge-language-models

k4yt3x/video2x

A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.

★ 19,670C++updated 2026-03-07anime4kframe-interpolationmachine-learningneural-networksrealcugan

Unity-Technologies/ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

★ 19,355C#updated 2026-04-17deep-learningdeep-reinforcement-learningmachine-learningneural-networksreinforcement-learning

Alibaba-NLP/DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

★ 18,746Pythonupdated 2026-02-27agentalibabaartificial-intelligencedeep-researchdeepresearch

screenpipe/screenpipe

Run agents that work for you based on what you do. AI finally knows what you are doing

★ 18,404Rustupdated 2026-04-20agentsagiaicomputer-visionllm

meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

★ 18,303Jupyter Notebookupdated 2026-04-17aifinetuninglangchainllamallama2

alyssaxuu/screenity

The free and privacy-friendly screen recorder with no limits 🎥

★ 18,138JavaScriptupdated 2026-04-08annotationannotation-toolaudiocamerachrome-extension

arc53/DocsGPT

Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.

★ 17,849Pythonupdated 2026-04-20agent-builderagentsaichatgptdocsgpt

NVIDIA-NeMo/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

★ 17,126Pythonupdated 2026-04-20asrdeeplearninggenerative-aimachine-translationneural-networks

camel-ai/camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

★ 16,793Pythonupdated 2026-04-19agentai-societiesartificial-intelligencecommunicative-aicooperative-ai

weaviate/weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.

★ 16,080Goupdated 2026-04-20approximate-nearest-neighbor-searchgenerative-searchgrpchnswhybrid-search

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

★ 15,852Pythonupdated 2026-03-17audio-visual-speech-recognitionconformerdfsmnparaformerpretrained-model

treeverse/dvc

🦉 Data Versioning and ML Experiments

★ 15,565Pythonupdated 2026-04-14aidata-sciencedata-version-controldeveloper-toolsmachine-learning

Fincept-Corporation/FinceptTerminal

FinceptTerminal is a modern finance application offering advanced market analytics, investment research, and economic data tools, designed for interactive exploration and data-driven decision-making in a user-friendly environment.

★ 15,037Pythonupdated 2026-04-20ai-agentsalgorithmic-tradingbloomberg-terminalcppfinance

AliaksandrSiarohin/first-order-model

This repository contains the source code for the paper First Order Motion Model for Image Animation

★ 15,008Jupyter Notebookupdated 2024-11-14deep-learninggenerative-modelimage-animationmotion-retargeting

AI4Finance-Foundation/FinRL

FinRL®: Financial Reinforcement Learning. 🔥

★ 14,920Jupyter Notebookupdated 2026-04-05algorithmic-tradingdeep-reinforcement-learningdrl-algorithmsdrl-frameworkdrl-trading-agents

botpress/botpress

The open-source hub to build & deploy GPT/LLM Agents ⚡️

★ 14,652TypeScriptupdated 2026-04-17agentaibotpresschatbotchatgpt

Unstructured-IO/unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

★ 14,559HTMLupdated 2026-04-14data-pipelinesdeep-learningdocument-image-analysisdocument-image-processingdocument-parser

WongKinYiu/yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

★ 14,102Jupyter Notebookupdated 2024-08-19darknetpytorchscaled-yolov4yoloryolov3

visenger/awesome-mlops

A curated list of references for MLOps

★ 13,874updated 2024-11-21aidata-sciencedevopsengineeringfederated-learning

owainlewis/awesome-artificial-intelligence

A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.

★ 13,518updated 2025-08-12aiartificial-intelligencedeep-learningintelligent-machinesintelligent-systems

Lightning-AI/litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

★ 13,326Pythonupdated 2026-04-16aiartificial-intelligencedeep-learninglarge-language-modelsllm

alicevision/Meshroom

Node-based Visual Programming Toolbox

★ 12,688QMLupdated 2026-04-173d-reconstructionalicevisioncamera-trackingcomputer-visionhdr-imaging

neuml/txtai

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

★ 12,424Pythonupdated 2026-04-14agentsaiai-agentsembeddingsinformation-retrieval

ludwig-ai/ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

★ 11,677Pythonupdated 2026-04-19computer-visiondata-centricdata-sciencedeepdeep-learning

colmap/colmap

COLMAP - Structure-from-Motion and Multi-View Stereo

★ 11,539C++updated 2026-04-19computer-visiongeometrymulti-view-stereoreconstructionstructure-from-motion

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

★ 11,475Pythonupdated 2026-04-03asraudioaudio-processingdeep-learninghuggingface

tensorzero/tensorzero

TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

★ 11,278Rustupdated 2026-04-19aiai-engineeringanthropicartificial-intelligencedeep-learning

wandb/wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

★ 11,031Pythonupdated 2026-04-20aicollaborationdata-sciencedata-versioningdeep-learning

rhasspy/piper

A fast, local neural text to speech system

★ 10,858C++updated 2025-08-26speech-synthesistext-to-speechtts

huggingface/text-generation-inference

Large Language Model Text Generation Inference

★ 10,844Pythonupdated 2026-03-21bloomdeep-learningfalcongptinference

doccano/doccano

Open source annotation tool for machine learning practitioners.

★ 10,635Pythonupdated 2026-04-14annotation-tooldata-labelingdatasetdatasetsmachine-learning

triton-inference-server/server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

★ 10,605Pythonupdated 2026-04-20clouddatacenterdeep-learningedgegpu

openvinotoolkit/openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

★ 10,142C++updated 2026-04-20aicomputer-visiondeep-learningdeploy-aidiffusion-models

satellite-image-deep-learning/techniques

Techniques for deep learning with satellite & aerial imagery

★ 10,119updated 2026-04-15convolutional-neural-networksdatasetdatasetsdeep-learningdeep-neural-networks

bigscience-workshop/petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

★ 10,090Pythonupdated 2024-09-07bloomchatbotdeep-learningdistributed-systemsfalcon

Netflix/metaflow

Build, Manage and Deploy AI/ML Systems

★ 10,056Pythonupdated 2026-04-20agentsaiawsazurecost-optimization

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

★ 9,830Jupyter Notebookupdated 2026-04-16overlapped-speech-detectionpretrained-modelspytorchspeaker-change-detectionspeaker-diarization

drivendataorg/cookiecutter-data-science

A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.

★ 9,805Pythonupdated 2026-04-09aicookiecuttercookiecutter-data-sciencecookiecutter-templatedata-science

gorse-io/gorse

AI powered open source recommender system engine supports classical/LLM rankers and multimodal content via embedding

★ 9,619Goupdated 2026-04-22collaborative-filteringgoknnmachine-learningrecommender-system

onnx/models

A collection of pre-trained, state-of-the-art models in the ONNX format

★ 9,562Jupyter Notebookupdated 2026-03-09deep-learningdownloadmodelsonnxpretrained

sloria/TextBlob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

★ 9,516Pythonupdated 2026-04-21natural-language-processingnlpnltkpatternpython

replicate/cog

Containers for machine learning

★ 9,402Goupdated 2026-04-17aicontainerscudadockermachine-learning

yzhao062/anomaly-detection-resources

Anomaly detection related books, papers, videos, and toolboxes. Last update late 2025 for LLM and VLM works!

★ 9,270Pythonupdated 2026-03-02anomaly-detectionawesomeawesome-listdata-miningfraud

xorbitsai/inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

★ 9,261Pythonupdated 2026-04-20artificial-intelligencechatglmdeploymentflan-t5gemma

activeloopai/deeplake

Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.

★ 9,100C++updated 2026-02-16agentagentic-ragaiclawbotcomputer-vision

snakers4/silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

★ 8,899Pythonupdated 2026-03-26onnxonnx-runtimeonnxruntimepytorchspeech

modelscope/modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

★ 8,878Pythonupdated 2026-04-20cvdeep-learningmachine-learningmulti-modalnlp

mage-ai/mage-ai

🧙 Build, run, and manage data pipelines for integrating and transforming data.

★ 8,710Pythonupdated 2026-04-02artificial-intelligencedatadata-engineeringdata-integrationdata-pipelines

YaoFANGUK/video-subtitle-extractor

视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

★ 8,695Pythonupdated 2026-04-09deep-learningextracthardsubocrripper

firmai/financial-machine-learning

A curated list of practical financial machine learning tools and applications.

★ 8,530Pythonupdated 2025-01-03algorithmic-tradingcryptocurrencyfinanceinvestmentquant

netease-youdao/EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

★ 8,470Pythonupdated 2024-08-13aideep-learningemotionemotivoicemulti-speaker

nhivp/Awesome-Embedded

A curated list of awesome embedded programming.

★ 8,466updated 2026-03-18aiautosarawesomebeaglebonebootloader

alirezadir/Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

★ 8,066Jupyter Notebookupdated 2025-11-28agenticaiai-agentsai-engineeringdeep-learning

FunAudioLLM/SenseVoice

Multilingual Voice Understanding Model

★ 8,041Pythonupdated 2025-12-30aiaigcasraudio-event-classificationcross-lingual

LibrePhotos/librephotos

A self-hosted open source photo management service.

★ 7,980Pythonupdated 2026-04-17djangoexifhacktoberfestmachine-learningphoto

nadermx/backgroundremover

Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.

★ 7,852Pythonupdated 2026-03-21aibackground-removalbackground-removerbackgroundremoverphoto-editing

microsoft/presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

★ 7,806Pythonupdated 2026-04-20anonymizationdata-anonymizationdata-maskingdata-obfuscationdata-privacy

2FastLabs/agent-squad

Flexible and powerful framework for managing multiple AI agents and handling complex conversations

★ 7,587Pythonupdated 2026-04-17agentic-aiagentsai-agentsai-agents-frameworkanthropic

amusi/awesome-object-detection

Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html

★ 7,501updated 2022-12-17computer-visiondeep-learningdetectionobject-detectionobject-localisation

google-deepmind/lab

A customisable 3D platform for agent-based AI research

★ 7,353Cupdated 2023-01-04artificial-intelligencedeep-learningmachine-learningneural-networks

alirezamika/autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

★ 7,146Pythonupdated 2025-06-09aiartificial-intelligenceautomationcrawlermachine-learning

vladmandic/sdnext

SD.Next: All-in-one WebUI for AI generative image and video creation, captioning and processing

★ 7,075Pythonupdated 2026-04-20ai-artcaptiondiffusersgenerative-artpython

deeppavlov/DeepPavlov

An open source library for deep learning end-to-end dialog systems and chatbots.

★ 6,975Pythonupdated 2025-08-06aiartificial-intelligencebotchatbotchitchat

vespa-engine/vespa

AI + Data, online. https://vespa.ai

★ 6,896Javaupdated 2026-04-20aibig-datajavamachine-learningrag

flwrlabs/flower

Flower: A Friendly Federated AI Framework

★ 6,854Pythonupdated 2026-04-19aiandroidartificial-intelligencecppdeep-learning

AI4Finance-Foundation/FinRobot

FinRobot: An Open-Source AI Agent Platform for Financial Analysis using LLMs 🚀 🚀 🚀

★ 6,777Jupyter Notebookupdated 2026-04-03aiagentchatgptfinancefingptlarge-language-models

MycroftAI/mycroft-core

Mycroft Core, the Mycroft Artificial Intelligence platform.

★ 6,623Pythonupdated 2024-09-08aiarchartificial-intelligencefedorahacktoberfest

kyegomez/swarms

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

★ 6,543Pythonupdated 2026-04-20agentic-aiagentic-workflowagentsaiartificial-intelligence

yl4579/StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

★ 6,243Pythonupdated 2024-08-10adversarial-trainingdeep-learningdiffusion-modelsganlatent-diffusion

multimodal-art-projection/YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

★ 6,167Pythonupdated 2025-06-04aiaudio-generationdeep-learningfoundation-modelsgpt

aimhubio/aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

★ 6,100Pythonupdated 2026-04-19aidata-sciencedata-visualizationexperiment-trackingmachine-learning

Trusted-AI/adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

★ 5,947Pythonupdated 2025-12-12adversarial-attacksadversarial-examplesadversarial-machine-learningaiartificial-intelligence

snakers4/silero-models

Silero Models: pre-trained text-to-speech models made embarrassingly simple

★ 5,888Jupyter Notebookupdated 2026-04-16armenianazerbaijanibelaruscolabgeorgian

promptslab/Awesome-Prompt-Engineering

This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

★ 5,820TypeScriptupdated 2026-04-20chatgptchatgpt-apideep-learningfew-shot-learninggpt

adbar/trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

★ 5,802Pythonupdated 2025-09-12article-extractorcorpus-buildercorpus-toolscrawlerhtml-to-markdown

lauragift21/awesome-learning-resources

🔥 Awesome list of resources on Web Development.

★ 5,723updated 2025-07-16aiawesome-listcssdeveloper-storiesjavascript

biolab/orange3

🍊 :bar_chart: :bulb: Orange: Interactive data analysis

★ 5,605Pythonupdated 2026-04-17classificationclusteringdata-miningdata-sciencedata-visualization

Eventual-Inc/Daft

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

★ 5,437Rustupdated 2026-04-20ai-engineeringai-pipelinearrowartificial-intelligencebig-data

zenml-io/zenml

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

★ 5,362Pythonupdated 2026-04-20agentopsagentsaiautomldata-science

superduper-io/superduper

Superduper: End-to-end framework for building custom AI applications and agents.

★ 5,271Pythonupdated 2025-09-01aichatbotdatadatabasedistributed-ml

business-science/ai-data-science-team

An AI-powered data science team of agents to help you perform common data science tasks 10X faster.

★ 5,178Pythonupdated 2026-01-28agentsaiai-engineerai-engineeringcopilot

timesler/facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

★ 5,144Pythonupdated 2025-09-16face-detectionface-identificationface-recognitionface-trackingfacial-recognition

Deci-AI/super-gradients

Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

★ 5,025Jupyter Notebookupdated 2026-02-24dependency-graph

open-mmlab/mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

★ 5,002Pythonupdated 2026-03-18action-recognitionavabenchmarkdeep-learningi3d

mahseema/awesome-ai-tools

A curated list of Artificial Intelligence Top Tools

★ 4,992updated 2025-12-31aiai-agentai-agentsai-assistantai-tools

sacridini/Awesome-Geospatial

Long list of geospatial tools and resources

★ 4,977updated 2026-04-22awesomeawesome-listdata-analysisdeep-learningearth-observation

Kodezi/Chronos

Kodezi Chronos is a debugging-first language model that achieves state-of-the-art results on SWE-bench Lite (80.33%) and 67% real-world fix accuracy, over six times better than GPT-4. Built with Adaptive Graph-Guided Retrieval and Persistent Debug Memory. Model available Q1 2026 via Kodezi OS.

★ 4,945Javaupdated 2025-11-12artificial-intelligenceautonomous-debuggingbenchmarkbenchmark-reportbug-fixing

argilla-io/argilla

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

★ 4,945Pythonupdated 2026-04-13active-learningaiannotation-tooldeveloper-toolsgpt-4

thunlp/OpenPrompt

An Open-Source Framework for Prompt-Learning.

★ 4,857Pythonupdated 2024-07-16aideep-learningnatural-language-processingnatural-language-understandingnlp

Picovoice/porcupine

On-device wake word detection powered by deep learning

★ 4,798Pythonupdated 2026-04-17handsfreehotwordhotword-detectionhotword-detectorkeyword-spotter

remsky/Kokoro-FastAPI

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching

★ 4,764Pythonupdated 2026-01-04fastapihuggingface-spaceskokorokokoro-ttsonnx

trigaten/Learn_Prompting

Prompt Engineering, Generative AI, and LLM Guide by Learn Prompting | Join our discord for the largest Prompt Engineering learning community

★ 4,688MDXupdated 2025-01-14chatgptchatgpt-apideep-learninggpt-3gpt-4

huggingface/speech-to-speech

Build local voice agents with open-source models

★ 4,686Pythonupdated 2026-04-20aiassistantlanguage-modelmachine-learningpython

Kanaries/Rath

Next generation of automated data exploratory analysis and visualization platform.

★ 4,634TypeScriptupdated 2026-03-10augmented-analyticsautomated-data-analysisautomated-visualizationautoviscausal-discovery

py-why/EconML

ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.

★ 4,610Jupyter Notebookupdated 2026-04-13causal-inferencecausalityeconometricseconomicsmachine-learning

WhisperSpeech/WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

★ 4,595Jupyter Notebookupdated 2025-12-14pytorchspeech-synthesistts

promptslab/Promptify

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

★ 4,588Pythonupdated 2026-03-27chatgptchatgpt-apichatgpt-pythongpt-3gpt-3-prompts

huggingface/autotrain-advanced

🤗 AutoTrain Advanced

★ 4,571Pythonupdated 2026-04-17autotraindeep-learninghuggingfacemachine-learningnatural-language-processing

OpenNMT/CTranslate2

Fast inference engine for Transformer models

★ 4,450C++updated 2026-02-04avxavx2cppcudadeep-learning

microsoft/VoTT

Visual Object Tagging Tool: An electron app for building end to end Object Detection Models from Images and Videos.

★ 4,430TypeScriptupdated 2021-12-06annotation-toolcntkdeep-learningdetectiondetection-model

camel-ai/oasis

🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents.

★ 4,412Pythonupdated 2026-04-17agent-based-frameworkagent-based-simulationai-societiesdeep-learninglarge-language-models

truefoundry/cognita

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

★ 4,409Pythonupdated 2026-03-13agentaiapplicationdatadeep-learning

HumanSignal/awesome-data-labeling

A curated list of awesome data labeling tools

★ 4,314updated 2024-06-173d-annotationannotationannotation-toolaudio-annotationaudio-annotation-tool

zvtvz/zvt

modular quant framework.

★ 4,112Pythonupdated 2026-04-13algorithmic-tradingbacktestingcryptocurrencyfintechfundamental-analysis

modelscope/ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

★ 4,098Pythonupdated 2025-08-14audiobandwidth-extensiondeep-learningnoise-suppressionpytorch

Rikorose/DeepFilterNet

Noise supression using deep filtering

★ 4,095Pythonupdated 2024-10-17audiodeep-learningnoise-suppressionpytorchrust

open-compass/VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

★ 4,079Pythonupdated 2026-04-10chatgptclaudeclipcomputer-visionevaluation

mcmonkeyprojects/SwarmUI

SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

★ 4,009C#updated 2026-04-18aicomfyuicsharpimage-generationjavascript

torchgeo/torchgeo

TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data

★ 4,004Pythonupdated 2026-04-22computer-visiondatasetsdeep-learningearth-observationgeospatial

opengeos/segment-geospatial

A Python package for segmenting geospatial data with the Segment Anything Model (SAM)

★ 3,974Pythonupdated 2026-04-20artificial-intelligencedeep-learninggeopythongeospatialmachine-learning

chrieke/awesome-satellite-imagery-datasets

🛰️ List of satellite image training datasets with annotations for computer vision and deep learning

★ 3,884updated 2022-07-14computer-visiondeep-learningearth-observationinstance-segmentationmachine-learning

CyberAlbSecOP/Awesome_GPT_Super_Prompting

ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.

★ 3,864HTMLupdated 2026-03-06adversarial-machine-learningagentaiassistantchatgpt

Nixtla/nixtla

TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It's capable of accurately predicting various domains such as retail, electricity, finance, and IoT with just a few lines of code 🚀.

★ 3,861Jupyter Notebookupdated 2026-04-13agentagentic-aianomaly-detectionartificial-intelligencedeep-learning

jtablesaw/tablesaw

Java dataframe and visualization library

★ 3,748Javaupdated 2026-03-02chartdata-analysisdata-framedata-sciencedata-visualization

MrForExample/ComfyUI-3D-Pack

An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)

★ 3,720Pythonupdated 2025-12-29comfycomfyuimachine-learning

facebookresearch/pytorchvideo

A deep learning library for video understanding research.

★ 3,554Pythonupdated 2026-01-12

avinashkranjan/Amazing-Python-Scripts

🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

★ 3,530Jupyter Notebookupdated 2026-02-20artificial-intelligencehacktoberfestmachine-learningprojectspython

codefuse-ai/Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

★ 3,312updated 2026-04-10aiawesomedatasetsllmnlp

libAudioFlux/audioFlux

A library for audio and music analysis, feature extraction.

★ 3,302Cupdated 2026-03-06audioaudio-analysisaudio-featuresaudio-processingdeep-learning

edtechre/pybroker

Algorithmic Trading in Python with Machine Learning

★ 3,274Pythonupdated 2026-04-20aialgorithmic-tradingalgotradingartificial-intelligencebacktesting

truera/trulens

Evaluation and Tracking for LLM Experiments and AI Agents

★ 3,269Pythonupdated 2026-04-20agent-evaluationagentopsai-agentsai-monitoringai-observability

Djdefrag/QualityScaler

QualityScaler - image/video AI upscaler app

★ 3,036Pythonupdated 2026-04-05amdanimecompression-artifact-reductiondeep-learningdirectx-12

HeyWillow/willow

Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative

★ 3,017Cupdated 2026-02-13alexadeep-learningechoesp-adfesp-idf

roflcoopter/viseron

Self-hosted, local only NVR and AI Computer Vision software. With features such as object detection, motion detection, face recognition and more, it gives you the power to keep an eye on your home, office or any other place you want to monitor.

★ 3,011Pythonupdated 2026-04-17coralcudadarknetedgetpuface-recognition

bigscience-workshop/promptsource

Toolkit for creating, sharing and using natural language prompts.

★ 3,008Pythonupdated 2023-10-23machine-learningnatural-language-processingnlp

tensorflow/agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

★ 3,007Pythonupdated 2026-01-16banditscontextual-banditsdqnmulti-armed-banditsreinforcement-learning

zjunlp/LLMAgentPapers

Must-read Papers on LLM Agents.

★ 2,984updated 2026-04-17agentagentsawsome-listenvironmentin-context-learning

facebookresearch/habitat-lab

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

★ 2,957Pythonupdated 2026-02-21aicomputer-visiondeep-learningdeep-reinforcement-learningpython

opengeos/geoai

GeoAI: Artificial Intelligence for Geospatial Data

★ 2,913Pythonupdated 2026-04-15aidata-sciencedeep-learningearth-observationgeoai

pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

★ 2,869Pythonupdated 2026-04-20audioaudio-processingiomachine-learningpython

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

★ 2,803Pythonupdated 2025-09-09asrattention-is-all-you-needattention-mechanismattention-modelattention-network

jhj0517/Whisper-WebUI

A Web UI for easy subtitle using whisper model.

★ 2,764Pythonupdated 2025-12-29aigradioopen-sourcepythonpytorch

AmberSahdev/Open-Interface

Control Any Computer Using LLMs.

★ 2,659Pythonupdated 2026-02-25assistantassistant-computer-controlautomationgptgpt4

plexe-ai/plexe

✨ Build a machine learning model from a prompt

★ 2,564Pythonupdated 2026-03-06agentic-aiagentsaimachine-learningml

young-geng/EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

★ 2,520Pythonupdated 2024-08-13chatbotdeep-learningflaxjaxlanguage-model

ray-project/kuberay

A toolkit to run Ray applications on Kubernetes

★ 2,468Goupdated 2026-04-19apachedeep-learningkubernetesmachine-learningray

huggingface/evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

★ 2,444Pythonupdated 2026-04-17evaluationmachine-learning

fhamborg/news-please

news-please - an integrated web crawler and information extractor for news that just works

★ 2,441Pythonupdated 2026-04-14cc-newsccnewscommoncrawlcrawlerdata-gathering

Yutong-Zhou-cv/Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

★ 2,436updated 2026-02-07awseome-listgenerative-adversarial-networkimage-generationimage-manipulationimage-synthesis

ggozad/oterm

the terminal client for Ollama

★ 2,360Pythonupdated 2025-12-19llmllmsmachine-learningollamapython

uptrain-ai/uptrain

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.

★ 2,344Pythonupdated 2024-08-18autoevaluationevaluationexperimentationhallucination-detectionjailbreak-detection

google-research-datasets/Objectron

Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes

★ 2,327Jupyter Notebookupdated 2026-03-063d3d-reconstruction3d-visionaiaugmented-reality

pydn/ComfyUI-to-Python-Extension

A powerful tool that translates ComfyUI workflows into executable Python code.

★ 2,310Pythonupdated 2026-04-19ai-artcomfyuigenerative-artimage-generationpytorch

azavea/raster-vision

An open source library and framework for deep learning on satellite and aerial imagery.

★ 2,215Pythonupdated 2025-09-29classificationcomputer-visiondeep-learninggeospatialmachine-learning

wenhwu/awesome-remote-sensing-change-detection

A comprehensive and up-to-date compilation of datasets, tools, methods, review papers, and competitions for remote sensing change detection.

★ 2,199updated 2026-04-16awesomechange-detectiondatasetdeep-learningremote-sensing

kayba-ai/agentic-context-engine

🧠 Make your agents learn from experience. Now available as a hosted solution at kayba.ai

★ 2,176Pythonupdated 2026-04-25agent-learningagent-memoryagentsaiai-agents

brokermr810/QuantDinger

Open-source AI-driven quantitative trading platform for crypto, stocks, and forex with backtesting, live trading, market data, and multi-agent research.

★ 2,135Pythonupdated 2026-04-20ai-traderalgorithmic-trading-portfolioalgotradebacktesting-frameworkscryptocurrencies

UniversalDataTool/universal-data-tool

Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.

★ 2,060JavaScriptupdated 2025-03-15annotate-imagesannotation-toolclassificationcomputer-visioncsv

Cloud-CV/EvalAI

:cloud: :rocket: :bar_chart: :chart_with_upwards_trend: Evaluating state of the art in AI

★ 2,016Pythonupdated 2026-04-20aiai-challengesangularjsartificial-intelligencechallenge

pykeen/pykeen

🤖 A Python library for learning and evaluating knowledge graph embeddings

★ 1,983Pythonupdated 2026-04-21cudadeep-learningknowledge-base-completionknowledge-graph-embeddingsknowledge-graphs

tatsu-lab/alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

★ 1,977Jupyter Notebookupdated 2025-08-09deep-learningevaluationfoundation-modelsinstruction-followinglarge-language-models

A9T9/RPA

Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use/LLM. Selenium IDE import/export.

★ 1,891JavaScriptupdated 2026-03-20anthropicanthropic-claudebrowser-automationbrowser-extensioncomputer-use

shcherbak-ai/contextgem

ContextGem: Effortless LLM extraction from documents

★ 1,823Pythonupdated 2026-03-16aicontract-analysisdata-extractiondocument-intelligencedocx

FL33TW00D/whisper-turbo

Cross-Platform, GPU Accelerated Whisper 🏎️

★ 1,800TypeScriptupdated 2024-02-27audiomachine-learningrustspeech-recognitionwebgpu

coderonion/awesome-yolo-object-detection

🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.

★ 1,728updated 2025-05-31cudadatasetsdeepseekfew-shot-object-detectiongui

Andrew-Jang/RAGHub

A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.

★ 1,715updated 2026-04-17aiartificial-intelligencelarge-language-modelsllmmachine-learning

lyuchenyang/Macaw-LLM

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

★ 1,591Pythonupdated 2025-01-01deep-learninglanguage-modelmachine-learningmulti-modal-learningnatural-language-processing

kritiksoman/GIMP-ML

AI for GNU Image Manipulation Program

★ 1,542Pythonupdated 2024-10-12coloring-imagecomputer-visioncomputervisiondeblurringdeep-learning

semperai/amica

Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.

★ 1,492TypeScriptupdated 2025-07-23aiassistant-chat-botscomputer-visionllmspeech-recognition

web-arena-x/webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

★ 1,443Pythonupdated 2025-11-26agentnlp

voice-cloning-app/Voice-Cloning-App

A Python/Pytorch app for easily synthesising human voices

★ 1,440Pythonupdated 2024-12-02deep-learningpythonpytorchtacotron2text-to-speech

EthicalML/awesome-artificial-intelligence-regulation

This repository aims to map the ecosystem of artificial intelligence guidelines, principles, codes of ethics, standards, regulation and beyond.

★ 1,428updated 2026-04-18aiai-ethicsai-guidelinesai-policydata-ethics

Capsize-Games/airunner

Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows

★ 1,314Pythonupdated 2026-02-10aiai-artartasset-generatorchatbot

NPC-Worldwide/npcpy

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

★ 1,311Pythonupdated 2026-04-20agentsaillmmcpmcp-client

GradientHQ/parallax

Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere

★ 1,259Pythonupdated 2026-04-12blackwellchatbotdecentralized-inferencedeepseekdistributed-systems

yeyupiaoling/Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

★ 1,211Cupdated 2025-12-17androidasrchinesectranslate2huggingface

maum-ai/voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

★ 1,201Pythonupdated 2024-07-25audio-separationpytorchsource-separationspeech-separationvoicefilter

devnen/Chatterbox-TTS-Server

Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.

★ 1,187Pythonupdated 2026-04-02aiapi-serveraudio-generationchatterboxchatterbox-tts

xtreme1-io/xtreme1

Xtreme1 is an all-in-one data labeling and annotation platform for multimodal data training and supports 3D LiDAR point cloud, image, and LLM.

★ 1,185TypeScriptupdated 2025-07-153d-annotationannotationannotation-toolcomputer-visionimage-annotation

modal-labs/modal-examples

Examples of programs built using Modal

★ 1,172Pythonupdated 2026-04-20clouddistributedgpumachine-learningmodal

Danielskry/Awesome-RAG

😎 Awesome list of Retrieval-Augmented Generation (RAG) applications in Generative AI.

★ 1,170updated 2026-04-11artificial-intelligencegenerative-ailarge-language-modelsmachine-learningretrieval-augmented-generation

midas-research/audino

Open source audio annotation tool for humans

★ 1,133TypeScriptupdated 2026-02-03annotation-toolaudio-annotationaudio-processingdatasetsmachine-learning

satellite-image-deep-learning/datasets

Datasets for deep learning with satellite & aerial imagery

★ 1,131updated 2026-04-15datasetsearth-observationremote-sensingsatellite-datasatellite-imagery

JuergenFleiss/aTrain

A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning models.

★ 1,113Pythonupdated 2026-04-16

NVIDIA/audio-flamingo

PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models

★ 1,103updated 2025-12-15audio-captioningaudio-language-modelsaudio-question-answeringaudio-reasoningmultimodal-large-language-models

OSGeo/grass

GRASS - free and open-source geospatial processing engine

★ 1,102Cupdated 2026-04-22arraysdata-scienceearth-observationgeospatialgeospatial-analysis

yangheng95/PyABSA

Sentiment Analysis, Text Classification, Text Augmentation, Text Adversarial defense, etc.;

★ 1,096Jupyter Notebookupdated 2026-04-24adversarialaspect-based-sentiment-analysisaspect-sentiment-triplet-extractionaspect-term-extractionlcf-bert

mpaepper/llm_agents

Build agents which are controlled by LLMs

★ 1,040Pythonupdated 2025-06-23deep-learninglangchainllmsmachine-learning

pytorch/benchmark

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

★ 1,030Pythonupdated 2026-04-17benchmarkpytorch

victordibia/autogen-ui

Web UI for AutoGen (A Framework Multi-Agent LLM Applications)

★ 997TypeScriptupdated 2024-11-14agent-based-frameworkaiai-agentsautogenautogen-sample

whylabs/langkit

🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring safety & security. 🛡️ Features include text quality, relevance metrics, & sentiment analysis. 📊 A comprehensive tool for LLM observability. 👀

★ 984Jupyter Notebookupdated 2024-11-22large-language-modelsmachine-learningnlgnlpobservability

paschmann/rasa-ui

Rasa UI is a frontend for the Rasa Framework

★ 966JavaScriptupdated 2025-11-12angularmanage-botsnlpnlp-apisnlp-machine-learning

codeproject/CodeProject.AI-Server

CodeProject.AI Server is a self contained service that software developers can include in, and distribute with, their applications in order to augment their apps with the power of AI.

★ 957C#updated 2025-07-14artificial-intelligencegenerative-aimlopsobject-detectiononnx

jsbroks/awesome-dataset-tools

🔧 A curated list of awesome dataset tools

★ 935updated 2023-06-09annotation-toolannotationsawsomeawsome-listdatasets

Yuan-ManX/ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

★ 930updated 2025-07-08aigcartificial-intelligenceaudioaudio-effectaudio-generation

thekevinscott/UpscalerJS

Enhance Images with Javascript and AI. Increase resolution, retouch, denoise, and more. Open Source, Browser & Node Compatible, MIT License.

★ 885TypeScriptupdated 2026-04-17aideblurringdehazingdenoisingderaining

zamalali/DeepGit

Deep research agent to help you find the best GitHub repositories 🕵️!

★ 867Pythonupdated 2026-04-08agentdeep-researchgithub-searchlangchainlanggraph

huggingface/dataset-viewer

Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.

★ 857Pythonupdated 2026-04-17api-restdatadatasetshuggingfacemachine-learning

OHF-Voice/micro-wake-word

A TensorFlow based wake word detection training framework using synthetic sample generation suitable for certain microcontrollers.

★ 819Pythonupdated 2025-12-21

mayeaux/generate-subtitles

Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration

★ 809JavaScriptupdated 2023-03-16expressjsgpulibretranslatemachine-learningnodejs

MagnivOrg/prompt-layer-library

🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.

★ 759Pythonupdated 2026-04-17gptmachine-learningopenaipromptprompt-engineering

kedro-org/kedro-viz

Visualise your Kedro data and machine-learning pipelines and track your experiments.

★ 748JavaScriptupdated 2026-04-22data-visualizationexperiment-trackinghacktoberfestkedrokedro-plugin

b12io/orchestra

Orchestra is a human-in-the-loop AI system for orchestrating project teams of experts and machines.

★ 705Pythonupdated 2026-04-15experthuman-computer-interactionhuman-in-the-loop-machine-learningorchestraworkflow

dgarnitz/vectorflow

VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.

★ 701Pythonupdated 2024-05-16aidata-engineeringembeddingsmachine-learningnlp

vvincenttttt/Awesome-3D-Object-Detection

Papers, code and datasets about deep learning for 3D Object Detection.

★ 686updated 2025-09-173d-representationautonomous-drivingcomputer-visioncvprcvpr2020

NVlabs/OmniVinci

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

★ 658Pythonupdated 2026-02-26audio-language-modeldeep-learninglarge-language-modelsmultimodal-large-language-modelsvision-language-model

EmuKit/emukit

A Python-based toolbox of various methods in decision making, uncertainty quantification and statistical emulation: multi-fidelity, experimental design, Bayesian optimisation, Bayesian quadrature, etc.

★ 655Pythonupdated 2026-02-22bayesian-optimizationbayesian-quadraturedecision-makingemulationexperimental-design

vilassn/whisper_android

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

★ 650C++updated 2026-03-18androidasrautomatic-speech-recognitionembeddedmobile

joelhooks/swarm-tools

🐝 Multi-agent swarm coordination for OpenCode with learning capabilities, agent issue tracking, and management

★ 645TypeScriptupdated 2026-02-23ai-agentsmachine-learningmulti-agentopencodeswarm

hailo-ai/hailo_model_zoo

The Hailo Model Zoo includes pre-trained models and a full building and evaluation environment

★ 634Pythonupdated 2026-04-05ai-acceleratorscomputer-visiondeep-learningedge-aihailo

chakki-works/sumeval

Well tested & Multi-language evaluation framework for text summarization.

★ 626Pythonupdated 2026-04-13bleumachine-learningrougetext-summarization

OvidijusParsiunas/myvision

Computer vision based ML training data generation tool :rocket:

★ 608JavaScriptupdated 2025-02-15aiannotationannotation-toolcococomputer-vision

theopenconversationkit/tock

Tock, the open source conversational AI toolkit.

★ 605Kotlinupdated 2026-04-17aialexaapple-business-chatassistantbot

jupediaz/chatgpt-prompt-splitter

ChatGPT PROMPTs Splitter. Tool for safely process chunks of up to 15,000 characters per request

★ 565Pythonupdated 2024-02-28ai-conversationscharacter-limitchatgptlanguage-modelnlp-tool

Skytliang/Multi-Agents-Debate

MAD: The first work to explore Multi-Agent Debate with Large Language Models :D

★ 556Pythonupdated 2025-12-16chatgptgpt-4large-language-modelsllmsnlp

apm1467/videocr

Extract hardcoded subtitles from videos using machine learning

★ 547Pythonupdated 2024-01-30

adamlui/ai-web-extensions

🤖 AI browser extensions & userscripts to augment your web experience

★ 541JavaScriptupdated 2026-04-23aiamazonartificialintelligencebravechat

yzfly/Awesome-AGI-Agents

🤖 Awesome list of AGI Agents. Agents 精选资源合集.

★ 524updated 2023-10-31agentsagents-artagiagi-agentsai

Leeroo-AI/mergoo

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

★ 511Pythonupdated 2024-08-26artificial-intelligencefine-tuninggenerative-ailarge-language-modelsllm

gkonovalov/android-vad

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

★ 487Cupdated 2025-07-15androidaudio-processingdeep-neural-networksdnngmm

ibaiGorordo/ONNX-YOLOv8-Object-Detection

Python scripts performing object detection using the YOLOv8 model in ONNX.

★ 483Pythonupdated 2024-08-22computer-visiondeep-learningobject-detectiononnxonnxruntime

modal-labs/modal-client

SDK libraries for Modal

★ 466Pythonupdated 2026-04-20aiclouddata-sciencedistributedgenai

turing-machines/mentals-ai

No code AI agents

★ 448C++updated 2025-01-16aiai-agentsartificial-intelligencecligpt

OpenAssistantGPT/OpenAssistantGPT

A Community Open-Source Saas for Crafting/Building/Creating Chatbots with OpenAI's Assistant API that you can add to your website.

★ 422TypeScriptupdated 2024-10-24aiartificial-intelligenceartificial-neural-networksassistantassistant-app

vasistalodagala/whisper-finetune

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

★ 361Pythonupdated 2023-05-23asrjaxpytorchspeech-recognitiontransformers

rizerphe/obsidian-companion

Autocomplete your obsidian notes with AI, including ChatGPT, through a copilot-like interface.

★ 355TypeScriptupdated 2024-05-28aiai21labschatgptgroqgroq-ai

medtorch/awesome-healthcare-ai

A curated list of awesome open source healthcare tools, algorithms, datasets and research papers.

★ 320updated 2025-01-20awesome-listawesome-listshealthcarehealthcare-applicationhealthcare-datasets

RelevanceAI/relevanceai

Home of the AI workforce - Multi-agent system, AI agents & tools

★ 279Pythonupdated 2026-01-15clusteringcomputer-visionembeddingsnatural-language-processingnlp

digiteinfotech/kairon

Agentic AI platform that harnesses Visual LLM Chaining to build proactive digital assistants

★ 275Pythonupdated 2026-04-16botbot-frameworkbotkitbotschatbot

microsoft/finnts

Microsoft Finance Time Series Forecasting Framework (FinnTS) is a forecasting package that utilizes cutting-edge time series forecasting and parallelization on the cloud to produce accurate forecasts for financial data.

★ 254Rupdated 2026-04-20agentaibusinessfeature-selectionfinance

zhanghengdev/awesome-video-object-detection

This is a list of awesome articles about object detection from video.

★ 247updated 2019-07-01awesome-listcomputer-visiondeep-learningdeep-neural-networksobject-detection

coqui-ai/Trainer

🐸 - A general purpose model trainer, as flexible as it gets

★ 234Pythonupdated 2024-03-07aidata-sciencedeep-learningmachine-learningpytorch

leopiney/neuralnoise

The AI Podcast Studio: generate podcasts scripts and their audio version with a team of AI workers in a Podcast Studio 🎙️📜

★ 224Pythonupdated 2025-03-05ag2aiaudio-generationautogenelevenlabs

rsandx/AlphaSuite

AlphaSuite is an open-source quantitative analysis platform that gives you the power to build, test, and deploy professional-grade trading strategies. It's designed for traders and analysts who want to move beyond simple backtests and develop a genuine, data-driven edge in the financial markets.

★ 213Pythonupdated 2026-03-03algorithmic-tradingbacktestingdata-analysisfintechlangchain

gnbaron/signature-recognition

Verify the authenticity of handwritten signatures through digital image processing and neural networks. ✍️

★ 209Pythonupdated 2022-11-12handwritten-signaturesneural-networkopencvsignature-recognitionsignature-verification

agentsea/surfkit

A toolkit for building computer use AI agents

★ 194Pythonupdated 2025-06-26agentsllmmachine-learningmultimodal

holasoymalva/deepseek-cli

DeepSeek CLI, a command-line AI coding assistant that leverages the powerful DeepSeek Coder models

★ 191TypeScriptupdated 2025-08-25artificial-intelligenceclaudeclaude-aiclaude-codecursor

microsoft/eureka-ml-insights

A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.

★ 183Pythonupdated 2026-04-15aiartificial-intelligenceevaluation-frameworkllmmachine-learning

aisingapore/PeekingDuck

A modular framework built to simplify Computer Vision inference workloads.

★ 177Pythonupdated 2025-09-04computer-visiondeep-learningnodesobject-detectionobject-tracking

meokz/looking-to-listen

Deep neural network (DNN) for noise reduction, removal of background music, and speech separation

★ 172Pythonupdated 2022-11-21noise-reductionspeech-separation

coqui-ai/STT-models

Open models for Coqui STT

★ 155updated 2023-05-09deep-learningmodelsspeech-to-text

Captain-FLAM/KaraFan

The BEST music separation model with help of A.I. ... to my ears ! 👂👂

★ 149Pythonupdated 2024-06-10artificial-intelligenceaudioinstrumentalinstrumentalskaraoke

HqWu-HITCS/Awesome-Personalized-LLM

This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.

★ 141updated 2024-09-23awesome-listllmnlprole-playing

tsterbak/promptmage

simplifies the process of creating and managing LLM workflows.

★ 115Pythonupdated 2024-10-21aillmnlp-librarynlp-machine-learningprompt-engineering

strands-agents/evals

A comprehensive evaluation framework for AI agents and LLM applications.

★ 110Pythonupdated 2026-04-17agenticagentic-aiaievaluationmachine-learning

QuantiusBenignus/blurt

Gnome shell extension for accurate OFFLINE speech to text input in Linux using whisper.cpp. Input text from speech anywhere.

★ 105JavaScriptupdated 2025-04-09aiasrbloat-freedictatedictation

ftisiot/postgresql-ai-projects

List of PostgreSQL® AI projects and resources

★ 92updated 2025-08-19aimachine-learningpostgresql

Panopticon-AI-team/panopticon

A wargaming platform compatible with reinforcement learning agents

★ 87TypeScriptupdated 2025-09-09agent-based-simulationaiartificial-intelligencegame-enginemachine-learning

xcapt0/gpt2_chatbot

☕ GPT-2 chatbot for daily conversation

★ 84Pythonupdated 2022-02-14chatbotgpt-2nlp

whchien/deep-floor-plan-recognition

The project uses a computer vision model to extract structured features from floor plan images for a fire risk assessment.

★ 80Pythonupdated 2024-07-08deep-learningdeep-neural-networksdjangodjango-applicationfloorplan

analyticalrohit/ai-blog-generator

Multi-Agent Blog Generator based on Agno framework. Supports leading LLM providers like OpenAI, Gemini, Claude, and Grok.

★ 74Pythonupdated 2026-01-06agentic-aiagentsagnoai-agentsclaude

vtempest/ai-research-agent

🤖🔎 STREAM: Search with Top Result Extraction & Answer Model 🔤📊 SEEKTOPIC 🚜📜 Tractor the Text Extractor 📈📝 REASON Docs Writing Agent

★ 72HTMLupdated 2025-12-29ai-searchautocompletehacktoberfestkeywordsknowledge-graph

dreadnode/tensor-man

A utility to inspect, validate, sign and verify machine learning model files.

★ 67Rustupdated 2025-02-05ggufonnxpytorchsafetensors

Jaykef/awesome-openAI

A curated list of all things awesome about OpenAI

★ 54Jupyter Notebookupdated 2024-07-19artificial-intelligencejavascriptjsonnlp-machine-learningopenai

atlas-bear/osint-ai-guide

Comprehensive guide to AI applications in OSINT workflows and intelligence analysis

★ 53updated 2025-07-14ai-toolsartificial-intelligenceautomated-analysiscomputer-visiondata-analysis

GreenGilad/IML.HUJI

GitHub repository of the Introduction to Machine Learning course in the Hebrew University of Jerusalem. Includes code examples, labs, and exercise templates

★ 51Jupyter Notebookupdated 2025-01-24algorithmsmachine-learningpython3

elazarg/nakdimon

Hebrew Diacritizer

★ 50Pythonupdated 2026-04-14diacritizationhebrewhebrew-niqqudmachine-learning

thewh1teagle/israwave

Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet

★ 40Pythonupdated 2025-01-05hebrewisraelpytorchtts

Multiverse-of-Projects/NewsAI

A dynamic NewsAI dashboard that uses NLP to analyze news articles, visualize sentiment trends, and extract insights through interactive data visualizations.

★ 38Pythonupdated 2026-04-03data-visualizationhacktoberfesthacktoberfest-acceptednewsaggnewsapi

shashank2122/Local-Voice

A real-time, offline voice assistant for Linux and Raspberry Pi. Uses local LLMs (via Ollama), speech-to-text (Vosk), and text-to-speech (Piper) for fast, wake-free voice interaction. No cloud. No APIs. Just Python, a mic, and your voice.

★ 37Pythonupdated 2026-04-20androidchatbotdeep-learningechoesp-idf

CyR1en/ElevenLabsS4TS

Speech-to-text, text-to-speech with ElevenLabs

★ 35Pythonupdated 2023-12-21elevenlabspyside6pytorchspeech-to-texttext-to-speeh

andrewstack-maker/agenticSeek

AgenticSeek is a fully local, voice-enabled AI assistant designed to autonomously browse the web, write code, and plan tasks while ensuring complete privacy by keeping all data on your device. Tailored for local reasoning models, it runs entirely on your hardware, eliminating any cloud dependency.

★ 30Pythonupdated 2025-08-27ai-agentsai-assistantautonomous-web-browsingchromedrivercoding-assistance

ErcinDedeoglu/WhisperDock

Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.

★ 28C++updated 2026-02-28apiaudio-transcriptiondockermachine-learningspeech-to-text

foorilla/ai-jobs-net-salaries

A dataset of global salaries in AI/ML and Big Data.

★ 27updated 2026-03-01aidata-sciencejobsmachine-learningml

Aaronontheweb/witticism

WhisperX-powered voice transcription tool that types text directly at your cursor position. Hold F9 to record, release to transcribe.

★ 26Pythonupdated 2026-03-04pytorchtranscriptionvoice-commandswhisperwhisperx

analyticsinmotion/werpy

🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error Rate (WER). Built for the scalable evaluation of speech and transcription accuracy.

★ 26Pythonupdated 2026-03-30asrasr-evaluationautomatic-speech-recognitionlevenshtein-distancemetrics

harrypapa2002/StockSim

💹 StockSim: Multi-Agent LLM Financial Market Simulator — A realistic trading simulation platform for evaluating large language models in dynamic financial environments.

★ 25Pythonupdated 2025-07-15algorithmic-tradinganthropicasyncbacktestingdocker

i4Ds/whisper-finetune

This repository contains code for fine-tuning the Whisper speech-to-text model.

★ 23Jupyter Notebookupdated 2026-04-16fine-tuningnlpspeech-to-textwhisper

pinokiofactory/Frame-Pack

(NVIDIA) FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively.

★ 21Pythonupdated 2025-12-18pinokio

1180300419/imperfect-deweathering

[AAAI 2024] Official pytorch implementation of “Learning Real-World Image De-Weathering with Imperfect Supervision”

★ 17Pythonupdated 2024-08-22

GerrySant/multimodalhugs

MultimodalHugs is an extension of Hugging Face that offers a generalized framework for training, evaluating, and using multimodal AI models with minimal code differences, ensuring seamless compatibility with Hugging Face pipelines.

★ 17Pythonupdated 2026-04-17huggingfacehuggingface-transformersmultimodalmultimodal-deep-learningmultimodal-large-language-models

Painter3000/AMD-GPU-BOOST

🚀 Unleash AMD GPU Performance: Fix PyTorch ROCm detection for 4x AI/ML speedup on RX 6000/7000 series for Pinokio and developers / custom setups

★ 17Pythonupdated 2025-08-24

ai-dock/pytorch

PyTorch docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.

★ 14Shellupdated 2024-06-14aicudadockerjupytermachine-learning

Raafat-Nagy/YOLO-Object-Detection-App

A modern FastAPI-based web app for real-time object detection using YOLO models, supporting image and video uploads, model selection, live streaming, and interactive UI.

★ 9Pythonupdated 2025-06-28ai-projectback-endcomputer-visionfastapifront-end

sharonmordechai/ai-assistant-with-vector-store

The AI Assistant uses OpenAI's GPT models and Langchain for agent management and memory handling. With a Streamlit interface, it offers interactive responses and supports efficient document search with FAISS. Users can upload and search pdf, docx, and txt files, making it a versatile tool for answering questions and retrieving content.

★ 9Pythonupdated 2024-04-22artificial-intelligencedata-sciencefaisslangchainllms

tainanboy/VoiceNote

This app allows users to take notes by recording and analyze the content using machine learning technology.

★ 7JavaScriptupdated 2019-05-13

JBSEnovs/hope

About AI-Powered Medical Assistant 🏥🤖 The AI-Powered Medical Assistant is an intelligent healthcare platform that utilizes AI to assist users in symptom analysis, treatment recommendations, medical research, and patient management. By integrating advanced AI models and multiple innovative features, this project enhances healthcare accessibility,

★ 5HTMLupdated 2026-04-20bloggingdruidgraphhopehopenet

streamshuttle/docker-compose

Modern NVR with object/motion/audio detection, push notifications, multi-location, and encrypted local and cloud-based storage support built in.

★ 4updated 2024-10-06aicamerahome-assistanthome-automationip-camera

sanastasiou/dictation-service

GPU-accelerated speech-to-text service that types what you say, powered by OpenAI's Whisper AI

★ 3Pythonupdated 2025-10-09accessibilitycudadictationgpu-accelerationlinux

R3DK3LL/VocalFLow

Your voice - VocalFlow dictation, harnessing Whisper and faster-whisper for real-time transcription, adaptive learning, and NLP. Built with Python, it spans Linux, Windows, and macOS, boosting productivity through voice-assisted workflows.

★ 3Pythonupdated 2025-09-08cross-platformdesktop-appdictationfaster-whisperlinux

Binyameensn/AI-Powered-Infant-Cry-Detector

A deep learning application that classifies the reason for a baby's cry (hunger, pain, etc.) from live or uploaded audio. Built with a TensorFlow/Keras CNN, Librosa for audio processing, and a responsive Flask web UI with real-time recording and visualization. Helps caregivers understand an infant's needs instantly.

★ 2updated 2025-08-01

whoshero/rotary

★ 2updated 2025-05-24

poovarasan011/yad2-semantic-search

🏡 Transform real estate searches with natural language queries; find contextually relevant listings effortlessly using ML embeddings and vector search.

★ 1Pythonupdated 2026-04-20bootstrapdialogdotfile-managerdotfilesdotfiles-linux

TalKleinBgu/Zap

Product deduplication pipeline for Israeli price-comparison — Hebrew/English normalization, FAISS embeddings, LLM cluster refinement. Pair F1: 0.955

★ 1Pythonupdated 2026-04-07deduplicationembeddingsfaissnlpopenai

raktim-mondol/deep-learning-research-sub-agents

deep-learning-research-sub-agents for claude code

★ 1updated 2025-08-30

dmt-zh/STT-pipeline

Automation of Whisper fine tuning using ClearML

★ 1Pythonupdated 2025-09-09clearmlclearml-servermlopsnlps3-storage

StatsGary/dreambooth-fine-tuning-pytorch

A repository to support the Leeds Data Science presentation

★ 1Pythonupdated 2023-07-03