AI, LLMs & Data
246 repositories across 4 subcategories
Subcategories
All Repositories — sorted by stars
| Repository | Stars | Language | Description |
|---|---|---|---|
| tensorflow/tensorflow | ⭐ 195.0K | C++ | An Open Source Machine Learning Framework for Everyone |
| Significant-Gravitas/AutoGPT | ⭐ 184.0K | Python | AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters. |
| obra/superpowers | ⭐ 177.3K | Shell | An agentic skills framework & software development methodology that works. |
| ollama/ollama | ⭐ 170.6K | Go | Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models. |
| huggingface/transformers | ⭐ 160.2K | Python | 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training. |
| langgenius/dify | ⭐ 140.0K | TypeScript | Production-ready platform for agentic workflow development. |
| langchain-ai/langchain | ⭐ 135.7K | Python | The agent engineering platform. Available in TypeScript! |
| open-webui/open-webui | ⭐ 135.4K | Python | User-friendly AI Interface (Supports Ollama, OpenAI API, ...) |
| anthropics/skills | ⭐ 127.7K | Python | Public repository for Agent Skills |
| ggml-org/llama.cpp | ⭐ 108.1K | C++ | LLM inference in C/C++ |
| google-gemini/gemini-cli | ⭐ 103.1K | TypeScript | An open-source AI agent that brings the power of Gemini directly into your terminal. |
| pytorch/pytorch | ⭐ 99.6K | Python | Tensors and Dynamic neural networks in Python with strong GPU acceleration |
| openai/whisper | ⭐ 98.8K | Python | Robust Speech Recognition via Large-Scale Weak Supervision |
| hacksider/Deep-Live-Cam | ⭐ 92.6K | Python | real time face swap and one-click video deepfake with only a single image |
| deepseek-ai/DeepSeek-R1 | ⭐ 92.0K | — | |
| browser-use/browser-use | ⭐ 91.9K | Python | 🌐 Make websites accessible for AI agents. Automate tasks online with ease. |
| opencv/opencv | ⭐ 87.3K | C++ | Open Source Computer Vision Library |
| openai/codex | ⭐ 79.8K | Rust | Lightweight coding agent that runs in your terminal |
| infiniflow/ragflow | ⭐ 79.6K | Python | RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs |
| nomic-ai/gpt4all | ⭐ 77.4K | C++ | GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use. |
| PaddlePaddle/PaddleOCR | ⭐ 77.0K | Python | Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages. |
| tesseract-ocr/tesseract | ⭐ 73.9K | C++ | Tesseract Open Source OCR Engine (main repository) |
| CompVis/stable-diffusion | ⭐ 73.0K | Jupyter Notebook | A latent text-to-image diffusion model |
| paperclipai/paperclip | ⭐ 62.4K | TypeScript | Open-source orchestration for zero-human companies |
| CorentinJ/Real-Time-Voice-Cloning | ⭐ 59.7K | Python | Clone a voice in 5 seconds to generate arbitrary speech in real-time |
| gsd-build/get-shit-done | ⭐ 59.7K | JavaScript | A light-weight and powerful meta-prompting, context engineering and spec-driven development system for Claude Code by TÂCHES. |
| meta-llama/llama | ⭐ 59.4K | Python | Inference code for Llama models |
| 666ghj/MiroFish | ⭐ 59.0K | Python | A Simple and Universal Swarm Intelligence Engine, Predicting Anything. 简洁通用的群体智能引擎,预测万物 |
| zylon-ai/private-gpt | ⭐ 57.2K | Python | Interact with your documents using the power of GPT, 100% privately, no data leaks |
| code-yeongyu/oh-my-openagent | ⭐ 55.6K | TypeScript | omo; the best agent harness - previously oh-my-opencode |
| AntonOsika/gpt-engineer | ⭐ 55.2K | Python | CLI platform to experiment with codegen. Precursor to: https://lovable.dev |
| mem0ai/mem0 | ⭐ 54.7K | Python | Universal memory layer for AI Agents |
| upstash/context7 | ⭐ 54.4K | TypeScript | Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors |
| facebookresearch/segment-anything | ⭐ 54.1K | Jupyter Notebook | The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model. |
| run-llama/llama_index | ⭐ 49.1K | Python | LlamaIndex is the leading document agent and OCR platform |
| pandas-dev/pandas | ⭐ 48.7K | Python | Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more |
| mudler/LocalAI | ⭐ 46.0K | Go | LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required. |
| BerriAI/litellm | ⭐ 45.6K | Python | Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM] |
| apache/airflow | ⭐ 45.3K | Python | Apache Airflow - A platform to programmatically author, schedule, and monitor workflows |
| coqui-ai/TTS | ⭐ 45.2K | Python | 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production |
| Aider-AI/aider | ⭐ 44.3K | Python | aider is AI pair programming in your terminal |
| badlogic/pi-mono | ⭐ 44.2K | TypeScript | AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods |
| aaif-goose/goose | ⭐ 43.7K | Rust | an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM |
| HKUDS/nanobot | ⭐ 41.6K | Python | "🐈 nanobot: The Ultra-Lightweight Personal AI Agent" |
| google-research/bert | ⭐ 40.0K | Python | TensorFlow code and pre-trained models for BERT |
| facebookresearch/faiss | ⭐ 39.9K | C++ | A library for efficient similarity search and clustering of dense vectors. |
| lm-sys/FastChat | ⭐ 39.5K | Python | An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. |
| QuivrHQ/quivr | ⭐ 39.1K | Python | Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want. |
| suno-ai/bark | ⭐ 39.1K | Jupyter Notebook | 🔊 Text-Prompted Generative Audio Model |
| pola-rs/polars | ⭐ 38.4K | Rust | Extremely fast Query Engine for DataFrames, written in Rust |
| naptha/tesseract.js | ⭐ 38.0K | JavaScript | Pure Javascript OCR for more than 100 Languages 📖🎉🖥 |
| LAION-AI/Open-Assistant | ⭐ 37.4K | Python | OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so. |
| google/langextract | ⭐ 36.4K | Python | A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization. |
| stanfordnlp/dspy | ⭐ 34.2K | Python | DSPy: The framework for programming—not prompting—language models |
| Pythagora-io/gpt-pilot | ⭐ 33.8K | Python | The first real AI developer |
| explosion/spaCy | ⭐ 33.5K | Python | 💫 Industrial-strength Natural Language Processing (NLP) in Python |
| zeroclaw-labs/zeroclaw | ⭐ 31.0K | Rust | Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM |
| karpathy/llm.c | ⭐ 29.8K | Cuda | LLM training in simple, raw C/CUDA |
| sipeed/picoclaw | ⭐ 28.7K | Go | Tiny, Fast, and Deployable anywhere — automate the mundane, unleash your creativity |
| qwibitai/nanoclaw | ⭐ 28.6K | TypeScript | A lightweight alternative to OpenClaw that runs in containers for security |
| stanford-oval/storm | ⭐ 28.2K | Python | An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations. |
| assafelovic/gpt-researcher | ⭐ 26.8K | Python | An autonomous agent that conducts deep research on any data using any LLM providers |
| mozilla/DeepSpeech | ⭐ 26.8K | C++ | DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. |
| ApolloAuto/apollo | ⭐ 26.6K | C++ | An open autonomous driving platform |
| langfuse/langfuse | ⭐ 26.5K | TypeScript | 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23 |
| microsoft/JARVIS | ⭐ 24.7K | Python | JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf |
| resemble-ai/chatterbox | ⭐ 24.6K | Python | SoTA open-source TTS |
| multica-ai/multica | ⭐ 24.5K | TypeScript | The open-source managed agents platform. Turn coding agents into real teammates |
| toon-format/toon | ⭐ 24.1K | TypeScript | 🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK. |
| vercel/ai | ⭐ 24.0K | TypeScript | The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents |
| oraios/serena | ⭐ 23.8K | Python | A powerful MCP toolkit for coding, providing semantic retrieval and editing capabilities - the IDE for your agent |
| mastra-ai/mastra | ⭐ 23.5K | TypeScript | From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack. |
| ScrapeGraphAI/Scrapegraph-ai | ⭐ 23.4K | Python | Python scraper based on AI |
| deepseek-ai/DeepSeek-Coder | ⭐ 23.2K | Python | DeepSeek Coder: Let the Code Write Itself |
| mlc-ai/mlc-llm | ⭐ 22.6K | Python | Universal LLM Deployment Engine with ML Compilation |
| supermemoryai/supermemory | ⭐ 22.4K | TypeScript | Memory engine and app that is extremely fast, scalable. The Memory API for the AI era. |
| yoheinakajima/babyagi | ⭐ 22.3K | Python | |
| openai/chatgpt-retrieval-plugin | ⭐ 21.2K | Python | The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language. |
| microsoft/onnxruntime | ⭐ 20.4K | C++ | ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator |
| tensorflow/tfjs | ⭐ 19.1K | TypeScript | A WebGL accelerated JavaScript library for training and deploying ML models. |
| transitive-bullshit/agentic | ⭐ 18.1K | TypeScript | Your API ⇒ Paid MCP. Instantly. |
| openai/tiktoken | ⭐ 18.1K | Python | tiktoken is a fast BPE tokeniser for use with OpenAI's models. |
| mlc-ai/web-llm | ⭐ 17.9K | TypeScript | High-performance In-browser LLM Inference Engine |
| justadudewhohacks/face-api.js | ⭐ 17.8K | TypeScript | JavaScript API for face detection and face recognition in the browser and nodejs with tensorflow.js |
| agentskills/agentskills | ⭐ 17.8K | Python | Specification and documentation for Agent Skills |
| langchain-ai/langchainjs | ⭐ 17.6K | TypeScript | The agent engineering platform |
| leon-ai/leon | ⭐ 17.2K | TypeScript | 🧠 Leon is your open-source personal assistant. |
| manaflow-ai/cmux | ⭐ 16.1K | Swift | Ghostty-based macOS terminal with vertical tabs and notifications for AI coding agents |
| huggingface/transformers.js | ⭐ 16.0K | JavaScript | State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server! |
| Stability-AI/StableLM | ⭐ 15.7K | Jupyter Notebook | StableLM: Stability AI Language Models |
| official-stockfish/Stockfish | ⭐ 15.4K | C++ | A free and strong UCI chess engine |
| gpujs/gpu.js | ⭐ 15.4K | JavaScript | GPU Accelerated JavaScript |
| josdejong/mathjs | ⭐ 15.0K | JavaScript | An extensive math library for JavaScript and Node.js |
| BrainJS/brain.js | ⭐ 14.9K | TypeScript | 🤖 GPU accelerated Neural networks in JavaScript for Browsers and Node.js |
| neonbjb/tortoise-tts | ⭐ 14.8K | Jupyter Notebook | A multi-voice TTS system trained with an emphasis on quality |
| alphacep/vosk-api | ⭐ 14.7K | Jupyter Notebook | Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node |
| Unstructured-IO/unstructured | ⭐ 14.6K | HTML | Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding. |
| ggml-org/ggml | ⭐ 14.6K | C++ | Tensor library for machine learning |
| dottxt-ai/outlines | ⭐ 13.8K | Python | Structured Outputs |
| KittenML/KittenTTS | ⭐ 13.7K | Python | State-of-the-art TTS model under 25MB 😻 |
| nextapps-de/flexsearch | ⭐ 13.7K | JavaScript | Next-generation full-text search library for Browser and Node.js |
| microsoft/LoRA | ⭐ 13.5K | Python | Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models" |
| google-deepmind/mujoco | ⭐ 13.3K | C++ | Multi-Joint dynamics with Contact. A general purpose physics simulator. |
| jupyter/notebook | ⭐ 13.1K | Jupyter Notebook | Jupyter Interactive Notebook |
| cocktailpeanut/dalai | ⭐ 12.9K | CSS | The simplest way to run LLaMA on your local machine |
| ShishirPatil/gorilla | ⭐ 12.9K | Python | Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls) |
| neuml/txtai | ⭐ 12.5K | Python | 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows |
| sapientinc/HRM | ⭐ 12.4K | Python | Hierarchical Reasoning Model Official Release |
| nearai/ironclaw | ⭐ 12.1K | Rust | IronClaw is an Agent OS focused on privacy, security and extensibility |
| spencermountain/compromise | ⭐ 12.1K | JavaScript | modest natural-language processing |
| h2oai/h2ogpt | ⭐ 12.0K | Python | Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/ |
| 0xk1h0/ChatGPT_DAN | ⭐ 12.0K | — | ChatGPT DAN, Jailbreaks prompt |
| tambo-ai/tambo | ⭐ 11.1K | TypeScript | Generative UI SDK for React |
| artidoro/qlora | ⭐ 10.9K | Jupyter Notebook | QLoRA: Efficient Finetuning of Quantized LLMs |
| openai/openai-node | ⭐ 10.9K | TypeScript | Official JavaScript / TypeScript library for the OpenAI API |
| openai/DALL-E | ⭐ 10.9K | Python | PyTorch package for the discrete VAE used for DALL·E. |
| kedro-org/kedro | ⭐ 10.9K | Python | Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular. |
| browseros-ai/BrowserOS | ⭐ 10.7K | TypeScript | 🌐 The open-source Agentic browser; alternative to ChatGPT Atlas, Perplexity Comet, Dia. |
| huggingface/chat-ui | ⭐ 10.7K | TypeScript | The open source codebase powering HuggingChat |
| Turfjs/turf | ⭐ 10.4K | TypeScript | A modular geospatial engine written in JavaScript and TypeScript |
| mozilla/TTS | ⭐ 10.1K | Jupyter Notebook | :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) |
| bigscience-workshop/petals | ⭐ 10.1K | Python | 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading |
| yzhao062/pyod | ⭐ 9.8K | Python | A Python library for anomaly detection across tabular, time series, graph, text, and image data. 60+ detectors, benchmark-backed ADEngine orchestration, and an agentic workflow for AI agents. |
| Arize-ai/phoenix | ⭐ 9.5K | Python | AI Observability & Evaluation |
| infinitered/nsfwjs | ⭐ 8.9K | TypeScript | NSFW detection on the client-side via TensorFlow.js |
| davidkimai/Context-Engineering | ⭐ 8.8K | Python | "Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspired by Karpathy and 3Blue1Brown for moving beyond prompt engineering to the wider discipline of context design, orchestration, and optimization. |
| ogx-ai/ogx | ⭐ 8.4K | Python | Open GenAI Stack |
| leanprover/lean4 | ⭐ 8.0K | Lean | Lean 4 programming language and theorem prover |
| google-deepmind/alphafold3 | ⭐ 7.9K | Python | AlphaFold 3 inference pipeline. |
| weaviate/Verba | ⭐ 7.7K | Python | Retrieval Augmented Generation (RAG) chatbot powered by Weaviate |
| openlm-research/open_llama | ⭐ 7.5K | — | OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset |
| nullclaw/nullclaw | ⭐ 7.4K | Zig | Fastest, smallest, and fully autonomous AI assistant infrastructure written in Zig |
| tailcallhq/forgecode | ⭐ 7.2K | Rust | AI enabled pair programmer for Claude, GPT, O Series, Grok, Deepseek, Gemini and 300+ models |
| traceloop/openllmetry | ⭐ 7.1K | Python | Open-source observability for your GenAI or LLM application, based on OpenTelemetry |
| deepseek-ai/DeepSeek-LLM | ⭐ 6.9K | Makefile | DeepSeek LLM: Let there be answers |
| hexgrad/kokoro | ⭐ 6.9K | JavaScript | https://hf.co/hexgrad/Kokoro-82M |
| google-deepmind/graphcast | ⭐ 6.6K | Python | |
| ml5js/ml5-library | ⭐ 6.6K | JavaScript | Friendly machine learning for the web! 🤖 |
| microsoft/TRELLIS.2 | ⭐ 6.6K | Python | Native and Compact Structured Latents for 3D Generation |
| axa-group/nlp.js | ⭐ 6.6K | JavaScript | An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more |
| supercollider/supercollider | ⭐ 6.5K | C++ | An audio server, programming language, and IDE for sound synthesis and algorithmic composition. |
| axa-group/Parsr | ⭐ 6.2K | JavaScript | Transforms PDF, Documents and Images into Enriched Structured Data |
| mnfst/manifest | ⭐ 6.0K | TypeScript | Smart Model Routing for Agents. Cut Costs up to 70% 🦚 |
| salesforce/BLIP | ⭐ 5.7K | Jupyter Notebook | PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation |
| Helicone/helicone | ⭐ 5.6K | TypeScript | 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓 |
| MrLesk/Backlog.md | ⭐ 5.5K | TypeScript | Backlog.md - A tool for managing project collaboration between humans and AI Agents in a git ecosystem |
| treeverse/lakeFS | ⭐ 5.3K | Go | lakeFS - Data version control for your data lake | Git for data |
| katanaml/sparrow | ⭐ 5.2K | Python | Structured data extraction and instruction calling with ML, LLM and Vision LLM |
| google-deepmind/gemma | ⭐ 5.1K | Python | Gemma open-weight LLM library, from Google DeepMind |
| javascriptdata/danfojs | ⭐ 5.1K | TypeScript | Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data. |
| togethercomputer/RedPajama-Data | ⭐ 4.9K | Python | The RedPajama-Data repository contains code for preparing large datasets for training large language models. |
| h2oai/h2o-llmstudio | ⭐ 4.9K | Python | H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/ |
| Nixtla/statsforecast | ⭐ 4.8K | Python | Lightning ⚡️ fast forecasting with statistical and econometric models. |
| neo4j-labs/llm-graph-builder | ⭐ 4.7K | Jupyter Notebook | Neo4j graph construction from unstructured data using LLMs |
| getzep/zep | ⭐ 4.5K | Python | Zep | Examples, Integrations, & More |
| peterbraden/node-opencv | ⭐ 4.4K | C++ | OpenCV Bindings for node.js |
| ogx-ai/llama-stack-apps | ⭐ 4.3K | — | Agentic components of the Llama Stack APIs |
| Agenta-AI/agenta | ⭐ 4.1K | TypeScript | The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place. |
| CaviraOSS/OpenMemory | ⭐ 4.1K | TypeScript | Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc. |
| latitude-dev/latitude-llm | ⭐ 4.0K | TypeScript | Latitude is the open-source agent engineering platform |
| OHF-Voice/piper1-gpl | ⭐ 3.9K | C++ | Fast and local neural text-to-speech engine |
| mlc-ai/web-stable-diffusion | ⭐ 3.7K | Jupyter Notebook | Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support. |
| google/deepvariant | ⭐ 3.7K | Python | DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data. |
| vijishmadhavan/ArtLine | ⭐ 3.6K | Jupyter Notebook | A Deep Learning based project for creating line art portraits. |
| ploomber/ploomber | ⭐ 3.6K | Python | The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️ |
| benjitaylor/agentation | ⭐ 3.5K | TypeScript | The visual feedback tool for agents. |
| simple-statistics/simple-statistics | ⭐ 3.5K | JavaScript | simple statistics for node & browser javascript |
| facebookresearch/sam-audio | ⭐ 3.5K | Python | The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how to use the model. |
| hao-ai-lab/FastVideo | ⭐ 3.4K | Python | A unified inference and post-training framework for accelerated video generation. |
| pashpashpash/vault-ai | ⭐ 3.4K | JavaScript | OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend. |
| CodedotAl/gpt-code-clippy | ⭐ 3.3K | Python | Full description can be found here: https://discuss.huggingface.co/t/pretrain-gpt-neo-for-open-source-github-copilot-model/7678?u=ncoop57 |
| ob-f/OpenBot | ⭐ 3.3K | Swift | OpenBot leverages smartphones as brains for low-cost robots. We have designed a small electric vehicle that costs about $50 and serves as a robot body. Our software stack for Android smartphones supports advanced robotics workloads such as person following and real-time autonomous navigation. |
| deepseek-ai/DeepSeek-Math | ⭐ 3.3K | Python | DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models |
| langwatch/langwatch | ⭐ 3.2K | TypeScript | The platform for LLM evaluations and AI agent testing |
| jehna/humanify | ⭐ 3.2K | TypeScript | Deobfuscate Javascript code using ChatGPT |
| noahshinn/reflexion | ⭐ 3.1K | Python | [NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning |
| LeelaChessZero/lc0 | ⭐ 3.1K | C++ | Open source neural network chess engine with GPU acceleration and broad hardware support. |
| agentclientprotocol/agent-client-protocol | ⭐ 3.0K | Rust | A protocol for connecting any editor to any agent |
| leeoniya/uFuzzy | ⭐ 3.0K | JavaScript | A tiny, efficient fuzzy search that doesn't suck |
| FreedomIntelligence/LLMZoo | ⭐ 2.9K | Python | ⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡ |
| tidalcycles/strudel | ⭐ 2.9K | — | MOVED TO CODEBERG - Web-based environment for live coding algorithmic patterns, incorporating a faithful port of TidalCycles to JavaScript |
| spiceai/spiceai | ⭐ 2.9K | Rust | A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents. |
| janhq/cortex.cpp | ⭐ 2.8K | C++ | Local AI API Platform |
| TanStack/ai | ⭐ 2.6K | TypeScript | 🤖 SDK that enhances your applications with AI capabilities |
| coqui-ai/STT | ⭐ 2.6K | C++ | 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy. |
| jolibrain/deepdetect | ⭐ 2.5K | C++ | Deep Learning API and Server in C++14 support for PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE |
| kieler/elkjs | ⭐ 2.5K | JavaScript | ELK's layout algorithms for JavaScript |
| nicolaspanel/numjs | ⭐ 2.5K | JavaScript | Like NumPy, in JavaScript |
| sagemath/sage | ⭐ 2.4K | Python | Main repository of SageMath |
| n-riesco/ijavascript | ⭐ 2.3K | JavaScript | IJavascript is a javascript kernel for the Jupyter notebook |
| ymichael/open-codex | ⭐ 2.2K | TypeScript | Lightweight coding agent that runs in your terminal |
| shcherbak-ai/contextgem | ⭐ 1.8K | Python | ContextGem: Effortless LLM extraction from documents |
| yusufcanb/tlm | ⭐ 1.5K | Go | Local CLI Copilot, powered by Ollama. 💻🦙 |
| Yifan-Song793/RestGPT | ⭐ 1.4K | Python | An LLM-based autonomous agent controlling real-world applications via RESTful APIs |
| winkjs/wink-nlp | ⭐ 1.4K | JavaScript | Developer friendly Natural Language Processing ✨ |
| ThousandBirdsInc/chidori | ⭐ 1.3K | Rust | A reactive runtime for building durable AI agents |
| eduardoleao052/js-pytorch | ⭐ 1.2K | JavaScript | A JavaScript library like PyTorch, with GPU acceleration. |
| devnen/Chatterbox-TTS-Server | ⭐ 1.2K | Python | Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU. |
| mendersoftware/mender | ⭐ 1.2K | C++ | Mender over-the-air software updater client. |
| stereolabs/zed-sdk | ⭐ 1.2K | C++ | ⚡️The spatial perception framework for rapidly building smart robots and spaces |
| nmrugg/stockfish.js | ⭐ 1.2K | C++ | The Stockfish chess engine for web browsers |
| CASIA-LMC-Lab/AnomalyGPT | ⭐ 1.1K | Python | [AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models |
| davidedc/Algebrite | ⭐ 997 | TypeScript | Computer Algebra System in Javascript (Typescript) |
| SalesforceAIResearch/promptomatix | ⭐ 952 | Python | An Automatic Prompt Optimization Framework for Large Language Models |
| jonwiggins/optio | ⭐ 934 | TypeScript | Workflow orchestration for AI coding agents, from task to merged PR. |
| kardolus/chatgpt-cli | ⭐ 921 | Go | ChatGPT CLI is a powerful, multi-provider command-line interface for working with modern LLMs. It supports OpenAI, Azure, Perplexity, LLaMA, and more, with features like streaming, interactive chat, prompt files, image/audio I/O, MCP tool calls, and an experimental agent mode for safe, multi-step automation. |
| QuantGeekDev/mcp-framework | ⭐ 917 | TypeScript | The Typescript MCP Framework |
| xavctn/img2table | ⭐ 865 | Python | img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing |
| Atome-FE/llama-node | ⭐ 865 | Rust | Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model. |
| MagnivOrg/prompt-layer-library | ⭐ 762 | Python | 🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions. |
| latitudegames/GPT-3-Encoder | ⭐ 722 | JavaScript | Javascript BPE Encoder Decoder for GPT-2 / GPT-3 |
| liveloveapp/hashbrown | ⭐ 686 | TypeScript | Hashbrown is a framework for building agents that run the browser. Built for Angular and React. |
| gnu-octave/octave | ⭐ 606 | C++ | GNU Octave Mirror (https://www.octave.org/hg/octave). Report bugs and submit pull requests (patches) at https://bugs.octave.org |
| llm-tools/embedJs | ⭐ 605 | TypeScript | A NodeJS RAG framework to easily work with LLMs and embeddings |
| castorini/hedwig | ⭐ 597 | Python | PyTorch deep learning models for document classification |
| jiggzson/nerdamer | ⭐ 547 | JavaScript | a symbolic math expression evaluator for javascript |
| LeonLok/Deep-SORT-YOLOv4 | ⭐ 501 | Python | People detection and optional tracking with Tensorflow backend. |
| bhky/opennsfw2 | ⭐ 490 | Python | Keras implementation of the Yahoo Open-NSFW model |
| ricklupton/floweaver | ⭐ 464 | Python | View flow data as Sankey diagrams |
| okuvshynov/slowllama | ⭐ 450 | Python | Finetune llama2-70b and codellama on MacBook Air without quantization |
| simonmysun/ell | ⭐ 435 | Shell | A command-line interface for LLMs written in Bash. |
| zju-vipa/Odyssey | ⭐ 379 | Python | Odyssey: Empowering Minecraft Agents with Open-World Skills |
| RustCrypto/stream-ciphers | ⭐ 319 | Rust | Collection of stream cipher algorithms |
| AtomicFrontierCode/keyboards | ⭐ 299 | Julia | Simulated annealing code for video |
| zeyu2001/chess-ai | ⭐ 274 | JavaScript | Simple chess AI in JavaScript. Uses the chess.js and chessboard.js libraries. |
| quantastica/quantum-circuit | ⭐ 272 | JavaScript | Quantum Circuit Simulator implemented in JavaScript |
| yusufhilmi/client-vector-search | ⭐ 231 | TypeScript | A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenAI's text-embedding-ada-002 and is way faster than Pinecone and other VectorDBs. |
| qminer/qminer | ⭐ 215 | C++ | Analytic platform for real-time large-scale streams containing structured and unstructured data. |
| Waikato/weka-3.8 | ⭐ 190 | Java | No longer updated mirror of the Weka 3.8 branch. |
| meilisearch/meilisearch-mcp | ⭐ 185 | Python | A Model Context Protocol (MCP) server for interacting with Meilisearch through LLM interfaces. |
| ShawnHymel/computer-vision-with-embedded-machine-learning | ⭐ 181 | Jupyter Notebook | |
| jamesporter/solandra | ⭐ 181 | TypeScript | A framework for algorithmic art. TypeScript first. Make drawing concepts part of framework. Make APIs for humans. |
| Anush008/fastembed-js | ⭐ 175 | TypeScript | Generate vector embeddings in NodeJS |
| josefjadrny/js-chess-engine | ⭐ 159 | TypeScript | Complete TypeScript chess engine with zero dependencies for Node.js >=24 and browsers. Features configurable AI (5 predefined difficulty levels), stateful/stateless APIs, and supports JSON and FEN formats. |
| phil65/agentpool | ⭐ 145 | Python | A unified agent orchestration hub that lets you configure and manage multiple AI agents (native, ACP, AGUI, Claude Code) via YAML, and exposes them through standardized protocols (ACP/OpenCode Server). |
| luczeng/HoughRectangle | ⭐ 118 | C++ | Rectangle detection using the Hough transform |
| nordwestt/ollama-ai-provider-v2 | ⭐ 100 | TypeScript | Vercel AI Provider for running LLMs locally using Ollama |
| panchishin/geneticalgorithm | ⭐ 100 | JavaScript | A fully generalized implementation of the Genetic Algorithm usable on any json based Phenotype |
| tkafka/node-elizabot | ⭐ 63 | JavaScript | |
| sbrsv/ai-embed-search | ⭐ 31 | TypeScript | Smart. Simple. Local. AI-powered semantic search in TypeScript using transformer embeddings. No cloud, no API keys — 100% offline. |
| fboerncke/bloom-ai-simple-starter-nodejs | ⭐ 31 | JavaScript | Simple starter BLOOM example showing how to access the web api to get something up and running in short time. |
| muthuspark/line-segmentation-handwritten-doc | ⭐ 22 | Jupyter Notebook | A star path planning algorithm based line segmentation of handwritten document |
| anthonyray/littlebrain | ⭐ 16 | JavaScript | Multi-layer Neural Network in Javascript |
| ignaciomosca/id3-prolog | ⭐ 5 | Prolog | Prolog Implementation of the ID3 Algorithm |
| johnsonj561/Search-and-Classification-With-Natural | ⭐ 2 | JavaScript | Text pre-processing, tf-idf, cosine similarity, and classification using Node package 'Natural' |
| Cereceres/gradient-descent-js | ⭐ 1 | JavaScript | Gradient descent |
Showing 246 repositories