AI, LLMs & Data

246 repositories across 4 subcategories

Subcategories

All Repositories — sorted by stars

Repository Stars Language Description
tensorflow/tensorflow ⭐ 195.0K C++ An Open Source Machine Learning Framework for Everyone
Significant-Gravitas/AutoGPT ⭐ 184.0K Python AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
obra/superpowers ⭐ 177.3K Shell An agentic skills framework & software development methodology that works.
ollama/ollama ⭐ 170.6K Go Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
huggingface/transformers ⭐ 160.2K Python 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
langgenius/dify ⭐ 140.0K TypeScript Production-ready platform for agentic workflow development.
langchain-ai/langchain ⭐ 135.7K Python The agent engineering platform. Available in TypeScript!
open-webui/open-webui ⭐ 135.4K Python User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
anthropics/skills ⭐ 127.7K Python Public repository for Agent Skills
ggml-org/llama.cpp ⭐ 108.1K C++ LLM inference in C/C++
google-gemini/gemini-cli ⭐ 103.1K TypeScript An open-source AI agent that brings the power of Gemini directly into your terminal.
pytorch/pytorch ⭐ 99.6K Python Tensors and Dynamic neural networks in Python with strong GPU acceleration
openai/whisper ⭐ 98.8K Python Robust Speech Recognition via Large-Scale Weak Supervision
hacksider/Deep-Live-Cam ⭐ 92.6K Python real time face swap and one-click video deepfake with only a single image
deepseek-ai/DeepSeek-R1 ⭐ 92.0K
browser-use/browser-use ⭐ 91.9K Python 🌐 Make websites accessible for AI agents. Automate tasks online with ease.
opencv/opencv ⭐ 87.3K C++ Open Source Computer Vision Library
openai/codex ⭐ 79.8K Rust Lightweight coding agent that runs in your terminal
infiniflow/ragflow ⭐ 79.6K Python RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
nomic-ai/gpt4all ⭐ 77.4K C++ GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
PaddlePaddle/PaddleOCR ⭐ 77.0K Python Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
tesseract-ocr/tesseract ⭐ 73.9K C++ Tesseract Open Source OCR Engine (main repository)
CompVis/stable-diffusion ⭐ 73.0K Jupyter Notebook A latent text-to-image diffusion model
paperclipai/paperclip ⭐ 62.4K TypeScript Open-source orchestration for zero-human companies
CorentinJ/Real-Time-Voice-Cloning ⭐ 59.7K Python Clone a voice in 5 seconds to generate arbitrary speech in real-time
gsd-build/get-shit-done ⭐ 59.7K JavaScript A light-weight and powerful meta-prompting, context engineering and spec-driven development system for Claude Code by TÂCHES.
meta-llama/llama ⭐ 59.4K Python Inference code for Llama models
666ghj/MiroFish ⭐ 59.0K Python A Simple and Universal Swarm Intelligence Engine, Predicting Anything. 简洁通用的群体智能引擎,预测万物
zylon-ai/private-gpt ⭐ 57.2K Python Interact with your documents using the power of GPT, 100% privately, no data leaks
code-yeongyu/oh-my-openagent ⭐ 55.6K TypeScript omo; the best agent harness - previously oh-my-opencode
AntonOsika/gpt-engineer ⭐ 55.2K Python CLI platform to experiment with codegen. Precursor to: https://lovable.dev
mem0ai/mem0 ⭐ 54.7K Python Universal memory layer for AI Agents
upstash/context7 ⭐ 54.4K TypeScript Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors
facebookresearch/segment-anything ⭐ 54.1K Jupyter Notebook The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
run-llama/llama_index ⭐ 49.1K Python LlamaIndex is the leading document agent and OCR platform
pandas-dev/pandas ⭐ 48.7K Python Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
mudler/LocalAI ⭐ 46.0K Go LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
BerriAI/litellm ⭐ 45.6K Python Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]
apache/airflow ⭐ 45.3K Python Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
coqui-ai/TTS ⭐ 45.2K Python 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Aider-AI/aider ⭐ 44.3K Python aider is AI pair programming in your terminal
badlogic/pi-mono ⭐ 44.2K TypeScript AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
aaif-goose/goose ⭐ 43.7K Rust an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
HKUDS/nanobot ⭐ 41.6K Python "🐈 nanobot: The Ultra-Lightweight Personal AI Agent"
google-research/bert ⭐ 40.0K Python TensorFlow code and pre-trained models for BERT
facebookresearch/faiss ⭐ 39.9K C++ A library for efficient similarity search and clustering of dense vectors.
lm-sys/FastChat ⭐ 39.5K Python An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
QuivrHQ/quivr ⭐ 39.1K Python Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
suno-ai/bark ⭐ 39.1K Jupyter Notebook 🔊 Text-Prompted Generative Audio Model
pola-rs/polars ⭐ 38.4K Rust Extremely fast Query Engine for DataFrames, written in Rust
naptha/tesseract.js ⭐ 38.0K JavaScript Pure Javascript OCR for more than 100 Languages 📖🎉🖥
LAION-AI/Open-Assistant ⭐ 37.4K Python OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
google/langextract ⭐ 36.4K Python A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
stanfordnlp/dspy ⭐ 34.2K Python DSPy: The framework for programming—not prompting—language models
Pythagora-io/gpt-pilot ⭐ 33.8K Python The first real AI developer
explosion/spaCy ⭐ 33.5K Python 💫 Industrial-strength Natural Language Processing (NLP) in Python
zeroclaw-labs/zeroclaw ⭐ 31.0K Rust Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM
karpathy/llm.c ⭐ 29.8K Cuda LLM training in simple, raw C/CUDA
sipeed/picoclaw ⭐ 28.7K Go Tiny, Fast, and Deployable anywhere — automate the mundane, unleash your creativity
qwibitai/nanoclaw ⭐ 28.6K TypeScript A lightweight alternative to OpenClaw that runs in containers for security
stanford-oval/storm ⭐ 28.2K Python An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
assafelovic/gpt-researcher ⭐ 26.8K Python An autonomous agent that conducts deep research on any data using any LLM providers
mozilla/DeepSpeech ⭐ 26.8K C++ DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
ApolloAuto/apollo ⭐ 26.6K C++ An open autonomous driving platform
langfuse/langfuse ⭐ 26.5K TypeScript 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
microsoft/JARVIS ⭐ 24.7K Python JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
resemble-ai/chatterbox ⭐ 24.6K Python SoTA open-source TTS
multica-ai/multica ⭐ 24.5K TypeScript The open-source managed agents platform. Turn coding agents into real teammates
toon-format/toon ⭐ 24.1K TypeScript 🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
vercel/ai ⭐ 24.0K TypeScript The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents
oraios/serena ⭐ 23.8K Python A powerful MCP toolkit for coding, providing semantic retrieval and editing capabilities - the IDE for your agent
mastra-ai/mastra ⭐ 23.5K TypeScript From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.
ScrapeGraphAI/Scrapegraph-ai ⭐ 23.4K Python Python scraper based on AI
deepseek-ai/DeepSeek-Coder ⭐ 23.2K Python DeepSeek Coder: Let the Code Write Itself
mlc-ai/mlc-llm ⭐ 22.6K Python Universal LLM Deployment Engine with ML Compilation
supermemoryai/supermemory ⭐ 22.4K TypeScript Memory engine and app that is extremely fast, scalable. The Memory API for the AI era.
yoheinakajima/babyagi ⭐ 22.3K Python
openai/chatgpt-retrieval-plugin ⭐ 21.2K Python The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
microsoft/onnxruntime ⭐ 20.4K C++ ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
tensorflow/tfjs ⭐ 19.1K TypeScript A WebGL accelerated JavaScript library for training and deploying ML models.
transitive-bullshit/agentic ⭐ 18.1K TypeScript Your API ⇒ Paid MCP. Instantly.
openai/tiktoken ⭐ 18.1K Python tiktoken is a fast BPE tokeniser for use with OpenAI's models.
mlc-ai/web-llm ⭐ 17.9K TypeScript High-performance In-browser LLM Inference Engine
justadudewhohacks/face-api.js ⭐ 17.8K TypeScript JavaScript API for face detection and face recognition in the browser and nodejs with tensorflow.js
agentskills/agentskills ⭐ 17.8K Python Specification and documentation for Agent Skills
langchain-ai/langchainjs ⭐ 17.6K TypeScript The agent engineering platform
leon-ai/leon ⭐ 17.2K TypeScript 🧠 Leon is your open-source personal assistant.
manaflow-ai/cmux ⭐ 16.1K Swift Ghostty-based macOS terminal with vertical tabs and notifications for AI coding agents
huggingface/transformers.js ⭐ 16.0K JavaScript State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
Stability-AI/StableLM ⭐ 15.7K Jupyter Notebook StableLM: Stability AI Language Models
official-stockfish/Stockfish ⭐ 15.4K C++ A free and strong UCI chess engine
gpujs/gpu.js ⭐ 15.4K JavaScript GPU Accelerated JavaScript
josdejong/mathjs ⭐ 15.0K JavaScript An extensive math library for JavaScript and Node.js
BrainJS/brain.js ⭐ 14.9K TypeScript 🤖 GPU accelerated Neural networks in JavaScript for Browsers and Node.js
neonbjb/tortoise-tts ⭐ 14.8K Jupyter Notebook A multi-voice TTS system trained with an emphasis on quality
alphacep/vosk-api ⭐ 14.7K Jupyter Notebook Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Unstructured-IO/unstructured ⭐ 14.6K HTML Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
ggml-org/ggml ⭐ 14.6K C++ Tensor library for machine learning
dottxt-ai/outlines ⭐ 13.8K Python Structured Outputs
KittenML/KittenTTS ⭐ 13.7K Python State-of-the-art TTS model under 25MB 😻
nextapps-de/flexsearch ⭐ 13.7K JavaScript Next-generation full-text search library for Browser and Node.js
microsoft/LoRA ⭐ 13.5K Python Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
google-deepmind/mujoco ⭐ 13.3K C++ Multi-Joint dynamics with Contact. A general purpose physics simulator.
jupyter/notebook ⭐ 13.1K Jupyter Notebook Jupyter Interactive Notebook
cocktailpeanut/dalai ⭐ 12.9K CSS The simplest way to run LLaMA on your local machine
ShishirPatil/gorilla ⭐ 12.9K Python Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
neuml/txtai ⭐ 12.5K Python 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
sapientinc/HRM ⭐ 12.4K Python Hierarchical Reasoning Model Official Release
nearai/ironclaw ⭐ 12.1K Rust IronClaw is an Agent OS focused on privacy, security and extensibility
spencermountain/compromise ⭐ 12.1K JavaScript modest natural-language processing
h2oai/h2ogpt ⭐ 12.0K Python Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
0xk1h0/ChatGPT_DAN ⭐ 12.0K ChatGPT DAN, Jailbreaks prompt
tambo-ai/tambo ⭐ 11.1K TypeScript Generative UI SDK for React
artidoro/qlora ⭐ 10.9K Jupyter Notebook QLoRA: Efficient Finetuning of Quantized LLMs
openai/openai-node ⭐ 10.9K TypeScript Official JavaScript / TypeScript library for the OpenAI API
openai/DALL-E ⭐ 10.9K Python PyTorch package for the discrete VAE used for DALL·E.
kedro-org/kedro ⭐ 10.9K Python Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
browseros-ai/BrowserOS ⭐ 10.7K TypeScript 🌐 The open-source Agentic browser; alternative to ChatGPT Atlas, Perplexity Comet, Dia.
huggingface/chat-ui ⭐ 10.7K TypeScript The open source codebase powering HuggingChat
Turfjs/turf ⭐ 10.4K TypeScript A modular geospatial engine written in JavaScript and TypeScript
mozilla/TTS ⭐ 10.1K Jupyter Notebook :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
bigscience-workshop/petals ⭐ 10.1K Python 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
yzhao062/pyod ⭐ 9.8K Python A Python library for anomaly detection across tabular, time series, graph, text, and image data. 60+ detectors, benchmark-backed ADEngine orchestration, and an agentic workflow for AI agents.
Arize-ai/phoenix ⭐ 9.5K Python AI Observability & Evaluation
infinitered/nsfwjs ⭐ 8.9K TypeScript NSFW detection on the client-side via TensorFlow.js
davidkimai/Context-Engineering ⭐ 8.8K Python "Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspired by Karpathy and 3Blue1Brown for moving beyond prompt engineering to the wider discipline of context design, orchestration, and optimization.
ogx-ai/ogx ⭐ 8.4K Python Open GenAI Stack
leanprover/lean4 ⭐ 8.0K Lean Lean 4 programming language and theorem prover
google-deepmind/alphafold3 ⭐ 7.9K Python AlphaFold 3 inference pipeline.
weaviate/Verba ⭐ 7.7K Python Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
openlm-research/open_llama ⭐ 7.5K OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
nullclaw/nullclaw ⭐ 7.4K Zig Fastest, smallest, and fully autonomous AI assistant infrastructure written in Zig
tailcallhq/forgecode ⭐ 7.2K Rust AI enabled pair programmer for Claude, GPT, O Series, Grok, Deepseek, Gemini and 300+ models
traceloop/openllmetry ⭐ 7.1K Python Open-source observability for your GenAI or LLM application, based on OpenTelemetry
deepseek-ai/DeepSeek-LLM ⭐ 6.9K Makefile DeepSeek LLM: Let there be answers
hexgrad/kokoro ⭐ 6.9K JavaScript https://hf.co/hexgrad/Kokoro-82M
google-deepmind/graphcast ⭐ 6.6K Python
ml5js/ml5-library ⭐ 6.6K JavaScript Friendly machine learning for the web! 🤖
microsoft/TRELLIS.2 ⭐ 6.6K Python Native and Compact Structured Latents for 3D Generation
axa-group/nlp.js ⭐ 6.6K JavaScript An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more
supercollider/supercollider ⭐ 6.5K C++ An audio server, programming language, and IDE for sound synthesis and algorithmic composition.
axa-group/Parsr ⭐ 6.2K JavaScript Transforms PDF, Documents and Images into Enriched Structured Data
mnfst/manifest ⭐ 6.0K TypeScript Smart Model Routing for Agents. Cut Costs up to 70% 🦚
salesforce/BLIP ⭐ 5.7K Jupyter Notebook PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Helicone/helicone ⭐ 5.6K TypeScript 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
MrLesk/Backlog.md ⭐ 5.5K TypeScript Backlog.md - A tool for managing project collaboration between humans and AI Agents in a git ecosystem
treeverse/lakeFS ⭐ 5.3K Go lakeFS - Data version control for your data lake | Git for data
katanaml/sparrow ⭐ 5.2K Python Structured data extraction and instruction calling with ML, LLM and Vision LLM
google-deepmind/gemma ⭐ 5.1K Python Gemma open-weight LLM library, from Google DeepMind
javascriptdata/danfojs ⭐ 5.1K TypeScript Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
togethercomputer/RedPajama-Data ⭐ 4.9K Python The RedPajama-Data repository contains code for preparing large datasets for training large language models.
h2oai/h2o-llmstudio ⭐ 4.9K Python H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Nixtla/statsforecast ⭐ 4.8K Python Lightning ⚡️ fast forecasting with statistical and econometric models.
neo4j-labs/llm-graph-builder ⭐ 4.7K Jupyter Notebook Neo4j graph construction from unstructured data using LLMs
getzep/zep ⭐ 4.5K Python Zep | Examples, Integrations, & More
peterbraden/node-opencv ⭐ 4.4K C++ OpenCV Bindings for node.js
ogx-ai/llama-stack-apps ⭐ 4.3K Agentic components of the Llama Stack APIs
Agenta-AI/agenta ⭐ 4.1K TypeScript The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
CaviraOSS/OpenMemory ⭐ 4.1K TypeScript Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc.
latitude-dev/latitude-llm ⭐ 4.0K TypeScript Latitude is the open-source agent engineering platform
OHF-Voice/piper1-gpl ⭐ 3.9K C++ Fast and local neural text-to-speech engine
mlc-ai/web-stable-diffusion ⭐ 3.7K Jupyter Notebook Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
google/deepvariant ⭐ 3.7K Python DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
vijishmadhavan/ArtLine ⭐ 3.6K Jupyter Notebook A Deep Learning based project for creating line art portraits.
ploomber/ploomber ⭐ 3.6K Python The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
benjitaylor/agentation ⭐ 3.5K TypeScript The visual feedback tool for agents.
simple-statistics/simple-statistics ⭐ 3.5K JavaScript simple statistics for node & browser javascript
facebookresearch/sam-audio ⭐ 3.5K Python The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
hao-ai-lab/FastVideo ⭐ 3.4K Python A unified inference and post-training framework for accelerated video generation.
pashpashpash/vault-ai ⭐ 3.4K JavaScript OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend.
CodedotAl/gpt-code-clippy ⭐ 3.3K Python Full description can be found here: https://discuss.huggingface.co/t/pretrain-gpt-neo-for-open-source-github-copilot-model/7678?u=ncoop57
ob-f/OpenBot ⭐ 3.3K Swift OpenBot leverages smartphones as brains for low-cost robots. We have designed a small electric vehicle that costs about $50 and serves as a robot body. Our software stack for Android smartphones supports advanced robotics workloads such as person following and real-time autonomous navigation.
deepseek-ai/DeepSeek-Math ⭐ 3.3K Python DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
langwatch/langwatch ⭐ 3.2K TypeScript The platform for LLM evaluations and AI agent testing
jehna/humanify ⭐ 3.2K TypeScript Deobfuscate Javascript code using ChatGPT
noahshinn/reflexion ⭐ 3.1K Python [NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
LeelaChessZero/lc0 ⭐ 3.1K C++ Open source neural network chess engine with GPU acceleration and broad hardware support.
agentclientprotocol/agent-client-protocol ⭐ 3.0K Rust A protocol for connecting any editor to any agent
leeoniya/uFuzzy ⭐ 3.0K JavaScript A tiny, efficient fuzzy search that doesn't suck
FreedomIntelligence/LLMZoo ⭐ 2.9K Python ⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
tidalcycles/strudel ⭐ 2.9K MOVED TO CODEBERG - Web-based environment for live coding algorithmic patterns, incorporating a faithful port of TidalCycles to JavaScript
spiceai/spiceai ⭐ 2.9K Rust A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.
janhq/cortex.cpp ⭐ 2.8K C++ Local AI API Platform
TanStack/ai ⭐ 2.6K TypeScript 🤖 SDK that enhances your applications with AI capabilities
coqui-ai/STT ⭐ 2.6K C++ 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
jolibrain/deepdetect ⭐ 2.5K C++ Deep Learning API and Server in C++14 support for PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE
kieler/elkjs ⭐ 2.5K JavaScript ELK's layout algorithms for JavaScript
nicolaspanel/numjs ⭐ 2.5K JavaScript Like NumPy, in JavaScript
sagemath/sage ⭐ 2.4K Python Main repository of SageMath
n-riesco/ijavascript ⭐ 2.3K JavaScript IJavascript is a javascript kernel for the Jupyter notebook
ymichael/open-codex ⭐ 2.2K TypeScript Lightweight coding agent that runs in your terminal
shcherbak-ai/contextgem ⭐ 1.8K Python ContextGem: Effortless LLM extraction from documents
yusufcanb/tlm ⭐ 1.5K Go Local CLI Copilot, powered by Ollama. 💻🦙
Yifan-Song793/RestGPT ⭐ 1.4K Python An LLM-based autonomous agent controlling real-world applications via RESTful APIs
winkjs/wink-nlp ⭐ 1.4K JavaScript Developer friendly Natural Language Processing ✨
ThousandBirdsInc/chidori ⭐ 1.3K Rust A reactive runtime for building durable AI agents
eduardoleao052/js-pytorch ⭐ 1.2K JavaScript A JavaScript library like PyTorch, with GPU acceleration.
devnen/Chatterbox-TTS-Server ⭐ 1.2K Python Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.
mendersoftware/mender ⭐ 1.2K C++ Mender over-the-air software updater client.
stereolabs/zed-sdk ⭐ 1.2K C++ ⚡️The spatial perception framework for rapidly building smart robots and spaces
nmrugg/stockfish.js ⭐ 1.2K C++ The Stockfish chess engine for web browsers
CASIA-LMC-Lab/AnomalyGPT ⭐ 1.1K Python [AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
davidedc/Algebrite ⭐ 997 TypeScript Computer Algebra System in Javascript (Typescript)
SalesforceAIResearch/promptomatix ⭐ 952 Python An Automatic Prompt Optimization Framework for Large Language Models
jonwiggins/optio ⭐ 934 TypeScript Workflow orchestration for AI coding agents, from task to merged PR.
kardolus/chatgpt-cli ⭐ 921 Go ChatGPT CLI is a powerful, multi-provider command-line interface for working with modern LLMs. It supports OpenAI, Azure, Perplexity, LLaMA, and more, with features like streaming, interactive chat, prompt files, image/audio I/O, MCP tool calls, and an experimental agent mode for safe, multi-step automation.
QuantGeekDev/mcp-framework ⭐ 917 TypeScript The Typescript MCP Framework
xavctn/img2table ⭐ 865 Python img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing
Atome-FE/llama-node ⭐ 865 Rust Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.
MagnivOrg/prompt-layer-library ⭐ 762 Python 🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.
latitudegames/GPT-3-Encoder ⭐ 722 JavaScript Javascript BPE Encoder Decoder for GPT-2 / GPT-3
liveloveapp/hashbrown ⭐ 686 TypeScript Hashbrown is a framework for building agents that run the browser. Built for Angular and React.
gnu-octave/octave ⭐ 606 C++ GNU Octave Mirror (https://www.octave.org/hg/octave). Report bugs and submit pull requests (patches) at https://bugs.octave.org
llm-tools/embedJs ⭐ 605 TypeScript A NodeJS RAG framework to easily work with LLMs and embeddings
castorini/hedwig ⭐ 597 Python PyTorch deep learning models for document classification
jiggzson/nerdamer ⭐ 547 JavaScript a symbolic math expression evaluator for javascript
LeonLok/Deep-SORT-YOLOv4 ⭐ 501 Python People detection and optional tracking with Tensorflow backend.
bhky/opennsfw2 ⭐ 490 Python Keras implementation of the Yahoo Open-NSFW model
ricklupton/floweaver ⭐ 464 Python View flow data as Sankey diagrams
okuvshynov/slowllama ⭐ 450 Python Finetune llama2-70b and codellama on MacBook Air without quantization
simonmysun/ell ⭐ 435 Shell A command-line interface for LLMs written in Bash.
zju-vipa/Odyssey ⭐ 379 Python Odyssey: Empowering Minecraft Agents with Open-World Skills
RustCrypto/stream-ciphers ⭐ 319 Rust Collection of stream cipher algorithms
AtomicFrontierCode/keyboards ⭐ 299 Julia Simulated annealing code for video
zeyu2001/chess-ai ⭐ 274 JavaScript Simple chess AI in JavaScript. Uses the chess.js and chessboard.js libraries.
quantastica/quantum-circuit ⭐ 272 JavaScript Quantum Circuit Simulator implemented in JavaScript
yusufhilmi/client-vector-search ⭐ 231 TypeScript A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenAI's text-embedding-ada-002 and is way faster than Pinecone and other VectorDBs.
qminer/qminer ⭐ 215 C++ Analytic platform for real-time large-scale streams containing structured and unstructured data.
Waikato/weka-3.8 ⭐ 190 Java No longer updated mirror of the Weka 3.8 branch.
meilisearch/meilisearch-mcp ⭐ 185 Python A Model Context Protocol (MCP) server for interacting with Meilisearch through LLM interfaces.
ShawnHymel/computer-vision-with-embedded-machine-learning ⭐ 181 Jupyter Notebook
jamesporter/solandra ⭐ 181 TypeScript A framework for algorithmic art. TypeScript first. Make drawing concepts part of framework. Make APIs for humans.
Anush008/fastembed-js ⭐ 175 TypeScript Generate vector embeddings in NodeJS
josefjadrny/js-chess-engine ⭐ 159 TypeScript Complete TypeScript chess engine with zero dependencies for Node.js >=24 and browsers. Features configurable AI (5 predefined difficulty levels), stateful/stateless APIs, and supports JSON and FEN formats.
phil65/agentpool ⭐ 145 Python A unified agent orchestration hub that lets you configure and manage multiple AI agents (native, ACP, AGUI, Claude Code) via YAML, and exposes them through standardized protocols (ACP/OpenCode Server).
luczeng/HoughRectangle ⭐ 118 C++ Rectangle detection using the Hough transform
nordwestt/ollama-ai-provider-v2 ⭐ 100 TypeScript Vercel AI Provider for running LLMs locally using Ollama
panchishin/geneticalgorithm ⭐ 100 JavaScript A fully generalized implementation of the Genetic Algorithm usable on any json based Phenotype
tkafka/node-elizabot ⭐ 63 JavaScript
sbrsv/ai-embed-search ⭐ 31 TypeScript Smart. Simple. Local. AI-powered semantic search in TypeScript using transformer embeddings. No cloud, no API keys — 100% offline.
fboerncke/bloom-ai-simple-starter-nodejs ⭐ 31 JavaScript Simple starter BLOOM example showing how to access the web api to get something up and running in short time.
muthuspark/line-segmentation-handwritten-doc ⭐ 22 Jupyter Notebook A star path planning algorithm based line segmentation of handwritten document
anthonyray/littlebrain ⭐ 16 JavaScript Multi-layer Neural Network in Javascript
ignaciomosca/id3-prolog ⭐ 5 Prolog Prolog Implementation of the ID3 Algorithm
johnsonj561/Search-and-Classification-With-Natural ⭐ 2 JavaScript Text pre-processing, tf-idf, cosine similarity, and classification using Node package 'Natural'
Cereceres/gradient-descent-js ⭐ 1 JavaScript Gradient descent

Showing 246 repositories