🔭AI Tools Scout
LeaderboardMCPSkillsContentAbout
🔭AI Tools Scout·Open signals for AI builders
LeaderboardMCPSkillsContentAbout
← Back to Leaderboard

Best Open Source AI Inference Projects

95 inference projects ranked by GitHub stars, weekly growth, and maintenance health.

Project data last synced 9d ago. Check before relying on time-sensitive rankings.

Showing 1-50 of 95 projects

#ProjectCategoryStars▼Weekly▽TrendHealth▽LanguageUpdated▽
1
Ollama
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
⚡ Inference171.2K+45095Go9d ago
2
Prompts.chat
f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
⚡ Inference162.0K0
PreviousPage 1 of 2Next

Weekly AI open-source movers

Get the fastest-growing projects, useful MCP servers, and technical reads in one weekly email.

57
HTML
10d ago
3
llama.cpp
LLM inference in C/C++
⚡ Inference109.6K+1.2K100C++9d ago
4
vLLM
A high-throughput and memory-efficient inference and serving engine for LLMs
⚡ Inference79.7K+60693Python9d ago
5
Llm Course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
⚡ Inference79.2K030-3mo ago
6
Llamafactory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
⚡ Inference71.1K056Python13d ago
7
Caveman
🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman
⚡ Inference58.3K064JavaScript11d ago
8
Trendradar
⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构,赋能 AI 自然语言对话分析、情感洞察与趋势预测等。支持 Docker ,数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。
⚡ Inference57.3K039Python21d ago
9
Context7
Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors
⚡ Inference55.0K066TypeScript9d ago
10
Mempalace
The best-benchmarked open-source AI memory system. And it's free.
⚡ Inference52.0K080Python10d ago
11
Pi Mono
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
⚡ Inference48.2K073TypeScript10d ago
12
LocalAI
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
⚡ Inference46.2K+13091Go9d ago
13
Milvus
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
⚡ Inference44.2K075Go9d ago
14
Kong
🦍 The API and AI Gateway
⚡ Inference43.4K040Lua1mo ago
15
Jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
⚡ Inference42.5K+8880TypeScript10d ago
16
Lightrag
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
⚡ Inference35.0K080Python10d ago
17
Graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
⚡ Inference32.9K056Python9d ago
18
New Api
A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for personal and enterprise model management. 🍥
⚡ Inference32.5K074Go10d ago
19
Self Llm
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
⚡ Inference30.4K037Jupyter Notebook27d ago
20
Void
⚡ Inference28.7K036TypeScript4mo ago
21
Sglang
SGLang is a high-performance serving framework for large language models and multimodal models.
⚡ Inference27.7K077Python9d ago
22
Gitleaks
Find secrets with Gitleaks 🔑
⚡ Inference26.8K033Go1mo ago
23
Awesome Generative Ai Guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
⚡ Inference26.6K042HTML12d ago
24
Hands On Large Language Models
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
⚡ Inference26.2K032Jupyter Notebook27d ago
25
Llmfit
Hundreds of models & providers. One command to find what runs on your hardware.
⚡ Inference25.8K070Rust11d ago
26
Scrapegraph Ai
Python scraper based on AI
⚡ Inference25.0K060Python11d ago
27
llamafile
Distribute and run LLMs with a single file.
⚡ Inference24.4K+4465C++16d ago
28
Llm Action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
⚡ Inference24.3K030HTML11d ago
29
MLC LLM
Universal LLM Deployment Engine with ML Compilation
⚡ Inference22.6K+3662Python9d ago
30
Awesome Chinese LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
⚡ Inference22.6K041-11d ago
31
Unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
⚡ Inference22.1K043Python3mo ago
32
Skyvern
Automate browser based workflows with AI
⚡ Inference21.6K068Python9d ago
33
Datasets
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
⚡ Inference21.5K060Python10d ago
34
Free Llm Api Resources
A list of free LLM inference resources accessible via API.
⚡ Inference21.3K030Python11d ago
35
Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
⚡ Inference21.1K046Python2mo ago
36
Peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
⚡ Inference21.1K060Python10d ago
37
Heretic
Fully automatic censorship removal for language models
⚡ Inference20.8K054Python12d ago
38
Dyad
Local, open-source AI app builder for power users ✨ v0 / Lovable / Replit / Bolt alternative 🌟 Star if you like it!
⚡ Inference20.3K072TypeScript9d ago
39
Llama Cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
⚡ Inference18.3K037Jupyter Notebook29d ago
40
Web Llm
High-performance In-browser LLM Inference Engine
⚡ Inference18.0K046TypeScript15d ago
41
Ml Engineering
Machine Learning Engineering Open Book
⚡ Inference17.9K036Python2mo ago
42
Airllm
AirLLM 70B inference with single 4GB GPU
⚡ Inference17.7K031Jupyter Notebook2mo ago
43
Qbot
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
⚡ Inference17.3K030Jupyter Notebook2mo ago
44
Code Review Graph
Local knowledge graph for Claude Code. Builds a persistent map of your codebase so Claude reads only what matters — 6.8× fewer tokens on reviews and up to 49× on daily coding tasks.
⚡ Inference16.1K077Python13d ago
45
RWKV LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
⚡ Inference14.5K023Python13d ago
46
Easy Dataset
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
⚡ Inference14.2K056JavaScript20d ago
47
Outlines
Structured Outputs
⚡ Inference13.8K052Python17d ago
48
Omlx
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
⚡ Inference13.6K078Python10d ago
49
Awesome Generative Ai
A curated list of modern Generative Artificial Intelligence projects and services
⚡ Inference12.0K043-15d ago
50
Tensorzero
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
⚡ Inference11.4K072Rust9d ago