95 inference projects ranked by GitHub stars, weekly growth, and maintenance health.
Showing 51-95 of 95 projects
| # | Project | Category | Stars | Weekly | Trend | Health | Language | Updated |
|---|---|---|---|---|---|---|---|---|
| 51 | Llm Engineer Toolkit A curated list of 120+ LLM libraries category wise. | ⚡ Inference | 10.4K | 0 | 39 | - | 1mo ago | |
| 52 | Openvino OpenVINO™ is an open source toolkit for optimizing and deploying AI inference | ⚡ Inference | 10.2K | 0 | 72 | C++ | 9d ago | |
| 53 |
Get the fastest-growing projects, useful MCP servers, and technical reads in one weekly email.
| ⚡ Inference |
| 9.5K |
| 0 |
| 75 |
| C# |
| 16d ago |
| 56 | Prompt Master A Claude skill that writes the accurate prompts for any AI tool. Zero tokens or credits wasted. Full context and memory retention | ⚡ Inference | 7.4K | 0 | 42 | - | 18d ago |
| 57 | Transformer Explainer Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization | ⚡ Inference | 7.3K | 0 | 28 | JavaScript | 1mo ago |
| 59 | Openllmetry Open-source observability for your GenAI or LLM application, based on OpenTelemetry | ⚡ Inference | 7.1K | 0 | 51 | Python | 10d ago |
| 60 | Vespa AI + Data, online. https://vespa.ai | ⚡ Inference | 6.9K | 0 | 66 | Java | 9d ago |
| 62 | Learning A log of things I'm learning | ⚡ Inference | 6.9K | 0 | 30 | - | 18d ago |
| 63 | LTX 2 Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model. | ⚡ Inference | 6.6K | 0 | 31 | Python | 10d ago |
| 64 | Firecrawl Mcp Server 🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients. | ⚡ Inference | 6.3K | 0 | 37 | JavaScript | 14d ago |
| 65 | Sqlbot 🔥 基于大模型和 RAG 的智能问数系统,对话式数据分析神器。Text-to-SQL Generation via LLMs using RAG. | ⚡ Inference | 6.1K | 0 | 65 | JavaScript | 10d ago |
| 66 | Pgai A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL | ⚡ Inference | 5.8K | 0 | 34 | PLpgSQL | 3mo ago |
| 67 | Taxhacker Self-hosted AI accounting app. LLM analyzer for receipts, invoices, transactions with custom prompts and categories | ⚡ Inference | 5.6K | 0 | 40 | TypeScript | 1mo ago |
| 68 | Alignment Handbook Robust recipes to align language models with human and AI preferences | ⚡ Inference | 5.6K | 0 | 37 | Python | 1mo ago |
| 69 | Ultrarag A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines | ⚡ Inference | 5.5K | 0 | 43 | Python | 10d ago |
| 70 | Chronos Forecasting Chronos: Pretrained Models for Time Series Forecasting | ⚡ Inference | 5.3K | 0 | 41 | Python | 1mo ago |
| 72 | Sparrow Structured data extraction and instruction calling with ML, LLM and Vision LLM | ⚡ Inference | 5.2K | 0 | 43 | Python | 12d ago |
| 73 | Transformerlab App The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters. | ⚡ Inference | 4.9K | 0 | 74 | Python | 9d ago |
| 74 | Bifrost Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead at 5k RPS. | ⚡ Inference | 4.8K | 0 | 74 | Go | 9d ago |
| 75 | Shimmy ⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever. | ⚡ Inference | 4.8K | 0 | 44 | Rust | 1mo ago |
| 76 | Claude Obsidian Claude + Obsidian knowledge companion. Persistent, compounding wiki vault based on Karpathy's LLM Wiki pattern. /wiki /save /autoresearch | ⚡ Inference | 4.8K | 0 | 54 | Python | 27d ago |
| 77 | Mlx Vlm MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX. | ⚡ Inference | 4.7K | 0 | 68 | Python | 9d ago |
| 78 | Vllm Omni A framework for efficient model inference with omni-modality models | ⚡ Inference | 4.7K | 0 | 74 | Python | 10d ago |
| 79 | Llm Twin Course 🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴 | ⚡ Inference | 4.3K | 0 | 28 | Python | 1mo ago |
| 80 | LLM RL Visualized 🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps ) | ⚡ Inference | 4.3K | 0 | 34 | Python | 12d ago |
| 81 | Spark Nlp State of the Art Natural Language Processing | ⚡ Inference | 4.1K | 0 | 48 | Scala | 11d ago |
| 82 | Lemonade Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk | ⚡ Inference | 3.9K | 0 | 69 | C++ | 10d ago |
| 83 | Scikit Llm Seamlessly integrate LLMs into scikit-learn. | ⚡ Inference | 3.5K | 0 | 29 | Python | 19d ago |
| 84 | Optimum 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools | ⚡ Inference | 3.4K | 0 | 42 | Python | 14d ago |
| 85 | Horizon 📡 Your own AI-powered news radar. Generates daily briefings in English & Chinese. | 用 AI 构建你专属的新闻雷达 | ⚡ Inference | 3.4K | 0 | 54 | Python | 9d ago |
| 86 | Hallucination Leaderboard Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents | ⚡ Inference | 3.2K | 0 | 32 | Python | 10d ago |
| 87 | Landppt 一个基于LLM的演示文稿生成平台,能够自动将文档内容转换为专业的PPT演示文稿。平台支持多种AI模型,提供丰富的模板和样式选择,让用户能够创建高质量的演示文稿。 | ⚡ Inference | 3.2K | 0 | 38 | Python | 25d ago |
| 89 | Aix DB Aix-DB 基于 LangChain/LangGraph 框架,结合 MCP Skills 多智能体协作架构,实现自然语言到数据洞察的端到端转换。 | ⚡ Inference | 2.1K | 0 | 49 | JavaScript | 1mo ago |
| 91 | Lucebox Hub Lucebox optimization hub: hand-tuned LLM inference, built for specific consumer hardware. | ⚡ Inference | 1.9K | 0 | 63 | C++ | 10d ago |
| 92 | Detikzify Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ. | ⚡ Inference | 1.8K | 0 | 18 | Python | 3mo ago |
| 93 | Mindpipe A powerful model compression framework for LLMs and LVLMs, adapted for NVIDIA GPUs and Huawei Ascend NPUs. | ⚡ Inference | 1.0K | 0 | 43 | Python | 10d ago |
| 94 | Llm Internals Learn LLM internals step by step - from tokenization to attention to inference optimization. | ⚡ Inference | 978 | 0 | 21 | - | 10d ago |
| 95 | Vllm Studio Control panel for VLLM, Sglang, llama.cpp, exllamav3 | ⚡ Inference | 908 | 0 | 45 | TypeScript | 10d ago |