llm-server

Star

Here are 17 public repositories matching this topic...

Quazmoz / npu-windows

Sponsor

Star

Please see the newer: https://github.com/Quazmoz/openvino-windows-llm

python windows npu openai-api ai-inference local-llm llm-server openwebui ipex-llm intel-npu

Updated May 20, 2026
Python

wannaphong / android-hostai

Sponsor

Star

HostAI - Android LLM API Server

android android-app llm llm-server

Updated Apr 11, 2026
Kotlin

teremterem / litellm-server-boilerplate

Star

A lightweight LiteLLM server boilerplate pre-configured with uv and Docker for hosting your own OpenAI- and Anthropic-compatible endpoints. Includes LibreChat as an optional web UI.

Updated Dec 8, 2025
Python

shakfu / chimera

Star

single-executable / library which combines llama.cpp, whisper.cpp, and stable-diffusion.cpp

sqlite rag whisper-cpp llama-cpp llm-inference llm-server stable-diffusion-cpp

Updated Jun 26, 2026
C++

pikocloud / pikobrain

Star

Function-calling API for LLM from multiple providers

api gemini openai rag function-calling ollama aws-bedrock llm-server

Updated Aug 10, 2024
Go

acari-git / MLXServerManager

Star

macOS GUI for managing pure mlx_lm.server on Apple Silicon in Direct Mode.

macos mlx swiftui apple-silicon local-llm llm-server agent-tools mlx-lm openai-compatible hermes-agent

Updated Jun 25, 2026
Swift

raketenkater / opencode-dcp-dynamic-limits

Star

opencode dcp llama-cpp local-llm context-window gguf llm-server opencode-plugin ik-llama-cpp

Updated May 27, 2026
TypeScript

A complete, menu-driven AI model interface for Windows that simplifies running local GGUF language models with llama.cpp. This tool automatically manages dependencies, provides multiple interaction modes, and prioritizes user privacy through fully offline operation.

Updated Jan 30, 2026
PowerShell

Quazmoz / openvino-windows-llm

Sponsor

Star

Windows-first OpenAI-compatible local LLM server powered by OpenVINO GenAI for Intel CPU/GPU/NPU, with chat UI, model conversion, and setup scripts.

python windows openvino n8n openai-api ai-inference local-llm llm-server openwebui intel-npu

Updated Jun 21, 2026
Python

danielcorin / llm-api

Star

API server for `llm` CLI tool

ai openai llm llm-server llm-api llm-api-server

Updated Aug 12, 2025
Python

Slyracoon23 / llm_server

Star

A flexible FastAPI-based framework for handling AI tasks using Large Language Models (LLMs). Supports multiple providers, extensible tasks and routers, Redis caching, and OpenAI integration. Easily scalable for various LLM-based applications.

llm llm-server

Updated Sep 3, 2024
Python

sw30labs / mlx-responses-api-server

Star

OpenAI-compatible local inference server for Apple Silicon using MLX. FastAPI server with Chat Completions and Responses APIs, multi-turn conversations, and streaming support.

mlx fastapi apple-silicon openai-api local-inference chat-completions llm-server responses-api

Updated Mar 7, 2026
Python

Cellphonemega-LLC / 0llama-Server

Star

PHP Frontend for Hosting local LLM's (run via VSCode or basic php execution methods/ add to project)

llama local-llm ollama ollama-api llm-server run-local-llm php-llm-server

Updated Jul 13, 2025
PHP

Pixie-sh / mlxer

Star

Headless CLI for managing local MLX language-model HTTP servers on Apple Silicon Macs. Supports model discovery, server lifecycle management, performance benchmarking, and provider integration with OpenCode, Claude Code, and LiteLLM.

cli mlx apple-silicon local-llm llm-server mlx-lm openai-compatible

Updated Jun 26, 2026
Python

alextra-lab / slm_server

Star

Unified simple LLM server wrapper with intelligent routing based on model ID

python machine-learning language-model mlx fastapi huggingface apple-silicon openai-api llm llm-server

Updated Jun 28, 2026
Python

EricApgar / llm-server

Star

Host an LLM and make it accessible on a network via API.

server self-hosted fastapi llm large-language-model llm-server

Updated May 12, 2026
Python

movieonlyemail4 / vscode-local-llm-

Star

Run local AI models in VS Code with automatic model detection, server start, and built-in MCP endpoint—no cloud or manual setup required.

chat open-source ai code mcp vscode-extension llama grammar-checker llm openmetadata code-assistant llamacpp llama-cpp ollama gguf ollama-api llm-server run-local-llm

Updated Jun 29, 2026
PHP

Improve this page

Add a description, image, and links to the llm-server topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-server topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-server

Here are 17 public repositories matching this topic...

Quazmoz / npu-windows

wannaphong / android-hostai

teremterem / litellm-server-boilerplate

shakfu / chimera

pikocloud / pikobrain

acari-git / MLXServerManager

raketenkater / opencode-dcp-dynamic-limits

xsukax / xsukax-GGUF-Runner

Quazmoz / openvino-windows-llm

danielcorin / llm-api

Slyracoon23 / llm_server

sw30labs / mlx-responses-api-server

Cellphonemega-LLC / 0llama-Server

Pixie-sh / mlxer

alextra-lab / slm_server

EricApgar / llm-server

movieonlyemail4 / vscode-local-llm-

Improve this page

Add this topic to your repo