mlx-openai-server

mlx-openai-server

cubist38

A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-friendly solution for running MLX-based vision and language models locally with an OpenAI-compatible interface.

315 Stars
55 Forks
315 Watchers
Python Language
mit License
100 SrcLog Score
Cost to Build
$1.72M
Market Value
$6.64M

Growth over time

6 data points  ·  2025-09-20 → 2026-04-22
Stars Forks Watchers
💬

How do you feel about this project?

Ask AI about mlx-openai-server

Question copied to clipboard

What is the cubist38/mlx-openai-server GitHub project? Description: "A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-friendly solution for running MLX-based vision and language models locally with an OpenAI-compatible interface.". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone mlx-openai-server

Clone via HTTPS

git clone https://github.com/cubist38/mlx-openai-server.git

Clone via SSH

[email protected]:cubist38/mlx-openai-server.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the mlx-openai-server issue tracker:

Open GitHub Issues