A plug-and-play server that speaks OpenAIās Responses APIāno matter which AI backend youāre running.
Ollama? vLLM? LiteLLM? Even OpenAI itself?
This server bridges them all to the OpenAI ChatCompletions & Responses API interface.
In plain words:
š Want to run OpenAIās Coding Assistant (Codex) or other OpenAI API clients against your own models?
š Want to experiment with self-hosted LLMs but keep OpenAIās API compatibility?
This project makes it happen.
It handles stateful chat, tool calls, and future features like file search & code interpreterāall behind a familiar OpenAI API.
āø»
ā
Acts as a drop-in replacement for OpenAIās Responses API.
ā
Lets you run any backend AI (Ollama, vLLM, Groq, etc.) with OpenAI-compatible clients.
ā
MCP support around both Chat Completions and Responses APIs
ā
Supports OpenAIās new Coding Assistant / Codex that requires Responses API.
ā
Built for innovators, researchers, OSS enthusiasts.
ā
Enterprise-ready: scalable, reliable, and secure for production workloads.
āø»
š„ Whatās in & whatās next?
ā Done š Coming soon
āø»
Latest release on PyPI:
pip install open-responses-server
Or install from source:
uv venv
uv pip install .
uv pip install -e ".[dev]" # dev dependencies
Run the server:
# Using CLI tool (after installation)
otc start
# Or directly from source
uv run src/open_responses_server/cli.py start
Docker deployment:
# Run with Docker
docker run -p 8080:8080 \
-e OPENAI_BASE_URL_INTERNAL=http://your-llm-api:8000 \
-e OPENAI_BASE_URL=http://localhost:8080 \
-e OPENAI_API_KEY=your-api-key \
ghcr.io/teabranch/open-responses-server:latest
Works great with docker-compose.yaml for Codex + your own model.
āø»
Minimal config to connect your AI backend:
OPENAI_BASE_URL_INTERNAL=http://localhost:11434 # Ollama, vLLM, Groq, etc.
OPENAI_BASE_URL=http://localhost:8080 # This server's endpoint
OPENAI_API_KEY=sk-mockapikey123456789 # Mock key tunneled to backend
MCP_SERVERS_CONFIG_PATH=./mcps.json # Path to mcps servers json file
Server binding:
API_ADAPTER_HOST=0.0.0.0
API_ADAPTER_PORT=8080
Optional logging:
LOG_LEVEL=INFO
LOG_FILE_PATH=./log/api_adapter.log
Configure with CLI tool:
# Interactive configuration setup
otc configure
Verify setup:
# Check if the server is working
curl http://localhost:8080/v1/models
āø»
If you think this is cool:
ā Star the repo.
š Open an issue if somethingās broken.
š¤ Suggest a feature or submit a pull request!
This is early-stage but already usable in real-world demos.
Letās build something powerfulātogether.
āø»
@software{open-responses-server,
author = {TeaBranch},
title = {open-responses-server: Open-source server bridging any AI provider to OpenAIās Responses API},
year = {2025},
publisher = {GitHub},
journal = {GitHub Repository},
howpublished = {\url{https://github.com/teabranch/open-responses-server}},
commit = {use the commit hash youāre working with}
}
TeaBranch. (2025). open-responses-server: Open-source server the serves any AI provider with OpenAI ChatCompletions as OpenAIās Responses API and hosted tools. [Computer software]. GitHub. https://github.com/teabranch/open-responses-server
This repo had changed names: