MCP tool-server OpenAI responses API

panterlo · September 25, 2025, 2:09pm

We are trying out responses API in the nightly vllm builds and have followed the similar test as available here in Python:

github.com/vllm-project/vllm

tests/entrypoints/openai/test_response_api_mcp_tools.py

eb32335e3


      
              assert response is not None
              assert response.status == "completed"
              assert response.usage.output_tokens_details.tool_output_tokens > 0
          
          
          @pytest.mark.asyncio
          @pytest.mark.parametrize("model_name", [MODEL_NAME])
          @pytest.mark.skip(reason="Code interpreter tool is not available in CI yet.")
          async def test_mcp_tool_env_flag_disabled(mcp_disabled_client: OpenAI,
                                                    model_name: str):
              response = await mcp_disabled_client.responses.create(
                  model=model_name,
                  # TODO: Ideally should be able to set max tool calls
                  # to prevent multi-turn, but it is not currently supported
                  # would speed up the test
                  input=("What's the first 4 digits after the decimal point of "
                         "cube root of `19910212 * 20250910`? "
                         "Show only the digits. The python interpreter is not stateful "
                         "and you must print to see the output."),
                  tools=[{
                      "type": "mcp",

What we don’t follow is:

Should --tool-server be left out when we want to allow specifying the mcp tools in the json object with posting to responses api (example below) ?

{“type”: “mcp”,“allowed_tools”: [“get_data”],“headers”: null,“require_approval”: “never”,“server_description”: null,“server_label”: “TEST”,“server_url”: “https://localhost:8888”}

Anyone aware of specific environmental variables required for mcp tools that has to be set when launching OpenAI server ?

RunLLM · September 25, 2025, 2:10pm

You should not leave out --tool-server when you want to use MCP tools with the Responses API. The --tool-server flag is required to register the MCP tool server(s) with vLLM at startup, so that the backend knows how to route tool calls specified in your JSON payload. The MCP tool definitions in your request (such as {"type": "mcp", ...}) are matched to the tool servers registered via --tool-server at launch. If you omit --tool-server, vLLM will not be able to connect to or invoke the MCP tools you specify in your request, and tool calls will fail or be ignored. This is confirmed in the vllm/entrypoints/tool_server.py implementation and official documentation.

Regarding environment variables: there are no special environment variables required for MCP tools in general, unless your specific tool server requires authentication (e.g., API keys) or other configuration. For built-in tools like the Python interpreter, you may need to set PYTHON_EXECUTION_BACKEND=dangerously_use_uv if you want to avoid Docker-based execution, as described in the gpt-oss tool use guide. For MCP tools, ensure your tool server is running and accessible at the address you provide to --tool-server.

Would you like more detail or a code example?

Sources:

Hint: Mention @RunLLM in the post for followups.

panterlo · September 25, 2025, 2:16pm

@RunLLM for mcp to work are we required to install pip install mcp ?

RunLLM · September 25, 2025, 2:17pm

Yes, you are required to install the mcp Python package (e.g., via pip install mcp) for MCP tool support in vLLM. If mcp is not installed, vLLM will raise an ImportError and instruct you to install it, as shown in the MCPToolServer class implementation in vllm/entrypoints/tool_server.py and official API docs.

Would you like more detail?

Sources:

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
Multiple tools with Mistral Large 2411 Tool Calling	4	328	March 26, 2025
Tool calling using Offline Inference? Tool Calling	1	165	April 14, 2025
Using openai compatible with `beta.chat.completions.parse` can't do tool call and structured output together General	0	120	April 6, 2025
DeepSeek-V3 tool_choice="auto", not working but tool_choice="required" is working Tool Calling	4	896	October 13, 2025
Does the latest version support deepseek-v3 tool call Model Support	0	139	April 12, 2025

MCP tool-server OpenAI responses API

Related topics