fix: prevent server crash when clients send 'tools' param (fixes #530) by octo-patch · Pull Request #538 · microsoft/BitNet

octo-patch · 2026-04-09T11:17:34Z

Fixes #530

Problem

When clients like Open WebUI send a POST /v1/chat/completions request with a tools parameter (standard OpenAI function-calling API), the llama-server binary crashes with SIGABRT instead of returning a proper error response:

terminate called after throwing an instance of 'std::runtime_error'
  what():  Unsupported param: tools
Error occurred while running command: ... died with <Signals.SIGABRT: 6>

Root Cause

In examples/server/server.cpp, the handle_chat_completions function calls oaicompat_completion_params_parse(), which throws std::runtime_error for unsupported parameters (tools, tool_choice). This exception is not caught locally, so it propagates past httplib's global exception handler and calls std::terminate(), crashing the server process.

Solution

Wrap the oaicompat_completion_params_parse() call in a try-catch block that:

Catches any std::exception thrown during parameter parsing
Returns an HTTP 400 (invalid request) error response with the error message
Returns early, keeping the server running for subsequent requests

The change is minimal (7 lines in examples/server/server.cpp) and does not affect normal request handling.

Changes

Updated 3rdparty/llama.cpp submodule to octo-patch/llama.cpp at commit 7525084 which adds the try-catch fix in examples/server/server.cpp

Testing

Verified that the fix pattern matches how other error conditions are handled in the same file (e.g., the --embeddings check at the top of handle_chat_completions uses the same res_error + return pattern).

After this fix, sending a request with tools will return HTTP 400 with a clear error message instead of crashing the server.

…icrosoft#530) When clients like Open WebUI send a POST to /v1/chat/completions with a 'tools' parameter (for function calling), the llama-server binary crashes with SIGABRT instead of returning a proper error response. Root cause: oaicompat_completion_params_parse() throws std::runtime_error for unsupported params ('tools', 'tool_choice'), but the exception is not caught in handle_chat_completions(), causing it to propagate past httplib's exception handler and terminate the process. Fix: wrap the oaicompat_completion_params_parse() call in a try-catch that returns HTTP 400 (invalid request) with the error message, keeping the server alive for subsequent requests. Change: update 3rdparty/llama.cpp submodule to octo-patch/llama.cpp at commit 7525084 which contains the fix in examples/server/server.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: prevent server crash when clients send 'tools' param (fixes #530)#538

fix: prevent server crash when clients send 'tools' param (fixes #530)#538
octo-patch wants to merge 1 commit intomicrosoft:mainfrom
octo-patch:fix/issue-530-unsupported-param-crash

octo-patch commented Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

octo-patch commented Apr 9, 2026

Problem

Root Cause

Solution

Changes

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant