Commit 11fc254
**Description**
This PR addresses limitations when evaluating models served via remote
OpenAI-compatible endpoints (e.g., vLLM deployed on cloud GPU clusters,
RunPod, or behind enterprise gateways).
Previously, the handler assumed a rigid host:port structure and lacked
authentication support. Additionally, when connecting to a remote
endpoint, the handler would fail to load the tokenizer if it tried to
access a path that only exists on the remote server.
**Key Changes**
Custom Authentication & Routing:
Added support for REMOTE_OPENAI_BASE_URL to allow full control over the
endpoint URL (resolves issues with SSL/HTTPS and custom sub-paths).
Added support for REMOTE_OPENAI_API_KEY to enable authentication for
secured endpoints.
Remote Tokenizer Support:
Added REMOTE_OPENAI_TOKENIZER_PATH. Since the OSSHandler needs to load
the tokenizer locally for prompt formatting, this variable allows users
to point to a local Hugging Face path or model ID, preventing OSError
when the handler tries to load a non-existent local path derived from
the remote server configuration.
Documentation:
Updated .env.example and README.md to document these new configuration
options.
**Related Issue** Fixes #1280
**Type of Change**
- [x] New feature (non-breaking change which adds functionality)
- [x] Documentation update
**Checklist**
- [x] My code follows the style guidelines of this project
- [x] I have performed a self-review of my own code
- [x] I have updated the documentation accordingly
- [x] Existing local server setups remain backward compatible
---------
Co-authored-by: Dawei DW12 Zhang <zhangdw12@Lenovo.com>
Co-authored-by: zhangdw <zhangdw.cs@gmail.com>
1 parent 9b8a520 commit 11fc254
File tree
3 files changed
+42
-5
lines changed- berkeley-function-call-leaderboard
- bfcl_eval
- model_handler/local_inference
3 files changed
+42
-5
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
245 | 245 | | |
246 | 246 | | |
247 | 247 | | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
248 | 256 | | |
249 | 257 | | |
250 | 258 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
44 | 44 | | |
45 | 45 | | |
46 | 46 | | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
47 | 53 | | |
48 | | - | |
| 54 | + | |
Lines changed: 27 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
42 | 42 | | |
43 | 43 | | |
44 | 44 | | |
45 | | - | |
46 | | - | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
47 | 50 | | |
48 | 51 | | |
49 | 52 | | |
| |||
111 | 114 | | |
112 | 115 | | |
113 | 116 | | |
114 | | - | |
115 | | - | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
116 | 139 | | |
117 | 140 | | |
118 | 141 | | |
| |||
0 commit comments