Commit 2f654ad
[serve] Downgrade multiplex model loading/unloading logs from INFO to DEBUG (#60985)
## Why are these changes needed?
The "Loading model '...'" / "Successfully loaded model '...'" /
"Unloading model '...'" / "Successfully unloaded model '...'" messages
fire on every request when using multiplexed model IDs (e.g.,
session-aware routing). At high concurrency this floods stdout with
hundreds of lines per second, drowning out application logs.
Since model loading/unloading metrics are already tracked via counters
(`models_load_counter`, `models_unload_counter`) and histograms
(`model_load_latency_ms`, `model_unload_latency_ms`), these log lines
are only useful for debugging. Downgrade to DEBUG.
---------
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
Co-authored-by: Lonnie Liu <95255098+aslonnie@users.noreply.github.com>1 parent b14f098 commit 2f654ad
1 file changed
+4
-4
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
212 | 212 | | |
213 | 213 | | |
214 | 214 | | |
215 | | - | |
| 215 | + | |
216 | 216 | | |
217 | 217 | | |
218 | 218 | | |
| |||
222 | 222 | | |
223 | 223 | | |
224 | 224 | | |
225 | | - | |
| 225 | + | |
226 | 226 | | |
227 | 227 | | |
228 | 228 | | |
| |||
242 | 242 | | |
243 | 243 | | |
244 | 244 | | |
245 | | - | |
| 245 | + | |
246 | 246 | | |
247 | 247 | | |
248 | 248 | | |
| |||
254 | 254 | | |
255 | 255 | | |
256 | 256 | | |
257 | | - | |
| 257 | + | |
258 | 258 | | |
259 | 259 | | |
260 | 260 | | |
0 commit comments