docs: Add and expand guides for CLI, config, and data formats

kdelee · kdelee · commit 54d13bb81c57 · 2025-07-14T20:54:45.000-04:00
Created new guides for configuration options (environment variables,
.env files) and supported data formats. It also updates the main CLI
reference and links to the new guides.
diff --git a/docs/guides/cli.md b/docs/guides/cli.md
@@ -4,14 +4,34 @@ This page provides a reference for the `guidellm` command-line interface. For mo
 
 ## `guidellm benchmark run`
 
-This command is the primary entrypoint for running benchmarks.
+This command is the primary entrypoint for running benchmarks. It has many options that can be specified on the command line or in a scenario file.
 
-### Target Configuration
+### Scenario Configuration
+
+| Option | Description |
+| --- | --- |
+| `--scenario <PATH or NAME>` | The name of a builtin scenario or path to a scenario configuration file. Options specified on the command line will override the scenario file. |
+
+### Target and Backend Configuration
 
 These options configure how `guidellm` connects to the system under test.
 
 | Option | Description |
 | --- | --- |
-| `--target <URL>` | **Required.** The endpoint of the target system, e.g., `http://localhost:8080`. |
-| `--target-header <HEADER>` | A header to send with requests to the target. This option can be specified multiple times to send multiple headers. The header should be in the format `"Header-Name: Header-Value"`. For example: `--target-header "Authorization: Bearer my-secret-token"` |
-| `--target-skip-ssl-verify` | A flag to disable SSL certificate verification when connecting to the target. This is useful for development environments with self-signed certificates, but should be used with caution in production. |
+| `--target <URL>` | **Required.** The endpoint of the target system, e.g., `http://localhost:8080`. Can also be set with the `GUIDELLM__OPENAI__BASE_URL` environment variable. |
+| `--target-header <HEADER>` | A header to send with requests to the target. Can be specified multiple times. Example: `--target-header "Authorization: Bearer my-secret-token"`. |
+| `--target-skip-ssl-verify` | A flag to disable SSL certificate verification when connecting to the target. |
+| `--backend-type <TYPE>` | The type of backend to use. Defaults to `openai_http`. |
+| `--model <NAME>` | The ID of the model to benchmark within the backend. |
+
+### Data and Request Configuration
+
+These options define the data to be used for benchmarking and how requests will be generated.
+
+| Option | Description |
+| --- | --- |
+| `--data <SOURCE>` | The data source. This can be a HuggingFace dataset ID, a path to a local data file, or a synthetic data configuration. See the [Data Formats Guide](./data_formats.md) for more details. |
+| `--rate-type <TYPE>` | The type of request generation strategy to use (e.g., `constant`, `poisson`, `sweep`). |
+| `--rate <NUMBER>` | The rate of requests per second for `constant` or `poisson` strategies, or the number of steps for a `sweep`. |
+| `--max-requests <NUMBER>` | The maximum number of requests to run for each benchmark. |
+| `--max-seconds <NUMBER>` | The maximum number of seconds to run each benchmark for. |
diff --git a/docs/guides/configuration.md b/docs/guides/configuration.md
@@ -19,24 +19,40 @@ For example, to set the `api_key` for the `openai` backend, you would use the fo
 export GUIDELLM__OPENAI__API_KEY="your-api-key"
 ```
 
-### Target Configuration
+### Target and Backend Configuration
 
 You can configure the connection to the target system using environment variables. This is an alternative to using the `--target-*` command-line flags.
 
 | Environment Variable | Description | Example |
 | --- | --- | --- |
-| `GUIDELLM__OPENAI__HEADERS` | A JSON string representing a dictionary of headers to send to the target. These headers will override any default headers (like `Authorization` from `api_key`). | `export GUIDELLM__OPENAI__HEADERS='{"Authorization": "Bearer my-token", "X-Custom-Header": "value"}'` |
+| `GUIDELLM__OPENAI__BASE_URL` | The endpoint of the target system. Equivalent to the `--target` CLI option. | `export GUIDELLM__OPENAI__BASE_URL="http://localhost:8080"` |
+| `GUIDELLM__OPENAI__API_KEY` | The API key to use for bearer token authentication. | `export GUIDELLM__OPENAI__API_KEY="your-secret-api-key"` |
+| `GUIDELLM__OPENAI__BEARER_TOKEN` | The full bearer token to use for authentication. | `export GUIDELLM__OPENAI__BEARER_TOKEN="Bearer your-secret-token"` |
+| `GUIDELLM__OPENAI__HEADERS` | A JSON string representing a dictionary of headers to send to the target. These headers will override any default headers. | `export GUIDELLM__OPENAI__HEADERS='{"Authorization": "Bearer my-token"}'` |
 | `GUIDELLM__OPENAI__ORGANIZATION` | The OpenAI organization to use for requests. | `export GUIDELLM__OPENAI__ORGANIZATION="org-12345"` |
 | `GUIDELLM__OPENAI__PROJECT` | The OpenAI project to use for requests. | `export GUIDELLM__OPENAI__PROJECT="proj-67890"` |
 | `GUIDELLM__OPENAI__VERIFY_SSL` | Set to `false` or `0` to disable SSL certificate verification. | `export GUIDELLM__OPENAI__VERIFY_SSL=false` |
+| `GUIDELLM__OPENAI__MAX_OUTPUT_TOKENS` | The default maximum number of tokens to request for completions. | `export GUIDELLM__OPENAI__MAX_OUTPUT_TOKENS=2048` |
+
+### General HTTP Settings
+
+These settings control the behavior of the underlying HTTP client.
+
+| Environment Variable | Description |
+| --- | --- |
+| `GUIDELLM__REQUEST_TIMEOUT` | The timeout in seconds for HTTP requests. Defaults to 300. |
+| `GUIDELLM__REQUEST_HTTP2` | Set to `true` or `1` to enable HTTP/2 support. Defaults to true. |
+| `GUIDELLM__REQUEST_FOLLOW_REDIRECTS` | Set to `true` or `1` to allow the client to follow redirects. Defaults to true. |
+
 
 ### Using a `.env` file
 
 You can also place these variables in a `.env` file in your project's root directory:
 
 ```dotenv
 # .env file
+GUIDELLM__OPENAI__BASE_URL="http://localhost:8080"
 GUIDELLM__OPENAI__API_KEY="your-api-key"
 GUIDELLM__OPENAI__HEADERS='{"Authorization": "Bearer my-token"}'
 GUIDELLM__OPENAI__VERIFY_SSL=false
-```
+```
diff --git a/docs/guides/data_formats.md b/docs/guides/data_formats.md
@@ -0,0 +1,62 @@
+# Data Formats
+
+The `--data` argument for the `guidellm benchmark run` command accepts several different formats for specifying the data to be used for benchmarking.
+
+## Local Data Files
+
+You can provide a path to a local data file in one of the following formats:
+
+- **CSV (.csv)**: A comma-separated values file. The loader will attempt to find a column with a common name for the prompt (e.g., `prompt`, `text`, `instruction`).
+- **JSON (.json)**: A JSON file. The structure should be a list of objects, where each object represents a row of data.
+- **JSON Lines (.jsonl)**: A file where each line is a valid JSON object.
+- **Text (.txt)**: A plain text file, where each line is treated as a separate prompt.
+
+If the prompt column cannot be automatically determined, you can specify it using the `--data-args` option:
+```bash
+--data-args '{"text_column": "my_custom_prompt_column"}'
+```
+
+## Synthetic Data
+
+You can generate synthetic data on the fly by providing a configuration string or file.
+
+### Configuration Options
+
+| Parameter | Description |
+| --- | --- |
+| `prompt_tokens` | **Required.** The average number of tokens for the generated prompts. |
+| `output_tokens` | **Required.** The average number of tokens for the generated outputs. |
+| `samples` | The total number of samples to generate. Defaults to 1000. |
+| `source` | The source text to use for generating the synthetic data. Defaults to a built-in copy of "Pride and Prejudice". |
+| `prompt_tokens_stdev` | The standard deviation of the tokens generated for prompts. |
+| `prompt_tokens_min` | The minimum number of text tokens generated for prompts. |
+| `prompt_tokens_max` | The maximum number of text tokens generated for prompts. |
+| `output_tokens_stdev` | The standard deviation of the tokens generated for outputs. |
+| `output_tokens_min` | The minimum number of text tokens generated for outputs. |
+| `output_tokens_max` | The maximum number of text tokens generated for outputs. |
+
+### Configuration Formats
+
+You can provide the synthetic data configuration in one of three ways:
+
+1.  **Key-Value String:**
+    ```bash
+    --data "prompt_tokens=256,output_tokens=128,samples=500"
+    ```
+
+2.  **JSON String:**
+    ```bash
+    --data '{"prompt_tokens": 256, "output_tokens": 128, "samples": 500}'
+    ```
+
+3.  **YAML or Config File:**
+    Create a file (e.g., `my_config.yaml`):
+    ```yaml
+    prompt_tokens: 256
+    output_tokens: 128
+    samples: 500
+    ```
+    And use it with the `--data` argument:
+    ```bash
+    --data my_config.yaml
+    ```