Skip to content

Commit 2d12c28

Browse files
Merge pull request #220 from microsoft/update-model-capacity-similar-to-bicep
fix: updated model capacity to minimum 100
2 parents aaf336a + 4cf3d45 commit 2d12c28

File tree

2 files changed

+6
-6
lines changed

2 files changed

+6
-6
lines changed

docs/quota_check.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
## Check Quota Availability Before Deployment
22

33
Before deploying the accelerator, **ensure sufficient quota availability** for the required model.
4-
> **For Global Standard | GPT-4o - the capacity to at least 30K tokens for optimal performance.**
4+
> **For Global Standard | GPT-4o - the capacity to at least 100K tokens for optimal performance.**
55
66
### Login if you have not done so already
77
```
@@ -11,7 +11,7 @@ azd auth login
1111

1212
### 📌 Default Models & Capacities:
1313
```
14-
gpt-4o:30
14+
gpt-4o:100
1515
```
1616
### 📌 Default Regions:
1717
```
@@ -37,19 +37,19 @@ eastus, uksouth, eastus2, northcentralus, swedencentral, westus, westus2, southc
3737
```
3838
✔️ Check specific model(s) in default regions:
3939
```
40-
./quota_check_params.sh --models gpt-4o:30
40+
./quota_check_params.sh --models gpt-4o:100
4141
```
4242
✔️ Check default models in specific region(s):
4343
```
4444
./quota_check_params.sh --regions eastus,westus
4545
```
4646
✔️ Passing Both models and regions:
4747
```
48-
./quota_check_params.sh --models gpt-4o:30 --regions eastus,westus2
48+
./quota_check_params.sh --models gpt-4o:100 --regions eastus,westus2
4949
```
5050
✔️ All parameters combined:
5151
```
52-
./quota_check_params.sh --models gpt-4:30 --regions eastus,westus --verbose
52+
./quota_check_params.sh --models gpt-4:100 --regions eastus,westus --verbose
5353
```
5454

5555
### **Sample Output**

infra/scripts/quota_check_params.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ log_verbose() {
4747
}
4848

4949
# Default Models and Capacities (Comma-separated in "model:capacity" format)
50-
DEFAULT_MODEL_CAPACITY="gpt-4o:30"
50+
DEFAULT_MODEL_CAPACITY="gpt-4o:100"
5151
# Convert the comma-separated string into an array
5252
IFS=',' read -r -a MODEL_CAPACITY_PAIRS <<< "$DEFAULT_MODEL_CAPACITY"
5353

0 commit comments

Comments
 (0)