You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If you are unsure which chat model to use, we currently recommend Mistral Small 3.1 24B Instruct (`mistral-small-3.1-24b-instruct-2503`) to get started.
33
+
If you are unsure which chat model to use, we currently recommend Mistral Small 3.2 24B Instruct (`mistral-small-3.2-24b-instruct-2506`) to get started.
33
34
</Message>
34
35
35
36
## Vision models
@@ -61,7 +62,8 @@ Deprecated models should not be queried anymore. We recommend to use newer model
61
62
62
63
| Provider | Model string | End of Life (EOL) date
|`bge-multilingual-gemma2`| No | No | English, French, Chinese, Japanese, Korean |
70
76
|`sentence-t5-xxl`| No | No | English |
@@ -98,6 +104,22 @@ google/gemma-3-27b-it:bf16
98
104
99
105
- Pan & Scan is not yet supported for Gemma 3 images. This means that high resolution images are currently resized to 896x896 resolution that may generate artifacts and lead to a lower accuracy.
100
106
107
+
### Mistral-small-3.2-24b-instruct-2506
108
+
Mistral-small-3.2-24b-instruct-2506 is an improved version of Mistral-small-3.1 which performs better on tool calling.
109
+
This model was optimized to have a dense knowledge and faster tokens throughput compared to its size.
Mistral-small-3.1-24b-instruct-2503 is a model developed by Mistral to perform text processing and image analysis on many languages.
103
125
This model was optimized to have a dense knowledge and faster tokens throughput compared to its size.
@@ -112,6 +134,7 @@ This model was optimized to have a dense knowledge and faster tokens throughput
112
134
#### Model names
113
135
```
114
136
mistral/mistral-small-3.1-24b-instruct-2503:bf16
137
+
mistral/mistral-small-3.1-24b-instruct-2503:fp8
115
138
```
116
139
117
140
- Bitmap (or raster) image formats, meaning storing images as grids of individual pixels, are supported. Vector image formats (SVG, PSD) are not supported, neither PDFs nor videos.
@@ -147,16 +170,31 @@ allenai/molmo-72b-0924:fp8
147
170
148
171
## Text models
149
172
150
-
### Qwen3-235b-a22b-instruct-2507
173
+
### Gpt-oss-120b
151
174
Released July 23, 2025, Qwen 3 235B A22B is an open-weight model, competitive in multiple benchmarks (such as [LM Arena for text use cases](https://lmarena.ai/leaderboard)) compared to Gemini 2.5 Pro and GPT4.5.
152
175
153
176
| Attribute | Value |
154
177
|-----------|-------|
155
178
| Supports parallel tool calling | Yes |
156
179
180
+
181
+
157
182
#### Model name
158
183
```
159
-
qwen/qwen3-235b-a22b-instruct-2507:awq
184
+
openai/gpt-oss-120b:fp4
185
+
```
186
+
187
+
### Gpt-oss-120b
188
+
Released August 5, 2025, GPT OSS 120B is an open-weight model providing significant throughput performance and reasoning capabilities.
189
+
Currently, this model should be used through Responses API, as Chat Completion does not yet support tool calling for this model.
190
+
191
+
| Attribute | Value |
192
+
|-----------|-------|
193
+
| Supports parallel tool calling | Yes |
194
+
195
+
#### Model name
196
+
```
197
+
openai/gpt-oss-120b:fp4
160
198
```
161
199
162
200
### Llama-3.3-70b-instruct
@@ -333,6 +371,19 @@ kyutai/moshika-0.1-8b:fp8
333
371
334
372
## Code models
335
373
374
+
### Qwen3-coder-30b-a3b-instruct
375
+
Qwen3-coder is an improved version of Qwen2.5 with better accuracy and throughput.
376
+
Thanks to its a3b architecture, only a subset of its weights are activated for a given generation, leading to much faster input and output token processing, ideal for code completion.
377
+
378
+
| Attribute | Value |
379
+
|-----------|-------|
380
+
| Supports parallel tool calling | Yes |
381
+
382
+
#### Model name
383
+
```
384
+
qwen/qwen3-coder-30b-a3b-instruct:fp8
385
+
```
386
+
336
387
### Qwen2.5-coder-32b-instruct
337
388
Qwen2.5-coder is your intelligent programming assistant familiar with more than 40 programming languages.
338
389
With Qwen2.5-coder deployed at Scaleway, your company can benefit from code generation, AI-assisted code repair, and code reasoning.
0 commit comments