[Education] Offline benchmark performance of Qwen3-0.6B on MLX (CPU) and Modal (GPU) #40

lamng3 · 2025-12-23T09:35:59Z

Offline benchmark performance of Qwen3-0.6B on MLX (CPU) and Modal (GPU)

This PR compares the performance of the Qwen3-0.6B model running on two different platforms: GPU acceleration via Modal (A10G) and CPU inference using MLX (Apple Silicon M1).

Modal

GPU: A10G
Model: Qwen/Qwen3-0.6B
Total: 133966tok
Time: 44.10s
Throughput: 3037.56tok/s

MLX Perf

CPU: Apple Silicon M1
Model: mlx-community/Qwen3-0.6B-4bit
Total: 140435tok
Time: 1017.77s
Throughput: 137.98tok/s

Signed-off-by: ltn18 <[email protected]>

lamng3 added 12 commits December 22, 2025 23:47

setup modal GPU image run; benchmark offline tested with A100

6e518b5

Signed-off-by: ltn18 <[email protected]>

Add Modal benchmark setup for profiling prepare_for_replay()

3e52745

update

5198aa4

Signed-off-by: ltn18 <[email protected]>

running output

87a7a50

Signed-off-by: ltn18 <[email protected]>

update README

3f00e6e

Signed-off-by: ltn18 <[email protected]>

updates

8d44132

Signed-off-by: ltn18 <[email protected]>

update

146680f

Signed-off-by: ltn18 <[email protected]>

move to tests

c603138

Signed-off-by: ltn18 <[email protected]>

rerun mlx

115ba0d

Signed-off-by: ltn18 <[email protected]>

Delete .gitignore

d46f62b

Delete uv.lock

ab843a8

Update README.md

5b6471d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Education] Offline benchmark performance of Qwen3-0.6B on MLX (CPU) and Modal (GPU) #40

[Education] Offline benchmark performance of Qwen3-0.6B on MLX (CPU) and Modal (GPU) #40

lamng3 commented Dec 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[Education] Offline benchmark performance of Qwen3-0.6B on MLX (CPU) and Modal (GPU) #40

Are you sure you want to change the base?

[Education] Offline benchmark performance of Qwen3-0.6B on MLX (CPU) and Modal (GPU) #40

Conversation

lamng3 commented Dec 23, 2025

Offline benchmark performance of Qwen3-0.6B on MLX (CPU) and Modal (GPU)

Modal

MLX Perf

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant