Skip to content

Commit 6569f8d

Browse files
authored
Add DataFrame Service sizing example for DFS3 (#303)
* Add row for DFS3 * An M80 is actually what we were recommending here * 60% of the DFS3 tables have 1M rows * Break table shapes out into a separate table
1 parent 750765a commit 6569f8d

File tree

2 files changed

+67
-6
lines changed

2 files changed

+67
-6
lines changed

getting-started/templates/sizing-examples/dataframe-service/README.md

Lines changed: 23 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -28,9 +28,25 @@ file.
2828

2929
## Scale Profiles
3030

31-
| Scale identifier | Peak concurrent writers | Peak tables ingested per hour | Table shape mix | Peak query concurrency | Total tables ingested per year |
32-
| ---------------- | ----------------------- | ----------------------------- | ---------------------------------------------------------------------- | ---------------------- | ------------------------------ |
33-
| DFS1 | 10 | 10 | 100,000 rows and 25 columns - 80%, 1,000,000 rows and 25 columns - 20% | 5 | 10,000 |
31+
| Scale identifier | Peak concurrent writers | Peak tables ingested per hour | Table shape mix | Peak query concurrency | Total tables ingested per year |
32+
| ---------------- | ----------------------- | ----------------------------- | ------------------------------------------- | ---------------------- | ------------------------------ |
33+
| DFS1 | 10 | 10 | See [DFS1 Table Shapes](#dfs1-table-shapes) | 5 | 10,000 |
34+
| DFS3 | 100 | 800 | See [DFS3 Table Shapes](#dfs3-table-shapes) | 20 | 1,000,000 |
35+
36+
### DFS1 Table Shapes
37+
38+
| Rows | Columns | Share |
39+
| --------- | ------- | ----- |
40+
| 100,000 | 25 | 80% |
41+
| 1,000,000 | 25 | 20% |
42+
43+
### DFS3 Table Shapes
44+
45+
| Rows | Columns | Share |
46+
| ----------- | ------- | ----- |
47+
| 100,000 | 100 | 20% |
48+
| 1,000,000 | 100 | 60% |
49+
| 500,000,000 | 100 | 20% |
3450

3551
### Assumptions
3652

@@ -59,6 +75,7 @@ of scale. This table also specifies the AWS node types. If your system runs on
5975
Azure or on-premises hardware, approximate the specs of the AWS nodes as closely
6076
as possible. Network and storage bandwidth are very important for scaling.
6177

62-
| Scale identifier | Number of nodes | Node type | Storage per node (GB) | Dedicated MongoDB instance recommended? | Recommended database resources | Anticipated average data storage growth per hour | YAML resource template |
63-
| ---------------- | ------------------------------------------- | ------------------------------------------------- | ------------------------------------------------ | --------------------------------------- | ------------------------------ | ------------------------------------------------ | ---------------------------------------- |
64-
| DFS1 | 1 (general node pool), 4 (Dremio node pool) | `m6a.2xlarge` (general), `m5ad.4xlarge` (Dremio)` | 0 (general node pool), 356 GB (Dremio node pool) | Yes | Atlas M30 or equivalent | 0.4 GiB (S3), 0.19 GiB (MongoDB) | [`dfs1-values.yaml`](./dfs1-values.yaml) |
78+
| Scale identifier | Number of nodes | Node type | Storage per node (GiB) | Dedicated MongoDB instance recommended? | Recommended database resources | Anticipated average data storage growth per hour | YAML resource template |
79+
| ---------------- | -------------------------------------------- | ------------------------------------------------- | ------------------------------------------------- | --------------------------------------- | ------------------------------ | ------------------------------------------------ | ---------------------------------------- |
80+
| DFS1 | 1 (general node pool), 4 (Dremio node pool) | `m6a.2xlarge` (general), `m5ad.4xlarge` (Dremio)` | 0 (general node pool), 356 GiB (Dremio node pool) | Yes | Atlas M30 or equivalent | 46.8 MiB (S3), 14.01 KiB (MongoDB) | [`dfs1-values.yaml`](./dfs1-values.yaml) |
81+
| DFS3 | 5 (general node pool), 15 (Dremio node pool) | `m6a.2xlarge` (general), `m5ad.8xlarge` (Dremio)` | 0 (general node pool), 500 GiB (Dremio node pool) | Yes | Atlas M80 or equivalent | 315.6 GiB (S3), 5.47 MiB (MongoDB) | [`dfs3-values.yaml`](./dfs3-values.yaml) |
Lines changed: 44 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,44 @@
1+
dataframeservice:
2+
replicaCount: 12
3+
autoscaling:
4+
enabled: false
5+
logLevel: "warning"
6+
icebergOperations:
7+
tableWritesPerCompaction: 25
8+
9+
queryEngine:
10+
workloadManagement:
11+
highCostUserQueriesQueue:
12+
concurrencyLimit: 24
13+
writeQueue:
14+
concurrencyLimit: 20
15+
optimizeQueue:
16+
concurrencyLimit: 20
17+
18+
nessie:
19+
replicaCount: 4
20+
autoscaling:
21+
enabled: false
22+
logLevel: "warning"
23+
24+
sldremio:
25+
coordinator:
26+
cpu: 4
27+
memory: 16384
28+
volumeSize: 150Gi
29+
executor:
30+
count: 10
31+
cpu: 22
32+
memory: 122800
33+
volumeSize: 400Gi
34+
engines:
35+
- default
36+
- iceberg
37+
engineOverride:
38+
iceberg:
39+
count: 5
40+
cpu: 22
41+
memory: 68719
42+
heapMemoryOverride: 40000
43+
directMemoryOverride: 20000
44+
volumeSize: 20Gi

0 commit comments

Comments
 (0)