Skip to content

Commit 316719d

Browse files
kszucslhoestq
andauthored
post: Parquet Content-Defined Chunking (#2987)
* post: Parquet Content-Defined Chunking * update the pyarrow documentation link to point to the latest release * address review comments * Update date in _blog.yml Co-authored-by: Quentin Lhoest <[email protected]> --------- Co-authored-by: Quentin Lhoest <[email protected]>
1 parent 5f88ef0 commit 316719d

File tree

3 files changed

+731
-0
lines changed

3 files changed

+731
-0
lines changed

_blog.yml

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6427,3 +6427,16 @@
64276427
- multimodal
64286428
- open-source
64296429
- benchmark
6430+
6431+
- local: parquet-cdc
6432+
title: "Parquet Content-Defined Chunking"
6433+
author: kszucs
6434+
date: July 25, 2025
6435+
tags:
6436+
- data
6437+
- datasets
6438+
- dedupe
6439+
- hub
6440+
- parquet
6441+
- storage
6442+
- xet

assets/parquet-cdc/thumbnail.png

349 KB
Loading

0 commit comments

Comments
 (0)