Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md
createTables.sql	createTables.sql
populate.sql	populate.sql
query1.sql	query1.sql
query2.sql	query2.sql
query3.sql	query3.sql
query4.sql	query4.sql
query5.sql	query5.sql
query6.sql	query6.sql
query7.sql	query7.sql

Name

Last commit message

Last commit date

ClickHouse as a Stream Processor - Part 1: SQL Logic With Static Data

Description

Example ClickHouse SQL code for the blog "All I want for Christmas is...another real-time stream processing technology — ClickHouse!"

This is an experiment using ClickHouse as a stream processing system instead of Kafka Streams or RisingWave. It implements the stream processing logic from the earlier blog An Apache Kafka and RisingWave Stream Processing Christmas Special.

This first part focuses on the SQL logic (joins and time windows) using static data. A follow-up part will integrate with Kafka topics for a complete real-time pipeline.

Getting Started

Set up a ClickHouse instance (see Prerequisites).

Connect to ClickHouse using the client:

clickhouse-client --host <your-host> --port 9000 --user <user> --password <password>

Run the SQL scripts in order:

createTables.sql  - Creates the 'toys' and 'boxes' tables
populate.sql      - Inserts 300 sample rows per table with random data
query1.sql        - Basic JOIN matching toys to boxes by type (no time window)
query2.sql        - JOIN with 60-second time window constraint
query3.sql        - Ranked matches selecting the closest box per toy type
query4.sql        - Same as query3 filtered to recent data only (last 60 seconds)
query5.sql        - Tumble window function for fixed 60-second windows
query6.sql        - Tumble window with start/end time boundaries
query7.sql        - Hop (sliding) window function with 60s window and 30s hop

Prerequisites

ClickHouse - Either open source ClickHouse installed locally or NetApp Instaclustr managed ClickHouse
A ClickHouse client (CLI or GUI) for executing SQL queries

Deployment

No special deployment required. These are standalone SQL scripts that run directly against any ClickHouse instance.

For creating an Instaclustr managed ClickHouse cluster, see the ClickHouse documentation.

Authors

Paul Brebner - Initial work - NetApp Instaclustr

See also the list of MAINTAINERS who participated in projects in this repository.

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

ClickHouse as a Stream Processor - Part 1: SQL Logic With Static Data

Description

Getting Started

Prerequisites

Deployment

Authors

License

FilesExpand file tree

part 1

Directory actions

More options

Directory actions

More options

Latest commit

History

part 1

Folders and files

parent directory

README.md

ClickHouse as a Stream Processor - Part 1: SQL Logic With Static Data

Description

Getting Started

Prerequisites

Deployment

Authors

License