Substreams Message Queue Sinks

A production-ready toolkit for streaming Substreams output into message queues. The repository contains a shared core library plus two specialized sinks: NATS JetStream and Kafka/Redpanda. Fanout routing, protobuf analysis, and deployment automation are delivered through a unified Python toolchain.

Highlights

Separated sink implementations for NATS and Redpanda/Kafka
Universal protobuf analysis with semantic type support
Dynamic fanout routing and topic/subject generation
Integration-ready message envelopes for downstream consumers
Comprehensive Python and Rust test suites

Architecture

Substreams API → Specialized Sink → Message Queue

Core crates:

substreams-core/: dynamic router, protobuf utilities, shared logic
substreams-nats-sink/: JetStream client with dot-notation subjects
substreams-kafka-sink/: Kafka/Redpanda producer with topic routing

Key scripts (all Python):

scripts/setup_universal_pipeline.py: end-to-end fanout pipeline bootstrap
scripts/analyze_protobuf_schema.py: schema + semantic type inspection
scripts/nats_stream_manager.py / scripts/kafka_topic_manager.py: infrastructure provisioning

Prerequisites

Docker & Docker Compose
Python 3.10+ with uv
Rust 1.85+ (edition 2024)
Substreams API token from StreamingFast (store in secrets/substreams_token.txt)

Getting Started

git clone <repo-url>
cd substreams-sink-mq
uv venv .venv
source .venv/bin/activate
uv pip install -r requirements.txt

Put your Substreams token at secrets/substreams_token.txt (not tracked by git).

Bootstrap a Pipeline

Use the combined setup command to analyze protobufs, generate fanout configuration, and optionally create queue resources.

# Backend = nats | kafka
docker compose -f docker-compose-<backend>.yaml --profile <profile> up -d
uv run scripts/setup_universal_pipeline.py <backend> path/to/schema.proto --deploy

--deploy provisions NATS or Kafka resources. Omit the flag to inspect generated files under configs/.

Manual Steps (if needed)

uv run scripts/analyze_protobuf_schema.py path/to/schema.proto configs/generated-analysis.json
uv run scripts/nats_stream_manager.py configs/generated-fanout.yaml    # NATS
uv run scripts/kafka_topic_manager.py configs/generated-fanout.yaml    # Kafka/Redpanda

Start the sinks after provisioning:

# NATS infrastructure only
docker compose -f docker-compose-nats.yaml --profile nats-basic up -d
# NATS sink + dependencies
docker compose -f docker-compose-nats.yaml up -d substreams-nats-sink

# Kafka infrastructure (Redpanda service) only
docker compose -f docker-compose-kafka.yaml --profile kafka-basic up -d
# Kafka sink + dependencies
docker compose -f docker-compose-kafka.yaml up -d substreams-kafka-sink

Monitoring

Message queues: nats stream list, docker exec redpanda-0 rpk topic list
Container logs: docker logs <service> --follow

Testing & Quality

# Rust workspace
cargo fmt --all
cargo clippy --all-targets --all-features
cargo test --workspace

# Python suite
uv run scripts/run_tests.py            # default (unit + integration)
uv run scripts/run_tests.py --type unit
uv run scripts/run_tests.py --type integration

Testing standards and coverage requirements are documented in docs/TESTING_STANDARDS.md.

Documentation

docs/REORG_HANDLING_ANALYSIS.md & docs/REORG_HANDLING_TECHNICAL_SPEC.md: queue-focused reorg handling roadmap and specification
CLAUDE.md: assistant workflow guidelines (applies to Codex and other LLM collaborators)

Troubleshooting

Missing NATS stream: uv run scripts/nats_stream_manager.py configs/generated-fanout.yaml
Missing Kafka topic: uv run scripts/kafka_topic_manager.py configs/generated-fanout.yaml
Schema mismatches: regenerate analysis + schema with the scripts above
Resource limits: adjust Docker Compose profiles or backend retention settings as needed

Contributing

Create a feature branch.
Update docs/tests alongside code.
Run the quality checks listed above.
Submit a PR describing changes and verification steps.

License information is pending; add when ready.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
configs		configs
docs		docs
example-substream/substreams-spl-all-tokens		example-substream/substreams-spl-all-tokens
scripts		scripts
substreams-core		substreams-core
substreams-kafka-sink		substreams-kafka-sink
substreams-nats-sink		substreams-nats-sink
test-configs		test-configs
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Cargo.toml		Cargo.toml
README.md		README.md
docker-compose-kafka.yaml		docker-compose-kafka.yaml
docker-compose-nats.yaml		docker-compose-nats.yaml
docker-compose-test.yaml		docker-compose-test.yaml
pytest.ini		pytest.ini
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Substreams Message Queue Sinks

Highlights

Architecture

Prerequisites

Getting Started

Bootstrap a Pipeline

Manual Steps (if needed)

Monitoring

Testing & Quality

Documentation

Troubleshooting

Contributing

About

Uh oh!

Releases 2

Packages

Languages

graphops/substreams-sink-mq

Folders and files

Latest commit

History

Repository files navigation

Substreams Message Queue Sinks

Highlights

Architecture

Prerequisites

Getting Started

Bootstrap a Pipeline

Manual Steps (if needed)

Monitoring

Testing & Quality

Documentation

Troubleshooting

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages