-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Description
Is your feature request related to a problem? Please describe.
Currently, the only way to stream changes from PostgreSQL into Pathway is through pw.io.debezium.read, which requires running Debezium + Kafka as middleware. For many deployments — especially multi-tenant SaaS where each client has its own PostgreSQL instance — this adds significant operational overhead (provisioning and managing Kafka brokers + Debezium connectors per instance).
Describe the solution you'd like
A native PostgreSQL CDC input connector (e.g. pw.io.postgres.read) that uses PostgreSQL's built-in logical replication (logical decoding via replication slots + pgoutput or wal2json plugin) to stream changes directly into Pathway — without requiring Debezium or Kafka.
Pathway already has native Rust-level PostgreSQL connectivity for the output connector (pw.io.postgres.write). Extending this to a read/CDC connector using logical replication seems like a natural next step.
Describe alternatives you've considered
- Using
pw.io.debezium.readwith full Debezium + Kafka stack — works but adds infrastructure complexity - Polling PostgreSQL with periodic queries — not real-time, defeats the purpose of streaming
- Using other CDC tools (e.g. Conduit, RisingWave) that already support direct PG logical replication without Kafka
Additional context
PostgreSQL's logical replication protocol is well-documented and stable. Other streaming frameworks (e.g. RisingWave, Materialize) already consume PG WAL directly without Kafka middleware. A direct connector would significantly lower the barrier to adoption for PostgreSQL-heavy deployments.