Skip to content

Feature Request: Direct PostgreSQL CDC connector without Debezium/Kafka #186

@Git-Faisal

Description

@Git-Faisal

Is your feature request related to a problem? Please describe.

Currently, the only way to stream changes from PostgreSQL into Pathway is through pw.io.debezium.read, which requires running Debezium + Kafka as middleware. For many deployments — especially multi-tenant SaaS where each client has its own PostgreSQL instance — this adds significant operational overhead (provisioning and managing Kafka brokers + Debezium connectors per instance).

Describe the solution you'd like

A native PostgreSQL CDC input connector (e.g. pw.io.postgres.read) that uses PostgreSQL's built-in logical replication (logical decoding via replication slots + pgoutput or wal2json plugin) to stream changes directly into Pathway — without requiring Debezium or Kafka.

Pathway already has native Rust-level PostgreSQL connectivity for the output connector (pw.io.postgres.write). Extending this to a read/CDC connector using logical replication seems like a natural next step.

Describe alternatives you've considered

  • Using pw.io.debezium.read with full Debezium + Kafka stack — works but adds infrastructure complexity
  • Polling PostgreSQL with periodic queries — not real-time, defeats the purpose of streaming
  • Using other CDC tools (e.g. Conduit, RisingWave) that already support direct PG logical replication without Kafka

Additional context

PostgreSQL's logical replication protocol is well-documented and stable. Other streaming frameworks (e.g. RisingWave, Materialize) already consume PG WAL directly without Kafka middleware. A direct connector would significantly lower the barrier to adoption for PostgreSQL-heavy deployments.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions