Skip to content
/ dozer Public

Dozer is a real-time data movement tool that leverages CDC from various sources and moves data into various sinks.

License

Notifications You must be signed in to change notification settings

getdozer/dozer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Overview

Dozer is a real time data movement tool leveraging CDC from various sources to multiple sinks.

Dozer is magnitudes of times faster than Debezium+Kafka and natively supports stateless transformations. Primarily used for moving data into warehouses. In our own application, we move data to Clickhouse and build data APIs and integration with LLMs.

How to use it

Dozer runs with a single configuration file like the following:

app_name: dozer-bench
version: 1
connections:
  - name: pg_1
    config: !Postgres
      user: user
      password: postgres
      host: localhost
      port: 5432
      database: customers
sinks:
  - name: customers
    config: !Dummy
      table_name: customers

Full documentation can be found here

Supported Sources

Connector Extraction Resuming Enterprise
Postgres
MySQL
Snowflake
Kafka 🚧
MongoDB 🎯
Amazon S3 🎯
Google Cloud Storage 🎯
**Oracle Enterprise Only
**Aerospike Enterprise Only

Supported Sinks

Database Connectivity Enterprise
Clickhouse
Postgres
MySQL
Big Query
Oracle Enterprise Only
Aerospike Enterprise Only