Skip to content

maksenius/conduit-connector-sap-hana

 
 

Repository files navigation

conduit-connector-sap-hana

General

The SAP HANA connector is one of Conduit plugins. It provides the source SAP HANA connector.

Prerequisites

How to build it

Run make build.

Testing

Run make test to run all the unit and integration tests.

Source

The SAP HANA source connects to the database using the provided connection and starts creating records for each table row and each detected change. It supports gets snapshot from table and catches CDC (Change Data Captured) changes.

Configuration options

Name Description Required Example By default
table The name of a table in the database that the connector should read from. true users
orderingColumn The name of a column that the connector will use for ordering rows. Its values must be unique and suitable for sorting, otherwise, the snapshot won't work correctly. true id
primaryKeys Comma separated list of column names that records could use for their Key fields. By default connector uses primary keys from table if they are not exist connector will use ordering column. false id
snapshot Whether or not the plugin will take a snapshot of the entire table before starting cdc mode, by default true. false false
batchSize Size of rows batch. false 100 1000
auth.mechanism Mechanism type of auth. Valid types: DSN, Basic, JWT, X509. false JWT DSN
auth.dsn DSN connection string Required for DSN auth type. hdb://user:password@host443?TLSServerName=name
auth.host Sap Hana database host. Required for Basic, JWT, X509 auth types. hdb://hanacloud.ondemand.com:443
auth.username Sap Hana user Required for Basic type. hbadmin
auth.password Sap Hana password Required for Basic type. pass
auth.token JWT token Required for JWT type. jwt_token
auth.clientCertFilePath Path for certification file Required for X509 type. /tmp/file.cert
auth.clientKeyFilePath Path for key file Required for X509 type. /tmp/key.cert

Snapshot

By default when the connector starts for the first time, snapshot mode is enabled, which means that existing data will be read. To skip reading existing, change config parameter snapshot to false. All rows which exist in a table at the time the snapshot started, are considered part of snapshot. When all snapshot records are returned, the connector switches to the CDC iterator.

Change Data Capture (CDC)

This connector implements CDC features for DB2 by adding a tracking table and triggers to populate it. The tracking table has the same name as a target table with the prefix CONDUIT_ and suffix from time when pipeline started on format "hhmmss". For example for table PRODUCTS the tracking table's name will be looks like CONDUIT_PRODUCTS_213315. The tracking table has all the same columns as the target table plus two additional columns:

name description
CONDUIT_TRACKING_ID Autoincrement index for the position.
CONDUIT_OPERATION_TYPE Operation type: INSERT, UPDATE, or DELETE.

The connector saves information about update, delete, insert table operations inside tracking table. For example if user inserts new row into table connector will save all new columns values inside tracking table
with CONDUIT_OPERATION_TYPE = INSERT

Triggers have name pattern CD_{{TABLENAME}}_{{OPERATION_TYPE}}_{{SUFFIXNAME}}. For example: CD_PRODUCTS_INSERT_213315

Queries to retrieve change data from a tracking table are very similar to queries in a Snapshot iterator, but with CONDUIT_TRACKING_ID ordering column.

The connector cleans up the tracking table every 5 seconds.

Iterator saves the last CONDUIT_TRACKING_ID to the position from the last successfully recorded row.

If connector stops, it will parse position from the last record and will try to get row where {{CONDUIT_TRACKING_ID}} > {{position.CDCLastID}}.

Please pay attention

The tracking table and the triggers are not automatically removed when a pipeline is deleted. That needs to be done manually, for example by using the following commands:

  DROP TABLE CONDUIT_{{YOUR_TABLE_NAME}}_{{SUFFIXNAME}};
  DROP TRIGGER CD_{{TABLENAME}}_INSERT_{{SUFFIXNAME}};
  DROP TRIGGER CD_{{TABLENAME}}_UPDATE_{{SUFFIXNAME}};
  DROP TRIGGER CD_{{TABLENAME}}_DELETE_{{SUFFIXNAME}};

CDC FAQ

Is it possible to add/remove/rename column to table?

Yes. You have to stop the pipeline and do the same with conduit tracking table. For example:

ALTER TABLE CLIENTS
ADD COLUMN phone VARCHAR(18);

ALTER TABLE CONDUIT_CLIENTS_{suffix}
    ADD COLUMN phone VARCHAR(18);

I accidentally removed tracking table.

You have to restart pipeline, tracking table will be recreated by connector.

About

Conduit connector for SAP HANA database

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Go 99.2%
  • Makefile 0.8%