Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question: Scalability #9

Open
igorng opened this issue Oct 14, 2022 · 0 comments
Open

Question: Scalability #9

igorng opened this issue Oct 14, 2022 · 0 comments

Comments

@igorng
Copy link

igorng commented Oct 14, 2022

Hello, thank you for this great connector.

We are using this to push data from kafka to Celonis.
Data has primary keys that allows us to deduplicate via the connector feature when exporting to Celonis.
The connector is deployed in a standalone mode, and with basic configration (1 task, pkeys, some transform on dates)
Now we want to push huge table, around 30 Millions of records. With one instance of the connector, exports to celonis takes too much time (more than 4hours).

We tried to scale to 3 instances (we have kafka topics with 3 partitions), no error in logs (aside some connexion timeout to celonis from time to time). But when we checked our data in Celonis the count did not match. We had less than expected , deduplication tken into account.

Question is, what is the scalability status of this connector? Is there any specific point to watch over or any particular configuration to make?

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

1 participant