YotpoLtd/metorikku

CDC for multiple Tables

rubenssoto opened this issue ยท 1 comments

Hello ๐Ÿ‘

I read your article about CDC and metorikku, great article.
I have a case that I have 200 tables that arrives in parquet format in my datalake, metorikku could process more than one table in the same spark context in any paralalel way? For example with threads.

thank you

Hi! thanks for opening the issue.
I started dabbling with this here:
#310

But encountered some issues with the avro deserialization lib we're using...
Maybe I'll take another go at this soon.
Just FYI we started moving away from hudi for CDC recently as it became too expensive and complex and started using https://www.upsolver.com/ and we couldn't be happier :)