Feature Store

Overall data pipeline architecture:

Description:

  • In this repo, we deploy a data pipeline having one flow for batch data and one flow for streaming data. Depend on each flow, we use different services to serve these flows. Some services we can mention are Pyspark, PostgreSQL, Flink, Kafka, Airflow...