Pinned Repositories
apache-airflow-study
Python code that implement simple etl on Apache Airflow
Hive_emr_python
Hive streaming using Python and Hive transform function
kafka-stream-poc
POC on how to use kafka-stream that read AVRO from Kafka topic and filter only the desire value to print to console.
kafka_connect_rds_to_s3_json
Retrieve the data from Posgresql on RDS (non CDC) and ingest to AWS S3 as Json String.
kafka_flink_deduplicate1
This project consume the message from Kafka topic using Flink and do deduplication on the incoming message.
kafka_publisher_json_to_azure_event_hub
This demo is for test kafka publisher that publish json string to Azure Event Hub (enable Kafka support)
kstreams
martingale_ea_improvement
To improve the forex robot that use martingale strategy
poc_streaming_twitter_to_kafka_to_spark_to_hdfs
I try to build the data pipeline that read the twitter stream and store tweet data into HDFS
pyspark_read_write_to_hive
Correct way to read the json file on AWS S3 with Pyspark
jitkasempin's Repositories
jitkasempin/kafka_connect_rds_to_s3_json
Retrieve the data from Posgresql on RDS (non CDC) and ingest to AWS S3 as Json String.
jitkasempin/3commas-Smart-Trades-helpers
3commas Smart Trade with Risk Management & AUTO TP's & Auto Stoploss & Telegram based scripts & TradingView Webhook to Telegram
jitkasempin/airflow-dbt-python
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
jitkasempin/analytics
code snippet for analytics sessions
jitkasempin/Backfill-GA4-to-BigQuery
Backfill-GA4-to-BigQuery" repository offers a solution for users to backfill their GA4 data into BigQuery. This is useful for those who need historical data from the start of their GA4 property, as GA4 data is typically only available in BigQuery after linking the two services. Our solution provides a complete backfill of data to BigQuery
jitkasempin/bq-customer-segmentation
Use BQML (k-means) to generate customer segments (from BigQuery public data) and then calls Gemini in Vertex AI through SQL to create human readable headlines and summaries for each segment.
jitkasempin/bq-lineage-tool
BigQuery Column Lineage parser
jitkasempin/Building-OLAP-Dimensional-Model-using-BigQuery-and-DBT
jitkasempin/course_advanced_dbt
This is the repository for Bingeflix dbt Project (Uplimit Advanced dbt course)
jitkasempin/data-diff
Efficiently diff rows across two different databases.
jitkasempin/deb-application
This repository contains application code for the Wizeline Data Engineering Bootcamp (DEB) 2023. It is one of two repositories for the DEB. The other houses the infrastructure code.
jitkasempin/evidence
Evidence enables analysts to deliver a polished business intelligence system using SQL and markdown
jitkasempin/FinTwit_Bot
A Discord bot to keep track of your favorite financial influencers on Twitter
jitkasempin/frappe
Low code web framework for real world applications, in Python and Javascript
jitkasempin/Messenger-Chatbot-API
This repository contains a custom messenger chatbot built using Node.js. The chatbot utilizes the Messenger Platform APIs provided by Facebook to enable automated messaging and interactions with users on the Facebook Messenger platform.
jitkasempin/nocodb
🔥 🔥 🔥 Open Source Airtable Alternative
jitkasempin/OpenLineage
An Open Standard for lineage metadata collection
jitkasempin/patroni
A template for PostgreSQL High Availability with Etcd, Consul, ZooKeeper, or Kubernetes
jitkasempin/premier-league
A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.
jitkasempin/PyDrive2
Google Drive API Python wrapper library. Maintained fork of PyDrive.
jitkasempin/python-analytics-data
jitkasempin/reactpy
It's React, but in Python
jitkasempin/recap
Recap is a dead simple data catalog for engineers
jitkasempin/simple_dbt_project
Code for dbt tutorial
jitkasempin/smol-dev-dong
jitkasempin/sqllineage
SQL Lineage Analysis Tool powered by Python
jitkasempin/Telegram-TradingView-Gmail-Bot-Google-Script-Version-
Telegram Gmail Bot (Google Script Version) The most stable, Simplest, No maintaining Time, No Python Server Host, Easiest, No Python,
jitkasempin/tradingapp
AI Trading App
jitkasempin/tradingview-futu-setup
jitkasempin/TradingViewTelegram
📊 Send TradingView alerts to Telegram, Discord, Slack, Twitter and Email.