Pinned Repositories
airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
aws-serverless-streaming
AWS serverless streaming of changes in a RDBS.
cptr-vision-transformer
Implementation of the CPTR model by https://arxiv.org/pdf/2101.10804.pdf
datahub
The Metadata Platform for the Modern Data Stack
DistributedProcessWatcher
Fully distributed and asynchronous Process Monitoring library
e2e_spark_streamlib_af_cp
Complete run for the generic spark streaming library available @ https://github.com/jsoft88/structured-streaming-lib
hadoop-ozone
Scalable, redundant, and distributed object store for Apache Hadoop
pyspark-conda-k8s
A running example of how to run a pyspark application on k8s with a conda environment
spark-csv2mongodb
An extensible library which allows to load csv data into a mongodb using spark 2.4.6
jsoft88's Repositories
jsoft88/cptr-vision-transformer
Implementation of the CPTR model by https://arxiv.org/pdf/2101.10804.pdf
jsoft88/pyspark-conda-k8s
A running example of how to run a pyspark application on k8s with a conda environment
jsoft88/spark-csv2mongodb
An extensible library which allows to load csv data into a mongodb using spark 2.4.6
jsoft88/DistributedProcessWatcher
Fully distributed and asynchronous Process Monitoring library
jsoft88/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
jsoft88/aws-serverless-streaming
AWS serverless streaming of changes in a RDBS.
jsoft88/datahub
The Metadata Platform for the Modern Data Stack
jsoft88/e2e_spark_streamlib_af_cp
Complete run for the generic spark streaming library available @ https://github.com/jsoft88/structured-streaming-lib
jsoft88/hadoop-ozone
Scalable, redundant, and distributed object store for Apache Hadoop
jsoft88/Impala
Real-time Query for Hadoop
jsoft88/JCAvroSchemaClassBuilder
Generates a class file based on an avro schema.
jsoft88/JCAvroToKudu
Implementation of a kafka application to take data from avro data files and insert them into a kudu table.
jsoft88/JCClassBuilder
Library for generating classes from schema. It provides interfaces for making process easy.
jsoft88/JCGenericKafka
This a generic kafka producer-consumer java library.
jsoft88/joyn-challenge
My design for the challenge
jsoft88/k8s-helper-ui
Drag 'n drop style helper for building k8s concepts, based on predefined templates and custom k8s concepts components
jsoft88/KuduJDBHelper
Helper library for inserting data into a Kudu table or querying the information contained in a table.
jsoft88/MapReduceJSqoopHelper
Library for spliting data imported with sqoop before exporting.
jsoft88/play-scala
Starting interface for Sentiment Analysis
jsoft88/SentimentAnalysis
Twitter sentiment analysis
jsoft88/Spark-MLlib-Twitter-Sentiment-Analysis
:star2: :sparkles: Analyze and visualize Twitter Sentiment on a world map using Spark MLlib
jsoft88/structured-streaming-lib
A spark structured lib which can be extended to accommodate different sources, different transformation algorithms and different sinks - with a sample out of the box
jsoft88/WebDPWManager