Pinned Repositories
spark-tpcds-datagen
All the things about TPC-DS in Apache Spark
vagrantfiles
jia3857's Repositories
jia3857/spark-tpcds-datagen
All the things about TPC-DS in Apache Spark
jia3857/vagrantfiles
jia3857/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
jia3857/Applio
VITS-based Voice Conversion focused on simplicity, quality and performance.
jia3857/CDPDCTrial
Steps to create a Cloudera CDP DC Trial environment from scratch
jia3857/chollinger-blog
Repository for all posts over at chollinger.com/blog/.
jia3857/doubledrops
jia3857/dremio-oss
Dremio - the missing link in modern data
jia3857/FakeYou-Tacotron2-Notebook
Tacotron2 Training Notebook for FakeYou.com
jia3857/gmall2020-mock
jia3857/iceberg
Apache Iceberg
jia3857/macstats
Mac OS X Statistics - Battery, Fans, CPU
jia3857/Makefile.test
A makefile used for running test executables
jia3857/mazes
A comprehensive library of maze generation algorithms.
jia3857/Miscellaneous
Includes notes on Apache Spark, Spark for Physics, Jupyter notebook examples for Spark and Oracle.
jia3857/my-flink-project
jia3857/omni-engineer
jia3857/openstack-ansible
Ansible playbooks for deploying OpenStack.
jia3857/ranger
Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond
jia3857/ReactJS-Spring-Boot-CRUD-Full-Stack-App
Learn how to develop a full-stack CRUD application using React as frontend and spring boot as backend.
jia3857/s-k-l
jia3857/scala-spark-workload
jia3857/spark-playground
Code snippets used in demos recorded for the blog.
jia3857/SQL_practice
A collection of SQL practice problems for interviews
jia3857/starrocks
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.
jia3857/TheKingOfBigData
🚀🚀🚀优质的历史文章,大数据高频考点,Java一线大厂知识考点,更有精美简历模板,简历指导手册和上百本技术书籍,最重要的就是被全网下载上千次的我自己花精力去画的大数据生态圈,Kafka,Spark,Scala的思维导图...这是一个你在大数据学习路上不能错过的宝藏项目!
jia3857/tiny-db
Tiny Database: Query Engine, Storage Engine, Calcite, ANTLR
jia3857/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
jia3857/vagrant-docker-provider
Build a docker image that can be used in vagrant as a development environment
jia3857/vagrantfiles-1