weinanlee's Stars
teamclairvoyant/airflow-maintenance-dags
A series of DAGs/Workflows to help maintain the operation of Airflow
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
cch123/elasticsql
convert sql to elasticsearch DSL in golang(go)
ricklamers/gridstudio
Grid studio is a web-based application for data science with full integration of open source data science frameworks and languages.
gto76/python-cheatsheet
Comprehensive Python Cheatsheet
datastacktv/data-engineer-roadmap
Roadmap to becoming a data engineer in 2021
vinta/awesome-python
An opinionated list of awesome Python frameworks, libraries, software and resources.
jcustenborder/kafka-connect-transform-xml
Transformation for converting XML data to Structured data.
strimzi/strimzi-kafka-operator
Apache Kafka® running on Kubernetes
streamthoughts/kafka-connect-file-pulse
🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka
josonle/Realtime-Data-Warehouse
实时数据仓库搭建
Parsely/pykafka
Apache Kafka client for Python; high-level & low-level consumer/producer, with great performance.
doocs/advanced-java
😮 Core Interview Questions & Answers For Experienced Java(Backend) Developers | 互联网 Java 工程师进阶知识完全扫盲:涵盖高并发、分布式、高可用、微服务、海量数据处理等领域知识
ksingh7/twitter_streaming_app_on_openshift_OCS
A Demo Twitter Streaming and Sentiment Analysis App to showcase RHT AMQ Streams (Kafka), MongoDB served through Python backend API and Javascript Frontend . This app runs on OpenShift and enjoys persistency using OpenShift Container Storage (rook-ceph)
aisuhua/restful-api-design-references
RESTful API 设计参考文献列表,可帮助你更加彻底的了解REST风格的接口设计。
dvu4/udacity-data-streaming-project-1
Udacity Data Streaming - Project Optimizing Public Transportation
shabie/streaming_nd
Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions
oneryalcin/udacity-Data-Streaming-ND-Kafka
Udacity Kafka Streams Course
aaronstone007/Udacity-Data-Streaming
Projects from Udacity Data Streaming Nanodegree
Esri/spatial-framework-for-hadoop
The Spatial Framework for Hadoop allows developers and data scientists to use the Hadoop data processing system for spatial data analysis.
Esri/gis-tools-for-hadoop
The GIS Tools for Hadoop are a collection of GIS tools for spatial analysis of big data.
whirlys/BigData-In-Practice
大数据实践项目 Hadoop、Spark、Kafka、Hbase、Flink.....
xkcoding/spring-boot-demo
🚀一个用来深入学习并实战 Spring Boot 的项目。
prakhar1989/docker-curriculum
:dolphin: A comprehensive tutorial on getting started with Docker!
aws-samples/aws-refarch-wordpress
This reference architecture provides best practices and a set of YAML CloudFormation templates for deploying WordPress on AWS.
AlexIoannides/pyspark-example-project
Implementing best practices for PySpark ETL jobs and applications.
UrbanInstitute/pyspark-tutorials
Code snippets and tutorials for working with social science data in PySpark
LuckyZXL2016/Movie_Recommend
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
spring-projects/spring-data-examples
Spring Data Example Projects
ageron/handson-ml
⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.