juntaozhang's Stars
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
mli/paper-reading
深度学习经典、新论文逐段精读
datawhalechina/pumpkin-book
《机器学习》(西瓜书)公式详解
zhisheng17/flink-learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
apache/doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
datahub-project/datahub
The Metadata Platform for your Data and AI Stack
wangzhiwubigdata/God-Of-BigData
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
StarRocks/starrocks
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
roboticcam/machine-learning-notes
My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
KeKe-Li/book
:books: All programming languages books
flink-china/flink-training-course
Flink 中文视频课程(持续更新...)
apache/sedona
A cluster computing framework for processing large-scale geospatial data
logpai/loghub
A large collection of system log datasets for AI-driven log analytics [ISSRE'23]
reata/sqllineage
SQL Lineage Analysis Tool powered by Python
qingsongedu/awesome-AI-for-time-series-papers
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
hydro-project/fluent
A data-driven compute platform
WeBankFinTech/WeDataSphere
WeDataSphere is a financial grade, one-stop big data platform suite.
ververica/flink-training-exercises
ververica/sql-training
LiDan456/MAD-GANs
Applied generative adversarial networks (GANs) to do anomaly detection for time series data
Geek-Organization/geek-programming-books
Free programing ebooks
hortonworks-spark/spark-atlas-connector
A Spark Atlas connector to track data lineage in Apache Atlas
ashwin711/georaptor
Python Geohash Compression Tool
jlff/tf2_notes
(Unoffical)人工智能实践:Tensorflow笔记
firmai/business-analytics-and-mathematics-python-book
Advanced Business Analytics and Mathematics with Python (by @firmai)
NetManAIOps/AIOps-Challenge-2020-Data
The published dataset of AIOps Challenge 2020
LinMingQiang/spark-utils
:boom: :alien: :hotsprings::rocket:Encapsulated spark 与其他组件的结合api,方便使用,例如 es,hbase,kudu,kafka,mq等
msteindorfer/memory-measurer
Clone of Google's memory measurer, with slight additions.
udger/udger-java
Java agent string parser based on Udger https://udger.com/products/local_parser