Pinned Repositories
Agile_Data_Code_2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Android
GitHub上最火的Android开源项目,所有开源项目都有详细资料和配套视频
architect-awesome
后端架构师技术图谱
architecture.taobao-alibaba
互联网公司架构: 淘宝技术架构,阿里巴巴技术架构
dataop
NB Operation
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
hudi
Upserts, Deletes And Incremental Processing on Big Data.
iceberg
Apache Iceberg
incubator-uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
spark
Apache Spark
Run-Lin's Repositories
Run-Lin/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Run-Lin/AiLearning
AiLearning: 机器学习 - MachineLearning - ML、深度学习 - DeepLearning - DL、自然语言处理 NLP
Run-Lin/Awesome-System-for-Machine-Learning
A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.
Run-Lin/emqx
EMQ X Broker - Scalable Distributed MQTT Message Broker for IoT in 5G Era
Run-Lin/fluid
Fluid, elastic data abstraction layer for BigData/AI applications in cloud native systems
Run-Lin/gin-vue-admin
基于gin+vue搭建的后台管理系统框架,集成jwt鉴权,权限管理,动态路由,分页封装,多点登录拦截,资源权限,上传下载,代码生成器,表单生成器等基础功能,更多功能正在开发中,欢迎issue和pr~
Run-Lin/graph-learn
graph-learn
Run-Lin/incubator-sedona
A cluster computing framework for processing large-scale geospatial data
Run-Lin/juicefs
A shared POSIX file system built on top of Redis and S3.
Run-Lin/k9s
🐶 Kubernetes CLI To Manage Your Clusters In Style!
Run-Lin/kubernetes
Production-Grade Container Scheduling and Management
Run-Lin/mmlspark
Microsoft Machine Learning for Apache Spark
Run-Lin/nebula
A high performance distributed Graph Database
Run-Lin/orc
Mirror of Apache Orc
Run-Lin/piflow
πflow is a big data flow engine with spark support
Run-Lin/PublicCMS
CMS written in Java,Safe and fast,Easy support 10 million data, 10 million PV; Currently has 0.0002% of the world's users.Language support English,中文,繁體,日本語
Run-Lin/PySparkDemo
PySpark算子及空间应用的各个Demo
Run-Lin/RemoteShuffleService
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
Run-Lin/ros_hadoop
Hadoop splittable InputFormat for ROS. Process rosbag with Hadoop Spark and other HDFS compatible systems.
Run-Lin/scala
Scala 2 compiler and standard library. For bugs, see scala/bug
Run-Lin/sealos
一条命令安装kubernetes,超全版本,支持国产化,生产环境中稳如老狗,99年证书,0依赖,去haproxy keepalived,v1.20支持containerd!
Run-Lin/sevntu.checkstyle
Additional Checkstyle checks, that could be added as extension to EclipseCS plugin and maven-checkstyle-plugin, Sonar checkstyle plugin, extension for CheckStyle IDEA plugin.
Run-Lin/Shift-AI-models-to-real-world-products
Share some useful guides and references about how to shift AI models to real world products or projects.
Run-Lin/spark-examples
[ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples
Run-Lin/spark-jobserver
REST job server for Apache Spark
Run-Lin/Spark-ML
通过Spark引擎进行机器学习,全文基于Spark 2.4.3版本
Run-Lin/SparkCube
SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.
Run-Lin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Run-Lin/TDengine
An open-source big data platform designed and optimized for the Internet of Things (IoT).
Run-Lin/TileDB
The Universal Storage Engine