Pinned Repositories
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
hadoop
Mirror of Apache Hadoop
hbase
Mirror of Apache Hadoop HBase
incubator-ratis
Mirror of Apache Ratis (Incubating)
kubernetes
Production-Grade Container Scheduling and Management
OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
spark
Mirror of Apache Spark
tensorflow
Computation using data flow graphs for scalable machine learning
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
vcap
Cloud Foundry - the open platform as a service project
JunpingDu's Repositories
JunpingDu/DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
JunpingDu/incubator-ratis
Mirror of Apache Ratis (Incubating)
JunpingDu/OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
JunpingDu/ray
A fast and simple framework for building and running distributed applications.
JunpingDu/alc-site
JunpingDu/ambari-vagrant
Vagrant setup for creating Ambari development/test virtual machines
JunpingDu/artigraph
Artigraph is a tool to improve the authorship, management, and quality of data. It emphasizes that the core deliverable of a data pipeline or workflow is the data, not the tasks.
JunpingDu/datahub
The Metadata Platform for the Modern Data Stack
JunpingDu/kubernetes
Production-Grade Container Scheduling and Management
JunpingDu/spark
Mirror of Apache Spark
JunpingDu/tensorflow
Computation using data flow graphs for scalable machine learning
JunpingDu/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
JunpingDu/data-engineering-wiki
The best place to learn data engineering. Built and maintained by the data engineering community.
JunpingDu/deltacat
A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.
JunpingDu/exercises-stdlib
Scala Exercises' lessons for the standard library
JunpingDu/flink
Mirror of Apache Flink
JunpingDu/guava
Google Core Libraries for Java
JunpingDu/incubator-gobblin
Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin features integrations with Apache Hadoop, Apache Kafka, Salesforce, S3, MySQL, Google etc.
JunpingDu/incubator-tubemq
Apache TubeMQ
JunpingDu/isa-l
Intelligent Storage Acceleration Library
JunpingDu/kafka
Mirror of Apache Kafka
JunpingDu/kubernetes-ec2-autoscaler
A batch-optimized scaling manager for Kubernetes
JunpingDu/nifi
Mirror of Apache NiFi
JunpingDu/osr
"开源雨林" - 开源合规通识
JunpingDu/protobuf
Protocol Buffers - Google's data interchange format
JunpingDu/pulsar
Apache Pulsar - distributed pub-sub messaging system
JunpingDu/TensorFlowOnSpark
TensorFlowOnSpark brings TensorFlow programs onto Apache Spark clusters
JunpingDu/tools
JunpingDu/TubeMQ
TubeMQ focuses on high-performance storage and transmission of massive data in large data scenarios
JunpingDu/YiVal
Your Automatic Prompt Engineering Assistant for GenAI Applications