yangwang166's Stars
josephmisiti/awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
mlflow/mlflow
Open source platform for the machine learning lifecycle
kubeflow/kubeflow
Machine Learning Toolkit for Kubernetes
ritchieng/the-incredible-pytorch
The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.
MicrosoftDocs/azure-docs
Open source documentation of Microsoft Azure
NirantK/awesome-project-ideas
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
delta-io/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
mrdbourke/machine-learning-roadmap
A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
snowplow/snowplow
The leader in Next-Generation Customer Data Infrastructure
microsoft/SynapseML
Simple and Distributed Machine Learning
elyase/awesome-gpt3
jitsucom/jitsu
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
minivision-ai/photo2cartoon
人像卡通化探索项目 (photo-to-cartoon translation project)
google-research/football
Check out the new game server:
databricks/koalas
Koalas: pandas API on Apache Spark
com-lihaoyi/mill
Mill is a fast JVM build tool that supports Java and Scala. 2-4x faster than Gradle and 5-10x faster than Maven for common workflows, Mill aims to make your project’s build process performant, maintainable, and flexible
databricks/spark-deep-learning
Deep Learning Pipelines for Apache Spark
databricks/LearningSparkV2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
adafruit/Adafruit_Python_DHT
Python library to read the DHT series of humidity and temperature sensors on a Raspberry Pi or Beaglebone Black.
com-lihaoyi/requests-scala
A Scala port of the popular Python Requests HTTP client: flexible, intuitive, and straightforward to use.
databricks/devrel
This repository contains the notebooks and presentations we use for our Databricks Tech Talks
handsonscala/handsonscala
Discussion and and code examples for the book Hands-on Scala Programming
dmatrix/mlflow-workshop-part-1
Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this three part series, we will cover MLflow Tracking, Projects, Models, and Model Registry.
databrickslabs/cicd-templates
Manage your Databricks deployments and CI with code.
cloudera-labs/envelope
Build configuration-driven ETL pipelines on Apache Spark
dmatrix/mlflow-workshop-part-2
Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four part series, we will cover MLflow Tracking, Projects, Models, and Model Registry.
hortonworks/efm
cegganesh84/cdp-azure-tools
Cloudera CDP Tools for Microsoft Azure
odeshmane/cdp-azure-tools
Cloudera CDP Tools for Microsoft Azure