windpiger's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
microsoft/ML-For-Beginners
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
BuilderIO/gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
Alluxio/alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
apache/hive
Apache Hive
JerryLead/SparkInternals
Notes talking about the design and implementation of Apache Spark
elyase/awesome-gpt3
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
apache/carbondata
High performance data store solution
graphframes/graphframes
databricks/spark-avro
Avro Data Source for Apache Spark
krb5/krb5
mirror of MIT krb5 repository
boundary/high-scale-lib
A fork of Cliff Click's High Scale Library. Improved with bug fixes and a real build system.
ankurdave/color-identifiers-mode
Emacs minor mode to highlight each source code identifier uniquely based on its name
bytedance/primus
rahulsom/lgtmin
Say 'Looks good to me' with pictures
aliyun/aliyun-emapreduce-datasources
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
scala-records/scala-records
Labeled records for Scala based on structural refinement types and macros.
alibaba/SparkCube
SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.
apache/spark-website
Apache Spark Website
apache/directory-kerby
Mirror of Apache Directory Kerby
hortonworks-spark/spark-llap
yanboliang/spark-vlbfgs
Vector-free L-BFGS implementation for Spark MLlib
JoshRosen/hive
Mirror of Apache Hive
TritonNetworking/themis_tritonsort
Themis MapReduce and TritonSort