Pinned Repositories
aliyun-emapreduce-sdk
Hadoop/Spark on Aliyun, supporting interactions with Aliyun's base services.
antlr4
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
awesome-gpt3
aws-doc-sdk-examples
Welcome to the AWS Code Examples Repository. This repo contains code examples used in the AWS documentation, AWS SDK Developer Guides, and more. For more information, see the Readme.md file below.
aws-sdk-cpp
AWS SDK for C++
benchmarks
A place in which we publish scripts for reproducible benchmarks.
botocore
The low-level, core functionality of boto3 and the AWS CLI.
chatllama
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
connectors
Connectors for Delta Lake
sparkstudy
windpiger's Repositories
windpiger/antlr4
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
windpiger/awesome-gpt3
windpiger/aws-doc-sdk-examples
Welcome to the AWS Code Examples Repository. This repo contains code examples used in the AWS documentation, AWS SDK Developer Guides, and more. For more information, see the Readme.md file below.
windpiger/aws-sdk-cpp
AWS SDK for C++
windpiger/botocore
The low-level, core functionality of boto3 and the AWS CLI.
windpiger/chatllama
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
windpiger/connectors
Connectors for Delta Lake
windpiger/connectors-2
Connectors for Delta Lake
windpiger/datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
windpiger/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
windpiger/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
windpiger/eel-sdk
Big Data Toolkit for the JVM
windpiger/generative-ai-for-beginners
12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
windpiger/grammars-v4
Grammars written for ANTLR v4; expectation that the grammars are free of actions.
windpiger/hive
Mirror of Apache Hive
windpiger/io
Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO
windpiger/kyuubi
Kyuubi is an enhanced editon of Apache Spark's primordial Thrift JDBC/ODBC Server.
windpiger/migrating-to-cloud-native-application-architectures
《迁移到云原生应用架构》中文版 http://jimmysong.io/migrating-to-cloud-native-application-architectures/
windpiger/ML-For-Beginners
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
windpiger/mysql-server
MySQL Server, the world's most popular open source database, and MySQL Cluster, a real-time, open source transactional database.
windpiger/parquet4s
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
windpiger/presto-exercise-debug
debug presto by myself to learn presto
windpiger/s3fs
S3 Filesystem
windpiger/smart-data-lake
Framework to quickly build and maintain Smart Data Lakes
windpiger/spark
Mirror of Apache Spark
windpiger/spark-adaptive
windpiger/spark-deep-learning
Deep Learning Pipelines for Apache Spark
windpiger/spark-llap
windpiger/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
windpiger/tensorflow
An Open Source Machine Learning Framework for Everyone