Pinned Repositories
3FS
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Ammonite
Scala Scripting
arrow
Mirror of Apache Arrow
arrow-datafusion-comet
Apache Arrow DataFusion Comet Spark Accelerator
atlasdb
Transactional Distributed Database Layer
Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
bce-qianfan-sdk
Provide best practices for LMOps, as well as elegant and convenient access to the features of the Qianfan MaaS Platform. (提供大模型工具链最佳实践,以及优雅且便捷地访问千帆大模型平台)
git
spark
Mirror of Apache Spark
LuciferYang's Repositories
LuciferYang/commons-crypto
Apache Commons Crypto
LuciferYang/spark
Mirror of Apache Spark
LuciferYang/3FS
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
LuciferYang/Ammonite
Scala Scripting
LuciferYang/arrow
Mirror of Apache Arrow
LuciferYang/bce-qianfan-sdk
Provide best practices for LMOps, as well as elegant and convenient access to the features of the Qianfan MaaS Platform. (提供大模型工具链最佳实践,以及优雅且便捷地访问千帆大模型平台)
LuciferYang/data-juicer
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
LuciferYang/geronimo-xbean
Mirror of Apache Geronimo xbean
LuciferYang/gravitino
A high-performance, geo-distributed and federated metadata lake
LuciferYang/hive
Apache Hive
LuciferYang/iceberg
Apache Iceberg
LuciferYang/incubator-uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
LuciferYang/jcasbin
An authorization library that supports access control models like ACL, RBAC, ABAC in Java
LuciferYang/kyuubi
Kyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark
LuciferYang/marquez
Collect, aggregate, and visualize a data ecosystem's metadata
LuciferYang/nifi
Apache NiFi
LuciferYang/official-images
Primary source of truth for the Docker "Official Images" program
LuciferYang/orc
Mirror of Apache Orc
LuciferYang/paimon-trino
Trino Connector for Apache Paimon.
LuciferYang/parquet-mr
Mirror of Apache Parquet
LuciferYang/polaris
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
LuciferYang/ranger
Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond
LuciferYang/smallpond
A lightweight data processing framework built on DuckDB and 3FS.
LuciferYang/spark-connect-swift
Apache Spark Connect Client for Swift
LuciferYang/spark-docker
Official Dockerfile for Apache Spark
LuciferYang/spark-kubernetes-operator
Apache Spark Kubernetes Operator
LuciferYang/spark-upgrade
Magic to help Spark pipelines upgrade
LuciferYang/spark-website
Apache Spark Website
LuciferYang/unitycatalog
Open, Multi-modal Catalog for Data & AI
LuciferYang/xxHash
Extremely fast non-cryptographic hash algorithm