Pinned Repositories
Ammonite
Scala Scripting
arrow
Mirror of Apache Arrow
arrow-datafusion-comet
Apache Arrow DataFusion Comet Spark Accelerator
atlasdb
Transactional Distributed Database Layer
Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
bigflow
brpc
Most common RPC framework used throughout Baidu, with 600,000+ instances and 500+ kinds of services, called "baidu-rpc" inside Baidu.
brpc-java
Java implementation for Baidu RPC, multi-protocol & high performance RPC.
git
spark
Mirror of Apache Spark
LuciferYang's Repositories
LuciferYang/spark
Mirror of Apache Spark
LuciferYang/arrow
Mirror of Apache Arrow
LuciferYang/arrow-datafusion-comet
Apache Arrow DataFusion Comet Spark Accelerator
LuciferYang/chill
Scala extensions for the Kryo serialization library
LuciferYang/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
LuciferYang/elasticsearch-hadoop
:elephant: Elasticsearch real-time search and analytics natively integrated with Hadoop
LuciferYang/gluten
LuciferYang/gravitino
A high-performance, geo-distributed and federated metadata lake
LuciferYang/hudi
Upserts, Deletes And Incremental Processing on Big Data.
LuciferYang/iceberg
Apache Iceberg
LuciferYang/incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
LuciferYang/incubator-uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
LuciferYang/kryo
Java serialization and cloning: fast, efficient, automatic
LuciferYang/kubernetes-client
Java client for Kubernetes & OpenShift
LuciferYang/lancedb
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
LuciferYang/llama
Inference code for LLaMA models
LuciferYang/llama_index
LlamaIndex is a data framework for your LLM applications
LuciferYang/logging-log4j2
Apache Log4j 2 is a versatile, feature-rich, efficient logging API and backend for Java.
LuciferYang/Megatron-LM
Ongoing research training transformer models at scale
LuciferYang/metrics
:chart_with_upwards_trend: Capturing JVM- and application-level metrics. So you know what's going on.
LuciferYang/nimble
New file format for storage of large columnar datasets.
LuciferYang/onetable
OneTable is an omni-directional converter for table formats that facilitates interoperability across data processing systems and query engines.
LuciferYang/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
LuciferYang/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
LuciferYang/simdjson-java
A Java version of simdjson
LuciferYang/spark-docker
Official Dockerfile for Apache Spark
LuciferYang/spark-kubernetes-operator
Apache Spark Kubernetes Operator
LuciferYang/sparklyr
R interface for Apache Spark
LuciferYang/substrait
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
LuciferYang/unitycatalog
Open, Multi-modal Catalog for Data & AI