Pinned Repositories
airflow
Apache Airflow
bigo-presto
Official home of Presto, the distributed SQL query engine for big data
brpc
Most common RPC framework used throughout Baidu, with 600,000+ instances and 500+ kinds of services, called "baidu-rpc" inside Baidu.
ClickHouse
ClickHouse is a free analytic DBMS for big data.
gluten
hive
Apache Hive
hnsw
HNSW header-only C++/python lib, 200M SIFT experiments from the paper
incubator-druid
Apache Druid (Incubating) - Column oriented distributed data store ideal for powering interactive applications
kyuubi
Kyuubi is an enhanced editon of Apache Spark's primordial Thrift JDBC/ODBC Server.
x-deeplearning
An industrial deep learning framework for high-dimension sparse data
BIGO's Repositories
bigo-sg/brpc
Most common RPC framework used throughout Baidu, with 600,000+ instances and 500+ kinds of services, called "baidu-rpc" inside Baidu.
bigo-sg/ClickHouse
ClickHouse is a free analytic DBMS for big data.
bigo-sg/bigo-presto
Official home of Presto, the distributed SQL query engine for big data
bigo-sg/gluten
bigo-sg/hive
Apache Hive
bigo-sg/kyuubi
Kyuubi is an enhanced editon of Apache Spark's primordial Thrift JDBC/ODBC Server.
bigo-sg/spark-hive1.2.1
bigo-sg/alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
bigo-sg/arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. L
bigo-sg/atlas-monitor
atlas monitor
bigo-sg/ByConity
ByConity is an open source cloud-native data warehouse
bigo-sg/dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available `out of the box`.
bigo-sg/doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
bigo-sg/dpdk
Mirror of Data Plane Development Kit, git://dpdk.org/dpdk (http://dpdk.org)
bigo-sg/gporca
A modular query optimizer for big data
bigo-sg/isa-l
Intelligent Storage Acceleration Library
bigo-sg/libhdfs3
HDFS file read access for ClickHouse
bigo-sg/libmaxminddb
C library for the MaxMind DB file format
bigo-sg/NNAnalytics
NameNodeAnalytics is a self-help utility for scouting and maintaining the namespace of an HDFS instance.
bigo-sg/orc
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
bigo-sg/presto-yarn
bigo-sg/ranger
Mirror of Apache Ranger
bigo-sg/rate
Golang rate limiter for distributed system
bigo-sg/redis
Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strings, Lists, Sets, Sorted Sets, Hashes, Streams, HyperLogLogs, Bitmaps.
bigo-sg/robin-hood-hashing
Fast & memory efficient hashtable based on robin hood hashing for C++11/14/17/20
bigo-sg/rttr
C++ Reflection Library
bigo-sg/seastar
High performance server-side application framework
bigo-sg/spark
Apache Spark
bigo-sg/sysroot
Files for cross-compilation
bigo-sg/trino-hadoop-apache
Shaded version of Apache Hadoop for Trino