lxynov

Bay Area

lxynov's Stars

awesomedata/awesome-public-datasets
A topic-centric list of HQ open datasets.
61.5k 2.3k 11710k
minio/minio
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
Language:Go49.1k 628 7.5k5.6k
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python34.7k 479 19.1k5.9k
zsh-users/zsh-syntax-highlighting
Fish shell like syntax highlighting for Zsh.
Language:Shell20.4k 136 6961.3k
mlflow/mlflow
Open source platform for the machine learning lifecycle
Language:Python19.1k 305 4k4.3k
airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Language:Python16.6k 187 14.7k4.2k
helm/charts
⚠️(OBSOLETE) Curated applications for Kubernetes
Language:Go15.5k 386 6.3k16.8k
open-policy-agent/opa
Open Policy Agent (OPA) is an open source, general-purpose policy engine.
Language:Go9.8k 128 2.7k1.4k
crossplane/crossplane
The Cloud Native Control Plane
Language:Go9.7k 148 2.2k975
spinnaker/spinnaker
Spinnaker is an open source, multi-cloud continuous delivery platform for releasing software changes with high velocity and confidence.
Language:Shell9.4k 342 5.5k1.2k
bitnami/charts
Bitnami Helm Charts
Language:Smarty9.1k 102 8.7k9.3k
bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
Language:Python7.2k 76 1.1k797
lancedb/lancedb
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
Language:Python5.1k 32 805356
lancedb/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
Language:Rust4.1k 43 1.1k236
facebookincubator/velox
A composable and fully extensible C++ execution engine library for data management systems.
Language:C++3.6k 114 2.2k1.2k
apache/kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Language:Scala2.1k 63 2.3k920
uber-common/jvm-profiler
JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter
Language:Java1.8k 103 38342
Intel-bigdata/HiBench
HiBench is a big data benchmark suite.
Language:Java1.5k 126 369766
apache/incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Language:Scala1.2k 41 2.5k446
substrait-io/substrait
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
Language:Python1.2k 42 174161
apache/ranger
Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond
Language:Java917 74 0987
apache/incubator-livy
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
Language:Scala895 59 20603
aws/aws-graviton-getting-started
Helping developers to use AWS Graviton2, Graviton3, and Graviton4 processors which power the 6th, 7th, and 8th generation of Amazon EC2 instances (C6g[d], M6g[d], R6g[d], T4g, X2gd, C6gn, I4g, Im4gn, Is4gen, G5g, C7g[d][n], M7g[d], R7g[d], R8g).
Language:Python895 46 55201
trinodb/trino-python-client
Python client for Trino
Language:Python340 16 183170
apple/batch-processing-gateway
The gateway component to make Spark on K8s much easier for Spark users.
Language:Java183 15 638
IBM/spark-tpc-ds-performance-test
Use the TPC-DS benchmark to test Spark SQL performance
Language:TSQL176 38 1696
martint/jmxutils
Exporting JMX mbeans made easy
Language:Java173 10 1547
Lewuathe/docker-trino-cluster
Multiple node presto cluster on docker container
Language:Makefile124 5 646
DataDog/jmxfetch
Export JMX metrics
Language:Java98 128 9170
datapunchorg/punch
This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like Apache Spark.
Language:Go53 6 75

lxynov

lxynov's Stars

awesomedata/awesome-public-datasets

minio/minio

ray-project/ray

zsh-users/zsh-syntax-highlighting

mlflow/mlflow

airbytehq/airbyte

helm/charts

open-policy-agent/opa

crossplane/crossplane

spinnaker/spinnaker

bitnami/charts

bentoml/BentoML

lancedb/lancedb

lancedb/lance

facebookincubator/velox

apache/kyuubi

uber-common/jvm-profiler

Intel-bigdata/HiBench

apache/incubator-gluten

substrait-io/substrait

apache/ranger

apache/incubator-livy

aws/aws-graviton-getting-started

trinodb/trino-python-client

apple/batch-processing-gateway

IBM/spark-tpc-ds-performance-test

martint/jmxutils

Lewuathe/docker-trino-cluster

DataDog/jmxfetch

datapunchorg/punch