Pinned Repositories
eventsim
Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
fastdict
Research codes for binary code indexing and search
flickr_fetcher
Research codes for image interestingness
grafana-presto
grafana with presto support
iceberg-rs
Rust implementation of Apache Iceberg
moedict-web
萌典 Web Client 與 REST API (MoeDict Web Client and REST API)
parquet-tools
parquet-tools and dependency jar files
puppet-hadoop
Puppet module for deploying Hadoop MapReduce Next Generation (MRv2)
ros-driver-techman-robot
ROS driver for techman robot
SparkAffinityPropagation
Affinity Propagation on Spark
viirya's Repositories
viirya/eventsim
Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
viirya/arrow-datafusion
Apache Arrow DataFusion and Ballista query engines
viirya/arrow-rs
Official Rust implementation of Apache Arrow
viirya/spark-1
Mirror of Apache Spark
viirya/arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
viirya/arrow-datafusion-comet
Apache Arrow DataFusion Comet Spark Accelerator
viirya/arrow-datafusion-python
Apache Arrow DataFusion Python Bindings
viirya/Bend
A massively parallel, high-level programming language
viirya/hadoop
Apache Hadoop
viirya/iceberg
Apache Iceberg
viirya/iceberg-rust
Apache Iceberg
viirya/incubator-opendal
Apache OpenDAL: Access data freely, painlessly, and efficiently.
viirya/lance
Modern columnar data format for ML implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
viirya/lancedb
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
viirya/llama-rs
Run LLaMA inference on CPU, with Rust 🦀🚀🦙
viirya/llama2.rs
A fast llama2 decoder in pure Rust.
viirya/lst-bench
LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as Delta Lake, Apache Hudi, and Apache Iceberg.
viirya/materialize
Materialize is a fast, distributed SQL database built on streaming internals.
viirya/mlx-examples
Examples in the MLX framework
viirya/pyspark-ai
English SDK for Apache Spark
viirya/qdrant
Qdrant - Vector Database for the next generation of AI applications. Also available in the cloud https://cloud.qdrant.io/
viirya/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
viirya/ray-llm
RayLLM - LLMs on Ray
viirya/ray-serve-text-ml
Finetune and serve t5-small model with text-to-sql dataset using Ray
viirya/risingwave
RisingWave: A Distributed SQL Database for Stream Processing
viirya/rjvm
A tiny JVM written in Rust. Learning project
viirya/spark-cassandra-connector
DataStax Spark Cassandra Connector
viirya/spark-website
Apache Spark Website
viirya/sqllogictest-rs
Sqllogictest parser and runner in Rust.
viirya/sqlparser-rs
Extensible SQL Lexer and Parser for Rust