Pinned Repositories
deltacat
A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.
ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
ray_beam_runner
Ray-based Apache Beam runner
allennlp
An open-source NLP research library, built on PyTorch.
graph-partition
implement different partition algorithm using Networkx python library
h5boss
Exploratory tools for reformatting BOSS spectra as hdf5 files
h5py
HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.
h5spark
Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark
sci-swift
Scientific Object Store based on Openstack Swift
tomosynthesis
3D Medical Image Reconstruction Platform
valiantljk's Repositories
valiantljk/h5spark
Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark
valiantljk/allennlp
An open-source NLP research library, built on PyTorch.
valiantljk/alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
valiantljk/code-prettify
An embeddable script that makes source-code snippets in HTML prettier.
valiantljk/cuda-training-series
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
valiantljk/cvat
Powerful and efficient Computer Vision Annotation Tool (CVAT)
valiantljk/docker-nginx
Official NGINX Dockerfiles
valiantljk/fluid
Fluid, elastic data abstraction layer for BigData/AI applications in cloud native systems
valiantljk/icml20-smp
[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"
valiantljk/kafka-python
Python client for Apache Kafka
valiantljk/katib
Repository for hyperparameter tuning
valiantljk/kubernetes
Production-Grade Container Scheduling and Management
valiantljk/ncem_io
explore ncem io on nersc
valiantljk/oap-raydp
RayDP: Distributed data processing library on Ray by running popular big data frameworks like Apache Spark on Ray. RayDP seamlessly integrates with other Ray libraries to make it simple to build E2E data analytics and AI pipeline.
valiantljk/pipelines
Machine Learning Pipelines for Kubeflow
valiantljk/py4j
Py4J enables Python programs to dynamically access arbitrary Java objects
valiantljk/pyprob
A PyTorch-based library for probabilistic programming and inference compilation
valiantljk/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
valiantljk/ray_beam_runner
Ray-based Apache Beam runner
valiantljk/redis-logger
simple logger based on redis, used in gRPC service on K8S
valiantljk/rust_fun
Rust program for fun
valiantljk/service_logger
Simple Logger for K8S services based on Redis
valiantljk/storage_ppl
Probabilistic Programming for Storage and I/O
valiantljk/tf-operator
Tools for ML/Tensorflow on Kubernetes.
valiantljk/tinyml
valiantljk/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
valiantljk/deltacat
A Pythonic Data Catalog powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.
valiantljk/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
valiantljk/langchain-ray
Examples on how to use LangChain and Ray
valiantljk/petals
🌸 Run 100B+ language models at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading