Pinned Repositories
deltacat
A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.
ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
ray_beam_runner
Ray-based Apache Beam runner
allennlp
An open-source NLP research library, built on PyTorch.
graph-partition
implement different partition algorithm using Networkx python library
h5boss
Exploratory tools for reformatting BOSS spectra as hdf5 files
h5py
HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.
h5spark
Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark
sci-swift
Scientific Object Store based on Openstack Swift
tomosynthesis
3D Medical Image Reconstruction Platform
valiantljk's Repositories
valiantljk/h5boss
Exploratory tools for reformatting BOSS spectra as hdf5 files
valiantljk/h5py
HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.
valiantljk/sci-swift
Scientific Object Store based on Openstack Swift
valiantljk/cython_exercise
Exercising Cython with IO codes
valiantljk/dfgsheet
Python Dataframe from Google Sheet
valiantljk/GOTCHA-tracer
Tracer generater example using GOTCHA
valiantljk/h5boss-dev
This is the h5boss python code base for multiple purposes
valiantljk/h5py_ioprof
Profiling H5py IO performance with Darshan
valiantljk/mpi4py-examples
mpi4py examples
valiantljk/alga-blooms
Website with database for analyzing water quality data in Texas
valiantljk/app-resilience
valiantljk/atgtools
Various tools created for my work in NERSC ATG
valiantljk/build_hdf5
Scripts for building HDF5 for various platforms and compilers
valiantljk/CAPTCHA
implement classification for CAPTCHA in TensorFlow
valiantljk/ceph
Ceph is a distributed object, block, and file storage platform
valiantljk/collectiveIO-profile
valiantljk/das-ml
valiantljk/deepsee
valiantljk/distribute-data-MPI
Redistribute a large data set into random subsets using one-sided MPI
valiantljk/docker
Dockerfile's for building
valiantljk/dockerfiles
Collection of docker images
valiantljk/gpu_study
learn some basics of GPU CUDA programming, use GPU in deeplearning
valiantljk/mpitutorial
MPI programming lessons in C and executable code examples
valiantljk/nersc-python-bench
valiantljk/pickledb
pickleDB is an open source key-value store using Python's simplejson module.
valiantljk/pravega
Pravega - Streaming as a new software defined storage primitive
valiantljk/productive-IO
DeepDive of Parallel H5py and Analyzing the Performance Distance between H5py and HDF5
valiantljk/survey-crawling
simple python code for crawling a survey website to extract all user information
valiantljk/tensorfx
TensorFlow framework for training and serving machine learning models
valiantljk/train
resources for user training exercises