seungjinn's Stars
microsoft/ML-For-Beginners
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
prestodb/presto
The official home of the Presto distributed SQL query engine for big data
rapidsai/cudf
cuDF - GPU DataFrame Library
activeloopai/deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
rust-unofficial/patterns
A catalogue of Rust design patterns, anti-patterns and idioms
riscv-collab/riscv-gnu-toolchain
GNU toolchain for RISC-V, including GCC
google-research/t5x
BlazingDB/blazingsql
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
uber/petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Wenzel/awesome-virtualization
Collection of resources about Virtualization
lizrice/ebpf-beginners
The beginner's guide to eBPF
kaist-cp/cs431
google/tensorstore
Library for reading and writing large multi-dimensional arrays.
WebAssembly/wasi-sdk
WASI-enabled WebAssembly C/C++ toolchain
sched-ext/scx
sched_ext schedulers and tools
eunomia-bpf/bpftime
Userspace eBPF runtime for Observability, Network & General Extensions Framework
erpc-io/eRPC
Efficient RPCs for datacenter networks
iovisor/ubpf
Userspace eBPF VM
rapidsai/raft
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
delimitrou/DeathStarBench
Open-source benchmark suite for cloud microservices
1duo/awesome-ai-infrastructures
Infrastructures™ for Machine Learning Training/Inference in Production.
OpenMPDK/SMDK
SMDK, Scalable Memory Development Kit, is developed for Samsung CXL(Compute Express Link) Memory Expander to enable full-stack Software-Defined Memory system
MoatLab/Pond
Pond: CXL-Based Memory Pooling Systems for Cloud Platforms (ASPLOS'23)
ipdk-io/ipdk
Infrastructure Programmer Development Kit (IPDK) is an open source, vendor agnostic framework of drivers and APIs for infrastructure offload and management that runs on a CPU, IPU, DPU or switch.
cambridgehackers/connectal
Connectal is a framework for software-driven hardware development.
google/minimalloc
A lightweight memory allocator for hardware-accelerated machine learning
kernelci/kernelci-core
Core KernelCI tools
MLBazaar/MLPrimitives
Primitives for machine learning and data science.
shao-hua-li/UBGen
UBGen can generate programs with undefined behaviors (e.g., buffer-overflow, use-after-free, etc.)