Cuda-Chen
Image Processing, Machine learning, Parallel Computing
Seeking for opportunitiesTaipei, Taiwan
Cuda-Chen's Stars
karpathy/llm.c
LLM training in simple, raw C/CUDA
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
slint-ui/slint
Slint is a declarative GUI toolkit to build native user interfaces for Rust, C++, or JavaScript apps.
valkey-io/valkey
A new project to resume development on the formerly open-source Redis project. We're calling it Valkey, since it's a twist on the key-value datastore.
microsoft/garnet
Garnet is a remote cache-store from Microsoft Research that offers strong performance (throughput and latency), scalability, storage, recovery, cluster sharding, key migration, and replication features. Garnet can work with existing Redis clients.
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
google/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
apple/ml-mgie
apache/kvrocks
Apache Kvrocks is a distributed key value NoSQL database that uses RocksDB as storage engine and is compatible with Redis protocol.
Netflix/bpftop
bpftop provides a dynamic real-time view of running eBPF programs. It displays the average runtime, events per second, and estimated total CPU % for each program.
ashvardanian/StringZilla
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc 🦖
facebookincubator/oomd
A userspace out-of-memory killer
rsta2/circle
A C++ bare metal environment for Raspberry Pi with USB (32 and 64 bit)
oreboot/oreboot
oreboot is a fork of coreboot, with C removed, written in Rust.
GoogleCloudPlatform/localllm
intel/intel-extension-for-pytorch
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
kolodny/safetest
stclib/STC
A modern, user friendly, generic, type-safe and fast C99 container library: String, Vector, Sorted and Unordered Map and Set, Deque, Forward List, Smart Pointers, Bitset and Random numbers.
tidwall/neco
Concurrency library for C (coroutines)
ashvardanian/SimSIMD
Up to 200x Faster Inner Products and Vector Similarity — for Python, JavaScript, Rust, C, and Swift, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE 📐
asterinas/asterinas
Asterinas is a secure, fast, and general-purpose OS kernel, written in Rust and providing Linux-compatible ABI.
nihui/ruapu
Detect CPU features with single-file
attractivechaos/plb2
A programming language benchmark
fuchuanpu/HyperVision
Flow Interaction Graph based attack traffic detection system.
GPUOpen-LibrariesAndSDKs/HIPRT
sysprog21/concurrency-primer
Concurrency Primer
tidwall/bluebox
Redis clone using Neco
sysprog21/vsnd
Virtual Linux soundcard driver
realwujing/rnnoise