Pinned Repositories
baby-llm
RedisXANN
redis x ANN vss
RedisXLM
redis x language model inference (load trained model); size (tiny|t, small|s, medium|m, large|l) with quantization; NOTE: redis embedded language model, available for stand-alone version only
RedisXSlot
redis x slot module, async block migrate/restore, don't block or less block other cmd.
craftsman
a craftsman, try best to give a best practicable solution for biz~
doraemon-nb
ipython notebooks do some sample experiments , make some idea
geo
php 扩展(地图相关的操作)
iowrapper
io_uring lib or syscall wrapper; add some benchmark for io_uring experimental study
perf-book-cn
https://github.com/dendibakh/perf-book gitbook在线电子书,翻译成中文原始markdown文档
weedge's Repositories
weedge/cccl
CUDA C++ Core Libraries
weedge/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
weedge/ClickHouse
ClickHouse® is a free analytics DBMS for big data
weedge/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image ====>>>> embedding vector data
weedge/cuCollections
weedge/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
weedge/cudf
cuDF - GPU DataFrame Library
weedge/DeepSpeed-Kernels
view: how to optimize
weedge/DrTopKSC
weedge/duckdb
DuckDB is an in-process SQL OLAP Database Management System
weedge/faster-whisper
Faster Whisper transcription with CTranslate2
weedge/FasterTransformer
Transformer related optimization, including BERT, GPT
weedge/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
weedge/generative-models
Generative Models by Stability AI, https://stability.ai/research/stable-video-diffusion-scaling-latent-video-diffusion-models-to-large-datasets (use it generative video demo for my baby hanghang~)
weedge/go-ds-crdt
A distributed go-datastore implementation using Merkle-CRDTs. for wedis add crdt with ipfs
weedge/go-plugin
Golang plugin system over RPC. diff idea~
weedge/gpu_selection
Parallel selection on GPUs. bucket-based selection algorithm on GPUs
weedge/HierarchicalKV
HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of Merlin-KV is to store key-value feature-embeddings on high-bandwidth memory (HBM) of GPUs and in host memory. It also can be used as a generic key-value storage.
weedge/HugeCTR
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training; learn CTR for RecSys on GPU
weedge/jetson-inference
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
weedge/Kernels
This is a set of simple programs that can be used to explore the features of a parallel platform.
weedge/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
weedge/metric
weedge/openmp-tutorial
Exercises and Solutions for "Programming Your GPU with OpenMP: A Hands-On Introduction"
weedge/RAPIDS-RAFT
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
weedge/surrealml
weedge/trpc-go
A pluggable, high-performance RPC framework written in golang, oho~ learn plugin impl to wedis, maybe impl a trpc-go plugin~~
weedge/trt-samples-for-hackathon-cn
Simple samples for TensorRT programming
weedge/unordered_dense
A fast & densely stored hashmap and hashset based on robin-hood backward shift deletion
weedge/usearch
Fastest Search & Clustering engine × for Vectors & Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, Wolfram, and WAI 🔍 ... 🐢 fork this to compare with faiss lib with more cases 🐢 code is wonderful~ head-only.