Notherthing's Stars
ggerganov/llama.cpp
LLM inference in C/C++
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
taskflow/taskflow
A General-purpose Task-parallel Programming System using Modern C++
pytorch/tutorials
PyTorch tutorials.
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
microsoft/SPTAG
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.
PlatformLab/NanoLog
Nanolog is an extremely performant nanosecond scale logging system for C++ that exposes a simple printf-like API.
ChunelFeng/CGraph
【A common used C++ DAG framework】 一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork & 交流
efficient/libcuckoo
A high-performance, concurrent hash table
linux-rdma/rdma-core
RDMA core userspace libraries and daemons
microsoft/DiskANN
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
tjumcw/6.824
MIT 6.824 distributed system C++Version
rapidsai/raft
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
intelligent-machine-learning/glake
GLake: optimizing GPU memory management and IO transmission.
OpenMPDK/SMDK
SMDK, Scalable Memory Development Kit, is developed for Samsung CXL(Compute Express Link) Memory Expander to enable full-stack Software-Defined Memory system
oap-project/gazelle_plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
claudebarthels/infinity
A lightweight C++ RDMA library for InfiniBand networks.
CUDA-Tutorial/CodeSamples
Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"
gpudirect/libibverbs
Mellanox libibverbs
howardlau1999/rdmapp
C++ interfaces for RDMA access
wxdwfc/rlib
RLib is a header-only library for easier usage of RDMA.
wxdwfc/rlibv2
A mirror of RLib from lab.
sunkx109/llama.cpp
llama 2 Inference
RavenLite/CompactChineseLaTeXResume
📋 A compact Chinese resume template using LaTeX. | 一个使用 LaTeX 编写的简单清晰的中文简历模板。
xiatwhu/baidu_topk
Notherthing/Recommendation-Systems
智能推荐系统作业
Notherthing/STI2_GPUtopK
STI2_GPUtopK,百度搜索比赛,GPUtopK代码