hitywt's Stars
pytorch/torchrec
Pytorch domain library for recommendation systems
greg7mdp/parallel-hashmap
A family of header-only, very fast and memory-friendly hashmap and btree containers.
bytedance/sonic-cpp
A fast JSON serializing & deserializing library, accelerated by SIMD.
cloudwu/coroutine
A asymmetric coroutine library for C.
TIM168/technical_books
:books:🔥收集全网最热门的技术书籍 (GO、黑客、Android、计算机原理、人工智能、大数据、机器学习、数据库、PHP、java、架构、消息队列、算法、python、爬虫、操作系统、linux、C语言),不间断更新中:hotsprings:
izenecloud/sf1r-lite
Search Formula-1——A distributed high performance massive data engine for enterprise/vertical search
izenecloud/izenelib
General purpose C++ library for iZENECloud
ChunelFeng/CGraph
【A common used C++ DAG framework】 一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork & 交流
taskflow/taskflow
A General-purpose Task-parallel Programming System using Modern C++
chenyahui/AnnotatedCode
知名开源代码库的注释版:C++、Golang等
Tencent/libco
libco is a coroutine library which is widely used in wechat back-end service. It has been running on tens of thousands of machines since 2013.
pytorch/benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
dragonflydb/dragonfly
A modern replacement for Redis and Memcached
PaddlePaddle/PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
meta-llama/llama
Inference code for Llama models
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
alibaba/Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
horance-liu/tensorflow-internals
It is open source ebook about TensorFlow kernel and implementation mechanism.
ArchaeaSoftware/cudahandbook
Source code that accompanies The CUDA Handbook.
google/nccl-fastsocket
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
NVIDIA/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
openmlsys/openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
NVIDIA/nccl
Optimized primitives for collective multi-GPU communication
NVIDIA/nccl-tests
NCCL Tests
dataprofessor/infographic
Infographic
ai-shifu/ChatALL
Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, 讯飞星火, 文心一言 and more, discover the best answers
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
llvm/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
hitywt/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
fsgo/smart-go-dl
Multi Go version management