Pinned Repositories
BabelStream
STREAM, for lots of devices written in many programming models
benchmark
benchmark profiling
benchmark-1
book
Deep Learning 101 with PaddlePaddle
caffe2
Caffe2 is a lightweight, modular, and scalable deep learning framework.
callCoreMLFromCppOrPython
example of using CoreML from c++
cld
Language Detection based on Chromium's Compact Language Detector library
concurrentqueue
A fast multi-producer, multi-consumer lock-free concurrent queue for C++11
cpp-subprocess
popen() -like C++ library with iostream support for stdio forwarding
CTranslate2
Optimized inference engine for OpenNMT models
chengduoZH's Repositories
chengduoZH/benchmark-1
chengduoZH/callCoreMLFromCppOrPython
example of using CoreML from c++
chengduoZH/concurrentqueue
A fast multi-producer, multi-consumer lock-free concurrent queue for C++11
chengduoZH/cpp-subprocess
popen() -like C++ library with iostream support for stdio forwarding
chengduoZH/CTranslate2
Optimized inference engine for OpenNMT models
chengduoZH/DeepLearningFrameworks
Demo of running NNs across different frameworks
chengduoZH/dlrm
An implementation of a deep learning recommendation model (DLRM)
chengduoZH/exo
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
chengduoZH/FluidDoc
Documentations for PaddlePaddle
chengduoZH/grpc-go-pool
grpc connection pool
chengduoZH/librime
Rime Input Method Engine, the core library
chengduoZH/llama.cpp
LLM inference in C/C++
chengduoZH/llama2.c
Inference Llama 2 in one file of pure C
chengduoZH/llm.c
LLM training in simple, raw C/CUDA
chengduoZH/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
chengduoZH/marian
Fast Neural Machine Translation in C++
chengduoZH/models
Model configurations
chengduoZH/ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
chengduoZH/oneflow
OneFlow is a performance-centered and open-source deep learning framework.
chengduoZH/open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
chengduoZH/Paddle
PArallel Distributed Deep LEarning
chengduoZH/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
chengduoZH/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
chengduoZH/pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
chengduoZH/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
chengduoZH/TensorRT
TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.
chengduoZH/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
chengduoZH/TurboTransformers
a fast and user-friendly tool for transformer inference on CPU and GPU
chengduoZH/vearch
A distributed system for efficient similarity search of embedding vectors
chengduoZH/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs