Pinned Repositories
caffe-bvlc
Caffe: a fast open framework for deep learning.
caffe-intel
hjchen2.github.io
individual blog
paddle-mobile-benchmark
PaddleMobile
oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
oneflow-xrt
Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
Paddle-Lite
PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)
onediff
OneDiff: An out-of-the-box acceleration library for diffusion models.
hjchen2's Repositories
hjchen2/paddle-mobile
This research aims at simply deploying deeplearning on mobile and embedded devices, with low complexity and high speed. old name mobile deep learning.
hjchen2/personal
hjchen2/tensorflow
Computation using data flow graphs for scalable machine learning
hjchen2/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
hjchen2/hjchen2.github.io
individual blog
hjchen2/Paddle
PArallel Distributed Deep LEarning
hjchen2/albert_zh
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
hjchen2/armnn
Arm NN ML Software. The code here is a read-only mirror of https://review.mlplatform.org/admin/repos/ml/armnn
hjchen2/chroutine
C++ coroutine framework
hjchen2/ci_demo
hjchen2/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
hjchen2/dmlc-core
A common bricks library for building scalable and portable distributed machine learning.
hjchen2/docker-ubuntu-desktop
Docker Image for Ubuntu Desktop which support HW GPU accelerated GUI apps. you can access the Container with ssh or remote desktop, just like Cloud VM.
hjchen2/incubator-brpc
Industrial-grade RPC framework used throughout Baidu, with 1,000,000+ instances and thousands kinds of services, called "baidu-rpc" inside Baidu.
hjchen2/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
hjchen2/llvm-tutorial
hjchen2/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
hjchen2/numba
NumPy aware dynamic Python compiler using LLVM
hjchen2/oneflow
OneFlow is a performance-centered and open-source deep learning framework.
hjchen2/OpenArkCompiler
the source code of OpenArkCompiler(Mirror Repo)
hjchen2/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
hjchen2/pytorch_backend
The Triton backend for the PyTorch TorchScript models.
hjchen2/Serving
A flexible, high-performance serving system for machine learning models(『飞桨』服务器端部署库)
hjchen2/sgemm_hsw
This is an implementation of sgemm_kernel on L1d cache.
hjchen2/souper
A superoptimizer for LLVM IR
hjchen2/taichi
The Taichi programming language
hjchen2/TASO
A Tensor Algebra SuperOptimizer for Deep Learning
hjchen2/tiramisu
A polyhedral compiler for expressing fast and portable data parallel algorithms
hjchen2/triton
Development repository for the Triton language and compiler
hjchen2/tvm-1
TVM integration into PyTorch