hjchen2

beijing, China

Pinned Repositories

caffe-bvlc
Caffe: a fast open framework for deep learning.
Language:C++10
caffe-intel
Language:C++1 2 00
hjchen2.github.io
individual blog
Language:HTML00
paddle-mobile-benchmark
Language:Python3 3 04
PaddleMobile
Language:C++32
oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
Language:C++5.8k 146 957654
oneflow-xrt
Language:C++22 36 23
Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）
Language:C++21.7k 721 18k5.5k
Paddle-Lite
PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎）
Language:C++6.9k 340 2.4k1.6k
onediff
OneDiff: An out-of-the-box acceleration library for diffusion models.
Language:Python1.3k 38 33779

hjchen2's Repositories

hjchen2/paddle-mobile
This research aims at simply deploying deeplearning on mobile and embedded devices, with low complexity and high speed. old name mobile deep learning.
Language:C++1 2 00
hjchen2/personal
Language:JavaScript1 1 1
hjchen2/tensorflow
Computation using data flow graphs for scalable machine learning
Language:C++1 2 00
hjchen2/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:C++1 2 0
hjchen2/hjchen2.github.io
individual blog
Language:HTML00
hjchen2/Paddle
PArallel Distributed Deep LEarning
Language:C++0 2 00
hjchen2/albert_zh
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
hjchen2/armnn
Arm NN ML Software. The code here is a read-only mirror of https://review.mlplatform.org/admin/repos/ml/armnn
hjchen2/chroutine
C++ coroutine framework
hjchen2/ci_demo
Language:C++2 0
hjchen2/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Language:Python1 0
hjchen2/dmlc-core
A common bricks library for building scalable and portable distributed machine learning.
Language:C++1 0
hjchen2/docker-ubuntu-desktop
Docker Image for Ubuntu Desktop which support HW GPU accelerated GUI apps. you can access the Container with ssh or remote desktop, just like Cloud VM.
hjchen2/incubator-brpc
Industrial-grade RPC framework used throughout Baidu, with 1,000,000+ instances and thousands kinds of services, called "baidu-rpc" inside Baidu.
Language:C++1 0
hjchen2/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python1 0
hjchen2/llvm-tutorial
Language:C++2 0
hjchen2/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
hjchen2/numba
NumPy aware dynamic Python compiler using LLVM
Language:Python1 0
hjchen2/oneflow
OneFlow is a performance-centered and open-source deep learning framework.
Language:C++1 0
hjchen2/OpenArkCompiler
the source code of OpenArkCompiler（Mirror Repo）
Language:C++1 0
hjchen2/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:C++2 0
hjchen2/pytorch_backend
The Triton backend for the PyTorch TorchScript models.
Language:C++0 0
hjchen2/Serving
A flexible, high-performance serving system for machine learning models（『飞桨』服务器端部署库）
hjchen2/sgemm_hsw
This is an implementation of sgemm_kernel on L1d cache.
Language:Assembly1 0
hjchen2/souper
A superoptimizer for LLVM IR
Language:C++1 0
hjchen2/taichi
The Taichi programming language
Language:C++1 0
hjchen2/TASO
A Tensor Algebra SuperOptimizer for Deep Learning
Language:C++1 0
hjchen2/tiramisu
A polyhedral compiler for expressing fast and portable data parallel algorithms
Language:Jupyter Notebook1 0
hjchen2/triton
Development repository for the Triton language and compiler
hjchen2/tvm-1
TVM integration into PyTorch