chenxfeng

chenxfeng's Stars

CyC2018/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
179k 5.3k 59651.2k
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python87k 1.8k 48.9k23.4k
521xueweihan/GitHub520
:kissing_heart: 让你“爱”上 GitHub，解决访问时图裂、加载慢的问题。（无需安装）
Language:Python24.7k 287 2222.4k
PaddlePaddle/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）
Language:C++22.5k 715 18.6k5.7k
apache/mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
Language:C++20.8k 1.1k 9.6k6.8k
NVIDIA/open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
Language:C15.5k 181 3821.3k
zhisheng17/flink-learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例，还有 Flink 落地应用的大型项目案例（PVUV、日志存储、百亿数据实时去重、监控告警）分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Language:Java14.7k 515 03.9k
apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:Python12k 378 3.4k3.5k
NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Language:C++11.2k 154 3.9k2.2k
openvinotoolkit/openvino
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Language:C++7.8k 193 2.8k2.4k
intel-analytics/ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, GraphRAG, DeepSpeed, Axolotl, etc
Language:Python7k 254 2.7k1.3k
mindspore-ai/mindspore
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
Language:C++4.4k 148 282721
NervanaSystems/neon
Intel® Nervana™ reference deep learning framework committed to best performance on all hardware
Language:Python3.9k 326 389812
oneapi-src/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
Language:C++3.7k 183 1.3k1k
alibaba/Alink
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
Language:Java3.6k 138 214802
ARM-software/ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Language:C++2.9k 229 1.1k787
soumith/convnet-benchmarks
Easy benchmarking of all publicly accessible implementations of convnets
Language:Python2.7k 286 77577
ispc/ispc
Intel® Implicit SPMD Program Compiler
Language:C++2.6k 94 1.3k318
msys2/msys2
A software distro and building platform for Windows
1.8k 101 0135
intelxed/xed
The X86 Encoder Decoder (XED), is a software library for encoding and decoding X86 (IA32 and Intel64) instructions
Language:Python1.4k 59 173151
princeton-nlp/MeZO
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
Language:Python1.1k 20 4268
NVIDIA/gdrcopy
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
Language:C++958 55 194148
libxsmm/libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Language:C862 50 346188
NVIDIA/caffe
Caffe: a fast open framework for deep learning.
Language:C++674 94 374262
jmeubank/tdm-gcc
TDM-GCC is a cleverly disguised GCC compiler for Windows!
Language:Makefile615 18 6152
keystone-enclave/keystone
Keystone Enclave (QEMU + HiFive Unleashed)
Language:C474 38 240142
numactl/numactl
NUMA support for Linux
Language:C436 28 106150
nascab/nascab-releases
Language:JavaScript373 4 9652
daadaada/turingas
Assembler for NVIDIA Volta and Turing GPUs
Language:Python212 12 1040
xingyul/sparse-winograd-cnn
Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)
Language:Python190 13 748