Flowingsun007's Stars
torvalds/linux
Linux kernel source tree
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
geekan/HowToLiveLonger
程序员延寿指南 | A programmer's guide to live longer
llvm/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
mli/paper-reading
深度学习经典、新论文逐段精读
taichi-dev/taichi
Productive, portable, and performant GPU programming in Python.
WeNeedHome/SummaryOfLoanSuspension
全国各省市停贷通知汇总
rui314/mold
Mold: A Modern Linker 🦠
zhaoolee/ChineseBQB
🇨🇳 Chinese sticker pack,More joy / 表情包的博物馆, Github最有毒的仓库, **表情包大集合, 聚欢乐~
apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
xournalpp/xournalpp
Xournal++ is a handwriting notetaking software with PDF annotation support. Written in C++ with GTK3, supporting Linux (e.g. Ubuntu, Debian, Arch, SUSE), macOS and Windows 10. Supports pen input from devices such as Wacom Tablets.
NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
microsoft/mimalloc
mimalloc is a compact general purpose allocator with excellent performance.
microsoft/STL
MSVC's implementation of the C++ Standard Library.
Megvii-BaseDetection/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
intel-analytics/ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, GraphRAG, DeepSpeed, Axolotl, etc
meituan/YOLOv6
YOLOv6: a single-stage object detection framework dedicated to industrial applications.
purocean/yn
A highly extensible Markdown editor. Version control, AI Copilot, mind map, documents encryption, code snippet running, integrated terminal, chart embedding, HTML applets, Reveal.js, plug-in, and macro replacement.
ROCm/HIP
HIP: C++ Heterogeneous-Compute Interface for Portability
oneapi-src/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
LinuxSuRen/remote-jobs-in-china
支持远程办公的**公司
wsdjeg/Learn-Vim_zh_cn
聪明地学习Vim
zuoqing1988/ZQCNN
一款推理框架,同时有很多有用的demo,觉得好用请点星啊
THUDM/CogDL
CogDL: A Comprehensive Library for Graph Deep Learning (WWW 2023)
Tencent/flare
Flare是广泛投产于腾讯广告后台的现代化C++开发框架,包含了基础库、RPC、各种客户端等。主要特点为易用性强、长尾延迟低。
alibaba/BladeDISC
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
PhilJd/contiguous_pytorch_params
Accelerate training by storing parameters in one contiguous chunk of memory.
Oneflow-Inc/models
Models and examples built with OneFlow