jy02414216's Stars
baidu/babylon
High-Performance C++ Fundamental Library
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
baidu/puck
Puck is a high-performance ANN search engine
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
lllyasviel/ControlNet
Let us control diffusion models!
merrymercy/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
bytedance/matxscript
A high-performance, extensible Python AOT compiler.
triton-lang/triton
Development repository for the Triton language and compiler
CVCUDA/CV-CUDA
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
alibaba/havenask
PaddlePaddle/FastDeploy
⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
BBuf/tvm_mlir_learn
compiler learning resources collect.
pytorch/TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
PaddlePaddle/CINN
Compiler Infrastructure for Neural Networks
NervanaSystems/maxas
Assembler for NVIDIA Maxwell architecture
shenweichen/GraphEmbedding
Implementation and experiments of graph embedding algorithms.
dmlc/dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
baidu/braft
An industrial-grade C++ implementation of RAFT consensus algorithm based on brpc, widely used inside Baidu to build highly-available distributed systems.
kakao/n2
TOROS N2 - lightweight approximate Nearest Neighbor library which runs fast even with large datasets
kharchenkolab/conos
R package for the joint analysis of multiple single-cell RNA-seq datasets
google-research/google-research
Google Research
onnx/onnx-tensorrt
ONNX-TensorRT: TensorRT backend for ONNX
ronaldo8210/brpc_source_code_analysis
brpc源码分析