jinqiua's Stars
replicate/replicate-python
Python client for Replicate
AIDC-AI/Ovis
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
ztxz16/fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
hnsywangxin/controlnet_stable_tensorrt
stable diffusion, controlnet, tensorrt, accelerate
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Tencent/TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
zhanzy178/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Rayrtfr/FasterTransformer
Transformer related optimization, including BERT, GPT
arkingc/note
学习笔记整理📚
youngyangyang04/leetcode-master
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
HuangJunJie2017/BEVDet
Official code base of the BEVDet series .
JulianWww/GPU-CUDA-MergeSort
traveller59/spconv
Spatial Sparse Convolution Library
kweisamx/TensorFlow-ESPCN
TensorFlow implementation of the Efficient Sub-Pixel Convolutional Neural Network
xxxwuwq/SRCNN-REPRODUCTION
The reproduction of SRCNN method for super-resolution
drakelevy/ESPCN-TensorFlow
An implementation of the Efficient Sub-Pixel Convolutional Neural Network in TensorFlow