jinqiua

jinqiua's Stars

replicate/replicate-python
Python client for Replicate
Language:Python749217
AIDC-AI/Ovis
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
Language:Python43625
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
2.7k182
ztxz16/fastllm
纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行
Language:C++3.3k335
apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:Python11.7k3.5k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15k1.4k
hnsywangxin/controlnet_stable_tensorrt
stable diffusion, controlnet, tensorrt, accelerate
Language:Python538
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python33.4k5.7k
Tencent/TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
Language:C++1.5k197
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.5k606
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.8k1.3k
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.4k399
zhanzy178/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python84
Rayrtfr/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++171
arkingc/note
学习笔记整理📚
Language:C++2k629
youngyangyang04/leetcode-master
《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀
Language:Shell51.2k11.4k
HuangJunJie2017/BEVDet
Official code base of the BEVDet series .
Language:Python1.4k264
JulianWww/GPU-CUDA-MergeSort
Language:Cuda1
traveller59/spconv
Spatial Sparse Convolution Library
Language:Python1.9k363
kweisamx/TensorFlow-ESPCN
TensorFlow implementation of the Efficient Sub-Pixel Convolutional Neural Network
Language:Python5215
xxxwuwq/SRCNN-REPRODUCTION
The reproduction of SRCNN method for super-resolution
Language:Python3124
drakelevy/ESPCN-TensorFlow
An implementation of the Efficient Sub-Pixel Convolutional Neural Network in TensorFlow
Language:Python12448