pmixer

Time tourist, having fun in between Math&Physics using Python&CUDA

NVIDIAShanghai, China

pmixer's Stars

NVIDIA/open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
Language:C15.4k 180 3691.3k
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++9.1k 97 2.1k1.1k
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++6k 63 625894
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python4.3k 90 1.1k1.1k
CVCUDA/CV-CUDA
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
Language:C++2.4k 47 173217
umlet/umlet
Free UML Tool for Fast UML Diagrams
Language:JavaScript1.5k 61 646306
HeKun-NVIDIA/CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
1.4k 38 1220
kourgeorge/arxiv-style
A Latex style and template for paper preprints (based on NIPS style)
Language:TeX1.2k 14 20323
triton-inference-server/tensorrtllm_backend
The Triton TensorRT-LLM Backend
Language:Python740 24 511112
Tlntin/Qwen-TensorRT-LLM
Language:Python590 6 11552
bytedance/effective_transformer
Running BERT without Padding
Language:C++466 8 453
pmixer/SASRec.pytorch
PyTorch(1.6+) implementation of https://github.com/kang205/SASRec
Language:Python367 5 4498
daadaada/turingas
Assembler for NVIDIA Volta and Turing GPUs
Language:Python203 12 1040
NVIDIA/GMAT
A toolkit showing GPU's all-round capability in video processing
Language:C182 10 1142
TrojanXu/onnxparser-trt-plugin-sample
A sample for onnxparser working with trt user defined plugins for TRT7.0
Language:C++166 7 1836
stasi009/NumpyWDL
Implement Wide & Deep algorithm by using NumPy
Language:Python151 7 142
YellowOldOdd/SDBI
Simple Dynamic Batching Inference
Language:Python145 4 217
chaytonmin/DeepMVS
3D reconstruction project with MVSNets for depth inferring.
Language:Python140 4 1025
NVIDIA-Merlin/HierarchicalKV
HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of HierarchicalKV is to store key-value feature-embeddings on high-bandwidth memory (HBM) of GPUs and in host memory. It also can be used as a generic key-value storage.
Language:Cuda135 19 2526
megvii-research/TreeEnergyLoss
[CVPR2022] Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation
Language:Python104 3 158
caojiangxia/BiGI
[WSDM 2021]Bipartite Graph Embedding via Mutual Information Maximization
Language:Python74 5 813
yuekaizhang/Triton-ASR-Client
ASR client for Triton ASR Service
Language:Python23 2 45
claws-lab/petgen
A PyTorch implementation of the ACM SIGKDD 2021 paper titled "PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-based Classification Models"
Language:Python16 2 12
leimao/ONNX-Python-Examples
ONNX Python Examples
Language:Dockerfile16 3 06
EdVince/whisper-trtllm
Whisper in TensorRT-LLM
Language:C++15 3 02
claws-lab/DAIN
Code for the ACM CIKM 2021 paper "Influence-guided Data Augmentation for Neural Tensor Completion"
Language:Python9 2 01
cunxi1992/turtle_American_shield
Turtle（海龟）作图教程，并画两个漂亮的图案，美国队长盾牌和360个正方形组成的图案。
Language:PostScript6 3 02
TrojanXu/GTC_S21736_materials
Extended materials for GTC2020 talk S21736
Language:Python6 2 20
pmixer/TiSASRec.debug
Based on https://github.com/JiachengLi1995/TiSASRec, replace negative sampling based evaluation with all-item based evaluation and try to make it better for ranking all items.
Language:Python5 1 00
DC-Shi/cudaNppSample
Language:C1 1 01

pmixer

pmixer's Stars

NVIDIA/open-gpu-kernel-modules

NVIDIA/TensorRT-LLM

NVIDIA/FasterTransformer

wenet-e2e/wenet

CVCUDA/CV-CUDA

umlet/umlet

HeKun-NVIDIA/CUDA-Programming-Guide-in-Chinese

kourgeorge/arxiv-style

triton-inference-server/tensorrtllm_backend

Tlntin/Qwen-TensorRT-LLM

bytedance/effective_transformer

pmixer/SASRec.pytorch

daadaada/turingas

NVIDIA/GMAT

TrojanXu/onnxparser-trt-plugin-sample

stasi009/NumpyWDL

YellowOldOdd/SDBI

chaytonmin/DeepMVS

NVIDIA-Merlin/HierarchicalKV

megvii-research/TreeEnergyLoss

caojiangxia/BiGI

yuekaizhang/Triton-ASR-Client

claws-lab/petgen

leimao/ONNX-Python-Examples

EdVince/whisper-trtllm

claws-lab/DAIN

cunxi1992/turtle_American_shield

TrojanXu/GTC_S21736_materials

pmixer/TiSASRec.debug

DC-Shi/cudaNppSample