NVIDIA Corporation

2788 San Tomas Expressway, Santa Clara, CA, 95051

Pinned Repositories

aistore
AIStore: scalable storage for AI applications
Language:Go1.6k 51 111222
cuopt
GPU accelerated decision optimization
Language:Cuda538 11 18289
cuopt-examples
NVIDIA cuOpt examples for decision optimization
Language:Jupyter Notebook377 18 2555
DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Language:Jupyter Notebook14.6k 291 8253.4k
GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
Language:Jupyter Notebook3.6k 82 70899
Megatron-LM
Ongoing research training transformer models at scale
Language:Python14.1k 171 1.2k3.3k
nvidia-container-toolkit
Build and run containers leveraging NVIDIA GPUs
Language:Go3.8k 40 612430
nvidia-docker
Build and run Docker containers leveraging NVIDIA GPUs
17.4k 446 1.6k2k
open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
Language:C16.3k 179 5061.5k
TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Language:C++12.3k 157 4.1k2.3k

NVIDIA Corporation's Repositories

NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python14.1k 171 1.2k3.3k
NVIDIA/TensorRT-LLM
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
Language:C++12.1k 120 3.1k1.8k
NVIDIA/cuda-python
CUDA Python: Performance meets Productivity
Language:Python3k 46 602219
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python2.9k 36 556540
NVIDIA/nv-ingest
NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
Language:Python2.8k 28 183273
NVIDIA/gpu-operator
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
Language:Go2.4k 50 984409
NVIDIA/stdexec
`std::execution`, the proposed C++ framework for asynchronous and parallel programming.
Language:C++2.1k 58 606209
NVIDIA/cccl
CUDA Core Compute Libraries
Language:C++2k 32 2.4k288
NVIDIA/aistore
AIStore: scalable storage for AI applications
Language:Go1.6k 51 111222
NVIDIA/TensorRT-Model-Optimizer
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.
Language:Python1.5k 25 258191
NVIDIA/NeMo-Agent-Toolkit
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
Language:Python1.5k 24 237414
NVIDIA/KAI-Scheduler
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
Language:Go898 18 86102
NVIDIA/cuda-quantum
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
Language:C++843 22 1k298
NVIDIA/NeMo-Skills
A project to improve skills of large language models
Language:Python568 19 112102
NVIDIA/bionemo-framework
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
Language:Jupyter Notebook565 41 11594
NVIDIA/cuopt
GPU accelerated decision optimization
Language:Cuda538 11 18289
NVIDIA/tilus
Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.
Language:Python395 4 118
NVIDIA/Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
Language:C++360 16 97869
NVIDIA/JAX-Toolbox
JAX-Toolbox
Language:Python359 19 18566
NVIDIA/mig-parted
MIG Partition Editor for NVIDIA GPUs
Language:Go224 9 3850
NVIDIA/nim-deploy
A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deployment.
Language:Jupyter Notebook204 9 3287
NVIDIA/recsys-examples
Examples for Recommenders - easy to train and deploy on accelerated infrastructure.
Language:Python16536
NVIDIA/vgpu-device-manager
NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes
Language:Go146 9 023
NVIDIA/cudaqx
Accelerated libraries for quantum-classical computing built on CUDA-Q.
Language:C++65 10 4735
NVIDIA/NV-Kernels
Ubuntu kernels which are optimized for NVIDIA server systems
Language:C64 5 141
NVIDIA/gontainer
Simple but powerful dependency injection container for Go projects!
Language:Go60 3 66
NVIDIA/doca-platform
DOCA Platform manages provisioning and service orchestration for Bluefield DPUs
Language:Go51 11 012
NVIDIA/spark-rapids-jni
RAPIDS Accelerator JNI For Apache Spark
Language:Cuda51 19 31874
NVIDIA/cloud-native-docs
Documentation repository for NVIDIA Cloud Native Technologies
Language:PowerShell29 10 1030
NVIDIA/doca-sosreport
A unified tool for collecting system logs and other debug information
Language:Python4 0 021