verdimrc's Stars
stas00/ml-engineering
Machine Learning Engineering Open Book
khangich/machine-learning-interview
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
NVIDIA/trt-samples-for-hackathon-cn
Simple samples for TensorRT programming
jrfiedler/causal_inference_python_code
Python code for part 2 of the book Causal Inference: What If, by Miguel Hernán and James Robins
NVIDIA/deepops
Tools for building GPU clusters
mwouts/itables
Pandas DataFrames as Interactive DataTables
triton-inference-server/tensorrtllm_backend
The Triton TensorRT-LLM Backend
st-tech/zr-obp
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
ptrblck/pytorch_misc
Code snippets created for the PyTorch discussion board
NVIDIA/workbench-example-hybrid-rag
An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)
aws-samples/awsome-distributed-training
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
ppwwyyxx/RAM-multiprocess-dataloader
Demystify RAM Usage in Multi-Process Data Loaders
xbresson/CS5242_2021
Neural Networks and Deep Learning, NUS CS5242, 2021
awslabs/aws-ai-solution-kit
Machine Learning APIs for common use cases, include: General OCR (Simplified/Traditional Chinese), Custom OCR, Image Similarity, Object Recognition, Face Detection, Face Comparison, Human Image Segmentation, Human Attribute Recognition, Pornography Detection, Image Super Resolution, Text Similarity, Car License Plate, etc.
NVIDIA/metropolis-nim-workflows
Collection of reference workflows for building intelligent agents with NIMs
NVIDIA/nim-anywhere
Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench
aws-samples/amazon-textract-transformer-pipeline
Post-process Amazon Textract results with Hugging Face transformer models for document understanding
aws-samples/aws-hpc-recipes
Contains example recipes that demonstrate how to build HPC systems using AWS services and solutions.
aws-samples/aws-do-eks
Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference examples.
aws-samples/aws-efa-nccl-baseami-pipeline
EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers
awslabs/amazon-accessible-rl-sdk
A2RL is a Python library for offline reinforcement learning
aws-samples/aws-parallelcluster-monitoring
Monitoring Dashboard for AWS ParallelCluster
samir-souza/laboratory
Some crazy experiments
aws-samples/aws-distributed-training-workshop-eks
Create an Amazon EKS cluster and run a distributed training example
aws-samples/aws-parallelcluster-post-install-scripts
Scripts to customize AWS ParallelCluster
awslabs/aws-cyclone-solution
aws-solutions-library-samples/distributed-compute-on-aws-with-cross-regional-dask
Perform I/O intensive workloads on high-volume data sparsely located across multiple AWS regions through the use of Dask.
aws-samples/ec2-topology-aware-for-slurm
shimomut/sagemaker-solutions
josiahdavis/getting-started-batch
How to use AWS Batch Array Jobs with Python and S3 for input/output.