Pinned Repositories
fast_upsampling
functional_ml_science_nlp_deeplearn-playground
Playground of functional code for machine learning, science, math, natural language processing, deep learning, and other related numerical processing.
mxnet_seq2seq
Simple MXNet sequence-to-sequence model (neural machine translation)
scala-data-science
Data Science in Scala - Conf. Talk Repo
word2vec_meetup
Word2Vec example
mkolod's Repositories
mkolod/fast_upsampling
mkolod/nimble
mkolod/open-gpu-doc
Documentation of NVIDIA chip/hardware interfaces
mkolod/tensorrt_python_samples
mkolod/awesome-reMarkable
A curated list of projects related to the reMarkable tablet
mkolod/bazel-examples
Examples of Bazel use
mkolod/bdf
Avnet Board Definition Files
mkolod/brevitas
Brevitas: quantization-aware training in Pytorch
mkolod/data-parallel-CPP
Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben Ashbaugh, James Brodman, Michael Kinsner, John Pennycook, Xinmin Tian (Apress, 2020).
mkolod/direwolf
Dire Wolf is a software "soundcard" AX.25 packet modem/TNC and APRS encoder/decoder. It can be used stand-alone to observe APRS traffic, as a tracker, digipeater, APRStt gateway, or Internet Gateway (IGate). For more information, look at the bottom 1/4 of this page and in https://github.com/wb2osz/direwolf/blob/dev/doc/README.md
mkolod/Get_Moving_With_Alveo
For publishing the source for UG1352 "Get Moving with Alveo"
mkolod/gradient-checkpointing
Make huge neural nets fit in memory
mkolod/iree
👻
mkolod/MinkowskiEngine
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
mkolod/nandland
All code found on nandland is here. underconstruction.gif
mkolod/NVBit
mkolod/nvidia_libs_test
Tests and benchmarks for cudnn (and in the future, other nvidia libraries)
mkolod/psp
mkolod/raytracinginoneweekendincuda
The code for the ebook Ray Tracing in One Weekend by Peter Shirley translated to CUDA by Roger Allen. This work is in the public domain.
mkolod/rpi-gpio-dma-demo
Performance writing to GPIO with CPU and DMA on the Raspberry Pi
mkolod/rules_cuda
Starlark implementation of bazel rules for CUDA.
mkolod/rules_cuda_examples
This repo holds the extended examples for rules_cuda.
mkolod/spconv
Spatial Sparse Convolution Library
mkolod/TensorRT
TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.
mkolod/torch2trt
An easy to use PyTorch to TensorRT converter
mkolod/torch_custom_op
mkolod/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.
mkolod/Vitis-AI-Tutorials
mkolod/Vitis-Tutorials
mkolod/Vitis_Accel_Examples
Vitis_Accel_Examples