ShadenSmith
Technical Staff @ Microsoft AI. Passionate about high performance computing and machine learning.
@MicrosoftBellevue, Washington
Pinned Repositories
frostt-tensor.github.io
FROSTT: the Formidable Repository of Open Sparse Tensors and Tools.
tensor_parser
A package for constructing sparse tensors from CSV-like data sources.
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
advent-2023
csvsorter
For sorting CSV files on disk that do not fit into memory
LLFI
LLFI is an LLVM based fault injection tool, that injects faults into the LLVM IR of the application source code. The faults can be injected into specific program points, and the effect can be easily tracked back to the source code. LLFI is typically used to map fault characteristics back to source code, and hence understand source level or program characteristics for various kinds of fault outcomes. Please refer to paper below for more details: Anna Thomas, Karthik Pattabiraman, LLFI: An Intermediate code-level fault injector, in Workshop on Silicon Errors in Logic, System Effects (SELSE), 2013.
shaden-io
Professional website.
splatt
The Surprisingly ParalleL spArse Tensor Toolkit.
splatt-ipdps17
SPLATT source code used in our IPDPS '17 paper.
splatt-stream
A streaming implementation of the CPD published in SDM'18.
ShadenSmith's Repositories
ShadenSmith/splatt
The Surprisingly ParalleL spArse Tensor Toolkit.
ShadenSmith/splatt-stream
A streaming implementation of the CPD published in SDM'18.
ShadenSmith/csvsorter
For sorting CSV files on disk that do not fit into memory
ShadenSmith/advent-2023
ShadenSmith/splatt-ipdps17
SPLATT source code used in our IPDPS '17 paper.
ShadenSmith/advent-2022-rust
Advent of Code 2022
ShadenSmith/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
ShadenSmith/ghpages-test
Playing around with GitHub pages.
ShadenSmith/speedy-softmax
ShadenSmith/spmv-rs
Noodling with sparse matrix vector multiplication in rust
ShadenSmith/torch-blocksparse
Block-sparse primitives for PyTorch
ShadenSmith/zpart
A simple Zoltan frontend for partitioning hypergraphs.
ShadenSmith/shaden-io
Professional website.
ShadenSmith/amx-rs
Rust wrapper for Apple Matrix Coprocessor (AMX) instructions
ShadenSmith/candle
Minimalist ML framework for Rust
ShadenSmith/collisionless_tests
Exploring phase-space methods for collisionless dark matter simulations
ShadenSmith/deepspeed-test-worker
ShadenSmith/DeepSpeedExamples
Example models using DeepSpeed
ShadenSmith/DSE
ShadenSmith/firehose
The main purpose of the FireHose Streaming Benchmarks is to enable comparison of streaming software and hardware, both quantitatively vis-a-vis the rate at which they can process data, and qualitatively by judging the effort involved to implement and run the benchmarks.
ShadenSmith/Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
ShadenSmith/ModelStepper
Musings on debugging DeepSpeed codes.
ShadenSmith/practice-azure-pipelines
Getting started with Azure Pipelines
ShadenSmith/PuLP
ShadenSmith/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
ShadenSmith/ShadenSmith.github.io
ShadenSmith/SimTensor
SimTensor: Tensor data generator for evaluation of tensor factorization algorithms
ShadenSmith/sparse
Sparse multi-dimensional arrays for the PyData ecosystem
ShadenSmith/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
ShadenSmith/triton-packaging-test
Prototyping Triton source code packaging.