ShadenSmith

Technical Staff @ Inflection AI. Passionate about high performance computing and machine learning.

@InflectionAI Bellevue, Washington

Pinned Repositories

frostt-tensor.github.io
FROSTT: the Formidable Repository of Open Sparse Tensors and Tools.
Language:HTML11 8 53
tensor_parser
A package for constructing sparse tensors from CSV-like data sources.
Language:Python9 3 115
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python33.5k 341 2.6k3.9k
advent-2023
Language:Rust2 2 00
csvsorter
For sorting CSV files on disk that do not fit into memory
Language:Python5 3 02
LLFI
LLFI is an LLVM based fault injection tool, that injects faults into the LLVM IR of the application source code. The faults can be injected into specific program points, and the effect can be easily tracked back to the source code. LLFI is typically used to map fault characteristics back to source code, and hence understand source level or program characteristics for various kinds of fault outcomes. Please refer to paper below for more details: Anna Thomas, Karthik Pattabiraman, LLFI: An Intermediate code-level fault injector, in Workshop on Silicon Errors in Logic, System Effects (SELSE), 2013.
Language:C++2 4 00
shaden-io
Professional website.
Language:TeX0 3 00
splatt
The Surprisingly ParalleL spArse Tensor Toolkit.
Language:C66 12 1829
splatt-ipdps17
SPLATT source code used in our IPDPS '17 paper.
Language:C2 3 10
splatt-stream
A streaming implementation of the CPD published in SDM'18.
Language:C7 4 01

ShadenSmith's Repositories

ShadenSmith/splatt
The Surprisingly ParalleL spArse Tensor Toolkit.
Language:C66 12 1829
ShadenSmith/splatt-stream
A streaming implementation of the CPD published in SDM'18.
Language:C7 4 01
ShadenSmith/csvsorter
For sorting CSV files on disk that do not fit into memory
Language:Python5 3 02
ShadenSmith/advent-2023
Language:Rust2 2 00
ShadenSmith/splatt-ipdps17
SPLATT source code used in our IPDPS '17 paper.
Language:C2 3 10
ShadenSmith/advent-2022-rust
Advent of Code 2022
Language:Rust1 2 00
ShadenSmith/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Language:Python1 3 01
ShadenSmith/ghpages-test
Playing around with GitHub pages.
Language:Python1 5 02
ShadenSmith/speedy-softmax
Language:Rust1 2 0
ShadenSmith/torch-blocksparse
Block-sparse primitives for PyTorch
Language:Python1 3 0
ShadenSmith/zpart
A simple Zoltan frontend for partitioning hypergraphs.
Language:C1 4 0
ShadenSmith/shaden-io
Professional website.
Language:TeX0 3 00
ShadenSmith/amx-rs
Rust wrapper for Apple Matrix Coprocessor (AMX) instructions
Language:Rust1 0
ShadenSmith/candle
Minimalist ML framework for Rust
Language:Rust1 0
ShadenSmith/collisionless_tests
Exploring phase-space methods for collisionless dark matter simulations
Language:C++3 0
ShadenSmith/deepspeed-test-worker
Language:Shell4 01
ShadenSmith/DeepSpeedExamples
Example models using DeepSpeed
Language:Python2 0
ShadenSmith/DSE
2 0
ShadenSmith/firehose
The main purpose of the FireHose Streaming Benchmarks is to enable comparison of streaming software and hardware, both quantitatively vis-a-vis the rate at which they can process data, and qualitatively by judging the effort involved to implement and run the benchmarks.
Language:C++3 0
ShadenSmith/Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python2 0
ShadenSmith/ModelStepper
Musings on debugging DeepSpeed codes.
Language:Python5 1
ShadenSmith/practice-azure-pipelines
Getting started with Azure Pipelines
Language:Python3 0
ShadenSmith/PuLP
Language:C++1 0
ShadenSmith/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Language:Python2 0
ShadenSmith/ShadenSmith.github.io
Language:HTML3 0
ShadenSmith/SimTensor
SimTensor: Tensor data generator for evaluation of tensor factorization algorithms
Language:Matlab3 01
ShadenSmith/sparse
Sparse multi-dimensional arrays for the PyData ecosystem
Language:Python4 0
ShadenSmith/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Language:Python2 0
ShadenSmith/travis-test
Language:CMake3 01
ShadenSmith/triton-packaging-test
Prototyping Triton source code packaging.
Language:Python3 0