Pinned Repositories
AccDNN
A compiler from AI model to RTL (Verilog) accelerator in FPGA hardware with auto design space exploration.
ares
Ares: A framework for quantifying the resilience of deep neural networks
BabelStream
STREAM, for lots of devices written in many programming models
beautiful-jekyll
:sparkles: Build a beautiful and simple website in literally minutes. Demo at http://deanattali.com/beautiful-jekyll
Bi3D
bwa
Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)
CS_838_Low-light-object-detection
cuda-profiler
Tools and extensions for CUDA profiling
tejashah94's Repositories
tejashah94/CS_838_Low-light-object-detection
tejashah94/AccDNN
A compiler from AI model to RTL (Verilog) accelerator in FPGA hardware with auto design space exploration.
tejashah94/ares
Ares: A framework for quantifying the resilience of deep neural networks
tejashah94/BabelStream
STREAM, for lots of devices written in many programming models
tejashah94/beautiful-jekyll
:sparkles: Build a beautiful and simple website in literally minutes. Demo at http://deanattali.com/beautiful-jekyll
tejashah94/Bi3D
tejashah94/bwa
Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)
tejashah94/bwa-mem2
The next version of bwa-mem
tejashah94/casita
tejashah94/ccbench
Memory System Microbenchmarks
tejashah94/ChampSim
ChampSim repository
tejashah94/CS231n
My assignment solutions for CS231n - Convolutional Neural Networks for Visual Recognition
tejashah94/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
tejashah94/DeepBench
Benchmarking Deep Learning operations on different hardware
tejashah94/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
tejashah94/gapbs
GAP Benchmark Suite
tejashah94/gem5
This is an read-only mirror of the gem5 simulator. The upstream repository is stored in https://gem5.googlesource.com, code reviews should be submitted to https://gem5-review.googlesource.com/. The mirrors are synchronized every 15 minutes.
tejashah94/gem5-cache-partitioning
tejashah94/gem5_docker
Run gem5 in Docker, avoiding issues with gem5 in newer OS and gcc versions
tejashah94/gpgpu-sim_distribution
GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated (and validated) energy model, GPUWattch.
tejashah94/HashingDeepLearning
Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"
tejashah94/hatchet
Graph-indexed Pandas DataFrames for analyzing hierarchical performance data
tejashah94/HIP-Examples
Examples for HIP
tejashah94/LSTM_Sentiment-Analysis
tejashah94/prim-benchmarks
PrIM (Processing-In-Memory benchmarks) is the first benchmark suite for a real-world processing-in-memory (PIM) architecture. PrIM is developed to evaluate, analyze, and characterize the first publicly-available real-world PIM architecture, the UPMEM PIM architecture. Described by Gómez-Luna et al. (preliminary version at https://arxiv.org/abs/2105.03814).
tejashah94/pypop
Python Tools for the POP Metrics
tejashah94/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
tejashah94/stack-distance
Utility to simulate cache behavior with Mattson's Stack Algorithm.
tejashah94/synaptic
architecture-free neural network library for node.js and the browser
tejashah94/visdom
A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.