Pinned Repositories
AccDNN
A compiler from AI model to RTL (Verilog) accelerator in FPGA hardware with auto design space exploration.
ares
Ares: A framework for quantifying the resilience of deep neural networks
BabelStream
STREAM, for lots of devices written in many programming models
beautiful-jekyll
:sparkles: Build a beautiful and simple website in literally minutes. Demo at http://deanattali.com/beautiful-jekyll
Bi3D
bwa
Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)
CS_838_Low-light-object-detection
cuda-profiler
Tools and extensions for CUDA profiling
tejashah94's Repositories
tejashah94/cuda-profiler
Tools and extensions for CUDA profiling
tejashah94/CCT
Chuck's CUDA Toolkit
tejashah94/clstm
A small C++ implementation of LSTM networks, focused on OCR.
tejashah94/CS537
Projects of CS-537: Intro to Operating Systems (Spring 2019) at University of Wisconsin-Madison using xv6 Operating System
tejashah94/cublas-demos
Demonstrations of cuBLAS usage in several contexts
tejashah94/cuDNNLSTMInvestigation
tejashah94/detmem
tejashah94/DNNMark
Benchmarks of Deep Neural Networks
tejashah94/ECE_552-Project-Processor-Design
WISC-F18 ISA based processor design for the course Introduction to Computer Architecture at UW-Madison.
tejashah94/ECS201A
Materials for ECS 201A
tejashah94/extension-cpp
C++ extensions in PyTorch
tejashah94/FinalProject_759
Main repo for EP759 final project. Group collaborators include: Alex Swenson, YoungHui Park, and Nancy Granda-Duarte
tejashah94/GaloisGPU
LonestarGPU: Irregular algorithms parallelized for GPUs
tejashah94/gem5-cache-partitioning-1
Gem5 L2 Cache Partitioning
tejashah94/gem5-gpu
tejashah94/Gem5-Simulator
This is a simulator, we used for our project Comparision Cache Replacement Policies using Gem5 Simulator.
tejashah94/hw
RTL, Cmodel, and testbench for NVDLA
tejashah94/lstm-rnn
long short-term memory for recursive neural network model
tejashah94/LSTMs
tejashah94/mesa
A modified Mesa 3D graphics library for gem5-graphics
tejashah94/microbench
Extremely Simple Microbenchmarks
tejashah94/mixbench
A GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP)
tejashah94/N3LP
C++ implementation for Neural Network-based NLP, such as LSTM machine translation!
tejashah94/Optimal_Cache_design_Analysis
This project is aimed at optimizing various cache design choices by making sensible tradeoff between cost and performance of the CPU. To show the variations, the L1 caches (data and instruction) are compared to L2 cache through the data of 5 benchmarks.
tejashah94/ppg
Low-power PPG waveform estimation using compressive sampling and hardware-oriented optimization
tejashah94/pycachesim
Python Cache Hierarchy Simulator
tejashah94/Python-Decision-Tree-Using-ID3
Problem : Write a program to demonstrate the working of the decision tree based ID3 algorithm. Use an appropriate data set for building the decision tree and apply this knowledge to classify a new sample.
tejashah94/pytorch-gpgpu-sim
Modified version of PyTorch able to work with changes to GPGPU-Sim
tejashah94/unet
unet for image segmentation
tejashah94/wgsim
Reads simulator