Pinned Repositories
BlockchainLibrary
useful documents and scientific papers about Blockchain & cryptocurrencies. I've read them all, now it's your turn ;)
CXXGraph
Header-Only C++ Library for Graph Representation and Algorithms
DeepLearningBookCode-Volume1
Python/Jupyter notebooks for Volume 1 of "Deep Learning - From Basics to Practice" by Andrew Glassner
DeepLearningBookCode-Volume2
Python/Jupyter notebooks for Volume 2 of "Deep Learning - From Basics to Practice" by Andrew Glassner
HackerNews
A Hacker News reader iOS app written in Swift.
MonotonicCubicPy
A python implementation of monotonic cubic/bicubic interpolation
Optix-PathTracer
Simple physically based path tracer based on Nvidia's Optix Ray Tracing Engine
UnderstandingUnixLinuxProgramming
source code for the book
VHCudaFluid
A Cuda Fluid Simulator for MAYA
bssrdf's Repositories
bssrdf/UnderstandingUnixLinuxProgramming
source code for the book
bssrdf/ggml
Tensor library for machine learning
bssrdf/pyleet
leet code training
bssrdf/avx2-examples
Short examples illustrating AVX2 intrinsics for simple tasks.
bssrdf/bcnn
Minimalist Convolutional Neural Networks in C and Cuda
bssrdf/clip.cpp
CLIP inference in plain C/C++ with no extra dependencies
bssrdf/Cpp-Concurrency-in-Action-2ed
C++11/14/17/20 multithreading, involving operating system principles and concurrent programming technology.
bssrdf/cuda-1brc
My CUDA solution to the 1BRC
bssrdf/CUDA-Based-Image-Convolution
Developed and optimized a CUDA kernel for 2D convolution, accommodating a 2D input tensor and a 2D filter tensor, with transposed filter application.
bssrdf/CUDA_Freshman
bssrdf/CUDA_gemm
A simple high performance CUDA GEMM implementation.
bssrdf/CUDALibrarySamples
CUDA Library Samples
bssrdf/cutlass
CUDA Templates for Linear Algebra Subroutines
bssrdf/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
bssrdf/generative-ai-for-beginners
12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
bssrdf/kgraph
A library for k-nearest neighbor search
bssrdf/LeNet-5_Speed_Up
Utilize OpenMP and CUDA to speed up LeNet-5 digit recognition CNN. In OpneMP, training with 11x speed up and 11x in testing. With the help of CUDA, the training is speed up by 3x and 57x speed up in testing.
bssrdf/llama.cpp
Port of Facebook's LLaMA model in C/C++
bssrdf/moderngpu
Design patterns for GPU computing
bssrdf/openCNN
A Winograd Minimal Filter Implementation in CUDA
bssrdf/PMPP
Solution of Programming Massively Parallel Processors
bssrdf/PMPP4th
bssrdf/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
bssrdf/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
bssrdf/SGEMM_CUDA
Fast CUDA matrix multiplication from scratch
bssrdf/sshfs
A network filesystem client to connect to SSH servers
bssrdf/stable-diffusion.cpp
Stable Diffusion in pure C/C++
bssrdf/stable-fast
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
bssrdf/udlbook
Understanding Deep Learning - Simon J.D. Prince
bssrdf/x86-simd-sort
C++ template library for high performance SIMD based sorting algorithms