mhaseeb123
Senior Software Engineer @NVIDIA (@RAPIDSAI) | Accelerating Data science and AI | C++, CUDA, Python
@NVIDIASanta Clara, CA
Pinned Repositories
ADEPT
revamping adept from scratch to make more usable in library form
al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
argparse
A Simple Argument Parser for C++
cuCollections
cudf
cuDF - GPU DataFrame Library
gcb
GCB includes a suite of benchmarks and basic tests for CUDA-aware MPI and C++ compilers.
gicops
HiCOPS: Computational framework for peptide identification from MS data through accelerated database search
nvstdpar
C++26 powered GPU-accelerated scientific apps
stdexec
`std::execution`, the proposed C++ framework for asynchronous and parallel programming.
timemory
Modular, multilingual (C, C++, CUDA, Fortran, Python) utility for performance measurement and analysis
mhaseeb123's Repositories
mhaseeb123/gcb
GCB includes a suite of benchmarks and basic tests for CUDA-aware MPI and C++ compilers.
mhaseeb123/nvstdpar
C++26 powered GPU-accelerated scientific apps
mhaseeb123/ADEPT
revamping adept from scratch to make more usable in library form
mhaseeb123/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
mhaseeb123/argparse
A Simple Argument Parser for C++
mhaseeb123/cuCollections
mhaseeb123/cudf
cuDF - GPU DataFrame Library
mhaseeb123/cugraph
cuGraph - RAPIDS Graph Analytics Library
mhaseeb123/gicops
HiCOPS: Computational framework for peptide identification from MS data through accelerated database search
mhaseeb123/hpcpp
mhaseeb123/stdexec
`std::execution`, the proposed C++ framework for asynchronous and parallel programming.
mhaseeb123/timemory
Modular, multilingual (C, C++, CUDA, Fortran, Python) utility for performance measurement and analysis
mhaseeb123/cuml
cuML - RAPIDS Machine Learning Library
mhaseeb123/cuspatial
CUDA-accelerated GIS and spatiotemporal algorithms
mhaseeb123/hicops
HiCOPS: Computational framework for peptide identification from MS data through accelerated database search
mhaseeb123/hip-training-series
Repository with examples and exercises for OLCF and AMD's HIP training series
mhaseeb123/Instruction_roofline_scripts
mhaseeb123/kvikio
KvikIO - High Performance File IO
mhaseeb123/magic_enum
Static reflection for enums (to string, from string, iteration) for modern C++, work with any enum type without any macro or boilerplate code
mhaseeb123/mhaseeb123
Config files for my GitHub profile.
mhaseeb123/mpich
Official MPICH Repository
mhaseeb123/parquet-bloom-filter-analysis
Generate Parquet Files
mhaseeb123/proteomicslfq
Proteomics label-free quantification (LFQ) analysis pipeline
mhaseeb123/raft
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
mhaseeb123/rmm
RAPIDS Memory Manager
mhaseeb123/sender-examples
Example code for C++Now talk
mhaseeb123/thread-pool
BS::thread_pool: a fast, lightweight, and easy-to-use C++17 thread pool library
mhaseeb123/ucxx
mhaseeb123/WarpX
WarpX is an advanced, time-based electromagnetic & electrostatic Particle-In-Cell code.
mhaseeb123/wholegraph
WholeGraph - large scale Graph Neural Networks