Pinned Repositories
argparse
Argument Parser for Modern C++
BLAS-Benchmark
ceres-solver
A large scale non-linear optimization library
rlkit-pmoe
robosuite-benchmark-pmoe
SGEMM-SASS-Annotation
ThreadPool
The original intention of this project is to learn the new C++20 standard in use. Therefore, make sure your compiler supports C++20 or later standards.
Timer
A powerful C++ timer, compiler is required to support C++14 or later standard.
MegBA
MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment
torchopt
TorchOpt is an efficient library for differentiable optimization built upon PyTorch.
JieRen98's Repositories
JieRen98/SGEMM-SASS-Annotation
JieRen98/rlkit-pmoe
JieRen98/ThreadPool
The original intention of this project is to learn the new C++20 standard in use. Therefore, make sure your compiler supports C++20 or later standards.
JieRen98/BLAS-Benchmark
JieRen98/Timer
A powerful C++ timer, compiler is required to support C++14 or later standard.
JieRen98/argparse
Argument Parser for Modern C++
JieRen98/ceres-solver
A large scale non-linear optimization library
JieRen98/robosuite-benchmark-pmoe
JieRen98/core_scheduler
CoreScheduler: A High-Performance Scheduler for Large Model Training
JieRen98/cublasLt_profile
JieRen98/cutlass
CUDA Templates for Linear Algebra Subroutines
JieRen98/Dagger.jl
A framework for out-of-core and parallel execution
JieRen98/DaggerGPU.jl
GPU integrations for Dagger.jl
JieRen98/Flops
How many FLOPS can you achieve?
JieRen98/gdrcopy
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
JieRen98/hicma
HiCMA: Hierarchical Computations on Manycore Architectures
JieRen98/JieRen98.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
JieRen98/MegBA
MegBA: A Distributed High-Performance Library for Large-Scale Bundle Adjustment with GPUs
JieRen98/openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
JieRen98/plasma
JieRen98/Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
JieRen98/selective-amnesia
JieRen98/stars-h
Software for Testing Accuracy, Reliability and Scalability of Hierarchical computations.
JieRen98/SuiteSparse
SuiteSparse: a suite of sparse matrix packages by @DrTimothyAldenDavis et al. with native CMake support
JieRen98/TorchOpt
TorchOpt is a high-performance optimizer library built upon PyTorch for easy implementation of functional optimization and gradient-based meta-learning.