dleunji

Sogang UniversitySeoul, Korea

Pinned Repositories

AdvancedSoftwarePractices
고급소프트웨어실습 아카이빙
Language:C++0 2 00
Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
Language:Python00
Awesome-Federated-Machine-Learning
Everything about federated learning, including research papers, books, codes, tutorials, videos and beyond
0 0 00
bitcoin-chart
📈Real-time Bitcoin Chart by Upbit
Language:JavaScript7 1 01
BSA-SpMM_EURO-PAR-2024
Official Artfifact for Accelerated Block-Sparsity-Aware Matrix Reordering for Leveraging Tensor Cores in Sparse Matrix-Multivector Multiplication (Euro-Par 2024)
Language:C++0 1 01
celery-redis-queue
Celery and Redis Queue in FastAPI
Language:Python0 1 00
chatbot_api
chatbot Swagger
Language:Jupyter Notebook0 2 00
codingTest
백준 문제풀이
0 3 00
kant
Using GPT-2, create a philosophical paper like Immanuel Kant
Language:CSS3 2 01
KoGPT2-chatbot
Simple Chit-Chat based on KoGPT2
Language:Jupyter Notebook1 1 00

dleunji's Repositories

dleunji/bitcoin-chart
📈Real-time Bitcoin Chart by Upbit
Language:JavaScript7 1 01
dleunji/kant
Using GPT-2, create a philosophical paper like Immanuel Kant
Language:CSS3 2 01
dleunji/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
Language:Python00
dleunji/Awesome-Federated-Machine-Learning
Everything about federated learning, including research papers, books, codes, tutorials, videos and beyond
0 0 00
dleunji/BSA-SpMM_EURO-PAR-2024
Official Artfifact for Accelerated Block-Sparsity-Aware Matrix Reordering for Leveraging Tensor Cores in Sparse Matrix-Multivector Multiplication (Euro-Par 2024)
Language:C++0 1 01
dleunji/celery-redis-queue
Celery and Redis Queue in FastAPI
Language:Python0 1 00
dleunji/CUDA-TC
Language:C++0 1 00
dleunji/cuda_til
Language:Cuda0 1 00
dleunji/cuda_hgemm
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
Language:Cuda0 0
dleunji/CUDATeaching
CUDA based GPU Programming
Language:Jupyter Notebook0 0
dleunji/curious-ui
Q&A Board 'Curious'의 UI
Language:JavaScript1 0
dleunji/dleunji.github.io
dleunji.github.io
Language:HTML1 0
dleunji/Federated-Averaging-PyTorch
An unofficial PyTorch implementation of a federated learning algorithm, FedAvg.
Language:Python0 0
dleunji/Federated-Learning-Research
An implementation of federated learning research baseline methods based on FedML-core, which can be deployed on real distributed cluster and help researchers to explore more problems existing in real FL systems.
Language:Python0 0
dleunji/Learn-CUDA-Programming
Learn CUDA Programming, published by Packt
Language:Cuda0 0
dleunji/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Language:Python0 01
dleunji/lmquant
Language:Python
dleunji/Magicube
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
Language:C++0 0
dleunji/Misc-Cheatsheet
대학원 생활을 하며 사용하는 작고 소중한 코딩팁 (linux 명령어 등)
Language:Vim Script0 0
dleunji/mongoDB-test
To master mongoDB and pymongo, clone social media platform
Language:Python1 0
dleunji/Parallel-Sudoku-Solver
🔢 A parallelized Sudoku solver implemented with various solving algorithms in C++.
Language:C++0 0
dleunji/ppopp20_spmm_artifact
Language:C++0 0
dleunji/pytorchviz
A small package to create visualizations of PyTorch execution graphs
Language:Jupyter Notebook0 0
dleunji/qserve
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
Language:Python
dleunji/TC-GNN_ATC23
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
Language:Python0 0
dleunji/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++
dleunji/test-nethereum
Connecting .NET with Solidity
Language:C#2 0
dleunji/vectorSparse-custom
Language:Cuda1 0
dleunji/wmma_tensorcore_sample
Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)
Language:Cuda0 0
dleunji/WWW23_ODE_custom
Language:Python1 0