ccs96307
I’m passionate about reading, playing video games, and exploring ideas about the world and AI. I’m always curious about how we can push the boundaries of tech
Taipei
Pinned Repositories
fast-llm-inference
Accelerating LLM inference with techniques like speculative decoding, quantization, and kernel fusion, focusing on implementing state-of-the-art research papers.
font-to-png
Use font file to plot a character image.
forward-forward-algorithm-pytorch-implemented
gan-mnist-pytorch-implemented
A simple test for GAN
highlight_code_convert_html
A simple script to convert your code to highlight code, and display as html format.
LeetCode
LeetCode Record
LeetGPU
llm-kernel-foundry
Optimized CUDA Kernels
MHWI_Monsters_Weakness
A Monster Hunter World: Iceborne GUI application
Sudoku
A solver of Sudoku.
ccs96307's Repositories
ccs96307/fast-llm-inference
Accelerating LLM inference with techniques like speculative decoding, quantization, and kernel fusion, focusing on implementing state-of-the-art research papers.
ccs96307/gan-mnist-pytorch-implemented
A simple test for GAN
ccs96307/font-to-png
Use font file to plot a character image.
ccs96307/LeetCode
LeetCode Record
ccs96307/forward-forward-algorithm-pytorch-implemented
ccs96307/highlight_code_convert_html
A simple script to convert your code to highlight code, and display as html format.
ccs96307/unity-snake
A mobile version of classic snake game and it can move any direction
ccs96307/ai-remove-background-website
Using AI model for background removing
ccs96307/LeetGPU
ccs96307/llm-kernel-foundry
Optimized CUDA Kernels
ccs96307/Algorithm
ccs96307/bert-pytorch-implemented
ccs96307/ccs96307
ccs96307/CUDA-Practice
ccs96307/FlagEmbedding
Dense Retrieval and Retrieval-augmented LLMs
ccs96307/gemma-pytorch-implementation
ccs96307/github-dir-dl
This is a command-line tool to download GitHub directories
ccs96307/github-readme-stats
:zap: Dynamically generated stats for your github readmes
ccs96307/gpt2-pytorch-implemented
ccs96307/Latent-Self-Reflection-Model
A novel train method for improve model instruction following and reduce hallucinate
ccs96307/llm-decode-filter-special-words
ccs96307/machine-learning-cupy-implemented
ccs96307/mimi
Where your treasure is, there will your heart be also.
ccs96307/mistral-pytorch-implemented
ccs96307/sglang
SGLang is a fast serving framework for large language models and vision language models.
ccs96307/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.
ccs96307/TimeScheduler
This is a time scheduling application developed using React.js for the front-end and Python Flask for the back-end.
ccs96307/trl
Train transformer language models with reinforcement learning.
ccs96307/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
ccs96307/website_crawler
It is a repository that records some website I crawled