derekwin

EngD student @ SDU, focus on Cloud Native, HPC Network Protocol, GPU, Distributed Memory, ebpf, Congestion Control, Reinforcement Learning, etc.

CS@SDUQingdao, China

derekwin's Stars

commaai/openpilot
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
Language:Python52.9k 1.3k 2.9k9.6k
numpy/numpy
The fundamental package for scientific computing with Python.
Language:Python29.1k 598 13.2k10.6k
Byaidu/PDFMathTranslate
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/Docker/Zotero
Language:Python19.1k 75 6261.6k
pybind/pybind11
Seamless operability between C++11 and Python
Language:C++16.3k 249 2.1k2.1k
huggingface/smolagents
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
Language:Python15.5k 116 3931.4k
triton-lang/triton
Development repository for the Triton language and compiler
Language:MLIR14.9k 196 1.7k1.9k
richards199999/Thinking-Claude
Let your Claude able to think
Language:TypeScript14.8k 107 331.7k
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python10.7k 103 2521.2k
cupy/cupy
NumPy & SciPy for GPU
Language:Python10k 127 2.4k892
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++9.8k 115 2.3k1.2k
cpp-best-practices/cppbestpractices
Collaborative Collection of C++ Best Practices. This online resource is part of Jason Turner's collection of C++ Best Practices resources. See README.md for more information.
8.3k 365 57898
mixxxdj/mixxx
Mixxx is Free DJ software that gives you everything you need to perform live mixes.
Language:C++5.2k 134 7.5k1.4k
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Language:C++2.9k 31 69185
ysymyth/ReAct
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
Language:Jupyter Notebook2.4k 18 32240
punica-ai/punica
Serving multiple LoRA finetuned LLM as one
Language:Python1k 12 3949
NVIDIA/gdrcopy
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
Language:C++1k 55 196154
eunomia-bpf/bpftime
Userspace eBPF runtime for Observability, Network & General Extensions Framework
Language:C++918 19 18185
Mellanox/libvma
Linux user space library for network socket acceleration based on RDMA compatible network adaptors
Language:C++628 58 144157
p12tic/cppreference-doc
C++ standard library reference
Language:HTML433 34 41110
facebookincubator/dynolog
Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.
Language:C++303 15 3153
IBM/tensorflow-large-model-support
Large Model Support in Tensorflow
202 21 4038
AIFM-sys/AIFM
AIFM: High-Performance, Application-Integrated Far Memory
Language:C119 4 2137
hyperai/triton-cn
Triton Documentation in Chinese Simplified / Triton 中文文档
Language:TypeScript60 4 56
Sys-KU/DeepPlan
Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)
Language:C++57 2 58
0x5ec1ab/gpu-tlb
Language:C++48 1 312
DataManagementLab/RDMA_synchronization
This is the source code for our (Tobias Ziegler, Jacob Nelson-Slivon, Carsten Binnig and Viktor Leis) published paper at SIGMOD’23: Design Guidelines for Correct, Efficient, and Scalable Synchronization using One-Sided RDMA
Language:C++26 5 12
wangchenxi7/Atlas
Language:C14 1 11
0x5ec1ab/invalidate-compare
Language:C10 2 00
joonspk-research/gabm-stanford-main
Language:Python3 1 00
liuweiseu/400GbE_Demo
Language:C2 1 10

derekwin

derekwin's Stars

commaai/openpilot

numpy/numpy

Byaidu/PDFMathTranslate

pybind/pybind11

huggingface/smolagents

triton-lang/triton

richards199999/Thinking-Claude

huggingface/lerobot

cupy/cupy

NVIDIA/TensorRT-LLM

cpp-best-practices/cppbestpractices

mixxxdj/mixxx

kvcache-ai/Mooncake

ysymyth/ReAct

punica-ai/punica

NVIDIA/gdrcopy

eunomia-bpf/bpftime

Mellanox/libvma

p12tic/cppreference-doc

facebookincubator/dynolog

IBM/tensorflow-large-model-support

AIFM-sys/AIFM

hyperai/triton-cn

Sys-KU/DeepPlan

0x5ec1ab/gpu-tlb

DataManagementLab/RDMA_synchronization

wangchenxi7/Atlas

0x5ec1ab/invalidate-compare

joonspk-research/gabm-stanford-main

liuweiseu/400GbE_Demo