Pinned Repositories
aioquic-1
QUIC and HTTP/3 implementation in Python
astra-network-analytical
astra-network-ns3
astra-sim
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
llama
Inference code for Llama models
llama2-chakra-assests
The figures of llama2 chakra traces
nccl
Optimized primitives for collective multi-GPU communication
ns3-rdma
NS3 simulator for RDMA over Converged Ethernet v2 (RoCEv2), including the implementation of DCQCN, TIMELY, PFC, ECN and shared buffer switch
pytorch-transformer
Attention is all you need implementation
Transformers-Recipe
🧠 A study guide to learn about Transformers
qyysjtu's Repositories
qyysjtu/astra-sim
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
qyysjtu/llama
Inference code for Llama models
qyysjtu/aioquic-1
QUIC and HTTP/3 implementation in Python
qyysjtu/astra-network-analytical
qyysjtu/astra-network-ns3
qyysjtu/AStream
A DASH segment size aware rate adaptation model for DASH
qyysjtu/DeepLearning
Deep Learning introduction and its application in various fields
qyysjtu/gst-rtsp-server
RTSP server based on GStreamer
qyysjtu/llama2-chakra-assests
The figures of llama2 chakra traces
qyysjtu/media-server
RTSP/RTP/RTMP/FLV/HLS/MPEG-TS/MPEG-PS/MPEG-DASH/MP4/fMP4
qyysjtu/multicast-test
Python Multicast Test Tool (Python2/3)
qyysjtu/nccl
Optimized primitives for collective multi-GPU communication
qyysjtu/ns3-rdma
NS3 simulator for RDMA over Converged Ethernet v2 (RoCEv2), including the implementation of DCQCN, TIMELY, PFC, ECN and shared buffer switch
qyysjtu/proto-quic
qyysjtu/pytorch-transformer
Attention is all you need implementation
qyysjtu/Transformers-Recipe
🧠 A study guide to learn about Transformers
qyysjtu/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
qyysjtu/DCTrafficGen
Data Center Traffic Generator Library
qyysjtu/Dragonfly_topology_path_exploitation
qyysjtu/HeliosData
Helios Traces from SenseTime
qyysjtu/Infiniband-Simulation
Using OMNeT++ for Infiniband Network data flow mechanism simulation, performance & bottleneck exploration, specifically for distributed machine-learning systems in data center
qyysjtu/mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
qyysjtu/quic-go
A QUIC implementation in pure go
qyysjtu/quic-py
QUIC protocol implementation in python
qyysjtu/QuicRtmp
Rtmp with Quic based transport medium
qyysjtu/RapidNetSim
qyysjtu/RTP
Implementation of a RTP server that sends video stream (H.264/HEVC) using the Real-time Transport Protocol(RTP) based on Linux/MacOS. 一个基于Linux/MacOS平台的可以发送携带H.264/HEVC媒体类型的RTP视频流的示例程序。
qyysjtu/RTSP-Client-Server
Implementation of a streaming video server and client that communicate using the Real-Time Streaming Protocol (RTSP) and send data using the Realtime Transfer Protocol (RTP)
qyysjtu/Transformers-for-NLP
Transformers 3rd Edition
qyysjtu/Video-Streaming-Server-and-Client