Pinned Repositories
algorithms-cuda
parallel algorithm based on cuda
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
atex
A TensorFlow Extension: GPU performance tools for TensorFlow.
bats-core
Bash Automated Testing System
blog
cudnn_fe_fp8
cudnn_fe_fp8 playground
cutlass_benchmark
cutlass playaround
instance_norm
JAX-fp8
transformer_benchmark_fp8
wenscarl's Repositories
wenscarl/cudnn_fe_fp8
cudnn_fe_fp8 playground
wenscarl/instance_norm
wenscarl/JAX-fp8
wenscarl/transformer_benchmark_fp8
wenscarl/algorithms-cuda
parallel algorithm based on cuda
wenscarl/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
wenscarl/atex
A TensorFlow Extension: GPU performance tools for TensorFlow.
wenscarl/bats-core
Bash Automated Testing System
wenscarl/cutlass_benchmark
cutlass playaround
wenscarl/FBPINNs
Solve forward and inverse problems related to partial differential equations using finite basis physics-informed neural networks (FBPINNs)
wenscarl/flax
Flax is a neural network library for JAX that is designed for flexibility.
wenscarl/fp8_gemm_test
wenscarl/fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
wenscarl/hexo-theme-yilia
一个简洁优雅的hexo主题 A simple and elegant theme for hexo.
wenscarl/keras
Deep Learning for humans
wenscarl/leetcode
repository for booking Leetcode's problems and solutions
wenscarl/tensorflow
An Open Source Machine Learning Framework for Everyone
wenscarl/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
wenscarl/models
Models and examples built with TensorFlow
wenscarl/paxml
Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.
wenscarl/praxis
wenscarl/saxml
wenscarl/t5x
wenscarl/tensorflow2_cpp
Build Tensorflow C++ API, load a SavedModel and serve predictions
wenscarl/tensorflowbook
tensorflow教程每个章节的源码
wenscarl/tf_op_graph
A visualization tool to display TF-Grappler optimized op graph
wenscarl/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
wenscarl/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
wenscarl/wenscarl.github.io
wenscarl/xla
A machine learning compiler for GPUs, CPUs, and ML accelerators