Pinned Repositories
annotated_deep_learning_paper_implementations
🧑🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
betty
Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization
bgpbt
[AutoML'22] Bayesian Generational Population-based Training (BG-PBT)
FuRL
jrlzoo
A collection of RL baselines in Jax.
RepL4RL
Representation Learning for RL
RIQL
rlmc
TemporalOT
fuyw's Repositories
fuyw/FuRL
fuyw/jrlzoo
A collection of RL baselines in Jax.
fuyw/TemporalOT
fuyw/anon-kode
koding with any LLMs
fuyw/cocotb
cocotb, a coroutine based cosimulation library for writing VHDL and Verilog testbenches in Python
fuyw/DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
fuyw/diffusion-literature-for-robotics
Summary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.
fuyw/Hands-On-Large-Language-Models
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
fuyw/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
fuyw/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
fuyw/Logic-RL
Reproduce R1 Zero on Logic Puzzle
fuyw/minimind
「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!
fuyw/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
fuyw/pyinstrument
🚴 Call stack profiler for Python. Shows you why your code is slow!
fuyw/RAGEN
RAGEN is the first open-source reproduction of DeepSeek-R1 for training agentic models via reinforcement learning.
fuyw/ReasonFlux
ReasonFlux beats o1-preview and DeepSeek-V3 with hierarchical RL and 500 thought templates
fuyw/rStar
fuyw/scaling-book
Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
fuyw/sgcrl
fuyw/SWELancer-Benchmark
This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"
fuyw/TD7
Author's PyTorch implementation of TD7 for online and offline RL
fuyw/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
fuyw/Time-Series-Library
A Library for Advanced Deep Time Series Models.
fuyw/tiny-universe
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
fuyw/TinyZero
fuyw/unsloth
Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
fuyw/vivado-on-silicon-mac
Installs Vivado on M1/M2 macs
fuyw/WCCommon
Frequently used small tools and functions
fuyw/workbench-example-nemotron-finetune
An NVIDIA AI Workbench example project for fine-tuning a Nemotron-3 8B model
fuyw/Xilinx-FPGA-PCIe-XDMA-Tutorial
Xilinx FPGA PCIe 保姆级教程 ——基于 PCIe XDMA IP核