Pinned Repositories
bi-att-flow
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to achieve a query-aware context representation without early summarization.
curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
farbox-template
Farbox 2 支持自动同步模板仓库
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Go-001
Hierarchical-Meta-Reinforcement-Learning
This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.
HRAC
PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020).
interview
📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.
imerdell-55's Repositories
imerdell-55/bi-att-flow
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to achieve a query-aware context representation without early summarization.
imerdell-55/curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
imerdell-55/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
imerdell-55/farbox-template
Farbox 2 支持自动同步模板仓库
imerdell-55/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
imerdell-55/fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
imerdell-55/Go-001
imerdell-55/Hierarchical-Meta-Reinforcement-Learning
This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.
imerdell-55/HRAC
PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020).
imerdell-55/interview
📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.
imerdell-55/notion_widgets
A set of HTML widgets that could be embedded into Notion.so https://www.notion.so/ pages. For more see https://blog.shorouk.dev/notion-widgets-gallery/
imerdell-55/LLaMA-Efficient-Tuning
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)
imerdell-55/options-hierarchical-rl
imerdell-55/overleaf-thesis-template
latex thesis template on overleaf
imerdell-55/project-based-learning
Curated list of project-based tutorials
imerdell-55/reinforcement_learning_ppo_rnd
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
imerdell-55/reinforcement_learning_robocup
Implementation of Correlated-Q Learning on RoboCup Game
imerdell-55/ROS-Dynamic-Window-Approach
imerdell-55/smooth
imerdell-55/Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
imerdell-55/strategitica
Displays Habitica tasks in calendar format, along with some other helpful info and a sleep toggle.
imerdell-55/superset
Apache Superset is a Data Visualization and Data Exploration Platform
imerdell-55/test-wdl
imerdell-55/unified-hrl
Unified Model-Free Hierarchical Reinforcement Learning Framework