imerdell-55

Pinned Repositories

bi-att-flow
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to achieve a query-aware context representation without early summarization.
Language:Python0 0 00
curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
Language:Python0 0 00
Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Language:Python0 0 00
farbox-template
Farbox 2 支持自动同步模板仓库
Language:Python0 0 00
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python0 0 00
fucking-algorithm
刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.
0 0 00
Go-001
Language:Go0 0 00
Hierarchical-Meta-Reinforcement-Learning
This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.
Language:Python0 0 00
HRAC
PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020).
Language:Python0 0 00
interview
📚 C/C++ 技术面试基础知识总结，包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.
Language:C++0 0 00

imerdell-55's Repositories

imerdell-55/bi-att-flow
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to achieve a query-aware context representation without early summarization.
Language:Python0 0 00
imerdell-55/curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
Language:Python0 0 00
imerdell-55/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Language:Python0 0 00
imerdell-55/farbox-template
Farbox 2 支持自动同步模板仓库
Language:Python0 0 00
imerdell-55/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python0 0 00
imerdell-55/fucking-algorithm
刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.
0 0 00
imerdell-55/Go-001
Language:Go0 0 00
imerdell-55/Hierarchical-Meta-Reinforcement-Learning
This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.
Language:Python0 0 00
imerdell-55/HRAC
PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020).
Language:Python0 0 00
imerdell-55/interview
📚 C/C++ 技术面试基础知识总结，包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.
Language:C++0 0 00
imerdell-55/notion_widgets
A set of HTML widgets that could be embedded into Notion.so https://www.notion.so/ pages. For more see https://blog.shorouk.dev/notion-widgets-gallery/
Language:HTML0 0 00
imerdell-55/LLaMA-Efficient-Tuning
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)
imerdell-55/options-hierarchical-rl
Language:Jupyter Notebook0 0
imerdell-55/overleaf-thesis-template
latex thesis template on overleaf
imerdell-55/project-based-learning
Curated list of project-based tutorials
0 0
imerdell-55/reinforcement_learning_ppo_rnd
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
Language:Python0 0
imerdell-55/reinforcement_learning_robocup
Implementation of Correlated-Q Learning on RoboCup Game
Language:Python0 0
imerdell-55/ROS-Dynamic-Window-Approach
Language:Python0 0
imerdell-55/smooth
Language:HTML1 0
imerdell-55/Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
Language:Python0 0
imerdell-55/strategitica
Displays Habitica tasks in calendar format, along with some other helpful info and a sleep toggle.
Language:JavaScript0 0
imerdell-55/superset
Apache Superset is a Data Visualization and Data Exploration Platform
Language:TypeScript0 0
imerdell-55/test-wdl
Language:wdl
imerdell-55/unified-hrl
Unified Model-Free Hierarchical Reinforcement Learning Framework
Language:Python0 0