huggingface/deep-rl-class

๐ŸŒ [i18n-KO] Translating rl-course to Korean

wonhyeongseo opened this issue ยท 1 comments

Hi!

Let's bring the reinforcement learning course to all the Korean-speaking community ๐ŸŒ (currently 9 out of 77 complete)

Would you want to translate? Please follow the ๐Ÿค— TRANSLATING guide. Here is a list of the files ready for translation. Let us know in this issue if you'd like to translate any, and we'll add your name to the list.

Some notes:

  • Please translate using an informal tone (imagine you are talking with a friend about transformers ๐Ÿค—).
  • Please translate in a gender-neutral way.
  • Add your translations to the folder called ko inside the source folder.
  • Register your translation in ko/_toctree.yml; please follow the order of the English version.
  • Once you're finished, open a pull request and tag this issue by including #issue-number in the description, where issue-number is the number of this issue. Please ping @simoninithomas for review.

์•ˆ๋…•ํ•˜์„ธ์š”!

ํ•œ๊ตญ์–ด๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๋ชจ๋‘๊ฐ€ ๊ฐ•ํ™”ํ•™์Šต ์ฝ”์Šค๋ฅผ ์ฝ์„ ์ˆ˜ ์žˆ๊ฒŒ ํ•ด๋ณด์•„์š” ๐ŸŒ

๋ฒˆ์—ญ์— ์ฐธ์—ฌํ•˜๊ณ  ์‹ถ์œผ์‹ ๊ฐ€์š”? ๐Ÿค— ๋ฒˆ์—ญ ๊ฐ€์ด๋“œ๋ฅผ ๋จผ์ € ์ฝ์–ด๋ณด์‹œ๊ธฐ ๋ฐ”๋ž๋‹ˆ๋‹ค. ๋ ๋ถ€๋ถ„์— ๋ฒˆ์—ญํ•ด์•ผํ•  ํŒŒ์ผ๋“ค์ด ๋‚˜์—ด๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค. ์ž‘์—…ํ•˜๊ณ  ๊ณ„์‹  ํŒŒ์ผ์ด ์žˆ๋‹ค๋ฉด ์—ฌ๊ธฐ์— ๊ฐ„๋‹จํžˆ ์•Œ๋ ค์ฃผ์„ธ์š”. ์ค‘๋ณต๋˜์ง€ ์•Š๋„๋ก ์ž‘์—…์ค‘์œผ๋กœ ํ‘œ์‹œํ•ด๋‘˜๊ฒŒ์š”.

์ฐธ๊ณ  ์‚ฌํ•ญ:

  • ๊ธฐ์ˆ  ๋ฌธ์„œ์ด์ง€๋งŒ (์นœ๊ตฌ์—๊ฒŒ ์„ค๋ช… ๋“ฃ๋“ฏ์ด) ์‰ฝ๊ฒŒ ์ฝํžˆ๋ฉด ์ข‹๊ฒ ์Šต๋‹ˆ๋‹ค. ์กด๋Œ“๋ง ๋กœ ์จ์ฃผ์‹œ๋ฉด ๊ฐ์‚ฌํ•˜๊ฒ ์Šต๋‹ˆ๋‹ค.
  • ์„ฑ๋ณ„์€ ์ผ๋ถ€ ์–ธ์–ด(์ŠคํŽ˜์ธ์–ด, ํ”„๋ž‘์Šค์–ด ๋“ฑ)์—๋งŒ ์ ์šฉ๋˜๋Š” ์‚ฌํ•ญ์œผ๋กœ, ํ•œ๊ตญ์–ด์˜ ๊ฒฝ์šฐ ๋ฒˆ์—ญ๊ธฐ๋ฅผ ์‚ฌ์šฉํ•˜์‹  ํ›„ ๋ฌธ์žฅ ๊ธฐํ˜ธ์™€ ์กฐ์‚ฌ ๋“ฑ์ด ์•Œ๋งž๋Š”์ง€ ํ™•์ธํ•ด์ฃผ์‹œ๊ธฐ ๋ฐ”๋ž๋‹ˆ๋‹ค.
  • ์†Œ์Šค ํด๋” ์•„๋ž˜ ko ํด๋”์— ๋ฒˆ์—ญ๋ณธ์„ ๋„ฃ์–ด์ฃผ์„ธ์š”.
  • ๋ชฉ์ฐจ(ko/_toctree.yml)๋„ ํ•จ๊ป˜ ์—…๋ฐ์ดํŠธํ•ด์ฃผ์„ธ์š”. ์˜์–ด ๋ชฉ์ฐจ์™€ ์ˆœ์„œ๊ฐ€ ๋™์ผํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.
  • ๋ชจ๋‘ ๋งˆ์น˜์…จ๋‹ค๋ฉด, ๊ธฐ๋ก์ด ์›ํ™œํ•˜๋„๋ก PR์„ ์—ฌ์‹ค ๋•Œ ํ˜„์žฌ ์ด์Šˆ(``)๋ฅผ ๋‚ด์šฉ์— ๋„ฃ์–ด์ฃผ์‹œ๊ธฐ ๋ฐ”๋ž๋‹ˆ๋‹ค. ๋ฆฌ๋ทฐ ์š”์ฒญ์€ @simoninithomas ๋‹˜๊ป˜ ์š”์ฒญํ•ด์ฃผ์„ธ์š”.
  • ๐Ÿ™‹ ์ปค๋ฎค๋‹ˆํ‹ฐ์— ๋งˆ์Œ๊ป ํ™๋ณดํ•ด์ฃผ์‹œ๊ธฐ ๋ฐ”๋ž๋‹ˆ๋‹ค! ๐Ÿค— ํฌ๋Ÿผ์— ์˜ฌ๋ฆฌ์…”๋„ ์ข‹์•„์š”.
  • Unit 0. Welcome to the course
    • Welcome to the course ๐Ÿค—
    • Setup
    • Discord 101
  • Unit 1. Introduction to Deep Reinforcement Learning
    • Introduction
    • What is Reinforcement Learning?
    • The Reinforcement Learning Framework
    • The type of tasks
    • The Exploration/ Exploitation tradeoff
    • The two main approaches for solving RL problems
    • The โ€œDeepโ€ in Deep Reinforcement Learning
    • Summary
    • Glossary
    • Hands-on
    • Quiz
    • Conclusion
    • Additional Readings
  • Bonus Unit 1. Introduction to Deep Reinforcement Learning with Huggy
    • Introduction
    • How Huggy works?
    • Train Huggy
    • Play with Huggy
    • Conclusion
  • Live 1. How the course work, Q&A, and playing with Huggy
    • Live 1. How the course work, Q&A, and playing with Huggy ๐Ÿถ
  • Unit 2. Introduction to Q-Learning
    • Introduction
    • What is RL? A short recap
    • The two types of value-based methods
    • The Bellman Equation, simplify our value estimation
    • Monte Carlo vs Temporal Difference Learning
    • Mid-way Recap
    • Mid-way Quiz
    • Introducing Q-Learning
    • A Q-Learning example
    • Q-Learning Recap
    • Glossary
    • Hands-on
    • Q-Learning Quiz
    • Conclusion
    • Additional Readings
  • Unit 3. Deep Q-Learning with Atari Games
    • Introduction
    • From Q-Learning to Deep Q-Learning
    • The Deep Q-Network (DQN)
    • The Deep Q Algorithm
    • Glossary
    • Hands-on
    • Quiz
    • Conclusion
    • Additional Readings
  • Bonus Unit 2. Automatic Hyperparameter Tuning with Optuna
    • Introduction
    • Optuna
    • Hands-on
  • Unit 4. Policy Gradient with PyTorch
    • Introduction
    • What are the policy-based methods?
    • The advantages and disadvantages of policy-gradient methods
    • Diving deeper into policy-gradient
    • (Optional) the Policy Gradient Theorem
    • Hands-on
    • Quiz
    • Conclusion
    • Additional Readings
  • Unit 5. Introduction to Unity ML-Agents
    • Introduction
    • How ML-Agents works?
    • The SnowballTarget environment
    • The Pyramids environment
    • (Optional) What is curiosity in Deep Reinforcement Learning?
    • Hands-on
    • Bonus. Learn to create your own environments with Unity and MLAgents
    • Conclusion
  • Unit 6. Actor Critic methods with Robotics environments
    • Introduction
    • The Problem of Variance in Reinforce
    • Advantage Actor Critic (A2C)
    • Hands-on: Advantage Actor Critic (A2C) using Robotics Simulations with PyBullet and Panda-Gym ๐Ÿค–
    • Conclusion
    • Additional Readings
  • Unit 7. Introduction to Multi-Agents and AI vs AI
    • Introduction
    • An introduction to Multi-Agents Reinforcement Learning (MARL)
    • Designing Multi-Agents systems
    • Self-Play
    • Hands-on: Let's train our soccer team to beat your classmates' teams (AI vs. AI)
    • Conclusion
    • Additional Readings
  • Unit 8. Part 1 Proximal Policy Optimization (PPO)
    • Introduction
    • The intuition behind PPO
    • Introducing the Clipped Surrogate Objective Function
    • Visualize the Clipped Surrogate Objective Function
    • PPO with CleanRL
    • Conclusion
    • Additional Readings
  • Unit 8. Part 2 Proximal Policy Optimization (PPO) with Doom
    • Introduction
    • PPO with Sample Factory and Doom
    • Conclusion
  • Bonus Unit 3. Advanced Topics in Reinforcement Learning
    • Introduction
    • Model-Based Reinforcement Learning
    • Offline vs. Online Reinforcement Learning
    • Reinforcement Learning from Human Feedback
    • Decision Transformers

Hey there๐Ÿ‘‹ thanks for your work. As mentionned in Discord we don't support other languages for this course for now contrary to the transformer course.

However, what we can do is I can create a moon-ci-docs.huggingface.co link that will allow you to share to people who want to follow the course in Korean (and also see what the course looks like).

Currently there's an error in the Build PR documentation so I can't provide you this link (check the Failing error, from what I see is because you don't have a table of contents

Have a nice day ๐Ÿค—