Add topic "Q-learning" Accepted
Changes: 6
-
Add Intro to Q-learning in RL
- Title
-
- Unchanged
- Intro to Q-learning in RL
- Type
-
- Unchanged
- Interactive
- Created
-
- Unchanged
- 2021-04-30
- Description
-
- Unchanged
- In this tutorial, we aim to provide readers with a high-level overview of the fundamentals of RL as well as example code in Python, introducing the OpenAI Gym library. We begin with building intuitions about what is considered an RL problem and we introduce formal definitions as well as key terminologies that are used to describe and model an RL application. In parallel, we will focus on solving a concrete example of an RL problem (CartPole) using a classic RL algorithm called Q-learning.
- Link
-
- Unchanged
- https://deepnote.com/@ken-e7bd/Intro-to-Q-learning-in-RL-4R450s6_RVKJIC2xiqs71g
- Identifier
-
- Unchanged
- no value
Resource | v1 | current (v1) -
Add Q-learning
- Title
-
- Unchanged
- Q-learning
- Description
-
- Unchanged
- Q-learning is a model-free reinforcement learning algorithm to learn the value of an action in a particular state. It does not require a model of the environment (hence "model-free"), and it can handle problems with stochastic transitions and rewards without requiring adaptations. For any finite Markov decision process (FMDP), Q-learning finds an optimal policy in the sense of maximizing the expected value of the total reward over any and all successive steps, starting from the current state. Q-learning can identify an optimal action-selection policy for any given FMDP, given infinite exploration time and a partly-random policy. "Q" refers to the function that the algorithm computes - the expected rewards for an action taken in a given state.
- Link
-
- Unchanged
- https://en.wikipedia.org/?curid=1281850
Topic | v1 | current (v1) -
Add Reinforcement learning
- Title
-
- Unchanged
- Reinforcement learning
- Description
-
- Unchanged
- Reinforcement learning (RL) is an area of machine learning concerned with how intelligent agents ought to take actions in an environment in order to maximize the notion of cumulative reward. Reinforcement learning is one of three basic machine learning paradigms, alongside supervised learning and unsupervised learning. Reinforcement learning differs from supervised learning in not needing labelled input/output pairs be presented, and in not needing sub-optimal actions to be explicitly corrected. Instead the focus is on finding a balance between exploration (of uncharted territory) and exploitation (of current knowledge).
- Link
-
- Unchanged
- https://en.wikipedia.org/?curid=66294
Topic | v1 | current (v1) -
Add Q-learning treated in Intro to Q-learning in RL
- Current
- treated in
Topic to resource relation | v1 -
Add Deep learning parent of Reinforcement learning
- Current
- parent of
Topic to topic relation | v1 -
Add Q-learning used by Reinforcement learning
- Current
- used by
Topic to topic relation | v1