I want to train a agent to place a polyomino (only one for example a square of the measure of 2x2) on a 9x9 field, that is either empty or already contains multiple OTHER (not the 2x2 square one) polyominos. So the observation space would not be continuous. Is this a proper use case for RL?
Can I use Reinforcment Learning for a problem that has a non continous observation space?
48 Views Asked by FlorianFusseder At
1
There are 1 best solutions below
Related Questions in ALGORITHM
- XQuery: Select a node in the context of a variable
- Extract value of a input hidden DOMXpath php
- There is no element matching XPath "//html" (Behat\Mink\Exception\DriverException)
- Xpath rule template is missing in SonarQube 5.1
- Xpath having elements with and without prefixes in java
- Traversing with XPath?
- Unable to locate element using cssSelector and xpath
- BaseX XQuery error: root(): no context value bound
- How to add spaces between nodes when using string() on a tree in XPath
- grabbing value from html by xpath in python
Related Questions in MACHINE-LEARNING
- XQuery: Select a node in the context of a variable
- Extract value of a input hidden DOMXpath php
- There is no element matching XPath "//html" (Behat\Mink\Exception\DriverException)
- Xpath rule template is missing in SonarQube 5.1
- Xpath having elements with and without prefixes in java
- Traversing with XPath?
- Unable to locate element using cssSelector and xpath
- BaseX XQuery error: root(): no context value bound
- How to add spaces between nodes when using string() on a tree in XPath
- grabbing value from html by xpath in python
Related Questions in REINFORCEMENT-LEARNING
- XQuery: Select a node in the context of a variable
- Extract value of a input hidden DOMXpath php
- There is no element matching XPath "//html" (Behat\Mink\Exception\DriverException)
- Xpath rule template is missing in SonarQube 5.1
- Xpath having elements with and without prefixes in java
- Traversing with XPath?
- Unable to locate element using cssSelector and xpath
- BaseX XQuery error: root(): no context value bound
- How to add spaces between nodes when using string() on a tree in XPath
- grabbing value from html by xpath in python
Related Questions in MACHINE-LEARNING-MODEL
- XQuery: Select a node in the context of a variable
- Extract value of a input hidden DOMXpath php
- There is no element matching XPath "//html" (Behat\Mink\Exception\DriverException)
- Xpath rule template is missing in SonarQube 5.1
- Xpath having elements with and without prefixes in java
- Traversing with XPath?
- Unable to locate element using cssSelector and xpath
- BaseX XQuery error: root(): no context value bound
- How to add spaces between nodes when using string() on a tree in XPath
- grabbing value from html by xpath in python
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Sure, why not? The simplest versions of reinforcement learning algorithms use a discrete state space (and, indeed, assume for convergence that the agent is able to visit each state sufficiently many times). Even if there are too many states and you have to replace the Q function by a learned approximation (probably a neural net), you can use a one-hot encoding for the input.