Specifying observation space for Q-Mix in ray

252 Views Asked by ckorzhik At 28 July 2025 at 11:20

I see that I have to define players observations for using Qmix + LSTM as here https://github.com/ray-project/ray/issues/8407#issuecomment-627401186 or as in this example https://github.com/ray-project/ray/blob/master/rllib/examples/two_step_game.py#L81

However, I don't understand what I should put into ENV_STATE.

Is this field for states that player may be in? Are there any restrictions for them? Are they connected with observations (the field that is near) in any way?

Original Q&A

There are 1 best solutions below

ckorzhik On 01 December 2022 at 08:58

ENV_STATE represents environment state dimension, and obs represents dimension of observations.

However, it will not magically work for any environment. You have to wrap your observations and environment state in dictionary as in this example https://github.com/ray-project/ray/blob/1.11.1/rllib/examples/env/two_step_game.py#L85 so that your environment returns it after every step and on reset().

After that, you can use with_agent_groups.

As you can see from the qmix sources, you can also define action masks in the same dictionary https://github.com/ray-project/ray/blob/1.11.1/rllib/agents/qmix/qmix_policy.py#L93

Specifying observation space for Q-Mix in ray

There are 1 best solutions below

Related Questions in REINFORCEMENT-LEARNING

Related Questions in RAY

Related Questions in MULTI-AGENT

Related Questions in MULTI-AGENT-REINFORCEMENT-LEARNING

Trending Questions

Popular # Hahtags

Popular Questions