for impossible actions (for example, an impossible move in chess)?, And what reward should you give in Q learning for impossible actions (for example, an impossible move in chess)?
Or Q learning does not work for environments where this happens?, Or Q learning does not work for environments where this happens?
give it a negative reward
Обсуждают сегодня