All the reinforcement learning algorithms I’ve read about are usually applied to a single

Question

0

Asked: May 24, 20262026-05-24T01:29:03+00:00 2026-05-24T01:29:03+00:00

All the reinforcement learning algorithms I’ve read about are usually applied to a single

0

All the reinforcement learning algorithms I’ve read about are usually applied to a single agent that has a fixed number of actions. Are there any reinforcement learning algorithms for making a decision while taking into account a variable number of actions? For example, how would you apply a RL algorithm in a computer game where a player controls N soldiers, and each soldier has a random number of actions based its condition? You can’t formulate fixed number of actions for a global decision maker (i.e. “the general”) because the available actions are continually changing as soldiers are created and killed. And you can’t formulate a fixed number of actions at the soldier level, since the soldier’s actions are conditional based on its immediate environment. If a soldier sees no opponents, then it might only be able to walk, whereas if it sees 10 opponents, then it has 10 new possible actions, attacking 1 of the 10 opponents.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-24T01:29:04+00:00

Editorial Team

2026-05-24T01:29:04+00:00Added an answer on May 24, 2026 at 1:29 am

What you describe is nothing unusual. Reinforcement learning is a way of finding the value function of a Markov Decision Process. In an MDP, every state has its own set of actions. To proceed with reinforcement learning application, you have to clearly define what the states, actions, and rewards are in your problem.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

All the reinforcement learning algorithms I’ve read about are usually applied to a single

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply