[RLlib] Enhance ConnectorV2 `ObservationPreprocessor` APIs (add multi-agent support; add `episode` arg). #54209

sven1977 · 2025-06-30T15:14:18Z

Enhance ConnectorV2 ObservationPreprocessor APIs:

add multi-agent support
add episode arg (so that for example rewards/actions can be added to an observation during preprocessing).

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: sven1977 <[email protected]>

…_redo_connector_v2

Signed-off-by: sven1977 <[email protected]>

…_redo_connector_v2 Signed-off-by: sven1977 <[email protected]> # Conflicts: # rllib/connectors/common/add_states_from_episodes_to_batch.py # rllib/connectors/module_to_env/remove_single_ts_time_rank_from_batch.py

Signed-off-by: sven1977 <[email protected]>

…_redo_connector_v2 Signed-off-by: sven1977 <[email protected]> # Conflicts: # doc/source/rllib/package_ref/index.rst

Signed-off-by: sven1977 <[email protected]>

…_redo_connector_v2

Signed-off-by: sven1977 <[email protected]>

…_redo_connector_v2

Signed-off-by: sven1977 <[email protected]>

…_redo_connector_v2

Signed-off-by: sven1977 <[email protected]>

…_redo_connector_v2

Signed-off-by: sven1977 <[email protected]>

…_redo_connector_v2

Signed-off-by: sven1977 <[email protected]>

Copilot

Pull Request Overview

This PR enhances the ConnectorV2 ObservationPreprocessor APIs to add multi-agent support and introduces an extra episode argument to the preprocess function. Key changes include:

Renaming the existing ObservationPreprocessor to SingleAgentObservationPreprocessor and adding a new MultiAgentObservationPreprocessor.
Updating the preprocess method signature and call implementations to pass the corresponding episode object.
Adjusting several connector components (e.g., numpy_to_tensor, add_time_dim_to_batch_and_zero_pad, add_states_from_episodes_to_batch) to safely handle cases when rl_module is None and updating documentation accordingly.

Reviewed Changes

Copilot reviewed 12 out of 21 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
rllib/connectors/env_to_module/observation_preprocessor.py	Renamed and extended preprocessor APIs for single-agent and multi-agent setups.
rllib/connectors/common/numpy_to_tensor.py	Removed the as_learner_connector parameter and added a conditional check on rl_module.
rllib/connectors/common/add_time_dim_to_batch_and_zero_pad.py	Updated rl_module check to account for None values.
rllib/connectors/common/add_states_from_episodes_to_batch.py	Updated rl_module check to account for None values.
doc/source/rllib/*.rst	Updated documentation to include connector-v2 and related new APIs.

Comments suppressed due to low confidence (2)

rllib/connectors/env_to_module/observation_preprocessor.py:162

Consider implementing a proper 'set_observations' API for multi-agent episodes instead of relying on the single-agent workaround.

            # TODO (sven): Implement set_observations API for multi-agent episodes.

rllib/connectors/common/numpy_to_tensor.py:60

The removal of the 'as_learner_connector' parameter may affect users expecting that configuration. Ensure that the associated documentation and any dependent code reflect this change.

*,

Signed-off-by: sven1977 <[email protected]>

simonsays1980

LGTM.

Signed-off-by: sven1977 <[email protected]>

…-agent support; add `episode` arg). (#54209) Signed-off-by: elliot-barn <[email protected]>

sven1977 added 28 commits August 13, 2024 10:16

wip

c5dcc11

Signed-off-by: sven1977 <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into docs…

bf7a10e

…_redo_connector_v2

wip

32b33ec

Signed-off-by: sven1977 <[email protected]>

wip

e7a6f24

Signed-off-by: sven1977 <[email protected]>

wip

30272c1

Signed-off-by: sven1977 <[email protected]>

wip

f2a2359

Signed-off-by: sven1977 <[email protected]>

wip

bba9fbb

Signed-off-by: sven1977 <[email protected]>

wip

328e0c1

Signed-off-by: sven1977 <[email protected]>

wip

345ee78

Signed-off-by: sven1977 <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into docs…

ebdd51b

…_redo_connector_v2 Signed-off-by: sven1977 <[email protected]> # Conflicts: # doc/source/rllib/package_ref/index.rst

wip

5a822cb

Signed-off-by: sven1977 <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into docs…

e7468dd

…_redo_connector_v2

wip

2550c03

Signed-off-by: sven1977 <[email protected]>

LINT

bc9fb66

Signed-off-by: sven1977 <[email protected]>

LINT

b870d4f

Signed-off-by: sven1977 <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into docs…

9df3bdf

…_redo_connector_v2

wip

d513f61

Signed-off-by: sven1977 <[email protected]>

wip

f273b72

Signed-off-by: sven1977 <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into docs…

7bfb131

…_redo_connector_v2

wip

b96eaa0

Signed-off-by: sven1977 <[email protected]>

wip

8b9a981

Signed-off-by: sven1977 <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into docs…

bf6a086

…_redo_connector_v2

wip

68f48a6

Signed-off-by: sven1977 <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into docs…

30bde82

…_redo_connector_v2

Merge branch 'master' of https://github.com/ray-project/ray into docs…

f7df27d

…_redo_connector_v2

wip

99da912

Signed-off-by: sven1977 <[email protected]>

wip

ad3c2eb

Signed-off-by: sven1977 <[email protected]>

Copilot AI review requested due to automatic review settings June 30, 2025 15:14

sven1977 requested a review from a team as a code owner June 30, 2025 15:14

Copilot AI reviewed Jun 30, 2025

View reviewed changes

sven1977 assigned simonsays1980 Jun 30, 2025

sven1977 added 2 commits June 30, 2025 18:14

wip

ac04d49

Signed-off-by: sven1977 <[email protected]>

wip

801549a

Signed-off-by: sven1977 <[email protected]>

simonsays1980 approved these changes Jun 30, 2025

View reviewed changes

wip

befdb79

Signed-off-by: sven1977 <[email protected]>

sven1977 enabled auto-merge (squash) July 1, 2025 15:16

github-actions bot added the go add ONLY when ready to merge, run all tests label Jul 1, 2025

sven1977 merged commit 1bffb1f into ray-project:master Jul 1, 2025
6 of 7 checks passed

elliot-barn pushed a commit that referenced this pull request Jul 2, 2025

[RLlib] Enhance ConnectorV2 ObservationPreprocessor APIs (add multi…

2618a79

…-agent support; add `episode` arg). (#54209) Signed-off-by: elliot-barn <[email protected]>

sven1977 deleted the enhance_preprocessor_apis branch July 2, 2025 06:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RLlib] Enhance ConnectorV2 `ObservationPreprocessor` APIs (add multi-agent support; add `episode` arg). #54209

[RLlib] Enhance ConnectorV2 `ObservationPreprocessor` APIs (add multi-agent support; add `episode` arg). #54209

Uh oh!

sven1977 commented Jun 30, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

simonsays1980 left a comment

Uh oh!

Uh oh!

Uh oh!

[RLlib] Enhance ConnectorV2 ObservationPreprocessor APIs (add multi-agent support; add episode arg). #54209

[RLlib] Enhance ConnectorV2 ObservationPreprocessor APIs (add multi-agent support; add episode arg). #54209

Uh oh!

Conversation

sven1977 commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

Related issue number

Checks

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

simonsays1980 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

[RLlib] Enhance ConnectorV2 `ObservationPreprocessor` APIs (add multi-agent support; add `episode` arg). #54209

[RLlib] Enhance ConnectorV2 `ObservationPreprocessor` APIs (add multi-agent support; add `episode` arg). #54209

sven1977 commented Jun 30, 2025 •

edited

Loading