[Feature Request] Add a next_observations field to RolloutBufferSamples

### 🚀 Feature

When sampling from a `RolloutBuffer`, we return `RolloutBufferSample`s containing tensors of observations, actions etc. 

https://github.com/DLR-RM/stable-baselines3/blob/69b94dd6a8f93cf0b9d2201dcae9c146b8a9c75d/stable_baselines3/common/buffers.py#L473-L479

It would be nice if `RolloutBufferSamples` could also contain a batch of _next_ observations (alongside a mask that, for each observation, tells us whether that observation has a successor).

### Motivation

I'm implementing an RL pipeline in which I extend PPO with a custom loss. For this custom loss, I need access to (observation, next observation) pairs.

In the PPO implementation 

https://github.com/DLR-RM/stable-baselines3/blob/69b94dd6a8f93cf0b9d2201dcae9c146b8a9c75d/stable_baselines3/ppo/ppo.py#L192-L197

each batch of rollout data over which we compute the PPO loss is a `RolloutBufferSample` -- and, as these consist of a random subset of observations from the `RolloutBuffer`, we do not have enough information to compute the next observation for each observation in the batch.

### Pitch

I have already implemented this feature and submitted it as a PR [to be linked after submission].

### Alternatives

Alternatively, we could return the indices of the sampled elements with respect to the original buffer. While this may allow for more general buffer manipulation, this feels less pleasant to use.

### Additional context

_No response_

### Checklist

- [X] I have checked that there is no similar [issue](https://github.com/DLR-RM/stable-baselines3/issues) in the repo

	def _get_samples(
	self,
	batch_inds: np.ndarray,
	env: Optional[VecNormalize] = None,
	) -> RolloutBufferSamples: # type: ignore[signature-mismatch] #FIXME
	data = (
	self.observations[batch_inds],

	# train for n_epochs epochs
	for epoch in range(self.n_epochs):
	approx_kl_divs = []
	# Do a complete pass on the rollout buffer
	for rollout_data in self.rollout_buffer.get(self.batch_size):
	actions = rollout_data.actions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature Request] Add a next_observations field to RolloutBufferSamples #1328

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

Checklist

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request] Add a next_observations field to RolloutBufferSamples #1328

Description

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

Checklist

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions