Rewrite state_dict in a more pytorch idiomatic way

## 🚀 Feature
Change the param_groups handling in the state dict, in order to follow more closely the default PyTorch assumptions
https://pytorch.org/docs/stable/optim.html#torch.optim.Optimizer.state_dict

## Motivation
- Some users may assume that the default pytorch optimizer interface with respect to the state dict is respected by fairscale/oss
- People familiar with pytorch optimizers would have an easier learning curve when peeking into OSS

## Pitch
Rewrite the exposed state dict in order to return "state" and "param_groups" in accordance to pytorch expectations, without duplications

## Alternatives
- rely on the python/pytorch memory model to remove duplicates in memory and while serializing
- add wrappers on the user side

## Additional context

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rewrite state_dict in a more pytorch idiomatic way #60

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Rewrite state_dict in a more pytorch idiomatic way #60

Description

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions