Collapse reference+learner hydra heads when using LoRa

### 🚀 The feature, motivation, and pitch

With additive (delta-style) parameter-efficient tuning methods such as [LoRa](https://arxiv.org/abs/2106.09685), we should be able to make a slightly more mem-efficient hydra architecture by using a single block that does ~`frozen_head + tunable_weights` for the learner/policy head's fwd-pass and simply `frozen_head` for the reference, instead of maintaining 2x heads.

CC @LouisCastricato and @cat-state for pointing this out

### Alternatives

_No response_

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Collapse reference+learner hydra heads when using LoRa #320

🚀 The feature, motivation, and pitch

Alternatives

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Collapse reference+learner hydra heads when using LoRa #320

Description

🚀 The feature, motivation, and pitch

Alternatives

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions