Add LORA support to TRLX

### 🚀 The feature, motivation, and pitch

LORA and other parameter-efficient methods can provide a number of advantages when finetuning large language models. These methods typically only update a small fraction of the model parameters during finetuning. For example, LORA only trains low-rank reparameterizations of weight matrices during training reducing parameter cost up to 10000 times. 

Key Advantages:

- Reduced storage cost and ability to switch out different adapters for different tasks
- Increased training efficiency and reduced memory requirements since fewer optimizer states need to be tracked

### Alternatives

_No response_

### Additional context

The [OpenDelta](https://opendelta.readthedocs.io/en/latest/index.html) library provides support for LORA and other "delta methods"

```python
model =  AutoModelForCausalLM.from_pretrained(model_base)
from opendelta import LoraModel
delta_model = LoraModel(backbone_model=model, modified_modules=['fc2'])
delta_model.freeze_module(exclude=["deltas", "layernorm_embedding"], set_state_dict=True)
# save only  trained parameters
delta_model.save_finetuned(save_path)
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add LORA support to TRLX #80

🚀 The feature, motivation, and pitch

Alternatives

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add LORA support to TRLX #80

Description

🚀 The feature, motivation, and pitch

Alternatives

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions