Skip to content

Commit a2b033e

Browse files
authored
Fix OpenLLaMA model names (#209)
* Update PR template. (#207) * Update OpenLLaMA model names.
1 parent 85176ff commit a2b033e

File tree

4 files changed

+15
-12
lines changed

4 files changed

+15
-12
lines changed

.github/PULL_REQUEST_TEMPLATE.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,9 @@ ran. If your test fixes a bug reported in an issue, don't forget to include the
77
issue number. If your PR is still a work in progress, that's totally fine – just
88
include a note to let us know. -->
99

10+
### Corresponding documentation PR
11+
<!--- Add the link to the corresponding documentation PR here, if applicable. -->
12+
1013
### Types of change
1114
<!-- What type of change does your PR cover? Is it a bug fix, an enhancement
1215
or new feature, or a change to the documentation? -->

README.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1475,14 +1475,14 @@ Example config block:
14751475
```ini
14761476
[components.llm.model]
14771477
@llm_models = "spacy.OpenLLaMA.v1"
1478-
name = "open_llama_3b_350bt_preview"
1478+
name = "open_llama_3b"
14791479
```
14801480

1481-
| Argument | Type | Default | Description |
1482-
| ------------- | ------------------------------------------------------------------------------------------------------------------------------------- | ------- | ---------------------------------------------------------------------------------------------------------------------------- |
1483-
| `name` | `Literal["open_llama_3b_350bt_preview", "open_llama_3b_600bt_preview", "open_llama_7b_400bt_preview", "open_llama_7b_600bt_preview"]` | | The name of a OpenLLaMA model that is supported (e. g. "open_llama_3b_350bt_preview"). |
1484-
| `config_init` | `Dict[str, Any]` | `{}` | Further configuration passed on to the construction of the model with `transformers.AutoModelForCausalLM.from_pretrained()`. |
1485-
| `config_run` | `Dict[str, Any]` | `{}` | Further configuration used during model inference. |
1481+
| Argument | Type | Default | Description |
1482+
| ------------- |-------------------------------------------------------------------------------------------------| ------- | ---------------------------------------------------------------------------------------------------------------------------- |
1483+
| `name` | `Literal["open_llama_3b", "open_llama_7b", "open_llama_7b_v2", "open_llama_13b"]` | | The name of a OpenLLaMA model that is supported. |
1484+
| `config_init` | `Dict[str, Any]` | `{}` | Further configuration passed on to the construction of the model with `transformers.AutoModelForCausalLM.from_pretrained()`. |
1485+
| `config_run` | `Dict[str, Any]` | `{}` | Further configuration used during model inference. |
14861486

14871487
See the [OpenLM Research OpenLLaMA GitHub repo](https://github.com/openlm-research/open_llama) for details.
14881488

spacy_llm/models/hf/openllama.py

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -9,10 +9,10 @@
99

1010
class OpenLLaMA(HuggingFace):
1111
MODEL_NAMES = Literal[
12-
"open_llama_3b_350bt_preview", # noqa: F722
13-
"open_llama_3b_600bt_preview", # noqa: F722
14-
"open_llama_7b_400bt_preview", # noqa: F722
15-
"open_llama_7b_600bt_preview", # noqa: F722
12+
"open_llama_3b", # noqa: F722
13+
"open_llama_7b", # noqa: F722
14+
"open_llama_7b_v2", # noqa: F722
15+
"open_llama_13b", # noqa: F722
1616
]
1717

1818
def __init__(

spacy_llm/tests/models/test_openllama.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@
88
_PIPE_CFG = {
99
"model": {
1010
"@llm_models": "spacy.OpenLLaMA.v1",
11-
"name": "open_llama_3b_350bt_preview",
11+
"name": "open_llama_3b",
1212
},
1313
"task": {"@llm_tasks": "spacy.NoOp.v1"},
1414
}
@@ -29,7 +29,7 @@
2929
3030
[components.llm.model]
3131
@llm_models = spacy.OpenLLaMA.v1
32-
name = open_llama_3b_350bt_preview
32+
name = open_llama_3b
3333
"""
3434

3535

0 commit comments

Comments
 (0)