transformers==4.37, yi & yuan2 & vicuna #11805

ada-jt1725 · 2024-08-15T02:38:48Z

added sample output for yi & vicuna
updated vicuna example model to 7b-v1.5 & 13b-v1.5
added yi-6b-chat model to yi

rnwang04 · 2024-08-15T02:48:47Z

python/llm/example/GPU/HuggingFace/LLM/vicuna/README.md

+#### [lmsys/vicuna-13b-v1.5](https://huggingface.co/lmsys/vicuna-13b-v1.5)
 ```log
-Inference time: xxxx s
+Inference time: 1.0269405841827393 s


Don't show real performance data in our example, just keep Inference time: xxxx s.

rnwang04 · 2024-08-15T02:49:08Z

python/llm/example/GPU/HuggingFace/LLM/vicuna/README.md

+#### [eachadea/vicuna-7b-v1.5](https://huggingface.co/lmsys/vicuna-7b-v1.5)
 ```log
-Inference time: xxxx s
+Inference time: 0.7162051200866699 s


just keep Inference time: xxxx s.

rnwang04 · 2024-08-15T02:49:52Z

python/llm/example/GPU/HuggingFace/LLM/yi/README.md

 In the example, several arguments can be passed to satisfy your requirements:

- `--repo-id-or-model-path REPO_ID_OR_MODEL_PATH`: argument defining the huggingface repo id for the Yi model (e.g. `01-ai/Yi-6B`) to be downloaded, or the path to the huggingface checkpoint folder. It is default to be `'01-ai/Yi-6B'`.
+- `--repo-id-or-model-path REPO_ID_OR_MODEL_PATH`: argument defining the huggingface repo id for the Yi model (e.g. `01-ai/Yi-6B`) to be downloaded, or the path to the huggingface checkpoint folder. It is default to be `'01-ai/Yi-6B-Chat'`.


(e.g. 01-ai/Yi-6B and 01-ai/Yi-6B-Chat)

rnwang04 · 2024-08-15T02:50:26Z

python/llm/example/GPU/HuggingFace/LLM/yi/README.md


 ```log
-Inference time: xxxx s
+Inference time: 1.1255202293395996 s


just keep Inference time: xxxx s.

rnwang04 · 2024-08-15T02:50:36Z

python/llm/example/GPU/HuggingFace/LLM/yi/README.md

+
+#### [01-ai/Yi-6B-Chat](https://huggingface.co/01-ai/Yi-6B-Chat)
+```log
+Inference time: 0.5318927764892578 s


just keep Inference time: xxxx s.

rnwang04 · 2024-08-15T02:51:10Z

python/llm/example/GPU/HuggingFace/LLM/yi/generate.py

 if __name__ == '__main__':
    parser = argparse.ArgumentParser(description='Predict Tokens using `generate()` API for Yi model')
-    parser.add_argument('--repo-id-or-model-path', type=str, default="01-ai/Yi-6B",
+    parser.add_argument('--repo-id-or-model-path', type=str, default="/home/arda/jinhe/Yi-6B-Chat",


default="01-ai/Yi-6B-Chat"

rnwang04 · 2024-08-15T03:20:00Z

python/llm/example/GPU/HuggingFace/LLM/yi/generate.py

 if __name__ == '__main__':
    parser = argparse.ArgumentParser(description='Predict Tokens using `generate()` API for Yi model')
-    parser.add_argument('--repo-id-or-model-path', type=str, default="01-ai/Yi-6B",
+    parser.add_argument('--repo-id-or-model-path', type=str, default="/01-ai/Yi-6B-Chat",


default="01-ai/Yi-6B-Chat"

rnwang04 · 2024-08-15T05:28:06Z

python/llm/example/GPU/HuggingFace/LLM/yi/generate.py

 from ipex_llm.transformers import AutoModelForCausalLM
 from transformers import AutoTokenizer

 # Refer to https://huggingface.co/01-ai/Yi-6B-Chat#31-use-the-chat-model


also delete this line

rnwang04

LGTM

ada-jt1725 added 3 commits August 15, 2024 09:51

transformers==4.37

6493736

added yi model

8a5f338

added yi model

b06c51f

ada-jt1725 changed the title ~~transformers==4.37, yuan2 & vicuna~~ transformers==4.37, yi & yuan2 & vicuna Aug 15, 2024

rnwang04 reviewed Aug 15, 2024

View reviewed changes

xxxx

9aca692

rnwang04 reviewed Aug 15, 2024

View reviewed changes

delete prompt template

504f243

rnwang04 reviewed Aug 15, 2024

View reviewed changes

/ and delete

876eb20

rnwang04 approved these changes Aug 15, 2024

View reviewed changes

rnwang04 merged commit 2fbbb51 into intel:main Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

transformers==4.37, yi & yuan2 & vicuna #11805

transformers==4.37, yi & yuan2 & vicuna #11805

Uh oh!

ada-jt1725 commented Aug 15, 2024

Uh oh!

rnwang04 Aug 15, 2024

Uh oh!

rnwang04 Aug 15, 2024

Uh oh!

rnwang04 Aug 15, 2024

Uh oh!

rnwang04 Aug 15, 2024

Uh oh!

rnwang04 Aug 15, 2024

Uh oh!

rnwang04 Aug 15, 2024

Uh oh!

rnwang04 Aug 15, 2024

Uh oh!

rnwang04 Aug 15, 2024

Uh oh!

rnwang04 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

transformers==4.37, yi & yuan2 & vicuna #11805

transformers==4.37, yi & yuan2 & vicuna #11805

Uh oh!

Conversation

ada-jt1725 commented Aug 15, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rnwang04 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants