怎样比较快速绑定显卡运行模型
#1125
Replies: 3 comments 1 reply
-
也许可以写yaml来指定: https://kvcache-ai.github.io/ktransformers/en/multi-gpu-tutorial.html ,理论上只要把V3被显卡加载的层都重定向到显卡A、把R1的都重定向到显卡B就可以实现了 |
Beta Was this translation helpful? Give feedback.
1 reply
-
想找个足够简单粗暴的直接能指定运行显卡,修改源码或者能简单配置指定显卡的方式。 |
Beta Was this translation helpful? Give feedback.
0 replies
-
找到简单的办法了完美解决~ |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
求助:有2块显卡 想绑定A 显卡运行V3模型,B显卡运行R1模型,有什么比较好的办法能直接指定吗,如设置环境变量这种方式。
Beta Was this translation helpful? Give feedback.
All reactions