Support Llama MoE model #2

xzyaoi · 2024-10-24T13:54:23Z

No description provided.

…-moe

xzyaoi · 2024-10-30T10:49:55Z

scratchpad/nn/layers/linear.py

        s += f", reduce_results={self.reduce_results}"
        return s
+
+class TritelaLinear(LinearBase):


Let's call it SparseQuantizedLinear instead of TriteiaLinear.

Actually, do you really need this class? It seems it is not used anyway in the model code

BoykoBorisov · 2024-11-03T14:55:48Z

Marking the PR as draft, as the code for the unquantised MoE may be incorrect

…-moe

checkpoint

a1793eb

xzyaoi changed the title ~~checkpoint~~ Support Llama MoE model Oct 24, 2024

Boyko Borisov and others added 3 commits October 25, 2024 12:17

checkpoint

ac46dbc

Finalize naive MoE

c5d56ac

Merge branch 'Llama-moe' of github.com:eth-easl/Scratchpad into Llama…

c4dc9f4

…-moe

xzyaoi commented Nov 1, 2024

View reviewed changes

Boyko Borisov added 6 commits November 1, 2024 17:10

checkpoint

fde6869

checkpoint

aa56a46

Finalize naive MoE

4517c6b

integrate with triteia sbmm

b37d043

Remove TriteiaLinear layer

38a5fe4

Adds initial code for unquantised MoE

7c327f4

BoykoBorisov force-pushed the Llama-moe branch from 9a74856 to 7c327f4 Compare November 3, 2024 14:52

BoykoBorisov marked this pull request as draft November 3, 2024 14:56

Improve linear MoE implementation

613cb11

BoykoBorisov marked this pull request as ready for review November 5, 2024 16:05

xzyaoi mentioned this pull request Nov 5, 2024

Model Support Matrix #4

Open

7 tasks

xzyaoi added 2 commits November 5, 2024 18:02

import attention from base llama

e9c7486

Merge branch 'Llama-moe' of github.com:eth-easl/Scratchpad into Llama…

6f6b0af

…-moe

xzyaoi merged commit 74f8c65 into dev Nov 5, 2024

xzyaoi deleted the Llama-moe branch November 5, 2024 17:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support Llama MoE model #2

Support Llama MoE model #2

Uh oh!

xzyaoi commented Oct 24, 2024

Uh oh!

xzyaoi Oct 30, 2024

Uh oh!

xzyaoi Oct 30, 2024

Uh oh!

BoykoBorisov Nov 1, 2024

Uh oh!

BoykoBorisov commented Nov 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support Llama MoE model #2

Support Llama MoE model #2

Uh oh!

Conversation

xzyaoi commented Oct 24, 2024

Uh oh!

xzyaoi Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

xzyaoi Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

BoykoBorisov Nov 1, 2024

Choose a reason for hiding this comment

Uh oh!

BoykoBorisov commented Nov 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants