Skip to content

Conversation

@Nexesenex
Copy link
Owner

No description provided.

@Nexesenex Nexesenex merged this pull request into Nexesenex:lcpp_pr_cuda_rope_fix_back Jan 15, 2025
3 of 4 checks passed
Nexesenex pushed a commit that referenced this pull request May 24, 2025
* imatrix: collect layer influence statistics

* imatrix: collect layer influence statiscs also for the last layer

For the last layer we need to use the input for the output.weight
tensor. Last layer(s) tend(s) to be important, so it is useful to also
have its influence metric.

* imatrix: separate metric for attention and ffn importance

* Use stripped tensor name, not src0->name

---------

Co-authored-by: Iwan Kawrakow <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant