Equation used to compute relative affinity for new sequences using curated MotifCentral models

Hi

Congratulations on impressive and hugely useful work - both the ProBound model and MotifCentral database!

I am trying to understand how exactly the relative affinities for new sequences are computed using curated MotifCentral models - e.i. what `bindingModeScores` computes in this line:

```
proBoundTools -c 'loadMotifCentralModel(15412).addNScoring().inputTXT(seq.txt).bindingModeScores(/dev/stdout)'
```

I struggle to understand which equation is used to compute relative affinity as a functions of A) PSAM (presumably stored in [MotifCentral.v1.0.0.json](https://motifcentral-resources.s3.us-west-2.amazonaws.com/MotifCentral.v1.0.0.json)) `w_{motif length, 4 nucleotides}`, and B) new one-hot encoded sequence `s_{total length, 4 nucleotides}`. Specifically, what is the function/equation that's used to compute one relative affinity for one offset?
```
affinity =  function(w_{motif length, 4 nucleotides}`, `s_{motif length, 4 nucleotides}`)
```

I see that this computation is done in 
[slidePN](https://github.com/BussemakerLab/ProBoundTools/blob/a79b75f0c08cac461fb438186b81882cdff599d0/ProBoundTools/src/main/java/sequenceTools/SlidingWindow.java#L255) and that it is related to Eq 5 in the paper [methods section](https://www.nature.com/articles/s41587-022-01307-0#Sec10): 
<img width="769" alt="Screenshot 2022-12-11 at 00 38 15" src="https://user-images.githubusercontent.com/22567383/206881009-927b37a7-4fa3-4963-907e-0306490ad466.png">

However, I don't understand these 2 terms below are related to PSAM and the new sequence - is beta_a = PSAM and X(S) = the new sequence? 
<img width="76" alt="Screenshot 2022-12-11 at 00 39 20" src="https://user-images.githubusercontent.com/22567383/206881034-ec1c72cd-7d1c-41b5-a394-a2f89d9529f0.png">

Could you please explain this in a bit more detail, ideally writing pseudocode for `affinity =  function(w_{motif length, 4 nucleotides}`, `s_{motif length, 4 nucleotides}`)`?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Equation used to compute relative affinity for new sequences using curated MotifCentral models #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Equation used to compute relative affinity for new sequences using curated MotifCentral models #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions