Hi there,
Thanks for developing RGI.
Regarding the perfect, strict, and loose hits, Jia et al. 2017 mentions:
The RGI currently supports two detection model types (Protein Homolog and Protein Variant) and analyzes sequences under three paradigms—Perfect, Strict, and Loose (a.k.a. Discovery). The Perfect algorithm is most often applied to clinical surveillance as it detects perfect matches to the curated reference sequences and mutations in CARD.
Could you provide a more technical definition of these terms? For instance for 'perfect', I understand from #101 that the self-mapping bit-scores determine the 'perfect' hits. Could you provide more info on how 'strict' and 'loose' thresholds are chosen?
Thanks in advance,
Ali