will attention sequence length has restrictions？

Hi，
We noticed in the TK implementation that a warpgroup must have 4 warps, and each warp processes a seq_len of 16. Does this imply that there are some restrictions on the seq_len of q when performing attention calculations?
Thank you！