The length of tokens in data-clipped

Hi, I'm now trying to use training data in generated_data/data-clipped for fine-tuning. I'm now using Roberta, but I found that there are still many sentence pairs over the limit of 512. Now I process the sentence pair like this: <s>text</s></s>claim</s>, maybe I'm wrong or something?
Thank you in advance!