1. During the training process, do you retain the history of previous turns for the same scene? Is each turn treated independently during training? 2. If the history is retained, how is it implemented in the model? Are there any specific mechanisms or architectures that enable this? 3. Also, is there a plan to release the training code for this model? It would be helpful for further understanding and experimentation.