-
Notifications
You must be signed in to change notification settings - Fork 14
Open
Description
Great work! However, I noticed that there are some missing details regarding the model training in the paper (or perhaps I overlooked them; were they disclosed elsewhere?). I would greatly appreciate it if you could clarify them:
- Which version of the data was used to train the MiraDiT model (the results in Table 3)? Was it the 330K, 93K, 42K, or 9K version?
- Which version of the captions was used during training? Was it the short, dense, or structural captions?
- What is the total amount of training data?
Thank you in advance for your response!
Metadata
Metadata
Assignees
Labels
No labels