-
Notifications
You must be signed in to change notification settings - Fork 554
Batch Mode + Maintain Aspect Ratio + Multi-GPU Random Seed + Fixed Multi-GPU CuSolver Error + Fixed 20-min Load Time + Video Input #31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
python3 generate_video_df.py --model_id ${model_id} --resolution 540P --ar_step 0 --base_num_frames 97 --num_frames 177 --overlap_history 17 --addnoise_condition 20 --offload --prompt 'A woman in a leather jacket and sunglasses riding a vintage motorcycle through a desert highway at sunset, her hair blowing wildly in the wind as the motorcycle kicks up dust, with the golden sun casting long shadows across the barren landscape.' 'A woman flies into space'
Added batch mode, added option to keep original aspect ratio, synchronized seeds on multi-gpu.
…nized randomized seeds on multi-gpu, exposed negative_prompt option.
… resolving merge conflicts.
… chaojie and fixed merged conflicts.
… chaojie and fixed merged conflicts.
… chaojie and fixed merged conflicts.
… chaojie and fixed merged conflicts.
… chaojie and fixed merged conflicts.
… chaojie and fixed merged conflicts.
I have the latest code however i still cant generate video wth multiple prompt in sequence any extra params that i need to pass ? |
What is your commandline prompt and what is the error when you try to run? |
no error just that video is not rendering second part hmm I am runnning via gradio 'A woman in a leather jacket and sunglasses riding a vintage motorcycle through a desert highway at sunset, her hair blowing wildly in the wind as the motorcycle kicks up dust, with the golden sun casting long shadows across the barren landscape.' 'A woman flies into space' let me run that generate_video_df.py direclty and see rather than my custum gradio to wrap around |
I'm not familiar with gradio - have you tried running it via command line like the example on the readme? A lot of the work for this fork's changes is in the generate py file, so you'd have to replicate those into your custom code. What is your num_frames and base_num_frames? It'd be good to know your full parameters list. Each additional prompt is assigned to a chunk, with # of chunks = num_frames/base_num_frames. So you need to make sure you have enough chunks to make it to the next prompt. If you're running the fork directly via command line you can also see the debug saying what prompt/chunk it is currently on to see. |
alright going to use exact as ur setting , my video length was 5 sec so only less frame i guess i will set the video to 10 sec .. thanks for your help will keep messing :) |
Thanks - it's hard to be sure on that issue - I didn't directly set out to solve it. I just know my Runpod instance was crashing with 4xA40s previously and afterwards it did not. I wonder if it is partly because you still have RAM to spare so maybe the system is not as stringent on clearing the memory. |
@pftq thanks for your fast reply. I guess the existing difference of RAM manager strategy between runpod and autodl where i run my code caused it. |
That's not an error message from anywhere in this code repo - if you are embedding this in a custom script or environment, you would need to look there for the issue. Additionally that is the multi-gpu code, which is quite complex so I don't recommend embedding that in another wrapper. |
Either (1) merge chaojie's PR first and then this one or (2) merge this one only, which includes chaojie already as a PR-merge to this PR. What won't work is merging this one first and then chaojie's after.
I updated the generate_video files to support the following:
This also includes and cleanly integrates chaojie's fork (#12):
Let me know if there is anything you guys want changed for the PR. I still think you guys have the best open-source model so far, just that it's really hard for an average user to get good results without a lot of debugging, so I'm happy to help out.
Multi-GPU with video input and prompt travel, batch of 10, preserving aspect ratio.
Change --video "video.mp4" to --image "image.jpg" if you want to load a starting image instead.
Single GPU with video input and prompt travel, batch of 10, preserving aspect ratio.
Change --video "video.mp4" to --image "image.jpg" if you want to load a starting image instead.