Skip to content

Actions: Lightning-AI/pytorch-lightning

Actions

Probot

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,748 workflow runs
1,748 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Support get optimizer and lr_schedulers from deepspeed config
Probot #30661: Issue #19860 labeled by SkafteNicki
September 5, 2025 11:46 15s
September 5, 2025 11:46 15s
Lightning stalls with 2 GPUs on 1 node with SLURM (and apptainer)
Probot #30660: Issue #19883 labeled by SkafteNicki
September 5, 2025 11:46 15s
September 5, 2025 11:46 15s
Lightning stalls with 2 GPUs on 1 node with SLURM (and apptainer)
Probot #30659: Issue #19883 labeled by SkafteNicki
September 5, 2025 11:46 16s
September 5, 2025 11:46 16s
EarlyStopping override disrupts wandb logging frequency
Probot #30658: Issue #19990 labeled by SkafteNicki
September 5, 2025 11:44 14s
September 5, 2025 11:44 14s
EarlyStopping override disrupts wandb logging frequency
Probot #30657: Issue #19990 labeled by SkafteNicki
September 5, 2025 11:44 16s
September 5, 2025 11:44 16s
September 5, 2025 11:41 12s
September 5, 2025 11:41 15s
Make it easier to setup a multi-line progress bar
Probot #30654: Issue #20608 labeled by SkafteNicki
September 5, 2025 11:41 14s
September 5, 2025 11:41 14s
Make it easier to setup a multi-line progress bar
Probot #30653: Issue #20608 labeled by SkafteNicki
September 5, 2025 11:41 11s
September 5, 2025 11:41 11s
16 bit precision in Trainer leading to NaN
Probot #30652: Issue #20619 labeled by SkafteNicki
September 5, 2025 11:40 13s
September 5, 2025 11:40 13s
16 bit precision in Trainer leading to NaN
Probot #30651: Issue #20619 labeled by SkafteNicki
September 5, 2025 11:40 13s
September 5, 2025 11:40 13s
Fabric run CLI cannot launch python module
Probot #30650: Issue #20654 labeled by SkafteNicki
September 5, 2025 11:38 11s
September 5, 2025 11:38 11s
ModelCheckpoint not saving best model
Probot #30649: Issue #20657 labeled by SkafteNicki
September 5, 2025 11:38 16s
September 5, 2025 11:38 16s
ModelCheckpoint not saving best model
Probot #30648: Issue #20657 labeled by SkafteNicki
September 5, 2025 11:38 17s
September 5, 2025 11:38 17s
Non-reproducible results with num_workers=0
Probot #30647: Issue #20679 labeled by SkafteNicki
September 5, 2025 11:36 12s
September 5, 2025 11:36 12s
self.manual_backward() makes all gradients gone
Probot #30646: Issue #20685 labeled by SkafteNicki
September 5, 2025 11:36 13s
September 5, 2025 11:36 13s
self.manual_backward() makes all gradients gone
Probot #30645: Issue #20685 labeled by SkafteNicki
September 5, 2025 11:36 13s
September 5, 2025 11:36 13s
Abnormally slow both single-gpu & DDP training, what is the problem here?
Probot #30644: Issue #20702 labeled by SkafteNicki
September 5, 2025 11:32 14s
September 5, 2025 11:32 14s
multi-node training runs crash because ddp_weakref is None during backward
Probot #30643: Issue #20706 labeled by SkafteNicki
September 5, 2025 11:30 17s
September 5, 2025 11:30 17s
multi-node training runs crash because ddp_weakref is None during backward
Probot #30642: Issue #20706 labeled by SkafteNicki
September 5, 2025 11:30 18s
September 5, 2025 11:30 18s
diff-svc(winerror3 when the training starts)
Probot #30641: Issue #20849 labeled by SkafteNicki
September 5, 2025 11:29 18s
September 5, 2025 11:29 18s
Can we have an LLM.txt?
Probot #30640: Issue #20758 labeled by SkafteNicki
September 5, 2025 11:27 16s
September 5, 2025 11:27 16s
Can we have an LLM.txt?
Probot #30639: Issue #20758 labeled by SkafteNicki
September 5, 2025 11:27 15s
September 5, 2025 11:27 15s
mark_forward_method does not work with ModelParallelStrategy
Probot #30638: Issue #20710 labeled by SkafteNicki
September 5, 2025 11:27 17s
September 5, 2025 11:27 17s
mark_forward_method does not work with ModelParallelStrategy
Probot #30637: Issue #20710 labeled by SkafteNicki
September 5, 2025 11:27 20s
September 5, 2025 11:27 20s