How to train multiple models with a shared loss and separate optimizers in MLX? #2115

cshan · 2025-04-23T22:16:50Z

cshan
Apr 23, 2025

Hi,

I have a training scenario where multiple models share a single loss. I’ve implemented a custom algorithm that computes the gradients for all model parameters in a single pass (for performance reasons), and I’d like to update each model using its own optimizer.

I’d like to ask:

Does MLX support this kind of training setup?

What is the recommended way to implement this in MLX?

Are there any plans to support higher-level abstractions for multi-model training with separate optimizers?

Thank you for the great work—MLX is a pleasure to use!

awni · 2025-04-23T22:46:02Z

awni
Apr 23, 2025
Maintainer

Yes MLX definitely supports using multiple optimizers. The simplest way is probably with the MultiOptimizer class.

Here's a pretty bare bones example which gets at how to do this:

import mlx.core as mx
import mlx.nn as nn
import mlx.optimizers as optim

o1 = optim.SGD(learning_rate=1)
o2 = optim.SGD(learning_rate=2)

opt = optim.MultiOptimizer([o1, o2], [lambda p, _: "model1" in p])

class Model(nn.Module):
    def __init__(self):
        super().__init__()
        self.model1 = mx.array(1.0)
        self.model2 = mx.array(2.0)

model = Model()

grads = {"model1": mx.array(1.0), "model2": mx.array(2.0)}
opt.update(model, grads)

2 replies

cshan Apr 24, 2025
Author

Thank you. I found that there is no MultiOptimizer in MLX Swift yet, maybe I need to wait for a new version.

awni Apr 24, 2025
Maintainer

Yes.. an issue on the mlx-swift repo would be helpful so we know to prioritize it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to train multiple models with a shared loss and separate optimizers in MLX? #2115

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to train multiple models with a shared loss and separate optimizers in MLX? #2115

Uh oh!

cshan Apr 23, 2025

Replies: 1 comment · 2 replies

Uh oh!

awni Apr 23, 2025 Maintainer

Uh oh!

cshan Apr 24, 2025 Author

Uh oh!

awni Apr 24, 2025 Maintainer

cshan
Apr 23, 2025

Replies: 1 comment 2 replies

awni
Apr 23, 2025
Maintainer

cshan Apr 24, 2025
Author

awni Apr 24, 2025
Maintainer