Add MNIST training provenance collection example #3

sandlbn · 2025-05-15T23:24:16Z

This PR adds example demonstrating how to collect provenance data from an ML workflow using Atlas CLI. The example tracks a full MNIST training pipeline including dataset download, model training, and evaluation.

marcelamelara

thanks @sandlbn ! LGTM, I'll approve once I run the example

examples/mnist/README.md

marcelamelara · 2025-05-21T00:28:35Z

examples/mnist/README.md

+    --storage-url=http://localhost:8080
+```
+
+## Troubleshooting


this is awesome, I wish more docs had such a section :)

marcelamelara · 2025-05-21T00:32:36Z

examples/mnist/README.md

+1. Extend the Pipeline: Add data preprocessing steps, hyperparameter tuning, or model optimization
+2. Track Experiments: Create manifests for different training runs with varying parameters
+3. Build CI/CD Integration: Automatically collect provenance in your ML pipeline
+4. Create Visualizations: Use the provenance graph to create visual representations of your ML workflow
+5. Implement Governance: Use provenance data for model approval and deployment decisions


Have you tested this example in TDX? If so, we should add a line about running inside of TDX either down here, or higher up in the doc to explain the use of the with-tdx feature

It would be cool to add it in the second iteration. No, I don’t test it with TDX. I think the best way to enable TDX is to add additional parameters to the script. But I think we can add this as an issue at the moment.

tracking this in #7

examples/mnist/pyproject.toml

Co-authored-by: Marcela Melara <[email protected]>

marcelamelara

Thanks for the fixes, LGTM

Add MNIST training provenance collection example

f1e5a76

sandlbn assigned sebszyller and marcelamelara May 15, 2025

Updating outdated torch dependencies

2205220

marcelamelara reviewed May 21, 2025

View reviewed changes

sandlbn and others added 2 commits May 21, 2025 08:07

Update examples/mnist/README.md

fefeeb5

Co-authored-by: Marcela Melara <[email protected]>

Add Running the Workflow section

4f3ed15

This was referenced May 21, 2025

Create Terms & Definitions doc #6

Open

Add ability to run examples on TDX #7

Closed

Update examples/mnist/pyproject.toml

8d5a321

Co-authored-by: Marcela Melara <[email protected]>

marcelamelara approved these changes May 27, 2025

View reviewed changes

marcelamelara merged commit c4a1a47 into main May 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add MNIST training provenance collection example #3

Add MNIST training provenance collection example #3

Uh oh!

sandlbn commented May 15, 2025

Uh oh!

marcelamelara left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

marcelamelara May 21, 2025

Uh oh!

marcelamelara May 21, 2025

Uh oh!

sandlbn May 21, 2025

Uh oh!

marcelamelara May 21, 2025

Uh oh!

Uh oh!

marcelamelara left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add MNIST training provenance collection example #3

Add MNIST training provenance collection example #3

Uh oh!

Conversation

sandlbn commented May 15, 2025

Uh oh!

marcelamelara left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

marcelamelara May 21, 2025

Choose a reason for hiding this comment

Uh oh!

marcelamelara May 21, 2025

Choose a reason for hiding this comment

Uh oh!

sandlbn May 21, 2025

Choose a reason for hiding this comment

Uh oh!

marcelamelara May 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

marcelamelara left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants