Skip to content

Conversation

@sandlbn
Copy link
Contributor

@sandlbn sandlbn commented May 15, 2025

This PR adds example demonstrating how to collect provenance data from an ML workflow using Atlas CLI. The example tracks a full MNIST training pipeline including dataset download, model training, and evaluation.

Copy link
Contributor

@marcelamelara marcelamelara left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks @sandlbn ! LGTM, I'll approve once I run the example

--storage-url=http://localhost:8080
```

## Troubleshooting
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is awesome, I wish more docs had such a section :)

Comment on lines +379 to +383
1. Extend the Pipeline: Add data preprocessing steps, hyperparameter tuning, or model optimization
2. Track Experiments: Create manifests for different training runs with varying parameters
3. Build CI/CD Integration: Automatically collect provenance in your ML pipeline
4. Create Visualizations: Use the provenance graph to create visual representations of your ML workflow
5. Implement Governance: Use provenance data for model approval and deployment decisions
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Have you tested this example in TDX? If so, we should add a line about running inside of TDX either down here, or higher up in the doc to explain the use of the with-tdx feature

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It would be cool to add it in the second iteration. No, I don’t test it with TDX. I think the best way to enable TDX is to add additional parameters to the script. But I think we can add this as an issue at the moment.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tracking this in #7

Copy link
Contributor

@marcelamelara marcelamelara left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the fixes, LGTM

@marcelamelara marcelamelara merged commit c4a1a47 into main May 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants