Skip to content

Conversation

@sahusiddharth
Copy link
Contributor

@sahusiddharth sahusiddharth commented Oct 12, 2025

Enhancements

Resolves: #8384

This enhancement introduces support for adding a version description when:

  • Creating a new dataset version
  • Adding new examples to an existing dataset

This is now available via the Python SDK.

Example Usage

px_client.datasets.add_examples_to_dataset(
    dataset="dataset name",
    inputs=[...],
    outputs=[...],
    dataset_version_description="version description",
)

Minimal Implementation

Use this Colab notebook

Result Preview

Below is a screenshot of the dataset version UI after executing the example above:

Screenshot 2025-10-12 at 11 35 48 AM

Note

Adds an optional version description when appending examples via Python SDK (sync/async) and persists it in dataset versions server-side.

  • Python SDK (client):
    • Datasets.add_examples_to_dataset(...): new dataset_version_description param; forwarded as dataset_description to both _upload_tabular_dataset and _upload_json_dataset.
    • AsyncDatasets.add_examples_to_dataset(...): same addition and forwarding.
  • Server/DB:
    • add_dataset_examples(...): pass description to insert_dataset_version(...) so new versions store the description.

Written by Cursor Bugbot for commit 0c444e2. This will update automatically on new commits. Configure here.

@sahusiddharth sahusiddharth requested a review from a team as a code owner October 12, 2025 06:20
@github-project-automation github-project-automation bot moved this to 📘 Todo in phoenix Oct 12, 2025
@dosubot dosubot bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label Oct 12, 2025
cursor[bot]

This comment was marked as outdated.

@dosubot dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. and removed size:XS This PR changes 0-9 lines, ignoring generated files. labels Oct 12, 2025
@sahusiddharth
Copy link
Contributor Author

Hi @axiomofjoy and @anticorrelator,

At your convenience, please review this PR and let me know if any revisions are necessary.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size:S This PR changes 10-29 lines, ignoring generated files.

Projects

Status: 📘 Todo

Development

Successfully merging this pull request may close these issues.

[datasets][client][enhancement] add optional description when creating a dataset version via python sdk

1 participant