feat(ui): SAM2 Node & Integration #8526

hipsterusername · 2025-09-03T20:38:36Z

Summary

This pull request introduces support for the Segment Anything 2 (SAM2) model in the backend, enabling advanced image segmentation capabilities. It adds a new invocation for SAM2, a pipeline wrapper for model management, updates frontend integration to use the new model, and bumps the transformers dependency to ensure SAM2 availability.

Backend: Segment Anything 2 (SAM2) integration & Node
Frontend: Canvas module update

Updated the canvas "Select Object" to use the new SAM2 model (segment-anything-large) and changed mask filtering to include all detected masks (this seemed to produce better results in the testing I did)

Dependency update

Bumped the transformers library version to >=4.56.0 in pyproject.toml

Related Issues / Discussions

Requests.

QA Instructions

Test out the SAM2 Node, ensure both bounding box inputs and points work correctly.

Merge Plan

Merge when ready. Ensure we have any/all necessary model changes in place downstream

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
Documentation added / updated (if applicable)
Updated What's New copy (if doing a release after this PR)

There was a really confusing aspect of the SAM pipeline classes where they accepted deeply nested lists of different dimensions (bbox, points, and labels). The lengths of the lists are related; each point must have a corresponding label, and if bboxes are provided with points, they must be same length. I've refactored the backend API to take a single list of SAMInput objects. This class has a bbox and/or a list of points, making it much simpler to provide the right shape of inputs. Internally, the pipeline classes take rejigger these input classes to have the correct nesting. The Nodes still have an awkward API where you can provide both bboxes and points of different lengths, so I added a pydantic validator that enforces correct lenghts.

Revised the Select Object feature to support two input modes: - Visual mode: Combined points and bounding box input for paired SAM inputs - Prompt mode: Text-based object selection (unchanged) Key changes: - Replaced three input types (points, prompt, bbox) with two (visual, prompt) - Visual mode supports both point and bbox inputs simultaneously - Click to add include points, Shift+click for exclude points - Click and drag to draw bounding box - Fixed bbox visibility issues when adding points - Fixed coordinate system issues for proper bbox positioning - Added proper event handling and interaction controls 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Fixed an issue where bounding boxes could grow exponentially when created at small sizes. The problem occurred because Konva Transformer modifies scaleX/scaleY rather than width/height directly, and the scale values weren't consistently reset after being applied to dimensions. Changes: - Ensure scale values are always reset to 1 after applying to dimensions - Add minimum size constraints to prevent zero/negative dimensions - Fix scale handling in transformend, dragend, and initial bbox creation 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

…dule When middle mouse button is used for canvas panning, the pointerup event was still creating points in the segmentation module. Added button check to onBboxDragEnd handler to only process left clicks. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Added button checks to bbox rect and transformer mousedown/touchstart handlers to only process left clicks. Also added stage dragging check in onBboxDragMove to clear bbox drag state when middle mouse panning is active. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

need to add translation strigns for new functionality

This actually works fine for SAM.

…bject

hipsterusername requested review from blessedcoolant, lstein, maryhipp and psychedelicious as code owners September 3, 2025 20:38

github-actions bot added python PRs that change python files Root invocations PRs that change invocations backend PRs that change backend files frontend PRs that change frontend files python-deps PRs that change python dependencies labels Sep 3, 2025

hipsterusername and others added 20 commits September 10, 2025 20:20

init Sam2

dbce8b1

update uv.lock

ad42d5b

fix models

41be715

consolidate into one node.

21ec5ca

chore: ruff

f768260

chore: typegen

6f81ea5

feat(ui): hold shift to add inverse point type

5b9113d

feat(ui): support prompt-based selection for object selection

36b0394

experiment(ui): support bboxes in select object

71091e2

tidy(ui): ts issues

06cb210

feat(ui): allow adding point inside bbox

c72914c

fix(backend): issue w/ multiple bbox and sam1

51802b8

tidy(nodes): clean up sam node

006f796

tidy(backend) cleanup sam pipelines

4091ef6

fix(ui): bbox no shrinkies

527ee0e

psychedelicious added 5 commits September 10, 2025 20:20

fix(ui): select obj box moving on mmb pan

3fdf3d2

fix(ui): restore old tooltip for select object

98dac31

need to add translation strigns for new functionality

tidy(ui): clean up CanvasSegmentAnythingModule

c03524c

fix(ui): respect selected point type

07bd931

feat(ui): increase hit area for bbox anchors

a97ee7a

psychedelicious force-pushed the SAM2 branch from e2fafc2 to a97ee7a Compare September 10, 2025 10:39

psychedelicious added 5 commits September 10, 2025 20:47

feat(ui): spruce up UI a bit

44a51e4

feat(nodes): accept neg coords for bbox

7b1b67b

This actually works fine for SAM.

tidy(ui): organize select object components

37333de

feat(ui): update select object info tooltip

e5af264

fix(ui): ensure mask image is deleted when no more inputs to select o…

780c504

…bject

psychedelicious approved these changes Sep 11, 2025

View reviewed changes

psychedelicious enabled auto-merge (rebase) September 11, 2025 02:12

psychedelicious merged commit 4f17de0 into main Sep 11, 2025
13 checks passed

psychedelicious deleted the SAM2 branch September 11, 2025 02:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(ui): SAM2 Node & Integration #8526

feat(ui): SAM2 Node & Integration #8526

Uh oh!

hipsterusername commented Sep 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(ui): SAM2 Node & Integration #8526

feat(ui): SAM2 Node & Integration #8526

Uh oh!

Conversation

hipsterusername commented Sep 3, 2025

Summary

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants