Refactor convex collision code #752

kbayes · 2025-10-07T10:55:33Z

This refactor aligns the convex collision code to be in parity with C Mujoco code:

Merge primitive collision functions into convex collision detection (ccd)
Move collision table to collision_driver.py
Merge rest of hfield logic into collision_convex.py

adenzler-nvidia · 2025-10-08T11:19:42Z

@kbayes could you provide some context as to why this PR is necessary/your decision making? I fear this might make it even harder to have a separate collision library.

kbayes · 2025-10-08T11:36:09Z

@adenzler-nvidia yes, please take a closer look in case that I'm misunderstanding something. I was hoping that this PR would make things easier actually. How we're currently doing things is by having two endpoints for calling narrowphase: primitive and general. I think this simplifies things by having one endpoint: convex_narrowphase. And have its logic be to use a primitive test otherwise fallback to the general test. This removes a lot of redundant code, and the core functions aren't altered so I didn't think it would impact Newton.

adenzler-nvidia · 2025-10-08T13:47:05Z

understood - yes looking closer at this it doesn't change any of the core functions.

My separate feedback is that I'm unsure whether it's the right thing to launch a kernel per pair type for all the possible pair types - leading to n x naconmax threads being launched over the course of this, where before it was just naconmax threads for the entirety of the primitive collisions. There is a downside to this as well due to the branching involved, so the solution is probably somewhere in the middle?

kbayes · 2025-10-08T14:47:46Z

Yes the difference is that the primitive kernel runs a batch of naconmax and branches on each geom type pair while the convex kernel runs a batch of n x naconmax where n is statically the number of unique geom type pairs. I think we should do one or the other but not both.

erikfrey · 2025-10-09T21:28:15Z

@kbayes I'm a bit confused by the organization scheme here... it looks like you're putting all the narrowphase functions into collision_convex.py - but then should we call it narrowphase.py?

Stepping back, one of the nice properties of the various implementations of MuJoCo is that we (kind of, sort of) have parity in function and file names. I can look in engine_collision_driver.c and see more or less the same functionality as here in collision_driver.py - it would be nice to maintain that property.

Is there a way to maintain that property and still achieve what you're going for here? Feel free to grab me in person.

kbayes · 2025-10-20T13:51:45Z

@adenzler-nvidia @erikfrey did some benchmarking but found no difference in timing (that includes the geom diverse mixed.xml model).

In terms of where I'm going with this as this is a first step. This PR better aligns with the table logic in engine_collision_driver.c. The next steps will be to delete the legacy GJK logic, and move the kernel setup out of collision_convex and into collision_driver.py, and keep collision_convex.py has a shared util for the primitive and gjk code.

mujoco_warp/_src/collision_convex.py

mujoco_warp/_src/collision_driver.py

mujoco_warp/_src/collision_sdf.py

erikfrey

OK I think I see what's going on here, I think this is directionally good but still has a few architectural issues.

Probably the best starting point for discussion is: I think it might still be better to have distinct narrowphase helpers for functionally distinct parts of the code, e.g. primitive vs. convex vs sdf... not sure where hfield lives in that hierarchy, if it should be grouped with convex or not.

My main argument for this is that, apparently, kernel launches are somewhat sensitive to the number of parameters, and I've been warned by folks on the Warp team that some of our kernel launches are starting to break older CUDA versions that only supported smaller stack sizes.

So in general, prefer fewer inputs/outputs where possible/practical.

Besides that, any big change like this, please do run the benchmarks before and after and let us know if you're seeing changes eitehr to step time or JIT time. You can compare head to yours by doing:

to test your branch:

asv run ccd_refactor^!

against main:

asv run

Thanks for your patience - this is a big refactor, please grab me in chat if you think it'll save more thrashing.

mujoco_warp/_src/collision_driver.py

kbayes · 2025-10-23T13:25:28Z

From my perspective the user shouldn't care if a convex pair was handled by a primitive function or ccd (as long as it was handled in a performant and accurate manner). The current way duplicates logic to segregate geom pairs into two categories which doesn't necessary align with anything (i.e. box-box pairs can use ccd or the box-box primitive code). And it makes more sense to unify these into one collision function table just like in mujoco c. We could separate out the primitives into their own kernel launch but that doesn't change the size of input / outputs nor give us any improvements in performance (actually, there might be a slight edge to unify them), so why not just merge them? SDFs are fundamentally different and should be handled separately.

kbayes added 3 commits October 7, 2025 11:10

refactor

4bd5304

fix issue

a182e83

remove unused code

b8bb76b

kbayes changed the title ~~Merge primitive collision functions into convex collision detection (ccd).~~ Merge primitive collision functions into convex collision detection (ccd) Oct 7, 2025

kbayes added 2 commits October 7, 2025 11:58

update

176de1d

fix issues from sync

0108be2

kbayes requested a review from erikfrey October 7, 2025 11:34

kbayes added 5 commits October 15, 2025 18:22

sync

1301a19

fix

a64563d

merge

c7caa35

fix memory

37948bb

revert

1998d3c

kbayes added 4 commits October 21, 2025 11:24

move contact_params

0ab1fca

move convex_narrowphase to collision_driver

31d1c1d

delete hfield file

17003e4

lint

9b7522a

kbayes changed the title ~~Merge primitive collision functions into convex collision detection (ccd)~~ Refactor convex collision code Oct 21, 2025

thowell reviewed Oct 22, 2025

View reviewed changes

mujoco_warp/_src/collision_convex.py Show resolved Hide resolved

thowell reviewed Oct 22, 2025

View reviewed changes

mujoco_warp/_src/collision_driver.py Outdated Show resolved Hide resolved

thowell reviewed Oct 22, 2025

View reviewed changes

mujoco_warp/_src/collision_driver.py Outdated Show resolved Hide resolved

thowell reviewed Oct 22, 2025

View reviewed changes

mujoco_warp/_src/collision_sdf.py Show resolved Hide resolved

erikfrey requested changes Oct 22, 2025

View reviewed changes

mujoco_warp/_src/collision_driver.py Outdated Show resolved Hide resolved

kbayes added 2 commits October 23, 2025 12:18

Update collision table

a7e9314

rename and rework doc

8748ae6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor convex collision code #752

Refactor convex collision code #752

Uh oh!

kbayes commented Oct 7, 2025 •

edited

Loading

Uh oh!

adenzler-nvidia commented Oct 8, 2025

Uh oh!

kbayes commented Oct 8, 2025

Uh oh!

adenzler-nvidia commented Oct 8, 2025

Uh oh!

kbayes commented Oct 8, 2025

Uh oh!

erikfrey commented Oct 9, 2025

Uh oh!

kbayes commented Oct 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

erikfrey left a comment •

edited

Loading

Uh oh!

Uh oh!

kbayes commented Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Refactor convex collision code #752

Are you sure you want to change the base?

Refactor convex collision code #752

Uh oh!

Conversation

kbayes commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adenzler-nvidia commented Oct 8, 2025

Uh oh!

kbayes commented Oct 8, 2025

Uh oh!

adenzler-nvidia commented Oct 8, 2025

Uh oh!

kbayes commented Oct 8, 2025

Uh oh!

erikfrey commented Oct 9, 2025

Uh oh!

kbayes commented Oct 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

erikfrey left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kbayes commented Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kbayes commented Oct 7, 2025 •

edited

Loading

erikfrey left a comment •

edited

Loading