ENH: add function to get broadcast shape from a given set of shapes. #17535

madhulikajc · 2020-10-11T02:48:23Z

Add new function numpy.broadcast_shape which takes tuples
for the shapes to be broadcast against each other.
Return the broadcasted shape as a tuple.
See #17217

Add new function numpy.broadcast_shape which takes tuples for the shapes to be broadcast against each other. Return the broadcasted shape as a tuple. See numpy#17217

numpy/lib/stride_tricks.py

tillahoffmann · 2020-10-11T14:44:40Z

I was incidentally having a look at the same thing and ended up with the following implementation. It doesn't use the builtin functionality of numpy, but it also doesn't create any temporary arrays. Might be another option?

def broadcast_shapes(*shapes):
    """
    Evaluate the shape due to broadcasting any number of shapes against each other.

    Parameters
    ----------
    `*shapes` : tuples
        The shapes to broadcast.

    Returns
    -------
    broadcasted_shape : tuple
        Shape of the array that would be obtained when broadcasting the shapes against each other.

    Examples
    --------
    >>> broadcast_shapes((2, 1), (1, 3))
    (2, 3)
    """
    shapes = [shape if isinstance(shape, tuple) else (shape,) for shape in shapes
              if shape is not None]
    maxdim = max(map(len, shapes))
    broadcast_shape = []
    for i, sizes in enumerate(it.zip_longest(*map(reversed, shapes), fillvalue=1)):
        sizes = set(sizes) | {1}
        if len(sizes) > 2:
            raise ValueError(f'unmatched dimensions at dimension {maxdim - i}')
        broadcast_shape.append(max(sizes))
    return tuple(reversed(broadcast_shape))

eric-wieser · 2020-10-11T15:31:30Z

@tillahoffmann, with #17535 (comment) the temporary arrays all have 0 bytes of data, so don't really cost anything

Co-authored-by: Eric Wieser <[email protected]>

tillahoffmann · 2020-10-11T16:43:01Z

Excellent, let's avoid the duplication of code in that case. Looking forward to having this functionality available natively in numpy. @madhulikajc, thank you.

madhulikajc · 2020-10-11T16:52:41Z

Thank you @tillahoffmann and @eric-wieser. One thing I notice from @tillahoffmann's code is that an integer is accepted as an array shape. My code will also do this as np.empty() accepts integers and converts them into rank one arrays. But I do not explicitly test for this in my tests. I will add tests to check for this functionality.

Also update docstring to include both ints and tuples of ints as input

numpy/lib/stride_tricks.py

eric-wieser · 2020-10-11T21:15:31Z

My only concern is that it might make sense to expose this more directly via a C API rather than with the strange workaround we've settled on here. That's always something that could be explored more in a future PR though.

madhulikajc · 2020-10-12T05:53:30Z

@eric-wieser, happy to explore exposing this functionality via a C API instead (i.e. the logic in PyArray_Broadcast). I had originally thought I would do this via a C API as well, but this ended up seeming simpler and cleaner (even though I agree it's a bit of an odd workaround). Let me know if you think we should proceed with this PR as-implemented and explore other options in future. If so, I can put together release notes and any other final housekeeping items. Thanks again for all your guidance here.

seberg · 2020-10-12T16:08:43Z

I like the plan to explore C (although I agree it is not absolutely necessary). Especially if we can consolidate a bit of code in PyArray_Broadcast to use the same in the end.

It is a bit unfortunate that I guess np.nditer is too complicated to consolidate here (broadcasting arrays can also be written as np.nditer(arrays, flags=["multi_index"]).shape). But that one supports too many other things we probably do not need (e.g. not broadcasting certain arrays, and nowadays, even broadcasting only specified axes).

numpy/lib/stride_tricks.py

charris · 2020-10-12T17:06:18Z

Needs a release note. Look in doc/release/upcoming_changes to see how it is done.

mhvk · 2020-10-12T20:03:38Z

On C vs python, I think it makes sense to merge this as is, since it is a helpful addition and any C implementation can just be slotted in anyway, and would then have the tests all ready.

Also fix up docstring details.

seberg

We should probably just put this in before it gets stalled for no good reason. Since it is an API addition, might want to ping the mailing list, but honestly, it is small enough to do that after the fact.

Unfortunately, I can't help but bikeshed the name: Do we have a preference for broadcast_shapes vs. broadcast_shape? For some reason I feel the first may be better, so just in case everyone leans that way. If not, either name is completely fine to be honest.

numpy/lib/tests/test_stride_tricks.py

Co-authored-by: Sebastian Berg <[email protected]>

madhulikajc · 2020-10-14T19:37:39Z

I prefer broadcast_shapes too actually. Curious how others feel. I guess it's a question of whether it's an implicit "get" as in "get_broadcast_shape" or "broadcast these input shapes". Using the plural feels consistent with other numpy functions like expand_dims etc.

mhvk · 2020-10-14T19:43:30Z

👍 to broadcast_shapes. After all, it is broadcast_arrays too.

seberg · 2020-10-14T19:51:53Z

I suppose the question of plural vs. singular is whether we are referring to the return value or the input value(s). Btw. noticed one possible addition to the test: Making sure that no input also returns () as a shape.

mattip

LGTM. One tiny nit, but can be ignored.

As for moving to C: definitely should go to another PR. The chain goes

add to numpy/core/multiarray/all
add a C function to array_module_methods. There are lots of examples there on different arg parsing and signatures. If you are new to this, you might want to look at the Python docs together with the code examples.
rummage around the code base looking at things like PyArray_Broadcase to see if it can be refactored

Maybe the correct way to reason about this is in the opposite order: there is no reason to move things into C just for fun if no real advantage can be realized.

numpy/lib/stride_tricks.py

eric-wieser · 2020-10-15T09:16:23Z

Maybe the correct way to reason about this is in the opposite order: there is no reason to move things into C just for fun if no real advantage can be realized.

My thinking is we probably already have the code in C somewhere, and if we don't we might be able to refactor it out and use it elsewhere. It's certainly not worth a standalone C API if all it's used for is this function.

numpy/lib/stride_tricks.py

Co-authored-by: Eric Wieser <[email protected]>

Also move versionadded

numpy/lib/tests/test_stride_tricks.py

Co-authored-by: Warren Weckesser <[email protected]>

asmeurer · 2020-10-15T21:26:53Z

Thanks for adding this. This looks good to me. The dtype=[] trick looks good (taking Eric's word that it uses no memory), as it will keep things like error messages the same. Though perhaps if the logic in C is ever factored out, it would make more sense for this to call the C function directly.

mattip · 2020-10-16T10:36:59Z

One last thing. This should be indexed in the reference documents somewhere so it shows up when searching for broadcast_shapes in the rendered documentation (available by looking at the last CI run -> ci/circleci: build artifact -> details). The documentation team might have an idea where the best place for this would be, maybe under utility functions, the source of which is routines.other.rst?

eric-wieser · 2020-10-16T10:42:33Z

taking Eric's word that it uses no memory

In case you want more than my word:

>>> np.zeros(1000000, dtype=[]).nbytes
0

rgommers · 2020-10-16T11:19:27Z

The documentation team might have an idea where the best place for this would be, maybe under utility functions, the source of which is routines.other.rst?

Good point. That seems like a decent place. To make it more discoverable, adding an entry to the See Also sections in the docstrings of broadcast_to, broadcast_arrays and perhaps one or two more of the most related functions would be good. And vice versa, the docstring of this new function could also use a See Also.

rgommers

Docs also look good now. This is very nice to have, thanks @madhulikajc. Merging.

ENH: add function to get broadcast shape from a given set of shapes.

b340258

Add new function numpy.broadcast_shape which takes tuples for the shapes to be broadcast against each other. Return the broadcasted shape as a tuple. See numpy#17217

github-actions bot added the 01 - Enhancement label Oct 11, 2020

eric-wieser reviewed Oct 11, 2020

View reviewed changes

numpy/lib/stride_tricks.py Outdated Show resolved Hide resolved

Perform array allocations of size 0 for provided shape tuples

12f4597

Co-authored-by: Eric Wieser <[email protected]>

Test for int as input shape

bc5c7fb

Also update docstring to include both ints and tuples of ints as input

mhvk reviewed Oct 11, 2020

View reviewed changes

numpy/lib/stride_tricks.py Outdated Show resolved Hide resolved

Remove unnecessary array_function_dispatch

b23f75e

Add missing set_module

98586b2

charris added the component: numpy.lib label Oct 12, 2020

charris reviewed Oct 12, 2020

View reviewed changes

numpy/lib/stride_tricks.py Show resolved Hide resolved

charris added the 56 - Needs Release Note. Needs an entry in doc/release/upcoming_changes label Oct 12, 2020

Add release notes. Add versionadded to docstring.

f763cc7

Also fix up docstring details.

madhulikajc force-pushed the broadcast-shape branch from 7ae4588 to f763cc7 Compare October 13, 2020 05:37

seberg added the 62 - Python API Changes or additions to the Python API. Mailing list should usually be notified. label Oct 13, 2020

seberg approved these changes Oct 14, 2020

View reviewed changes

numpy/lib/tests/test_stride_tricks.py Outdated Show resolved Hide resolved

follow convention for trailing comma

6485ca3

Co-authored-by: Sebastian Berg <[email protected]>

Change name to broadcast_shapes. Also add test case, and type hint.

7aeb9ef

mattip approved these changes Oct 15, 2020

View reviewed changes

numpy/lib/stride_tricks.py Outdated Show resolved Hide resolved

eric-wieser reviewed Oct 15, 2020

View reviewed changes

numpy/lib/stride_tricks.py Outdated Show resolved Hide resolved

eric-wieser reviewed Oct 15, 2020

View reviewed changes

numpy/lib/stride_tricks.py Outdated Show resolved Hide resolved

madhulikajc and others added 3 commits October 15, 2020 09:15

follow convention

8b9c7de

Co-authored-by: Eric Wieser <[email protected]>

Update docstring

8329cb2

Co-authored-by: Eric Wieser <[email protected]>

Add reference to numpy docs on broadcasting to docstring

01cdbd5

Also move versionadded

WarrenWeckesser reviewed Oct 15, 2020

View reviewed changes

numpy/lib/tests/test_stride_tricks.py Outdated Show resolved Hide resolved

Fix spelling

07747ea

Co-authored-by: Warren Weckesser <[email protected]>

Add broadcast_shapes to reference docs and add See Also sections

900bbdb

rgommers approved these changes Oct 17, 2020

View reviewed changes

rgommers merged commit 7b0a764 into numpy:master Oct 17, 2020

rgommers added this to the 1.20.0 release milestone Oct 17, 2020

rgommers removed the 56 - Needs Release Note. Needs an entry in doc/release/upcoming_changes label Oct 17, 2020

rgommers mentioned this pull request Oct 17, 2020

Function to get broadcast shape from a list of shapes #17217

Closed

kgryte mentioned this pull request Oct 29, 2020

List of APIs currently in (or proposed to be included in) the specification data-apis/array-api#43

Closed

madhulikajc deleted the broadcast-shape branch December 28, 2020 07:08

kgryte mentioned this pull request Feb 6, 2025

RFC: add broadcast_shapes to the specification data-apis/array-api#893

Open

Uh oh!

ENH: add function to get broadcast shape from a given set of shapes. #17535

ENH: add function to get broadcast shape from a given set of shapes. #17535

Uh oh!

Conversation

madhulikajc commented Oct 11, 2020

Uh oh!

Uh oh!

tillahoffmann commented Oct 11, 2020

Uh oh!

eric-wieser commented Oct 11, 2020

Uh oh!

tillahoffmann commented Oct 11, 2020

Uh oh!

madhulikajc commented Oct 11, 2020

Uh oh!

Uh oh!

eric-wieser commented Oct 11, 2020

Uh oh!

madhulikajc commented Oct 12, 2020

Uh oh!

seberg commented Oct 12, 2020

Uh oh!

Uh oh!

charris commented Oct 12, 2020

Uh oh!

mhvk commented Oct 12, 2020

Uh oh!

seberg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

madhulikajc commented Oct 14, 2020

Uh oh!

mhvk commented Oct 14, 2020

Uh oh!

seberg commented Oct 14, 2020

Uh oh!

mattip left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

eric-wieser commented Oct 15, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

asmeurer commented Oct 15, 2020

Uh oh!

mattip commented Oct 16, 2020

Uh oh!

eric-wieser commented Oct 16, 2020

Uh oh!

rgommers commented Oct 16, 2020

Uh oh!

rgommers left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants