HUH: ``squeeze``ing scalars

Question

HUH: ``squeeze``ing scalars

Opened this issue 9 days ago · 7 comments

I wasn't sure if this should be an "ENH", "BUG", "TYP", "DEP", or "MAINT", so I opted for the sound that came out of my mouth when I first encountered this. Feel free to change it to one of those if you feel like it.

This just made me "huh" out loud:

>>> np.squeeze(1)
array(1)
>>> np.squeeze(np.array(1))
array(1)
>>> np.squeeze(np.int_(1))
np.int64(1)

So "scalar-likes" are not treated equally; some return numpy scalars, others return 0d arrays.

Besides this increasing the ever so important "huh/sec" rate ¹, it's also pretty annoying to express in the stubs. But before you panick and/or call the press, the squeeze stubs aren't incorrect ² ³, so it's not that big of a disaster.

Anyway, given the array-api's lack of scalar-types, and them being infuriatingly annoying for static typing, I think that I'm slightly leaning towards choosing 0d-arrays over scalars here. I.e., changing squeeze so that it always returns an instance of numpy.ndarray (or a _subtype thereof), even if you pass it a np.generic scalar-type thingy.

This would technically be a backwards-incompatible breaking change. But, considering that 0d arrays are mostly duck-type compatible, and that I don't see why anyone would want to squeeze their scalars in the first place, I doubt that many will be bothered by this change.

TLDR; Let's have numpy.squeeze always return an array

Is that a thing? Well, if it isn't; then I think it should. ↩
which, given the lack of comment about it, might just have been a "happy little accident" (🎨) ↩
they're technically not correct either, seeing as it can return Any ↩

Answer 1 · 2025-11-04T20:57:19.000Z

squeeze is one of those annoying functions that forwards to the method via obj.squeeze and if that fails convert to an array.

So the problem with changing this isn't changing scalars, it's that np.squeeze(dataframe) might work by calling dataframe.squeeze()...

Answer 2 · 2025-11-04T22:05:54.000Z

So the problem with changing this isn't changing scalars, it's that np.squeeze(dataframe) might work by calling dataframe.squeeze()...

But there's no builtins.int.squeeze() method, so why does it become an array?

Either way, this could then be "fixed" by changing np.generic.squeeze() to return a 0d-array, no?

Answer 3 · 2025-11-05T07:26:15.000Z

Hello! Can I work over this problem?

Answer 4 · 2025-11-05T08:11:38.000Z

We don't assign issues. Just be sure to link back to this issue in the PR so others will know about the PR. I think it would be prudent to wait a little more before diving in, to make sure the suggested fix will be acceptable.

Answer 5 · 2025-11-05T09:26:44.000Z

to return a 0d-array, no?

I suppose we can do that, yes. NumPy tries to use the method and if it doesn't exist converts to array first.

Answer 6 · 2025-11-06T11:48:21.000Z

@jorenham I was looking into the implementation of np.squeeze, and here’s how I thought we could address this issue:

def squeeze(a, axis=None):
 """
 Docstring
 """
+   if isinstance(a, np.generic):
+       a = np.asanyarray(a)

 try:
     squeeze = a.squeeze
 except AttributeError:
     return _wrapit(a, 'squeeze', axis=axis)
 if axis is None:
     return squeeze()
 else:
     return squeeze(axis=axis)

Is this the correct way to approach this problem or do you suggest to look even deeper?

Answer 7 · 2025-11-06T12:56:52.000Z

@jorenham I was looking into the implementation of np.squeeze, and here’s how I thought we could address this issue:
def squeeze(a, axis=None):
 """
 Docstring
 """
+   if isinstance(a, np.generic):
+       a = np.asanyarray(a)

 try:
     squeeze = a.squeeze
 except AttributeError:
     return _wrapit(a, 'squeeze', axis=axis)
 if axis is None:
     return squeeze()
 else:
     return squeeze(axis=axis)
Is this the correct way to approach this problem or do you suggest to look even deeper?

Assuming that we decide to indeed have squeeze always return ndarray, then something like this would probably be the way to implement it, yes. There might be slightly more efficient ways to convert a scalar to a 0d array, but I might be wrong.

Footnotes