Issue 29841: errors raised by bytes() and bytearray() for invalid size argument (original) (raw)
currently (on my Windows 10):
bytes(-1 << 1000) Traceback (most recent call last): File "", line 1, in OverflowError: cannot fit 'int' into an index-sized integer bytes(-1) Traceback (most recent call last): File "", line 1, in ValueError: negative count bytes(sys.maxsize + 1) Traceback (most recent call last): File "", line 1, in OverflowError: cannot fit 'int' into an index-sized integer
for the same size arguments, bytearray raises the same errors.
thus, in accordance with #29833 (this is a sub-issue of #29833) for each of the constructors of bytes and bytearray: 1. ValueErrors with the same error message should be raised for any negative size argument (big negative as well as small negative). 2. MemoryError should be raised for any size argument bigger than sys.maxsize.
Moreover, currently:
bytes(sys.maxsize - 25) Traceback (most recent call last): File "", line 1, in MemoryError bytes(sys.maxsize - 24) Traceback (most recent call last): File "", line 1, in OverflowError: byte string is too large bytes(sys.maxsize) Traceback (most recent call last): File "", line 1, in OverflowError: byte string is too large
for each of these size arguments, bytearray raises a MemoryError.
IMHO, to make the error messages more consistent, the constructor of bytes should raise a MemoryError for any too large size argument, as the constructor of bytearray already does.
I worked on this issue. The simplest solution is calling PyNumber_AsSsize_t() with NULL rather than PyExc_OverflowError in bytes and bytearray constructors. Then both constructors will raise ValueError for large negative size and bytearray() will raise MemoryError for large positive size. For raising MemoryError in bytes() we should change OverflowError to MemoryError in other place.
But this is not the only difference between bytes and bytearray.
bytearray(b'abcd') * sys.maxsize Traceback (most recent call last): File "", line 1, in MemoryError b'abcd' * sys.maxsize Traceback (most recent call last): File "", line 1, in OverflowError: repeated bytes are too long
This looks related and I think that it is worth to change OverflowError to MemoryError in the repetition operation. But 'abcd' * sys.maxsize raises OverflowError too, therefore we should change exception types in str.
Concatenation also can raise OverflowError. If change OverflowError to MemoryError in above operations, it should be changed for concatenation too.