cpython: 294b8a7957e9 (original) (raw)

--- a/Doc/reference/expressions.rst +++ b/Doc/reference/expressions.rst @@ -1036,10 +1036,6 @@ must be integers. .. _comparisons: -.. _is: -.. _is not: -.. _in: -.. _not in: Comparisons =========== @@ -1075,66 +1071,183 @@ Note that a op1 b op2 c doesn't impl c, so that, e.g., x < y > z is perfectly legal (though perhaps not pretty). +Value comparisons +----------------- + The operators <, >, ==, >=, <=, and != compare the -values of two objects. The objects need not have the same type. If both are -numbers, they are converted to a common type. Otherwise, the == and != -operators always* consider objects of different types to be unequal, while the -<, >, >= and <= operators raise a :exc:TypeError when -comparing objects of different types that do not implement these operators for -the given pair of types. You can control comparison behavior of objects of -non-built-in types by defining rich comparison methods like :meth:__gt__, -described in section :ref:customization. +values of two objects. The objects do not need to have the same type. + +Chapter :ref:objects states that objects have a value (in addition to type +and identity). The value of an object is a rather abstract notion in Python: +For example, there is no canonical access method for an object's value. Also, +there is no requirement that the value of an object should be constructed in a +particular way, e.g. comprised of all its data attributes. Comparison operators +implement a particular notion of what the value of an object is. One can think +of them as defining the value of an object indirectly, by means of their +comparison implementation. + +Because all types are (direct or indirect) subtypes of :class:object, they +inherit the default comparison behavior from :class:object. Types can +customize their comparison behavior by implementing +:dfn:rich comparison methods like :meth:__lt__, described in +:ref:customization. + +The default behavior for equality comparison (== and !=) is based on +the identity of the objects. Hence, equality comparison of instances with the +same identity results in equality, and equality comparison of instances with +different identities results in inequality. A motivation for this default +behavior is the desire that all objects should be reflexive (i.e. x is y +implies x == y). -Comparison of objects of the same type depends on the type: +A default order comparison (<, >, <=, and >=) is not provided; +an attempt raises :exc:TypeError. A motivation for this default behavior is +the lack of a similar invariant as for equality. - Numbers are compared arithmetically. +The behavior of the default equality comparison, that instances with different +identities are always unequal, may be in contrast to what types will need that +have a sensible definition of object value and value-based equality. Such +types will need to customize their comparison behavior, and in fact, a number +of built-in types have done that. -* The values :const:float('NaN') and :const:Decimal('NaN') are special.

They are identical to themselves, x is x but are not equal to themselves,
x != x. Additionally, comparing any value to a not-a-number value +The following list describes the comparison behavior of the most important +built-in types. + +* Numbers of built-in numeric types (:ref:typesnumeric) and of the standard

library types :class:fractions.Fraction and :class:decimal.Decimal can be
compared within and across their types, with the restriction that complex
numbers do not support order comparison. Within the limits of the types
involved, they compare mathematically (algorithmically) correct without loss
of precision. +
The not-a-number values :const:float('NaN') and :const:Decimal('NaN')
are special. They are identical to themselves (x is x is true) but
are not equal to themselves (x == x is false). Additionally,
comparing any number to a not-a-number value will return False. For example, both 3 < float('NaN') and float('NaN') < 3 will return False. -* Bytes objects are compared lexicographically using the numeric values of their

elements. +* Binary sequences (instances of :class:bytes or :class:bytearray) can be

compared within and across their types. They compare lexicographically using
the numeric values of their elements. + +* Strings (instances of :class:str) compare lexicographically using the
numerical Unicode code points (the result of the built-in function
:func:ord) of their characters. [#]_ +
Strings and binary sequences cannot be directly compared. -* Strings are compared lexicographically using the numeric equivalents (the

result of the built-in function :func:ord) of their characters. [#]_ String
and bytes object can't be compared +* Sequences (instances of :class:tuple, :class:list, or :class:range) can

be compared only within each of their types, with the restriction that ranges
do not support order comparison. Equality comparison across these types
results in unequality, and ordering comparison across these types raises
:exc:TypeError. +
Sequences compare lexicographically using comparison of corresponding
elements, whereby reflexivity of the elements is enforced. +
In enforcing reflexivity of elements, the comparison of collections assumes
that for a collection element x, x == x is always true. Based on
that assumption, element identity is compared first, and element comparison
is performed only for distinct elements. This approach yields the same
result as a strict element comparison would, if the compared elements are
reflexive. For non-reflexive elements, the result is different than for
strict element comparison, and may be surprising: The non-reflexive
not-a-number values for example result in the following comparison behavior
when used in a list:: -* Tuples and lists are compared lexicographically using comparison of

corresponding elements. This means that to compare equal, each element must
compare equal and the two sequences must be of the same type and have the same
length.

nan = float('NaN')
nan is nan
True
nan == nan
False <-- the defined non-reflexive behavior of NaN
[nan] == [nan]
True <-- list enforces reflexivity and tests identity first

Lexicographical comparison between built-in collections works as follows: +
- For two collections to compare equal, they must be of the same type, have
the same length, and each pair of corresponding elements must compare
equal (for example, [1,2] == (1,2) is false because the type is not the
same).

If not equal, the sequences are ordered the same as their first differing
elements. For example, [1,2,x] <= [1,2,y] has the same value as
x <= y. If the corresponding element does not exist, the shorter
sequence is ordered first (for example, [1,2] < [1,2,3]).

- Collections that support order comparison are ordered the same as their
first unequal elements (for example, [1,2,x] <= [1,2,y] has the same
value as x <= y). If a corresponding element does not exist, the
shorter collection is ordered first (for example, [1,2] < [1,2,3] is
true).

+ +* Mappings (instances of :class:dict) compare equal if and only if they have

equal (key, value) pairs. Equality comparison of the keys and elements
enforces reflexivity. +
Order comparisons (<, >, <=, and >=) raise :exc:TypeError. + +* Sets (instances of :class:set or :class:frozenset) can be compared within
and across their types. -* Mappings (dictionaries) compare equal if and only if they have the same

(key, value) pairs. Order comparisons ('<', '<=', '>=', '>')
raise :exc:TypeError.

They define order
comparison operators to mean subset and superset tests. Those relations do
not define total orderings (for example, the two sets {1,2} and {2,3}
are not equal, nor subsets of one another, nor supersets of one
another). Accordingly, sets are not appropriate arguments for functions
which depend on total ordering (for example, :func:min, :func:max, and
:func:sorted produce undefined results given a list of sets as inputs). +
Comparison of sets enforces reflexivity of its elements. + +* Most other built-in types have no comparison methods implemented, so they
inherit the default comparison behavior. -* Sets and frozensets define comparison operators to mean subset and superset

tests. Those relations do not define total orderings (the two sets {1,2}
and {2,3} are not equal, nor subsets of one another, nor supersets of one
another). Accordingly, sets are not appropriate arguments for functions
which depend on total ordering. For example, :func:min, :func:max, and
:func:sorted produce undefined results given a list of sets as inputs. +User-defined classes that customize their comparison behavior should follow +some consistency rules, if possible: + +* Equality comparison should be reflexive.

In other words, identical objects should compare equal: +
x is y implies x == y

+ +* Comparison should be symmetric.

In other words, the following expressions should have the same result: +
x == y and y == x

x != y and y != x

x < y and y > x

-* Most other objects of built-in types compare unequal unless they are the same

object; the choice whether one object is considered smaller or larger than
another one is made arbitrarily but consistently within one execution of a
program.

x <= y and y >= x

+ +* Comparison should be transitive.

The following (non-exhaustive) examples illustrate that: +
x > y and y > z implies x > z

x < y and y <= z implies x < z

+ +* Inverse comparison should result in the boolean negation.

In other words, the following expressions should have the same result: +
x == y and not x != y

x < y and not x >= y (for total ordering)

-Comparison of objects of differing types depends on whether either of the -types provide explicit support for the comparison. Most numeric types can be -compared with one another. When cross-type comparison is not supported, the -comparison method returns NotImplemented.

x > y and not x <= y (for total ordering)

The last two expressions apply to totally ordered collections (e.g. to
sequences, but not to sets or mappings). See also the
:func:~functools.total_ordering decorator. +Python does not enforce these consistency rules. In fact, the not-a-number +values are an example for not following these rules. + + +.. _in: +.. _not in: .. _membership-test-details: +Membership test operations +-------------------------- + The operators :keyword:in and :keyword:not in test for membership. x in[](#l1.238) s evaluates to true if x is a member of s, and false otherwise. x not[](#l1.239) in s returns the negation of x in s. All built-in sequences and set types @@ -1176,6 +1289,13 @@ The operator :keyword:not in is define operator: is not pair: identity; test + +.. _is: +.. is not: + +Identity comparisons +-------------------- + The operators :keyword:is and :keyword:is not test for object identity: x[](#l1.252) is y is true if and only if x and y are the same object. x is not y yields the inverse truth value. [#] @@ -1405,12 +1525,24 @@ precedence and have a left-to-right chai cases, Python returns the latter result, in order to preserve that divmod(x,y)[0] * y + x % y be very close to x. -.. [#] While comparisons between strings make sense at the byte level, they may

be counter-intuitive to users. For example, the strings "\u00C7" and
"\u0043\u0327" compare differently, even though they both represent the
same unicode character (LATIN CAPITAL LETTER C WITH CEDILLA). To compare
strings in a human recognizable way, compare using
:func:unicodedata.normalize. +.. [#] The Unicode standard distinguishes between :dfn:code points

(e.g. U+0041) and :dfn:abstract characters (e.g. "LATIN CAPITAL LETTER A").
While most abstract characters in Unicode are only represented using one
code point, there is a number of abstract characters that can in addition be
represented using a sequence of more than one code point. For example, the
abstract character "LATIN CAPITAL LETTER C WITH CEDILLA" can be represented
as a single :dfn:precomposed character at code position U+00C7, or as a
sequence of a :dfn:base character at code position U+0043 (LATIN CAPITAL
LETTER C), followed by a :dfn:combining character at code position U+0327
(COMBINING CEDILLA). +
The comparison operators on strings compare at the level of Unicode code
points. This may be counter-intuitive to humans. For example,
"\u00C7" == "\u0043\u0327" is False, even though both strings
represent the same abstract character "LATIN CAPITAL LETTER C WITH CEDILLA". +
To compare strings at the level of abstract characters (that is, in a way
intuitive to humans), use :func:unicodedata.normalize.

.. [#] Due to automatic garbage-collection, free lists, and the dynamic nature of descriptors, you may notice seemingly unusual behaviour in certain uses of

--- a/Misc/NEWS +++ b/Misc/NEWS @@ -224,6 +224,13 @@ Library Documentation ------------- +- Issue #12067: Rewrite Comparisons section in the Expressions chapter of the

language reference. Some of the details of comparing mixed types were
incorrect or ambiguous. NotImplemented is only relevant at a lower level
than the Expressions chapter. Added details of comparing range() objects,
and default behaviour and consistency suggestions for user-defined classes.
Patch from Andy Maier. +

Issue #24952: Clarify the default size argument of stack_size() in the "threading" and "_thread" modules. Patch from Mattip.