Protocols and structural subtyping - mypy 1.16.0+dev.bd1f51ac4de637ab1cbe9f633cdd401d0b520112 documentation (original) (raw)

The Python type system supports two ways of deciding whether two objects are compatible as types: nominal subtyping and structural subtyping.

Nominal subtyping is strictly based on the class hierarchy. If class Doginherits class Animal, it’s a subtype of Animal. Instances of Dogcan be used when Animal instances are expected. This form of subtyping is what Python’s type system predominantly uses: it’s easy to understand and produces clear and concise error messages, and matches how the native isinstance check works – based on class hierarchy.

Structural subtyping is based on the operations that can be performed with an object. Class Dog is a structural subtype of class Animal if the former has all attributes and methods of the latter, and with compatible types.

Structural subtyping can be seen as a static equivalent of duck typing, which is well known to Python programmers. See PEP 544 for the detailed specification of protocols and structural subtyping in Python.

Predefined protocols

The collections.abc, typing and other stdlib modules define various protocol classes that correspond to common Python protocols, such asIterable[T]. If a class defines a suitable __iter__ method, mypy understands that it implements the iterable protocol and is compatible with Iterable[T]. For example, IntList below is iterable, over int values:

from future import annotations

from collections.abc import Iterator, Iterable

class IntList: def init(self, value: int, next: IntList | None) -> None: self.value = value self.next = next

def __iter__(self) -> Iterator[int]:
    current = self
    while current:
        yield current.value
        current = current.next

def print_numbered(items: Iterable[int]) -> None: for n, x in enumerate(items): print(n + 1, x)

x = IntList(3, IntList(5, None)) print_numbered(x) # OK print_numbered([4, 5]) # Also OK

Predefined protocol reference lists various protocols defined incollections.abc and typing and the signatures of the corresponding methods you need to define to implement each protocol.

Note

typing also contains deprecated aliases to protocols and ABCs defined incollections.abc, such as Iterable[T]. These are only necessary in Python 3.8 and earlier, since the protocols incollections.abc didn’t yet support subscripting ([]) in Python 3.8, but the aliases in typing have always supported subscripting. In Python 3.9 and later, the aliases in typing don’t provide any extra functionality.

Simple user-defined protocols

You can define your own protocol class by inheriting the special Protocolclass:

from collections.abc import Iterable from typing import Protocol

class SupportsClose(Protocol): # Empty method body (explicit '...') def close(self) -> None: ...

class Resource: # No SupportsClose base class!

def close(self) -> None:
   self.resource.release()

# ... other methods ...

def close_all(items: Iterable[SupportsClose]) -> None: for item in items: item.close()

close_all([Resource(), open('some/file')]) # OK

Resource is a subtype of the SupportsClose protocol since it defines a compatible close method. Regular file objects returned by open() are similarly compatible with the protocol, as they support close().

Defining subprotocols and subclassing protocols

You can also define subprotocols. Existing protocols can be extended and merged using multiple inheritance. Example:

... continuing from the previous example

class SupportsRead(Protocol): def read(self, amount: int) -> bytes: ...

class TaggedReadableResource(SupportsClose, SupportsRead, Protocol): label: str

class AdvancedResource(Resource): def init(self, label: str) -> None: self.label = label

def read(self, amount: int) -> bytes:
    # some implementation
    ...

resource: TaggedReadableResource resource = AdvancedResource('handle with care') # OK

Note that inheriting from an existing protocol does not automatically turn the subclass into a protocol – it just creates a regular (non-protocol) class or ABC that implements the given protocol (or protocols). The Protocol base class must always be explicitly present if you are defining a protocol:

class NotAProtocol(SupportsClose): # This is NOT a protocol new_attr: int

class Concrete: new_attr: int = 0

def close(self) -> None: ...

Error: nominal subtyping used by default

x: NotAProtocol = Concrete() # Error!

You can also include default implementations of methods in protocols. If you explicitly subclass these protocols you can inherit these default implementations.

Explicitly including a protocol as a base class is also a way of documenting that your class implements a particular protocol, and it forces mypy to verify that your class implementation is actually compatible with the protocol. In particular, omitting a value for an attribute or a method body will make it implicitly abstract:

class SomeProto(Protocol): attr: int # Note, no right hand side def method(self) -> str: ... # Literally just ... here

class ExplicitSubclass(SomeProto): pass

ExplicitSubclass() # error: Cannot instantiate abstract class 'ExplicitSubclass' # with abstract attributes 'attr' and 'method'

Similarly, explicitly assigning to a protocol instance can be a way to ask the type checker to verify that your class implements a protocol:

_proto: SomeProto = cast(ExplicitSubclass, None)

Invariance of protocol attributes

A common issue with protocols is that protocol attributes are invariant. For example:

class Box(Protocol): content: object

class IntBox: content: int

def takes_box(box: Box) -> None: ...

takes_box(IntBox()) # error: Argument 1 to "takes_box" has incompatible type "IntBox"; expected "Box" # note: Following member(s) of "IntBox" have conflicts: # note: content: expected "object", got "int"

This is because Box defines content as a mutable attribute. Here’s why this is problematic:

def takes_box_evil(box: Box) -> None: box.content = "asdf" # This is bad, since box.content is supposed to be an object

my_int_box = IntBox() takes_box_evil(my_int_box) my_int_box.content + 1 # Oops, TypeError!

This can be fixed by declaring content to be read-only in the Boxprotocol using @property:

class Box(Protocol): @property def content(self) -> object: ...

class IntBox: content: int

def takes_box(box: Box) -> None: ...

takes_box(IntBox(42)) # OK

Recursive protocols

Protocols can be recursive (self-referential) and mutually recursive. This is useful for declaring abstract recursive collections such as trees and linked lists:

from future import annotations

from typing import Protocol

class TreeLike(Protocol): value: int

@property
def left(self) -> TreeLike | None: ...

@property
def right(self) -> TreeLike | None: ...

class SimpleTree: def init(self, value: int) -> None: self.value = value self.left: SimpleTree | None = None self.right: SimpleTree | None = None

root: TreeLike = SimpleTree(0) # OK

Using isinstance() with protocols

You can use a protocol class with isinstance() if you decorate it with the @runtime_checkable class decorator. The decorator adds rudimentary support for runtime structural checks:

from typing import Protocol, runtime_checkable

@runtime_checkable class Portable(Protocol): handles: int

class Mug: def init(self) -> None: self.handles = 1

def use(handles: int) -> None: ...

mug = Mug() if isinstance(mug, Portable): # Works at runtime! use(mug.handles)

isinstance() also works with the predefined protocolsin typing such as Iterable.

Warning

isinstance() with protocols is not completely safe at runtime. For example, signatures of methods are not checked. The runtime implementation only checks that all protocol members exist, not that they have the correct type. issubclass() with protocols will only check for the existence of methods.

Note

isinstance() with protocols can also be surprisingly slow. In many cases, you’re better served by using hasattr() to check for the presence of attributes.

Callback protocols

Protocols can be used to define flexible callback types that are hard (or even impossible) to express using theCallable[...] syntax, such as variadic, overloaded, and complex generic callbacks. They are defined with a special __call__ member:

from collections.abc import Iterable from typing import Optional, Protocol

class Combiner(Protocol): def call(self, *vals: bytes, maxlen: int | None = None) -> list[bytes]: ...

def batch_proc(data: Iterable[bytes], cb_results: Combiner) -> bytes: for item in data: ...

def good_cb(*vals: bytes, maxlen: int | None = None) -> list[bytes]: ... def bad_cb(*vals: bytes, maxitems: int | None) -> list[bytes]: ...

batch_proc([], good_cb) # OK batch_proc([], bad_cb) # Error! Argument 2 has incompatible type because of # different name and kind in the callback

Callback protocols and Callable types can be used mostly interchangeably. Parameter names in __call__ methods must be identical, unless the parameters are positional-only. Example (using the legacy syntax for generic functions):

from collections.abc import Callable from typing import Protocol, TypeVar

T = TypeVar('T')

class Copy(Protocol): # '/' marks the end of positional-only parameters def call(self, origin: T, /) -> T: ...

copy_a: Callable[[T], T] copy_b: Copy

copy_a = copy_b # OK copy_b = copy_a # Also OK

Predefined protocol reference

Iteration protocols

The iteration protocols are useful in many contexts. For example, they allow iteration of objects in for loops.

collections.abc.Iterable[T]

The example above has a simple implementation of an__iter__ method.

def iter(self) -> Iterator[T]

See also Iterable.

collections.abc.Iterator[T]

def next(self) -> T def iter(self) -> Iterator[T]

See also Iterator.

Collection protocols

Many of these are implemented by built-in container types such aslist and dict, and these are also useful for user-defined collection objects.

collections.abc.Sized

This is a type for objects that support len(x).

See also Sized.

collections.abc.Container[T]

This is a type for objects that support the in operator.

def contains(self, x: object) -> bool

See also Container.

collections.abc.Collection[T]

def len(self) -> int def iter(self) -> Iterator[T] def contains(self, x: object) -> bool

See also Collection.

One-off protocols

These protocols are typically only useful with a single standard library function or class.

collections.abc.Reversible[T]

This is a type for objects that support reversed(x).

def reversed(self) -> Iterator[T]

See also Reversible.

typing.SupportsAbs[T]

This is a type for objects that support abs(x). T is the type of value returned by abs(x).

See also SupportsAbs.

typing.SupportsBytes

This is a type for objects that support bytes(x).

def bytes(self) -> bytes

See also SupportsBytes.

typing.SupportsComplex

This is a type for objects that support complex(x). Note that no arithmetic operations are supported.

def complex(self) -> complex

See also SupportsComplex.

typing.SupportsFloat

This is a type for objects that support float(x). Note that no arithmetic operations are supported.

def float(self) -> float

See also SupportsFloat.

typing.SupportsInt

This is a type for objects that support int(x). Note that no arithmetic operations are supported.

See also SupportsInt.

typing.SupportsRound[T]

This is a type for objects that support round(x).

See also SupportsRound.

Async protocols

These protocols can be useful in async code. See Typing async/awaitfor more information.

collections.abc.Awaitable[T]

def await(self) -> Generator[Any, None, T]

See also Awaitable.

collections.abc.AsyncIterable[T]

def aiter(self) -> AsyncIterator[T]

See also AsyncIterable.

collections.abc.AsyncIterator[T]

def anext(self) -> Awaitable[T] def aiter(self) -> AsyncIterator[T]

See also AsyncIterator.

Context manager protocols

There are two protocols for context managers – one for regular context managers and one for async ones. These allow defining objects that can be used in with and async with statements.

contextlib.AbstractContextManager[T]

def enter(self) -> T def exit(self, exc_type: type[BaseException] | None, exc_value: BaseException | None, traceback: TracebackType | None) -> bool | None

See also AbstractContextManager.

contextlib.AbstractAsyncContextManager[T]

def aenter(self) -> Awaitable[T] def aexit(self, exc_type: type[BaseException] | None, exc_value: BaseException | None, traceback: TracebackType | None) -> Awaitable[bool | None]

See also AbstractAsyncContextManager.