KittiFlow — Torchvision 0.22 documentation (original) (raw)

class torchvision.datasets.KittiFlow(root: ~typing.Union[str, ~pathlib.Path], split: str = 'train', transforms: ~typing.Optional[~typing.Callable] = None, loader: ~typing.Callable[[str], ~typing.Any] = <function default_loader>)[source]

KITTI dataset for optical flow (2015).

The dataset is expected to have the following structure:

root KittiFlow testing image_2 training image_2 flow_occ

Parameters:

Special-members:

__getitem__(index: int) → Union[Tuple[Image, Image, Optional[ndarray], Optional[ndarray]], Tuple[Image, Image, Optional[ndarray]]][source]

Return example at given index.

Parameters:

index (int) – The index of the example to retrieve

Returns:

A 4-tuple with (img1, img2, flow, valid_flow_mask)where valid_flow_mask is a numpy boolean mask of shape (H, W) indicating which flow values are valid. The flow is a numpy array of shape (2, H, W) and the images are PIL images. flow and valid_flow_mask are None ifsplit="test".

Return type:

tuple