Computer Vision - MATLAB & Simulink (original) (raw)

Extend deep learning workflows with computer vision applications

Apply deep learning to computer vision applications by using Deep Learning Toolbox™ together with the Computer Vision Toolbox™.

Apps

Image Labeler Label images for computer vision applications
Video Labeler Label video for computer vision applications

Functions

expand all

boxLabelDatastore Datastore for bounding box label data
pixelLabelDatastore Datastore for pixel label data
visionTransformer Pretrained vision transformer (ViT) neural network (Since R2023b)
patchEmbeddingLayer Patch embedding layer (Since R2023b)
unet Create U-Net convolutional neural network for semantic segmentation (Since R2024a)
unet3d Create 3-D U-Net convolutional neural network for semantic segmentation of volumetric images (Since R2024a)
deeplabv3plus Create DeepLab v3+ convolutional neural network for semantic image segmentation (Since R2024a)
rtmdetObjectDetector Detect objects using RTMDet object detector (Since R2024b)
yolov4ObjectDetector Detect objects using YOLO v4 object detector (Since R2022a)
yolov2ObjectDetector Detect objects using YOLO v2 object detector
yolov3ObjectDetector Detect objects using YOLO v3 object detector (Since R2021a)
ssdObjectDetector Detect objects using SSD deep learning detector
solov2 Segment objects using SOLOv2 instance segmentation network (Since R2023b)
maskrcnn Detect objects using Mask R-CNN instance segmentation (Since R2021b)
posemaskrcnn Predict object pose using Pose Mask R-CNN pose estimation (Since R2024a)
reidentificationNetwork Re-identification deep learning network for re-identifying and tracking objects (Since R2024a)
yoloxObjectDetector Detect objects using YOLOX object detector (Since R2023b)
efficientADAnomalyDetector Detect anomalies using EfficientAD network (Since R2024b)
patchCoreAnomalyDetector Detect anomalies using PatchCore network (Since R2023a)
fcddAnomalyDetector Detect anomalies using fully convolutional data description (FCDD) network for anomaly detection (Since R2022b)
fastFlowAnomalyDetector Detect anomalies using FastFlow network (Since R2023a)
detectTextCRAFT Detect texts in images by using CRAFT deep learning model (Since R2022a)

Topics

Image Classification

Object Detection and Instance Segmentation

Automated Visual Inspection

Semantic Segmentation

Video Classification

New

Identify Defects in Air Compressors Using Spectrogram Images

Identify Defects in Air Compressors Using Spectrogram Images

Detect and localize defects in acoustic recordings of air compressors using Mel spectrogram images and an EfficientAD anomaly detector.

(Computer Vision Toolbox)

Detect Small Objects Using Tiled Training of YOLOX Network

Detect Small Objects Using Tiled Training of YOLOX Network

Detect small objects in full-resolution images using tiled training of a you only look once version X (YOLOX) deep learning network.

(Computer Vision Toolbox)

Automatically Label Ground Truth Using Segment Anything Model

Automatically Label Ground Truth Using Segment Anything Model

Produce pixel labels for semantic segmentation using the Segment Anything Model (SAM) in the Image Labeler app. The SAM is an automatic segmentation technique that you can use to segment object regions to label with just a few clicks, or automatically segment the entire image and instantaneously create labels for selected regions. In this example, you interactively label pixels for semantic segmentation in two ways.

(Computer Vision Toolbox)

Detect Defects Using Tiled Training of EfficientAD Anomaly Detector

Detect Defects Using Tiled Training of EfficientAD Anomaly Detector

Detect and localize defects on anomalous chewing gum images by training an EfficientAD anomaly detection network on tiled normal images.

(Computer Vision Toolbox)

Localize Industrial Defects Using PatchCore Anomaly Detector

Localize Industrial Defects Using PatchCore Anomaly Detector

Perform localization of anomalous defects in printed circuit boards (PCBs) using anomaly heat maps generated with the PatchCore anomaly detector.

(Computer Vision Toolbox)

Detect Defects on Printed Circuit Boards Using YOLOX Network

Detect Defects on Printed Circuit Boards Using YOLOX Network

Detect, localize, and classify defects in printed circuit boards (PCBs) using a you only look once version X (YOLOX) deep learning network.

(Computer Vision Toolbox)

Perform 6-DoF Pose Estimation for Bin Picking Using Deep Learning

Perform 6-DoF Pose Estimation for Bin Picking Using Deep Learning

Perform six degrees-of-freedom (6-DoF) pose estimation by estimating the 3-D position and orientation of machine parts in a bin using RGB-D images and a deep learning network.

Reidentify People Throughout a Video Sequence Using ReID Network

Reidentify People Throughout a Video Sequence Using ReID Network

Track people throughout a video sequence using re-identification with a residual network.

Perform Instance Segmentation Using SOLOv2

Perform Instance Segmentation Using SOLOv2

Segment object instances of randomly rotated machine parts in a bin using a deep learning SOLOv2 network.

(Computer Vision Toolbox)

Object Detection Using YOLO v2 Deep Learning

Object Detection Using YOLO v2 Deep Learning

Train a you only look once (YOLO) v2 object detector.

Object Detection Using SSD Deep Learning

Object Detection Using SSD Deep Learning

Train a Single Shot Detector (SSD).

Object Detection Using YOLO v4 Deep Learning

Object Detection Using YOLO v4 Deep Learning

Detect objects in images using you only look once version 4 (YOLO v4) deep learning network. In this example, you will

Perform Instance Segmentation Using Mask R-CNN

Perform Instance Segmentation Using Mask R-CNN

Segment individual instances of people and cars using a multiclass mask region-based convolutional neural network (R-CNN).

Semantic Segmentation Using Deep Learning

Semantic Segmentation Using Deep Learning

Segment an image using a semantic segmentation network.

Generate Image from Segmentation Map Using Deep Learning

Generate Image from Segmentation Map Using Deep Learning

Generate a synthetic image of a scene from a semantic segmentation map.

Estimate Body Pose Using Deep Learning

Estimate Body Pose Using Deep Learning

Estimate the body pose of one or more people using the OpenPose algorithm.

Activity Recognition from Video and Optical Flow Data Using Deep Learning

Activity Recognition from Video and Optical Flow Data Using Deep Learning

First shows how to perform activity recognition using a pretrained Inflated 3-D (I3D) two-stream convolutional neural network based video classifier and then shows how to use transfer learning to train such a video classifier using RGB and optical flow data from videos [1].

Gesture Recognition using Videos and Deep Learning

Gesture Recognition using Videos and Deep Learning

Perform gesture recognition using a pretrained SlowFast video classifier.