Geometric Transformation in Image Processing (original) (raw)

Last Updated : 8 Jun, 2026

Geometric transformation is an image processing technique that changes the position of pixels in an image by mapping their coordinates from one location to another. It consists of two main steps:

Mathematical Representation

A pixel at coordinate (x,y) in the input image is mapped to a new coordinate (x', y') in the output image. ) in the output image. This transformation can be represented as:

\begin{bmatrix}\\x'\\y'\end{bmatrix}= T\begin{bmatrix}\\x\\y \end{bmatrix}= \begin{bmatrix}\\a &b \\c& d\end{bmatrix}\begin{bmatrix}\\x\\y \end{bmatrix}

where:

**Types of Geometric Transformations

Geometric transformations modify the position, orientation, size or shape of an image by changing the coordinates of its pixels.

Geometric Transformation in Image Processing

1. Translation

Translation shifts an image from one location to another without changing its size, shape, or orientation. Every pixel is moved by the same distance along the x-axis and y-axis.

**Applications: Image alignment, object tracking and image registration.

\begin{bmatrix}x' \\y'\end{bmatrix}=\begin{bmatrix}x + \Delta x \\y + \Delta y\end{bmatrix}

**2. Rotation

Rotation turns an image around a fixed point, usually the image center, by a specified angle. Positive angles rotate the image counterclockwise, while negative angles rotate it clockwise.

**Applications: Image correction, object recognition and computer vision systems.

\begin{bmatrix}x' \\y'\end{bmatrix}=\begin{bmatrix}\cos\theta & -\sin\theta \\\sin\theta & \cos\theta\end{bmatrix}\begin{bmatrix}x \\y\end{bmatrix}

**3. Scaling

Scaling changes the size of an image by enlarging or shrinking it. Uniform scaling changes both dimensions equally, while non-uniform scaling changes them independently.

**Applications: Image resizing, zooming and display optimization.

\begin{bmatrix}x' \\y'\end{bmatrix}=\begin{bmatrix}s_x & 0 \\0 & s_y\end{bmatrix}\begin{bmatrix}x \\y\end{bmatrix}

**4. Shearing (Skewing)

Shearing shifts pixels in one direction while keeping the other direction fixed, causing the image to appear slanted or tilted. Unlike rotation, it changes the angles between lines in the image.

**Applications: Perspective correction, image distortion and graphics design.

\begin{bmatrix}x' \\y'\end{bmatrix}=\begin{bmatrix}1 & k_y \\k_x & 1\end{bmatrix}\begin{bmatrix}x \\y\end{bmatrix}

**5. Affine Transformation

Affine transformation combines multiple transformations such as translation, rotation, scaling, and shearing into a single operation. It preserves straight lines and parallel lines, making it one of the most widely used geometric transformations.

**Applications: Image registration, object tracking, and image warping.

\begin{bmatrix}\\x'\\y'\\1 \end{bmatrix}=A\begin{bmatrix}\\x\\ y\\1\end{bmatrix}= \begin{bmatrix}a &b &c \\ d&e &f \\ 0& 0 &1 \end{bmatrix}\begin{bmatrix}\\x\\y\\1\end{bmatrix}

where:

**6. Perspective Transformation

Perspective transformation changes the viewpoint of an image by mapping points from one plane to another. It creates the effect of depth and distance, making objects appear closer or farther away.

**Applications: Image stitching, camera calibration, document scanning, and 3D reconstruction.

m \begin{bmatrix}\\x'\\ y'\\w'\end{bmatrix}=H\begin{bmatrix}\\x \\y\\1\end{bmatrix}=\begin{bmatrix}a &b &c \\ d &e &f \\ g &h &i \end{bmatrix}\begin{bmatrix}\\x\\y\\1\end{bmatrix}

Where:

To find x' and y' we need to normalize them with respect to w'. Thus the coordinate equations for perspective transformation are:

x'=\frac{ax+by+c}{gx+hy+i_{}}y'=\frac{dx+ey+f}{gx+hy+i}

**Applications