shadan khattak | COMSATS Institute of Information Technology (original) (raw)

Papers by shadan khattak

Research paper thumbnail of Bayesian Early Mode Decision Technique for View Synthesis Prediction-Enhanced Multiview Video Coding

IEEE Signal Processing Letters, 2013

View synthesis prediction (VSP) is a coding mode that predicts video blocks from synthesised fram... more View synthesis prediction (VSP) is a coding mode that predicts video blocks from synthesised frames. It is particularly useful in a multi-camera setup with large inter-camera distances. Adding a VSP-based SKIP mode to a standard Multiview Video Coding (MVC) framework improves the rate-distortion (RD) performance but increases the time complexity of the encoder. This letter proposes an early mode decision technique for VSP SKIP-enhanced MVC. Our method uses the correlation between the RD costs of the VSP SKIP mode in neighbouring views and Bayesian decision theory to reduce the number of candidate coding modes for a given macroblock. Simulation results showed that our technique can save up to 36.20% of the encoding time without any significant loss in RD performance.

Research paper thumbnail of Multimedia Privacy and Security Landscape in the Wake of AI/ML

CRC Press eBooks, Jun 10, 2022

Research paper thumbnail of Sensor Fusion for Identification of Freezing of Gait Episodes Using Wi-Fi and Radar Imaging

IEEE Sensors Journal, 2020

Research paper thumbnail of A Robust Method to Locate License Plates under Diverse Conditions

2019 International Conference on Applied and Engineering Mathematics (ICAEM), 2019

Automatic License Plate Detection (ALPD) is a crucial step, which significantly affects the recog... more Automatic License Plate Detection (ALPD) is a crucial step, which significantly affects the recognition rate and speed of the Intelligent Transport System (ITS). This paper presents a robust license plate detection method using an intelligent combination of Faster R-CNN and image processing operations. In the proposed method, initially, a vehicle is detected in the input colored RGB images using the Faster R-CNN. Next, the image with detected vehicle is fed to our developed License Plate Localization Module (LPLM) to search the possible existence of the license plate. The LPLM converts the detected vehicle image from RGB to the HSV domain and applies color segmentation along with morphological operations, and finally uses the dimensions analysis to locate the license plate. Simulations on the challenging PKU dataset reveal that the proposed technique outperforms recent state-of-the-art methods in terms of detection accuracy, precision, recall, and execution time.

Research paper thumbnail of A Robust Face Recognition Method for Occluded and Low-Resolution Images

Face images that appear in multimedia applications, such as digital entertainments usually exhibi... more Face images that appear in multimedia applications, such as digital entertainments usually exhibit dramatic nonuniform illumination, occlusions, low-resolution, and pose/expression variations that result in substantial performance degradation for traditional face recognition algorithms. Recent research is focused to develop robust face recognition algorithms to solve the aforementioned issues with maximum effort to mimic the human vision system. This paper presents a near real-time and novel face recognition method to recognize the occluded and low-resolution face images. Proposed face recognition algorithm initially uses 68 points to locate a face in the input image. Meanwhile, the adaptive boosting and Linear Discriminant Analysis (LDA) are used to extract face features. In the final stage, classic nearest centre classifier is used for face classification. Detailed experiments are performed on two publicly available LFW and the AR databases. Simulation results reveal that the prop...

Research paper thumbnail of Early decision of CU splitting, using base view information, for low complexity MV-HEVC

2017 International Multi-topic Conference (INMIC), 2017

Research paper thumbnail of Early CU Depth Decision and Reference Picture Selection for Low Complexity MV-HEVC

Symmetry, 2019

The Multi-View extension of High Efficiency Video Coding (MV-HEVC) has improved the coding effici... more The Multi-View extension of High Efficiency Video Coding (MV-HEVC) has improved the coding efficiency of multi-view videos, but this comes at the cost of the extra coding complexity of the MV-HEVC encoder. This coding complexity can be reduced by efficiently reducing time-consuming encoding operations. In this work, we propose two methods to reduce the encoder complexity. The first one is Early Coding unit Splitting (ECS), and the second is the Efficient Reference Picture Selection (ERPS) method. In the ECS method, the decision of Coding Unit (CU) splitting for dependent views is made on the CU splitting information obtained from the base view, while the ERPS method for dependent views is based on selecting reference pictures on the basis of the temporal location of the picture being encoded. Simulation results reveal that our proposed methods approximately reduce the encoding time by 58% when compared with HTM (16.2), the reference encoder for MV-HEVC.

Research paper thumbnail of Temporal and Inter-view Consistent Error Concealment Technique for Multiview plus Depth Video

IEEE Transactions on Circuits and Systems for Video Technology, 2016

Multiview plus depth (MVD) is an emerging video format with many applications, including 3D telev... more Multiview plus depth (MVD) is an emerging video format with many applications, including 3D television and free viewpoint television. During broadcast of compressed MVD video, transmission errors may cause the loss of whole frames, resulting in significant degradation of video quality. Error concealment techniques have been widely used to deal with transmission errors in video communication. However, the existing solutions do not address the requirement that the reconstructed frames be consistent with neighbouring frames, i.e., corresponding pixels have consistent color information. We propose a new consistency model for error concealment of MVD video that allows to maintain a high level of consistency between frames of the same view (temporal consistency) and those of neighbouring views (inter-view consistency). We then propose an algorithm that uses our model to implement concealment in a consistent way. Simulations with the reference software for the Multiview Video Coding project of the Joint Video Team (JVT) of the ISO/IEC MPEG and ITU-T VCEG show that our method outperforms benchmark techniques, including a baseline approach based on the Boundary Matching Algorithm, with respect to both reconstruction quality and view consistency.

Research paper thumbnail of Low-complexity multiview video coding

2012 Picture Coding Symposium, 2012

Research paper thumbnail of Low-Complexity Disparity Estimation for Multiview Video Coding

Keywords: LTS4 Reference EPFL-CONF-174803 Record created on 2012-02-09, modified on 2017-05-10

Research paper thumbnail of S. Khattak, R. Hamzaoui, S. Ahmad, P. Frossard

Research paper thumbnail of Fast encoding techniques for Multiview Video Coding

Signal Processing: Image Communication, 2013

Multiview Video Coding (MVC) is a technique that permits efficient compression of multiview video... more Multiview Video Coding (MVC) is a technique that permits efficient compression of multiview video. MVC uses variable block size motion and disparity estimation for block matching. This requires an exhaustive search process that involves all possible macroblock partition sizes. We analyze the time complexity of MVC and the methods that have been proposed to speed up motion and disparity estimation. We then propose two new methods: Previous Disparity Vector Disparity Estimation (PDV-DE) and Stereo-Motion Consistency Constraint Motion and Disparity Estimation (SMCC-MDE). PDV-DE exploits the correlation between temporal levels and disparity vectors to speed up the disparity estimation process while SMCC-MDE exploits the geometrical relationship of consecutive frame pairs to speed up motion and disparity estimation. We build a complete low complexity MVC encoding solution that combines our two methods with complementary previous methods to speed up motion and disparity search. We evaluate the complexity of our solution in terms of encoding time and number of search points. Our experimental results show that our solution can reduce the encoding time and number of search points of the standard MVC implementation (JMVM 6.0) using the fast TZ search mode by up to 93.7% and 96.9%, respectively, with negligible degradation in the rate-distortion performance. Compared to the best published results, this is an improvement of up to 11% and 7%, respectively.

Research paper thumbnail of Isolation Design Flow Effectiveness Evaluation Methodology for Zynq SoCs

Electronics

Static Random-Access Memory (SRAM)-based Field Programmable Gate Arrays (FPGAs) are increasingly ... more Static Random-Access Memory (SRAM)-based Field Programmable Gate Arrays (FPGAs) are increasingly being used in many application domains due to their higher logic density and reconfiguration capabilities. However, with state-of-the-art FPGAs being manufactured in the latest technology nodes, reliability is becoming an important issue, particularly for safety-critical avionics, automotive, aerospace, industrial robotics, medical, and financial systems. Therefore, fault tolerant system design methodologies have become essential in the aforementioned application domains. The Isolation Design Flow (IDF) is one such design methodology that has promising prospects due to its ability to isolate logic design modules at the physical level for fault containment purposes. This paper proposes a methodology to evaluate the effectiveness of the IDF. To do so, reverse engineering is used to enable fault injection on the IDF designs with minimal changes in the bit-stream. This reduces the time neede...

Research paper thumbnail of Framework for low-complexity multiview video coding

Research paper thumbnail of FracTCAM: Fracturable LUTRAM-Based TCAM Emulation on Xilinx FPGAs

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Research paper thumbnail of Bayesian Early Mode Decision Technique for View Synthesis Prediction-Enhanced Multiview Video Coding

IEEE Signal Processing Letters, 2013

View synthesis prediction (VSP) is a coding mode that predicts video blocks from synthesised fram... more View synthesis prediction (VSP) is a coding mode that predicts video blocks from synthesised frames. It is particularly useful in a multi-camera setup with large inter-camera distances. Adding a VSP-based SKIP mode to a standard Multiview Video Coding (MVC) framework improves the rate-distortion (RD) performance but increases the time complexity of the encoder. This letter proposes an early mode decision technique for VSP SKIP-enhanced MVC. Our method uses the correlation between the RD costs of the VSP SKIP mode in neighbouring views and Bayesian decision theory to reduce the number of candidate coding modes for a given macroblock. Simulation results showed that our technique can save up to 36.20% of the encoding time without any significant loss in RD performance.

Research paper thumbnail of Multimedia Privacy and Security Landscape in the Wake of AI/ML

CRC Press eBooks, Jun 10, 2022

Research paper thumbnail of Sensor Fusion for Identification of Freezing of Gait Episodes Using Wi-Fi and Radar Imaging

IEEE Sensors Journal, 2020

Research paper thumbnail of A Robust Method to Locate License Plates under Diverse Conditions

2019 International Conference on Applied and Engineering Mathematics (ICAEM), 2019

Automatic License Plate Detection (ALPD) is a crucial step, which significantly affects the recog... more Automatic License Plate Detection (ALPD) is a crucial step, which significantly affects the recognition rate and speed of the Intelligent Transport System (ITS). This paper presents a robust license plate detection method using an intelligent combination of Faster R-CNN and image processing operations. In the proposed method, initially, a vehicle is detected in the input colored RGB images using the Faster R-CNN. Next, the image with detected vehicle is fed to our developed License Plate Localization Module (LPLM) to search the possible existence of the license plate. The LPLM converts the detected vehicle image from RGB to the HSV domain and applies color segmentation along with morphological operations, and finally uses the dimensions analysis to locate the license plate. Simulations on the challenging PKU dataset reveal that the proposed technique outperforms recent state-of-the-art methods in terms of detection accuracy, precision, recall, and execution time.

Research paper thumbnail of A Robust Face Recognition Method for Occluded and Low-Resolution Images

Face images that appear in multimedia applications, such as digital entertainments usually exhibi... more Face images that appear in multimedia applications, such as digital entertainments usually exhibit dramatic nonuniform illumination, occlusions, low-resolution, and pose/expression variations that result in substantial performance degradation for traditional face recognition algorithms. Recent research is focused to develop robust face recognition algorithms to solve the aforementioned issues with maximum effort to mimic the human vision system. This paper presents a near real-time and novel face recognition method to recognize the occluded and low-resolution face images. Proposed face recognition algorithm initially uses 68 points to locate a face in the input image. Meanwhile, the adaptive boosting and Linear Discriminant Analysis (LDA) are used to extract face features. In the final stage, classic nearest centre classifier is used for face classification. Detailed experiments are performed on two publicly available LFW and the AR databases. Simulation results reveal that the prop...

Research paper thumbnail of Early decision of CU splitting, using base view information, for low complexity MV-HEVC

2017 International Multi-topic Conference (INMIC), 2017

Research paper thumbnail of Early CU Depth Decision and Reference Picture Selection for Low Complexity MV-HEVC

Symmetry, 2019

The Multi-View extension of High Efficiency Video Coding (MV-HEVC) has improved the coding effici... more The Multi-View extension of High Efficiency Video Coding (MV-HEVC) has improved the coding efficiency of multi-view videos, but this comes at the cost of the extra coding complexity of the MV-HEVC encoder. This coding complexity can be reduced by efficiently reducing time-consuming encoding operations. In this work, we propose two methods to reduce the encoder complexity. The first one is Early Coding unit Splitting (ECS), and the second is the Efficient Reference Picture Selection (ERPS) method. In the ECS method, the decision of Coding Unit (CU) splitting for dependent views is made on the CU splitting information obtained from the base view, while the ERPS method for dependent views is based on selecting reference pictures on the basis of the temporal location of the picture being encoded. Simulation results reveal that our proposed methods approximately reduce the encoding time by 58% when compared with HTM (16.2), the reference encoder for MV-HEVC.

Research paper thumbnail of Temporal and Inter-view Consistent Error Concealment Technique for Multiview plus Depth Video

IEEE Transactions on Circuits and Systems for Video Technology, 2016

Multiview plus depth (MVD) is an emerging video format with many applications, including 3D telev... more Multiview plus depth (MVD) is an emerging video format with many applications, including 3D television and free viewpoint television. During broadcast of compressed MVD video, transmission errors may cause the loss of whole frames, resulting in significant degradation of video quality. Error concealment techniques have been widely used to deal with transmission errors in video communication. However, the existing solutions do not address the requirement that the reconstructed frames be consistent with neighbouring frames, i.e., corresponding pixels have consistent color information. We propose a new consistency model for error concealment of MVD video that allows to maintain a high level of consistency between frames of the same view (temporal consistency) and those of neighbouring views (inter-view consistency). We then propose an algorithm that uses our model to implement concealment in a consistent way. Simulations with the reference software for the Multiview Video Coding project of the Joint Video Team (JVT) of the ISO/IEC MPEG and ITU-T VCEG show that our method outperforms benchmark techniques, including a baseline approach based on the Boundary Matching Algorithm, with respect to both reconstruction quality and view consistency.

Research paper thumbnail of Low-complexity multiview video coding

2012 Picture Coding Symposium, 2012

Research paper thumbnail of Low-Complexity Disparity Estimation for Multiview Video Coding

Keywords: LTS4 Reference EPFL-CONF-174803 Record created on 2012-02-09, modified on 2017-05-10

Research paper thumbnail of S. Khattak, R. Hamzaoui, S. Ahmad, P. Frossard

Research paper thumbnail of Fast encoding techniques for Multiview Video Coding

Signal Processing: Image Communication, 2013

Multiview Video Coding (MVC) is a technique that permits efficient compression of multiview video... more Multiview Video Coding (MVC) is a technique that permits efficient compression of multiview video. MVC uses variable block size motion and disparity estimation for block matching. This requires an exhaustive search process that involves all possible macroblock partition sizes. We analyze the time complexity of MVC and the methods that have been proposed to speed up motion and disparity estimation. We then propose two new methods: Previous Disparity Vector Disparity Estimation (PDV-DE) and Stereo-Motion Consistency Constraint Motion and Disparity Estimation (SMCC-MDE). PDV-DE exploits the correlation between temporal levels and disparity vectors to speed up the disparity estimation process while SMCC-MDE exploits the geometrical relationship of consecutive frame pairs to speed up motion and disparity estimation. We build a complete low complexity MVC encoding solution that combines our two methods with complementary previous methods to speed up motion and disparity search. We evaluate the complexity of our solution in terms of encoding time and number of search points. Our experimental results show that our solution can reduce the encoding time and number of search points of the standard MVC implementation (JMVM 6.0) using the fast TZ search mode by up to 93.7% and 96.9%, respectively, with negligible degradation in the rate-distortion performance. Compared to the best published results, this is an improvement of up to 11% and 7%, respectively.

Research paper thumbnail of Isolation Design Flow Effectiveness Evaluation Methodology for Zynq SoCs

Electronics

Static Random-Access Memory (SRAM)-based Field Programmable Gate Arrays (FPGAs) are increasingly ... more Static Random-Access Memory (SRAM)-based Field Programmable Gate Arrays (FPGAs) are increasingly being used in many application domains due to their higher logic density and reconfiguration capabilities. However, with state-of-the-art FPGAs being manufactured in the latest technology nodes, reliability is becoming an important issue, particularly for safety-critical avionics, automotive, aerospace, industrial robotics, medical, and financial systems. Therefore, fault tolerant system design methodologies have become essential in the aforementioned application domains. The Isolation Design Flow (IDF) is one such design methodology that has promising prospects due to its ability to isolate logic design modules at the physical level for fault containment purposes. This paper proposes a methodology to evaluate the effectiveness of the IDF. To do so, reverse engineering is used to enable fault injection on the IDF designs with minimal changes in the bit-stream. This reduces the time neede...

Research paper thumbnail of Framework for low-complexity multiview video coding

Research paper thumbnail of FracTCAM: Fracturable LUTRAM-Based TCAM Emulation on Xilinx FPGAs

IEEE Transactions on Very Large Scale Integration (VLSI) Systems