State of the Art Techniques to Advance Deep Networks for Semantic Segmentation (original) (raw)
Related papers
Deep-Learning-Based Approaches for Semantic Segmentation of Natural Scene Images: A Review
Electronics
The task of semantic segmentation holds a fundamental position in the field of computer vision. Assigning a semantic label to each pixel in an image is a challenging task. In recent times, significant advancements have been achieved in the field of semantic segmentation through the application of Convolutional Neural Networks (CNN) techniques based on deep learning. This paper presents a comprehensive and structured analysis of approximately 150 methods of semantic segmentation based on CNN within the last decade. Moreover, it examines 15 well-known datasets in the semantic segmentation field. These datasets consist of 2D and 3D image and video frames, including general, indoor, outdoor, and street scenes. Furthermore, this paper mentions several recent techniques, such as SAM, UDA, and common post-processing algorithms, such as CRF and MRF. Additionally, this paper analyzes the performance evaluation of reviewed state-of-the-art methods, pioneering methods, common backbone networks...
IJERT-A Survey on Semantic Segmentation using Deep Learning Techniques
International Journal of Engineering Research and Technology (IJERT), 2021
https://www.ijert.org/a-survey-on-semantic-segmentation-using-deep-learning-techniques https://www.ijert.org/research/a-survey-on-semantic-segmentation-using-deep-learning-techniques-IJERTCONV9IS05011.pdf Semantic segmentation is a challenging task in the field of computer vision. It is process of classifying each pixel belonging to a particular label. It has many challenging applications such as autonomous vehicles, human-computer interaction, robot navigation, medical research and so on, which motivates us to survey the different semantic segmentation architectures. Most of these methods have been built using the deep learning techniques. In this paper we made a review of some state-of-the-art Convolutional Neural Network(CNN) architectures such as AlexNet, GoogleNet, VGGNet, ResNet which form the basis for Semantic Segmentation. Further, we presenteddifferent semanticsegmentation architectures such as Fully Convolutional Network (FCN), ParseNet, Deconvolution Network, U-Net, Feature Pyramid Network(FPN), Mask R-CNN. Finally, we compared the performances of all these architectures.
IEEE Access
Machine learning and deep learning algorithms are widely used in computer science domains. These algorithms are mostly used for classification and regression problems in almost every field of life. Semantic segmentation is an instantly growing research topic in the last few decades that refers to the association of each pixel in the image to the class it belongs. This paper illustrates the systematic survey of advanced research in the field of semantic segmentation till date. This study provides the brief knowledge about the latest proposed methods in the domain of semantic segmentation. The proposed study comprehends the concepts, techniques, tool, and results of different research frameworks proposed in the context of semantic segmentation. This study discusses the latest research papers in which machine learning and deep learning techniques are exploited for semantic segmentation and published between 2016 and 2021. The systematic literature review collected from seven different article libraries including ACM digital Library, Google Scholar, IEEE Xplore, Science Direct, Google Books, Refseek and Worldwide Science. For assuring the quality of the paper those papers are selected which have several citations on standardized platforms. Most of the studies used COCO, PASCAL, Cityscapes and CamVid dataset for training and validation of the machine learning and deep learning models. The results of the selected research articles are collected in the form of accuracy, mIoU value, F1 score, precision, and recall. In this study, we also conclude that most of the semantic segmentation studies use ResNet as the backbone of the architecture and none of the researchers used ensemble learning methods for semantic segmentation that is the loophole of the selected studies.
How deep learning is empowering semantic segmentation
Multimedia Tools and Applications
Semantic segmentation involves extracting meaningful information from images or input from a video or recording frames. It is the way to perform the extraction by checking pixels by pixel using a classification approach. It gives us more accurate and fine details from the data we need for further evaluation. Formerly, we had a few techniques based on some unsupervised learning perspectives or some conventional ways to do some image processing tasks. With the advent of time, techniques are improving, and we now have more improved and efficient methods for segmentation. Image segmentation is slightly simpler than semantic segmentation because of the technical perspective as semantic segmentation is pixels based. After that, the detected part based on the label will be masked and refer to the masked objects based on the classes we have defined with a relevant class name and the designated color. In this paper, we have reviewed almost all the supervised and unsupervised learning algorithms from scratch to advanced and more efficient algorithms that have been done for semantic segmentation. As far as deep learning is concerned, we have many techniques already developed until now. We have studied around 120 papers in this research area. We have concluded how deep learning is helping in solving the critical issues of semantic segmentation and gives us more efficient results. We have reviewed and comprehensively studied different surveys on semantic segmentation, specifically using deep learning.
Real-Time Semantic Image Segmentation with Deep Learning for Autonomous Driving: A Survey
Applied Sciences
Semantic image segmentation for autonomous driving is a challenging task due to its requirement for both effectiveness and efficiency. Recent developments in deep learning have demonstrated important performance boosting in terms of accuracy. In this paper, we present a comprehensive overview of the state-of-the-art semantic image segmentation methods using deep-learning techniques aiming to operate in real time so that can efficiently support an autonomous driving scenario. To this end, the presented overview puts a particular emphasis on the presentation of all those approaches which permit inference time reduction, while an analysis of the existing methods is addressed by taking into account their end-to-end functionality, as well as a comparative study that relies upon a consistent evaluation framework. Finally, a fruitful discussion is presented that provides key insights for the current trend and future research directions in real-time semantic image segmentation with deep lea...
Semantic Segmentation using Fully Convolutional Net: A Review
Semantic segmentation has paved its way in predicting the models using dense pixel-wise prediction method apart from classification. The presented models for semantic partitions the image into semantically meaningful chunks and classifies each chunk into one of the predetermined classes. The presented model reduces the parameters to be trained and helps in up-sampling; it describes quality and accuracy and efficient mechanism. Deeper the layers are, helps in capturing the high-level semantic features from the previous convolutional layers.
Nashriyyahʼi farhangʼi Khurāsān, 2023
Semantic segmentation is a branch of computer vision, used extensively in image search engines, automated driving, intelligent agriculture, disaster management, and other machine-human interactions. Semantic segmentation aims to predict a label for each pixel from a given label set, according to semantic information. Among the proposed methods and architectures, researchers have focused on deep learning algorithms due to their good feature learning results. Thus, many studies have explored the structure of deep neural networks, especially convolutional neural networks. Most of the modern semantic segmentation models are based on fully convolutional networks (FCN), which first replace the fully connected layers in common classification networks with convolutional layers, getting pixel-level prediction results. After that, a lot of methods are proposed to improve the basic FCN methods results. With the increasing complexity and variety of existing data structures, more powerful neural networks and the development of existing networks are needed. This study aims to segment a high-resolution (HR) image dataset into six separate classes. Here, an overview of some important deep learning architectures will be presented with a focus on methods producing remarkable scores in segmentation metrics such as accuracy and F1-score. Finally, their segmentation results will be discussed and we would see that the methods, which are superior in the overall accuracy and overall F1-score, are not necessarily the best in all classes. Therefore, the results of this paper lead to the point to choose the segmentation algorithm according to the application of segmentation and the importance degree of each class.
Applying Deep Learning for Image Segmentation: A Survey
World Congress on Electrical Engineering and Computer Systems and Science
Image segmentation is one of the most important branches of image processing. But it comes with various challenges and problems to be solved. Researchers are always working on improving the accuracy, quality and performance of image segmentation techniques. As in modern days, deep learning being involved in almost all problem solving, it is being used in image segmentation too. In this paper, we discussed few image segmentation techniques developed using deep learning, some implementation of these techniques to applications. And lastly, we addressed some limitations, challenges and research scopes for future.
A Deep Learning-Based Semantic Segmentation Architecture for Autonomous Driving Applications
Wireless Communications and Mobile Computing
In recent years, the development of smart transportation has accelerated research on semantic segmentation as it is one of the most important problems in this area. A large receptive field has always been the center of focus when designing convolutional neural networks for semantic segmentation. A majority of recent techniques have used maxpooling to increase the receptive field of a network at an expense of decreasing its spatial resolution. Although this idea has shown improved results in object detection applications, however, when it comes to semantic segmentation, a high spatial resolution also needs to be considered. To address this issue, a new deep learning model, the M-Net is proposed in this paper which satisfies both high spatial resolution and a large enough receptive field while keeping the size of the model to a minimum. The proposed network is based on an encoder-decoder architecture. The encoder uses atrous convolution to encode the features at full resolution, and i...
A Brief Survey and an Application of Semantic Image Segmentation for Autonomous Driving
Handbook of Deep Learning Applications, 2019
Deep learning is a fast-growing machine learning approach to perceive and understand large amounts of data. In this paper, general information about the deep learning approach which is attracted much attention in the field of machine learning is given in recent years and an application about semantic image segmentation is carried out in order to help autonomous driving of autonomous vehicles. This application is implemented with Fully Convolutional Network (FCN) architectures obtained by modifying the Convolutional Neural Network (CNN) architectures based on deep learning. Experimental studies for the application are utilized 4 different FCN architectures named FCN-AlexNet, FCN-8s, FCN-16s and FCN-32s. For the experimental studies, FCNs are first trained separately and validation accuracies of these trained network models on the used dataset is compared. In addition, image segmentation inferences are visualized to take account of how precisely FCN architectures can segment objects.