Local Label Point Correction for Edge Detection of Overlapping Cervical Cells (original) (raw)

Abstract

Accurate labeling is essential for supervised deep learning methods. However, it is almost impossible to accurately and manually annotate thousands of images, which results in many labeling errors for most datasets. We proposes a local label point correction (LLPC) method to improve annotation quality for edge detection and image segmentation tasks. Our algorithm contains three steps: gradient-guided point correction, point interpolation, and local point smoothing. We correct the labels of object contours by moving the annotated points to the pixel gradient peaks. This can improve the edge localization accuracy, but it also causes unsmooth contours due to the interference of image noise. Therefore, we design a point smoothing method based on local linear fitting to smooth the corrected edge. To verify the effectiveness of our LLPC, we construct a largest overlapping cervical cell edge detection dataset (CCEDD) with higher precision label corrected by our label correction method. Our LLPC only needs to set three parameters, but yields 30–40% average precision improvement on multiple networks. The qualitative and quantitative experimental results show that our LLPC can improve the quality of manual labels and the accuracy of overlapping cell edge detection. We hope that our study will give a strong boost to the development of the label correction for edge detection and image segmentation. We will release the dataset and code at: https://github.com/nachifur/LLPC.

1. Introduction

Medical image datasets are generally annotated by professional physicians (Demner-Fushman et al., 2016; Almazroa et al., 2017; Johnson et al., 2019; Zhang et al., 2019; Lin et al., 2021; Ma et al., 2021; Wei et al., 2021). To construct an annotated dataset for edge detection or image segmentation tasks, annotators often need to annotate points and connect them into an object outline. In the manual labeling process, it is difficult to control label accuracy due to human error. Northcutt et al. (2021) found that label errors are numerous and universal: the average error rate in 10 datasets is 3.4%. These wrong labels seriously affect the accuracy of model evaluation and destabilize benchmarks, which will ultimately spill over model selection and deployment. For example, the deployed model in learning-based computer-aided diagnosis (Saha et al., 2019; Song et al., 2019, 2020; Wan et al., 2019; Zhang et al., 2020) is selected from many candidate models based on evaluation accuracy, which means that inaccurate annotations may ultimately affect accurate diagnosis. To mitigate labeling errors, an image is often annotated by multiple annotators (Arbelaez et al., 2010; Almazroa et al., 2017; Zhang et al., 2019), which generates multiple labels for one image. However, even if the annotation standard is unified, differences between different annotators are inevitable. Another way is to correct the labels manually (Ma et al., 2021). In fact, multi-person annotation and manual label correction are time-consuming and labor-intensive. Therefore, it is of great value to develop label correction methods based on manual annotation for supervised deep learning methods.

Most label correction works are focused on weak supervision (Zheng et al., 2021), semi-supervision (Li et al., 2020), crowdsourced labeling (Bhadra and Hein, 2015; Nicholson et al., 2016), classification (Nicholson et al., 2015; Kremer et al., 2018; Guo et al., 2019; Liu et al., 2020; Wang et al., 2021; Li et al., 2022), and natural language processing (Zhu et al., 2019). However, label correction in these tasks is completely different from correcting object contours. To automatically correct edge labels, we propose a local label point correction method for edge detection and image segmentation. Our method contains three steps: gradient-guided point correction, point interpolation, and local point smoothing. We correct the annotation of the object contours by moving label points to the pixel gradient peaks and smoothing the edges formed by these points. To verify the effectiveness of our label correction method, we construct a cervical cell edge detection dataset. Experiments with multiple state-of-the-art deep learning models on the CCEDD show that our LLPC can greatly improve the quality of manual annotation and the accuracy of overlapping cell edge detection, as shown in Figure 1. Our unique contributions are summarized as follows:

We are the first to propose a label correction method based on annotation points for edge detection and image segmentation. By correcting the position of these label points, our label correction method can generate higher-quality label, which contributes 30–40% AP improvement on multiple baseline models.
We construct a largest publicly cervical cell edge detection dataset based on our LLPC. Our dataset is ten times larger than the previous datasets, which greatly facilitates the development of overlapping cell edge detection.
We present the first publicly available label correction benchmark for improving contour annotation. Our study serves as a potential catalyst to promote label correction research and further paves the way to construct accurately annotated datasets for edge detection and image segmentation.

2.1. Label Correction

Deep learning is developing rapidly with the help of big computing (Jouppi et al., 2017) and big data (Deng et al., 2009; Sun et al., 2017; Zhou et al., 2017). Some works (Radford et al., 2019; Brown et al., 2020; Raffel et al., 2020) focus on feeding larger models with more data for better performance and generalization, while others design task-specific model structures and loss functions (Hu et al., 2019; Huang et al., 2021; Zhao et al., 2022) to improve performance on a fixed dataset. Recently, data itself has received a lot of attention. Ng et al. (2021) led the data revolution of deep learning and successfully organized the first “Data-Centric AI” competition. The competition aims to improve data quality and develop data optimization pipelines, such as label correction, data synthesis, and data augmentation (Motamedi et al., 2021). Competitors mine data potential instead of optimizing model structure to improve performance. Northcutt et al. (2021) found that if the error rate of test labels only increases by 6%, ResNet18 outperforms ResNet-50 on ImageNet (Deng et al., 2009). To improve data quality and accurately evaluate models, there is an urgent need to develop label correction algorithms. In weak supervision and semi-supervision (Li et al., 2020; Zheng et al., 2021), pseudo label correction is usually implemented due to the lack of supervision from real labels. Zheng et al. (2021) correct the noisy labels by using a meta network for image recognition and text classification. For supervised learning, bad data can be discarded by data preprocessing, but bad labels seem inevitable in large-scale datasets. In crowdsourcing (Bhadra and Hein, 2015; Nicholson et al., 2016), an image is annotated by multiple people to improve the accuracy of classification task (Nicholson et al., 2015; Kremer et al., 2018; Guo et al., 2019). Guo et al. (2019) trained a model by using a small amount of data and design a label completion method to generate labels (negative or positive) for the mostly unlabeled data. However, label correction in these tasks is significantly different from correcting object contours. In this paper, to eliminate edge location errors and inter-annotator differences in manual annotation, we propose an label correction method based on annotation points for edge detection and image segmentation. Besides, we compare our LLPC with conditional random fields (CRF) (Sutton et al., 2012), which is popular as post-processing for other segmentation methods (Chen et al., 2017; Sun et al., 2020; Fan et al., 2021a; Lu et al., 2021; Ma et al., 2022; Zhang et al., 2022). Dense CRF (Krähenbühl and Koltun, 2011) improves the labeling accuracy by optimizing energy function based on coarse segmentation images, while our LLPC is a label correction method based on annotation points, which are two different technical routes of label correction for image segmentation. More discussion in Section 5.3.

2.2. Cervical Cell Dataset

Currently, cervical cell datasets include ISBI 2015 challenge dataset (Lu et al., 2015), Shenzhen University dataset (Song et al., 2016), and Beihang University dataset (Wan et al., 2019). Supervised deep learning based methods require large amounts of data with accurate annotations. However, the only public ISBI dataset (Lu et al., 2015) has a small amount of data and simple image types, which are difficult to train deep neural networks. In this paper, we construct a largest high-accuracy cervical cell edge detection dataset based on our label correction method. Our CCEDD contains overlapping cervical cell masses in a variety of complex backgrounds and high-precision corrected labels, which are sufficient in quantity and richness to train various deep learning models.

3. Label Correction

Our LLPC contains three steps: gradient-guided point correction (GPC), point interpolation (PI) and local point smoothing (LPS). I(x, y) is a cervical cell image and g(x, y) is the gradient image of I(x, y) after Gaussian smoothing. is an original label point of I(x, y). First, we correct the points to the nearest gradient peak on g(x, y), as shown in Figure 2A, i.e., . i ∈ {1, 2, …, n s}. Second, we insert more points in large gaps, as shown in Figure 2B, i.e., . j ∈ {1, 2, …, n I}. n s and n I are the number of points before and after interpolation, respectively. Third, we divide the point set into n c groups. Each group of points is expressed as Φ_k_. We fit a curve C k on Φ_k_. k ∈ {1, 2, …, n c}. All curves {C k} are merged into a closed curve C c, as shown in Figure 2C. Finally, we sample C c to obtain discrete edges C d, as shown in Figure 2D. In fact, the closed discrete edges generated by multiple curves fusion are not smooth at the stitching nodes. Therefore, we propose a local point smoothing method without curves splicing and sampling in Section 3.3.

3.1. Gradient-Guided Point Correction

Although the annotations of cervical cell images are provided by professional cytologists, due to human error, the label points usually deviate from the pixel gradient peaks. To solve this problem, we design a gradient-guided point correction (GPC) method based on gradient guidance. We correct the label points only in the strong gradient region to eliminate human error, while preserving the original label points in the weak gradient region to retain the correct high-level semantics in human annotations. Our point correction consists of three steps as follows:

Determine whether the position of each label point is in strong gradient regions.
Select a set of candidate points for a label point.
Move the label point to the position of the point with the largest gradient value among these candidate points.

The processing object of our LLPC is a set of label points () corresponding to a closed contour. For an original label point , we select candidate points along the normal direction of label edge, as shown in Figure 2A. These points constitute a candidate point set , and is the point with the largest gradient in . We move to the position of to obtain the corrected label point .

where

is a candidate point in . We judge whether a point is in strong gradient regions through Δ. If Δ > 0, the point will be corrected; otherwise, it will not be moved. In this way, when the radius (r) of is larger, our method can correct larger annotation errors. However, this will increase the correction error of label points due to image noise and interference from adjacent edges. To balance the contradiction, the gradient value of the candidate point is weighted by ω_j_, which allows setting a larger radius to correct larger annotation errors. We compute the weight as

where

K(x, h) is a weighted kernel function with bandwidth h. κ(x) is a Gaussian function with zero mean and one variance. After point correction, .

3.2. Piecewise Curve Fitting

The edge generated directly from the point set is not smooth due to the errors in point correction process (see Section 5.4). To eliminate the errors, we fit multiple curve segments and stitch them together. In the annotation process of manually drawing cell contours, the annotators perform dense point annotations near large curvatures, and sparse annotations near small curvatures to accurately and quickly outline cell contours. Since the existence of large intervals is not conducive to curve fitting, we perform linear point interpolation (PI) on these intervals before curve fitting.

3.2.1. Point Interpolation

The sparse label point pairs can be represented as,

where i = 0, 1…n s − 1. Then, we insert points between the sparse points pairs to satisfy

as shown in Figure 2B. j = 0, 1…n I − 1. n s and n I are the number of points before and after interpolation, respectively. gap is the maximum interval between adjacent point pair. After interpolation, .

3.2.2. Curve Fitting

We divide into n c groups. Each group is expressed as . k = 0, 1…n c − 1. n c = ⌈n I/s_⌉. As shown in Figure 3, s = 2(r f − n d) is the interval between the center points of each group; r f = ⌊(n g − 1)/2⌋ is the group radius; n g is the number of points in the group. To reduce the fitting error at both ends of the curve, there is overlap between adjacent curves. The overlapping length is 2_n d. To fit a curve on Φ_k_, we create a new coordinate system, as shown in Figure 2C. The x-axis passes through the point and the point. The point set in the new coordinate system is . We obtain a curve C k by local linear fitting (McCrary, 2008) on . This is equivalent to solving the following problem at the target point x t = (x, y) on the curve C k.

β0(x) and β1(x) are the curve parameter at the point x t. (x j, y j) denotes the coordinates of point in . The weight function is

If the distance between the point and the target point x t is larger, the weight ω_j_(x) will be smaller. The matrix representation of the above parameter solution is

where , , ,

The matrix ω is zero except for the diagonal. Each corresponds to a curve C k. We stitch n c curves into a closed curve C c, as shown in Figures 2C, 3. Then, we sample on the interval as shown in Figure 2D. We convert the coordinates of these sampling points to the original image coordinate system. Finally, we can obtain a discrete edge C d, as shown in Figures 2E,F.

3.3. Local Point Smoothing

In Section 3.2, we stitch multi-segment curves to obtain a closed cell curve, and then sample the curve to generate a discrete edge. In fact, there is no smoothness at the splice nodes. To generate a smooth closed discrete edge, we design a local point smoothing (LPS) method without curves splicing and sampling. As shown in Figure 4A, we insert more points in large intervals (gap = 1). As shown in Figure 4B, we only correct the center point of by fitting a curve (C k). By shifting the local coordinate system by one step (s = 1), each point in will be corrected by fitting a curve. These correction points constitute a discrete edge C d. Because no curves are spliced, the generated edge is smooth at each point. The pipeline of our LLPC is shown in Algorithm 1.

LLPC Label Correction Algorithm.

3.4. Parameter Setting

In Section 3.1, we set the parameters r = 15, λ_t_ = 4 and _h_1 = r/2. In Section 3.2, we set n g = 14. r f = ⌊(n g − 1)/2⌋. _h_2 = r f/2. When gap = 1 and s = 1, the Section 3.3 is a special case of the Section 3.2. See Section 5.4 for more discussion of parameter selection.

4. Experimental Design

4.1. Data Aquisition and Processing

We compare our CCEDD with other cervical cytology datasets in Table 1. Our dataset was collected from Liaoning Cancer Hospital & Institute between 2016 and 2017. We capture digital images with a Nikon ELIPSE Ci slide scanner, SmartV350D lens and a 3-megapixel digital camera. For patients with negative and positive cervical cancer, the optical magnification is 100× and 400×, respectively. All of the cases are anonymized. All processes of our research (image acquisition and processing, etc.) follow ethical principles. Our CCEDD dataset includes 686 cervical images with a size of 2,048×1,536 pixels (Table 2). Six expert cytologists outline the closed contours of the cytoplasm and nucleus in cervical cytological images by an annotation software (labelme; Wada, 2016).

Dataset	Image size	Dataset size	Dataset size (512×512)	Open
ISBI (Lu et al., 2015)	1,024×1,024	17	68	√
SZU Dataset (Song et al., 2016)	1,360×1,024	21	84	×
BHU Dataset (Wan et al., 2019)	512×512	580	580	×
CCEDD	2,048×1,536	686	8,232	√

Comparison with other cervical cytology datasets.

For a fair comparison of the sizes of different datasets, we crop the images to 512×512, and our CCEDD is about ten times larger than other datasets. Best results are highlighted.

Our CCEDD	Uncut CCEDD	Cut CCEDD
Image size	2,048×1,536	512×384
Training set size	411	20,139
Validation set size	68	3,332
Test set size	207	10,143
Dataset size	686	33,614

The detailed description of CCEDD.

We randomly shuffle our dataset and split it into training, validation and test sets. To ensure test reliability, we set this ratio to 6:1:3. To be able to train various complex neural networks on a GPU, we crop a large-size image into small-size images. If an image is cut as shown in Figure 5A, it will result in incomplete edge at the cut boundary. To maximize data utilization efficiency, we move the cutting grid, as shown in Figures 5B–D. After label correction, we cut an image with a size of 2,048×1,536 into 49 image patches with a size of 512×384 pixels.

4.2. Baseline Model and Evaluation Metrics

4.2.1. Baseline Model

Our baseline detectors are 10 state-of-the-art models. We evaluate multiple edge detectors, such as RCF (Liu et al., 2019), ENDE (Nazeri et al., 2019), DexiNed (Poma et al., 2020), FINED (Wibisono and Hang, 2020), and PiDiNet (Su et al., 2021b). Furthermore, we explore more network structures for edge detection by introducing segmentation networks, which usually only requires simple modifications of the last layer of networks. These segmentation networks include STDC (Fan et al., 2021b), UNet (Ronneberger et al., 2015), UNet++ (Zhou et al., 2019), CENet (Gu et al., 2019), MSU-Net (Su et al., 2021a). To aggregate more shallow features for edge detection, we modify multiple layers of STDC, i.e., STDC+. More details of these network structure can be found in our code implementation.

4.2.2. Evaluation Metrics

We quantitatively evaluate the edge detection accuracy by calculating three standard measures (ODS, OIS, and AP) (Arbelaez et al., 2010). The average precision (AP) is the area under the precision-recall curve (Figure 1B). F1-score is the harmonic average of precision and recall. ODS is the best F1-score for a fixed scale, while OIS is the F1-score for the best scale in each image.

4.3. Experimental Setup

4.3.1. Training Strategy

Data augmentation can improve model generalization and performance (Bloice et al., 2019). In training, we perform rotation and shearing operations, which require padding zero pixels around an image. In testing, there is no zero pixel padding. This lead to different distributions of training and testing sets and degrade the model performance. Therefore, we perform data augmentation in pre-training and no augmentation during fine-tuning.

Due to the different structures and parameters of baseline networks, a fixed number of training iterations may lead to overfitting or underfitting. For accurate evaluation, we adaptively adjust the iteration number by evaluating the average accuracy (AP) on the validation set. The period of model evaluation is set 1 epoch for pre-training and 0.1 epoch for fine-tuning. After the _i_-th model evaluation, we can obtain Model i and AP i (i = 1, 2, ⋯ , 50). If AP i < min(AP i_−_j), the training ends and we obtain the optimal model Model j|max(AP j). j = 1, 2, 3 in pre-training and j = 1, 2, ⋯ , 10 in fine-tuning. The maximum iteration number is 50 epochs for pre-training and fine-tuning. Besides, we also dynamically adjust the learning rate to improve performance. The learning rate l decays from 1−4 to 1−5. If AP i < AP _i_−1, l i = l _i_−1/2.

4.3.2. Implementation Details

We use the Adam optimizer (Kingma and Ba, 2015) to optimize all baseline networks on PyTorch (β1 = 0, β2 = 0.9). We use random normal initialization to initialize these networks. To be able to train various complex neural networks on a GPU, we resize the image to 256×192. The batch size is set 4. We perform color adjustment, affine transformation and elastic deformation for data augmentation (Bloice et al., 2019). All experiments are implemented on a workstation equipped with a Intel Xeon Silver 4110 CPUs and a NVIDIA RTX 3090 GPU.

5. Experimental Results and Discussion

5.1. Edge Detection of Overlapping Cervical Cells

We show the visual comparison results on our CCEDD in Figure 6. The quantitative comparison is shown in Table 4 and Figure 1B. These results have important guiding implications for accurate edge detection of overlapping cervical cells. We analyze several factors affecting the performance of overlapping edge detection.

Loss function design. RCFLoss (Liu et al., 2019) produces coarser edges, as shown in Figure 7. This may be robust for natural images, but poor localization accuracy for accurate cervical cell edge detection.
Network structure design. Long-distance skip connections can fuse shallow and deep features for constructing multi-scale features. Our experiments show that the U-shaped structure is effective for overlapping edge detection [e.g., UNet (Ronneberger et al., 2015), UNet++ (Zhou et al., 2019) and MSU-Net (Su et al., 2021a)].
Pre-training. Due to the huge distribution difference between natural and medical images, pre-training may degrade performance (e.g., CE-Net; Gu et al., 2019) or have limited improvement (e.g., STDC; Fan et al., 2021b).

5.2. Effectiveness of Label Correction

In our LLPC, the position of label points is locally corrected to the pixel gradient peak. As shown in Figures 1A, 8B, Our LLPC can generate more accurate edge labels. Besides, we can easily generate corrected masks from corrected points in the labelme software (Wada, 2016). Compared with the original mask in Figure 8C, our corrected mask has higher edge localization accuracy and smoother edges, as shown in Figure 8D.

We train multiple networks using original label and corrected label. The quantitative comparison results is shown in Table 3 and Figure 1B. Compared with the original label, using the corrected label to train multiple networks can significantly improve AP (30–40%), which verifies the effectiveness of our label correction method. Table 4 shows that the performance improvement comes from two aspects. First, our corrected label can improve the evaluation accuracy in testing (0.541 → 0.588). Second, using our corrected label to train network can improve the accuracy of overlapping edge detection in training (0.588 → 0.755), as shown in Figure 9.

Year/Model/Loss	ΔAP(%)	Label correction	No label correction	Params (M)	MACs(G)
AP	ODS	OIS	AP	ODS	OIS
2019/RCF/RCFLoss	41.0	0.612	0.599	0.594	0.434	0.485	0.485	14.81	19.56
2019/RCF/BCELoss	41.9	0.667	0.638	0.645	0.470	0.507	0.512
2019/ENDE/BCELoss	37.0	0.733	0.682	0.691	0.535	0.548	0.555	6.06	32.51
2020/DexiNed/RCFLoss	30.3	0.649	0.633	0.635	0.498	0.528	0.533	35.08	27.72
2020/DexiNed/BCELoss	38.5	0.723	0.671	0.680	0.522	0.541	0.549
2020/FINED/RCFLoss	28.4	0.602	0.604	0.450	0.469	0.510	0.402	1.43	14.38
2020/FINED/BCELoss	41.4	0.703	0.660	0.621	0.497	0.528	0.530
2021/PiDiNet/RCFLoss	37.2	0.590	0.581	0.574	0.430	0.481	0.479	0.69	3.74
2021/PiDiNet/BCELoss	42.7	0.648	0.624	0.628	0.454	0.496	0.501
2021/STDC1/BCELoss	12.9	0.394	0.466	0.472	0.349	0.438	0.443	14.26	4.48
2021/STDC1(pretrain)/BCELoss	13.1	0.407	0.478	0.483	0.360	0.451	0.454
2021/STDC2/BCELoss	16.1	0.403	0.473	0.478	0.347	0.435	0.442	22.30	7.01
2021/STDC2(pretrain)/BCELoss	15.0	0.413	0.484	0.488	0.359	0.449	0.454
2021/STDC1+/BCELoss	41.3	0.701	0.652	0.659	0.496	0.518	0.524	13.76	39.28
2021/STDC2+/BCELoss	38.2	0.694	0.648	0.656	0.502	0.525	0.532	21.83	41.81
2015/UNet/BCELoss	38.9	0.729	0.679	0.689	0.525	0.539	0.546	31.03	41.96
2019/CE-Net(pretrain)/BCELoss	37.5	0.696	0.653	0.658	0.506	0.530	0.535	60.24	17.36
2019/CE-Net/BCELoss	36.4	0.712	0.668	0.675	0.522	0.540	0.547
2019/UNet++(DS)/BCELoss	37.6	0.739	0.687	0.696	0.537	0.548	0.555	9.16	26.76
2019/UNet++/BCELoss	39.6	0.755	0.691	0.701	0.541	0.550	0.557	26.75
2021/MSU-Net/BCELoss	39.7	0.749	0.689	0.699	0.536	0.550	0.556	47.09	59.93

Edge detection results on our CCEDD dataset.

Our baseline model contains RCF (Liu et al., 2019), ENDE (Nazeri et al., 2019), DexiNed (Poma et al., 2020), FINED (Wibisono and Hang, 2020), PiDiNet (Su et al., 2021b), STDC (Fan et al., 2021b), UNet (Ronneberger et al., 2015), CE-Net (Gu et al., 2019), UNet++ (Zhou et al., 2019), MSU-Net (Su et al., 2021a). “BCELoss” is binary cross entropy loss function. “RCFLoss” is an annotator-robust loss function for edge detection (Liu et al., 2019). STDC2 (Fan et al., 2021b) has more parameters than STDC1 (Fan et al., 2021b). “UNet++(DS)” is UNet++ (Zhou et al., 2019) with deep supervision. “MACs” is multiply-accumulate operation. “Params” and “MACs” are calculated by THOP1. Best and second best results are highlighted and underlined.

Training/Evaluation	AP	ODS	OIS
Original label/Original label	0.541	0.550	0.557
Original label/Corrected label	0.588	0.592	0.598
Corrected label/Corrected label	0.755	0.691	0.701

Performance improvement analysis of label correction.

Use UNet++ (Zhou et al., 2019) for evaluation. Best results are highlighted.

5.3. Comparison With Other Label Correction Methods

In Figures 10, 11, we compare our LLPC with active contours (Chan and Vese, 2001) and dense CRF (Krähenbühl and Koltun, 2011). We observed that active contours (Chan and Vese, 2001) is refinement failure of nucleus contours in Figure 10F, and dense CRF (Krähenbühl and Koltun, 2011) fails due to complex overlapping cell contours in Figure 11C. Since active contours (Chan and Vese, 2001) and dense CRF (Krähenbühl and Koltun, 2011) are global iterative optimization methods based on segmented images, which are uncontrollable for label correction of object contours and ultimately lead to these failed results. Our LLPC is the local label point correction without iterative optimization. Therefore, the correction error of our LLPC is controllable and the error in one place does not spread to other places, which is crucial for robust label correction. Besides, dense CRF (Krähenbühl and Koltun, 2011) is nonplussed over overlapping instance segmentation refinement, while our LLPC corrects label based on annotation point and can handle overlapping label correction, as shown in Figure 11E.

5.4. Ablation Experiment

5.4.1. Ablation of Label Correction Method

Our LLPC contains three steps: gradient-guided point correction (GPC), point interpolation (PI), and local point smoothing (LPS). Although our GPC can correct label points to pixel gradient peaks, there is still some error in the correction process. LPS can smooth the edges corrected by GPC, as shown in Figure 12A. Table 5 shows that GPC is the most important part of our LLPC (0.541 → 0.731), while PI and LPS can further improve the annotation quality by smoothing edges (0.731 → 0.755). Only smoothing the original labels (“w/o GPC”) is ineffective (0.541 → 0.533). Because this may lead to larger annotation errors. Compared to piecewise curve fitting in Section 3.2, LPS can generate smoother edges, as shown in Figure 12B. These qualitative and quantitative results verify that the three components of our LLPC are essential.

Correction method	AP	ODS	OIS
Original label	0.541	0.550	0.557
GPC (w/o PI, w/o LPS)	0.731	0.682	0.692
Our LLPC	0.755	0.691	0.701
w/o GPC	0.533	0.545	0.552
w/o PI	0.663	0.619	0.625
w/o LPS	0.742	0.689	0.699

Ablation of our LLPC. “GPC” is gradient-guided point correction.

“w/o LPS” is using piecewise curve fitting instead of local point smoothing. Use UNet++ (Zhou et al., 2019) for evaluation. “PI” is point interpolation. Best results are highlighted.

5.4.2. Selection of Hyper-Parameters

To set the optimal parameters, we conduct parameters ablation experiments in Table 6. gap can control the point density in PI. For local curve fitting, gap = 1 is optimal. Therefore, for an unknown dataset, our LLPC only needs to set three parameters, i.e., r, λ_t_ and n g. A qualitative comparison of these parameters with different settings is shown in Figure 13. r controls the maximum error correction range in human annotations. If r is too small, large label errors cannot be corrected. If r is too large, the error of point correction is larger. r limits the correction range in space, while λ_t_ is the threshold for a limitation of gradient values variation during the correction process. If λ_t_ is large, label points are corrected only when the gradient value changes sharply in the search direction. n g controls the scale of the local smoothing. For our CCEDD, r = 15, λ_t_ = 4, and n g = 14.

r	λ_t_	gap	n g	AP	ODS	OIS
7	4	1	14	0.691	0.645	0.653
11	4	1	14	0.732	0.681	0.691
19	4	1	14	0.746	0.691	0.701
23	4	1	14	0.734	0.683	0.692
15	1	1	14	0.750	0.689	0.700
15	2	1	14	0.751	0.690	0.700
15	3	1	14	0.745	0.691	0.700
15	5	1	14	0.750	0.689	0.699
15	10	1	14	0.729	0.679	0.688
15	15	1	14	0.708	0.658	0.664
15	4	1.5	14	0.749	0.688	0.698
15	4	2	14	0.742	0.689	0.699
15	4	1	10	0.750	0.689	0.699
15	4	1	12	0.729	0.687	0.697
15	4	1	16	0.752	0.690	0.700
15	4	1	18	0.750	0.687	0.698
15	4	1	14	0.755	0.691	0.701

Parameters ablation of our label correction method.

Use UNet++ (Zhou et al., 2019) for evaluation. For our CCEDD, we set r = 15, λt = 4, and ng = 14. Best results are highlighted.

5.4.3. Ablation of Training Strategy

Our training strategy can eliminate the influence of different distributions of the training and test sets due to data augmentation, and improve the AP by 3.6% in Table 7. To fairly evaluate multiple networks with different structures and parameters, we employ adaptive iteration and learning rate adjustment to avoid overfitting and underfitting. Table 8 and Figure 14A verify the effectiveness of our adaptive training strategy.

Training methods	AP	ODS	OIS
w/o augmentation, w/o fine-tuning	0.729	0.672	0.683
w/ augmentation, w/o fine-tuning	0.732	0.674	0.682
w/ augmentation, w/ fine-tuning	0.755	0.691	0.701

Ablation of two-stage training strategy.

We perform data augmentation in pre-training and no augmentation during fine-tuning. Use UNet++ (Zhou et al., 2019) for evaluation. Best results are highlighted.

Training methods	AP	ODS	OIS	epoch
w/o AIT, w/o ALR	0.683	0.639	0.642	50
w/o AIT, w/o ALR	0.449	0.653	0.657	70
w/o AIT, w/o ALR	0.308	0.647	0.653	100
w/ AIT, w/o ALR	0.747	0.684	0.693	13
w/ AIT, w/ ALR	0.750	0.693	0.700	21

Ablation of adaptive training strategy.

We evaluate UNet++ (Zhou et al., 2019) on the validation set. “AIT” is adaptive iteration training. “ALR” is adaptive learning rate. Best results are highlighted.

5.5. Computational Complexity

5.5.1. Label Correction

Our LLPC takes 270 s to generate 100 corrected edge images with a size of 2,048×1,536 pixels on CPU. Because our label correction algorithm is offline and does not affect the inference time of a neural network, we have not further optimized it. If the algorithm runs on GPU, the speed can be further improved, which can save more time for label correction of large-scale datasets.

5.5.2. Model Evaluation

We rewrite the evaluation code (Arbelaez et al., 2010) on GPU for fast evaluation. The average FPS using the UNet++ (Zhou et al., 2019) is 173 for 10,143 test images with a size of 256×192 pixels. In training, we need to calculate the AP of the validation set to adaptively control the learning rate and the number of iterations (see Section 4.3). Fast evaluation greatly accelerates our training process.

5.5.3. Neural Network Inference

We test the inference speed of UNet++ (Zhou et al., 2019). For 207 images with a resolution of 1,024×768, the average FPS is 9. For 207 images with a resolution of 512×512, the average FPS is 26. For 10,413 images with a resolution of 256×192, the average FPS is 295. Figure 14B shows the running efficiency comparison of multiple benchmark models. According to the report of Wan et al. (2019), the methods of Wan et al. (2019), Lu et al. (2015), and Lu et al. (2016), took 17.67, 35.69m and 213.62 s for an image a resolution of 512×512, respectively. Compared with these method, the UNet++ (Zhou et al., 2019) is significantly faster. Many cervical cell segmentation approaches (Phoulady et al., 2017; Tareef et al., 2017, 2018; Wan et al., 2019; Zhang et al., 2020) consist of three stages, including nucleus candidate detection, cell localizations, and cytoplasm segmentation. Fast edge detection of overlapping cervical cell means that the detected edges can be used as a priori input of these segmentation networks to improve performance at a small cost.

6. Discussion

6.1. Label Correction for Natural Images

Our label correction method can correct a closed contour by correcting the position of label points, which does not require additional prior assumptions (e.g., contour shape, object size). We annotated several images in the PASCAL VOC dataset (Everingham et al., 2010) with labelme (Wada, 2016) and corrected the label (r = 7, λ_t_ = 4, and n g = 9). As shown in Figure 15, our label correction method can generate more accurate object contours, which demonstrates the feasibility of our label correction method for natural images.

6.2. Overlapping Edge Detection

Overlapping edge detection of cervical cell is a challenging task due to the presence of strong and weak gradient edges. For edges with strong gradients, it only requires low-level detail features. For edges with weak gradients in overlapping region, it may require high-level semantics to reason contours and connect edges based on the context in strong gradient regions. While Unet++ (Zhou et al., 2019) achieves the best results on our CCEDD, there is no difference in the detection of these two different types of edges. Designing new network structures and loss functions for overlapping edge detection may be a way to further address this challenge.

7. Conclusions

We propose a local label point correction method for edge detection and image segmentation, which is the first benchmark for label correction based on annotation points. Our LLPC can improve the edge localization accuracy and mitigate labeling error from different annotators in manual annotation. Only three parameters need to be set in our LLPC, but using the label corrected by our LLPC to train multiple networks can yield 30–40% AP improvement. Besides, we construct a largest overlapping cervical cell edge detection dataset based on our LLPC, which will greatly facilitate the development of overlapping cell edge detection. In future work, we plan to develop a label point correction method with local adaptive parameter adjustment.

Funding

This work was supported by the National Natural Science Foundation of China (61873259, 62073205, and 61821005), the Key Research and Development Program of Liaoning (2018225037), and the Youth Innovation Promotion Association of Chinese Academy of Sciences (2019203).

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found at: https://github.com/nachifur/LLPC.

Author contributions

JL: conceptualization, methodology, software, validation, writing—original draft, and visualization. HF: investigation, resources, writing—review and editing, supervision, project administration, and funding acquisition. QW: writing—review and editing. WL: investigation. YT: writing—review and editing and supervision. DW: investigation, resources, and data curation. MZ and LC: investigation and resources. All authors contributed to the article and approved the submitted version.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

1
AlmazroaA.AlodhaybS.OsmanE.RamadanE.HummadiM.DlaimM.et al. (2017). Agreement among ophthalmologists in marking the optic disc and optic cup in fundus images. Int. Ophthalmol. 37, 701–717. 10.1007/s10792-016-0329-x
- Pubmed Abstract
- CrossRef
- Google Scholar
2
ArbelaezP.MaireM.FowlkesC.MalikJ. (2010). Contour detection and hierarchical image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 33, 898–916. 10.1109/TPAMI.2010.161
- Pubmed Abstract
- CrossRef
- Google Scholar
3
BhadraS.HeinM. (2015). Correction of noisy labels via mutual consistency check. _Neurocomputing_160, 34–52. 10.1016/j.neucom.2014.10.083
- CrossRef
- Google Scholar
4
BloiceM. D.RothP. M.HolzingerA. (2019). Biomedical image augmentation using augmentor. _Bioinformatics_35, 4522–4524. 10.1093/bioinformatics/btz259
- Pubmed Abstract
- CrossRef
- Google Scholar
5
BrownT. B.MannB.RyderN.SubbiahM.KaplanJ.DhariwalP.et al. (2020). Language models are few-shot learners, in Advances in Neural Information Processing Systems eds LarochelleH.RanzatoM.HadsellR.BalcanM.LinH.. Available online at: https://proceedings.neurips.cc/paper/2020/hash/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html
- Google Scholar
6
ChanT. F.VeseL. A. (2001). Active contours without edges. IEEE Trans. Image Process. 10, 266–277. 10.1109/83.902291
- Pubmed Abstract
- CrossRef
- Google Scholar
7
ChenL.-C.PapandreouG.KokkinosI.MurphyK.YuilleA. L. (2017). Deeplab: semantic image segmentation with deep convolutional nets, Atrous convolution, and fully connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 40, 834–848. 10.1109/TPAMI.2017.2699184
- Pubmed Abstract
- CrossRef
- Google Scholar
8
Demner-FushmanD.KohliM. D.RosenmanM. B.ShooshanS. E.RodriguezL.AntaniS.et al. (2016). Preparing a collection of radiology examinations for distribution and retrieval. J. Am. Med. Inform. Assoc. 23, 304–310. 10.1093/jamia/ocv080
- Pubmed Abstract
- CrossRef
- Google Scholar
9
DengJ.DongW.SocherR.LiL.Kai LiLi Fei-Fei (2009). Imagenet: a large-scale hierarchical image database, in 2009 IEEE Conference on Computer Vision and Pattern Recognition (Miami, FL: IEEE Computer Society), 248–255. 10.1109/CVPR.2009.5206848
- CrossRef
- Google Scholar
10
EveringhamM.Van GoolL.WilliamsC. K. I.WinnJ.ZissermanA. (2010). The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88, 303–338. 10.1007/s11263-009-0275-4
- CrossRef
- Google Scholar
11
FanD.-P.LiT.LinZ.JiG.-P.ZhangD.ChengM.-M.et al. (2021a). Re-thinking co-salient object detection. IEEE Trans. Pattern Anal. Mach. Intell. 2021.3060412. 10.1109/TPAMI.2021.3060412
- Pubmed Abstract
- CrossRef
- Google Scholar
12
FanM.LaiS.HuangJ.WeiX.ChaiZ.LuoJ.et al. (2021b). Rethinking bisenet for real-time semantic segmentation, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (Computer Vision Foundation/IEEE), 9716–9725. 10.1109/CVPR46437.2021.00959
- CrossRef
- Google Scholar
13
GuZ.ChengJ.FuH.ZhouK.HaoH.ZhaoY.et al. (2019). CE-NET: context encoder network for 2d medical image segmentation. IEEE Trans. Med. Imag. 38, 2281–2292. 10.1109/TMI.2019.2903562
- Pubmed Abstract
- CrossRef
- Google Scholar
14
GuoK.CaoR.KuiX.MaJ.KangJ.ChiT. (2019). LCC: towards efficient label completion and correction for supervised medical image learning in smart diagnosis. J. Netw. Comput. Appl. 133, 51–59. 10.1016/j.jnca.2019.02.009
- CrossRef
- Google Scholar
15
HuX.FuC.-W.ZhuL.QinJ.HengP.-A. (2019). Direction-aware spatial context features for shadow detection and removal. IEEE Trans. Pattern Anal. Mach. Intell. 42, 2795–2808. 10.1109/TPAMI.2019.2919616
- Pubmed Abstract
- CrossRef
- Google Scholar
16
HuangZ.WeiY.WangX.LiuW.HuangT. S.ShiH. (2021). AlignSeg: feature-aligned segmentation networks. IEEE Trans. Pattern Anal. Mach. Intell. 44, 550–557. 10.1109/TPAMI.2021.3062772
- Pubmed Abstract
- CrossRef
- Google Scholar
17
JohnsonA. E.PollardT. J.GreenbaumN. R.LungrenM. P.DengC.-y.PengY.et al. (2019). MIMIC-CXR-JPG, a large publicly available database of labeled chest radiographs. arXiv preprint arXiv:1901.07042. 10.1038/s41597-019-0322-0
- Pubmed Abstract
- CrossRef
- Google Scholar
18
JouppiN. P.YoungC.PatilN.PattersonD.AgrawalG.BajwaR.et al. (2017). In-datacenter performance analysis of a tensor processing unit, in Proceedings of the 44th Annual International Symposium on Computer Architecture (Toronto, ON: ACM), 1–12. 10.1145/3140659.3080246
- CrossRef
- Google Scholar
19
KingmaD. P.BaJ. (2015). Adam (2014), a method for stochastic optimization, in Proceedings of the 3rd International Conference on Learning Representations, Vol. 1412 (San Diego, CA). Available online at: http://arxiv.org/abs/1412.6980
- Google Scholar
20
KrähenbühlP.KoltunV. (2011). Efficient inference in fully connected CRFs with Gaussian edge potentials. Proceedings of the 24th International Conference on Neural Information Processing Systems (Red Hook, NY: Curran Associates Inc). 24, 109–117. 10.5555/2986459.2986472
- CrossRef
- Google Scholar
21
KremerJ.ShaF.IgelC. (2018). Robust active label correction, in International Conference on Artificial Intelligence and Statistics eds StorkeyA. J.Pérez-CruzF. (PMLR), 84, 308–316.
- Pubmed Abstract
- Google Scholar
22
LiJ.SocherR.HoiS. C. (2020). Dividemix: learning with noisy labels as semi-supervised learning, in Proceedings of the 8th International Conference on Learning Representations. Available online at: https://openreview.net/forum?id=HJgExaVtwr
- Google Scholar
23
LiS.-Y.ShiY.HuangS.-J.ChenS. (2022). Improving deep label noise learning with dual active label correction. Mach. Learn. 111, 1103–1124. 10.1007/s10994-021-06081-9
- CrossRef
- Google Scholar
24
LinZ.WeiD.PetkovaM. D.WuY.AhmedZ.ZouS.et al. (2021). NUCMM dataset: 3d neuronal nuclei instance segmentation at sub-cubic millimeter scale, in International Conference on Medical Image Computing and Computer-Assisted Intervention eds BruijneM.CattinP. C.CotinS.PadoyN.SpeidelS.ZhengY.et al. (Springer) 12901, 164–174. 10.1007/978-3-030-87193-2_16
- CrossRef
- Google Scholar
25
LiuS.Niles-WeedJ.RazavianN.Fernandez-GrandaC. (2020). Early-learning regularization prevents memorization of noisy labels, in Advances in Neural Information Processing Systems, Vol. 33. Available online at: https://proceedings.neurips.cc/paper/2020/hash/ea89621bee7c88b2c5be6681c8ef4906-Abstract.html
- Google Scholar
26
LiuY.ChengM.HuX.BianJ.ZhangL.BaiX.et al. (2019). Richer convolutional features for edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 41, 1939–1946. 10.1109/TPAMI.2018.2878849
- Pubmed Abstract
- CrossRef
- Google Scholar
27
LuX.WangW.ShenJ.CrandallD.Van GoolL. (2021). Segmenting objects from relational visual data. IEEE Trans. Pattern Anal. Mach. Intell. 2021:3115815. 10.1109/TPAMI.2021.3115815
- Pubmed Abstract
- CrossRef
- Google Scholar
28
LuZ.CarneiroG.BradleyA. P. (2015). An improved joint optimization of multiple level set functions for the segmentation of overlapping cervical cells. IEEE Trans. Image Process. 24, 1261–1272. 10.1109/TIP.2015.2389619
- Pubmed Abstract
- CrossRef
- Google Scholar
29
LuZ.CarneiroG.BradleyA. P.UshizimaD.NosratiM. S.BianchiA. G.et al. (2016). Evaluation of three algorithms for the segmentation of overlapping cervical cells. _IEEE J. Biomed. Health Informatics_21, 441–450. 10.1109/JBHI.2016.2519686
- Pubmed Abstract
- CrossRef
- Google Scholar
30
MaJ.ZhangY.GuS.ZhuC.GeC.ZhangY.et al. (2021). AbdomenCT-1K: is abdominal organ segmentation a solved problem. IEEE Trans. Pattern Anal. Mach. Intell. 2021:3100536. 10.1109/TPAMI.2021.3100536
- Pubmed Abstract
- CrossRef
- Google Scholar
31
MaT.WangQ.ZhangH.ZuoW. (2022). Delving deeper into pixel prior for box-supervised semantic segmentation. IEEE Trans. Image Process. 31, 1406–1417. 10.1109/TIP.2022.3141878
- Pubmed Abstract
- CrossRef
- Google Scholar
32
McCraryJ. (2008). Manipulation of the running variable in the regression discontinuity design: a density test. J. Econ. 142, 698–714. 10.1016/j.jeconom.2007.05.005
- CrossRef
- Google Scholar
33
MotamediM.SakharnykhN.KaldeweyT. (2021). A data-centric approach for training deep neural networks with less data. arXiv preprint arXiv:2110.03613.
- Google Scholar
34
NazeriK.NgE.JosephT.QureshiF.EbrahimiM. (2019). Edgeconnect: structure guided image inpainting using edge prediction, in Proceedings of the IEEE/CVF International Conference on Computer Vision Workshop (IEEE), 3265–3274. 10.1109/ICCVW.2019.00408
- CrossRef
- Google Scholar
35
NgA.LairdD.HeL. (2021). Data-Centric AI Competition. Available online at: https://https-deeplearning-ai.github.io/data-centric-comp/ (accessed August, 2021).
- Google Scholar
36
NicholsonB.ShengV. S.ZhangJ. (2016). Label noise correction and application in crowdsourcing. _Expert Syst. Appl._66, 149–162. 10.1016/j.eswa.2016.09.003
- CrossRef
- Google Scholar
37
NicholsonB.ZhangJ.ShengV. S.WangZ. (2015). Label noise correction methods, in IEEE International Conference on Data Science and Advanced Analytics, 1–9. 10.1109/DSAA.2015.7344791
- CrossRef
- Google Scholar
38
NorthcuttC. G.AthalyeA.MuellerJ. (2021). Pervasive label errors in test sets destabilize machine learning benchmarks. arXiv preprint arXiv:2103.14749.
- Google Scholar
39
PhouladyH. A.GoldgofD.HallL. O.MoutonP. R. (2017). A framework for nucleus and overlapping cytoplasm segmentation in cervical cytology extended depth of field and volume images. Comput. Med. Imaging Graph. 59, 38–49. 10.1016/j.compmedimag.2017.06.007
- Pubmed Abstract
- CrossRef
- Google Scholar
40
PomaX. S.RibaE.SappaA. (2020). Dense extreme inception network: towards a robust CNN model for edge detection, in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (Snowmass Village, CO: IEEE), 1923–1932.
- Google Scholar
41
RadfordA.WuJ.ChildR.LuanD.AmodeiD.SutskeverI.et al. (2019). Language models are unsupervised multitask learners. OpenAI blog. 1, 8–9.
- Google Scholar
42
RaffelC.ShazeerN.RobertsA.LeeK.NarangS.MatenaM.et al. (2020). Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res. 21, 1–67. Available online at: http://jmlr.org/papers/v21/20-074.html
- Google Scholar
43
RonnebergerO.FischerP.BroxT. (2015). U-net: convolutional networks for biomedical image segmentation, in International Conference on Medical Image Computing and Computer-Assisted Intervention, eds NavabN.HorneggerJ.WellsW. M.IIIFrangiA. F. (Springer) 234–241. 10.1007/978-3-319-24574-4_28
- CrossRef
- Google Scholar
44
SahaR.BajgerM.LeeG. (2019). SRM superpixel merging framework for precise segmentation of cervical nucleus, in Digital Image Computing: Techniques and Applications (IEEE), 1–8. 10.1109/DICTA47822.2019.8945887
- Pubmed Abstract
- CrossRef
- Google Scholar
45
SongY.QinJ.LeiB.HeS.ChoiK.-S. (2019). Joint shape matching for overlapping cytoplasm segmentation in cervical smear images, in IEEE 16th International Symposium on Biomedical Imaging (IEEE), 191–194. 10.1109/ISBI.2019.8759259
- CrossRef
- Google Scholar
46
SongY.TanE.-L.JiangX.ChengJ.-Z.NiD.ChenS.et al. (2016). Accurate cervical cell segmentation from overlapping clumps in PAP smear images. _IEEE Trans. Med. Imaging_36, 288–300. 10.1109/TMI.2016.2606380
- Pubmed Abstract
- CrossRef
- Google Scholar
47
SongY.ZhuL.LeiB.ShengB.DouQ.QinJ.et al. (2020). Constrained multi-shape evolution for overlapping cytoplasm segmentation. arXiv preprint arXiv:2004.03892.
- Google Scholar
48
SuR.ZhangD.LiuJ.ChengC. (2021a). MSU-NET: multi-scale U-Net for 2d medical image segmentation. Front. Genet. 12:140. 10.3389/fgene.2021.639930
- Pubmed Abstract
- CrossRef
- Google Scholar
49
SuZ.LiuW.YuZ.HuD.LiaoQ.TianQ.et al. (2021b). Pixel difference networks for efficient edge detection, in Proceedings of the IEEE/CVF International Conference on Computer Vision (Montreal, QC: IEEE), 5117–5127. 10.1109/ICCV48922.2021.00507
- CrossRef
- Google Scholar
50
SunC.ShrivastavaA.SinghS.GuptaA. (2017). Revisiting unreasonable effectiveness of data in deep learning era, in Proceedings of the IEEE International Conference on Computer Vision (Venice: IEEE Computer Society), 843–852. 10.1109/ICCV.2017.97
- CrossRef
- Google Scholar
51
SunG.WangW.DaiJ.Van GoolL. (2020). Mining cross-image semantics for weakly supervised semantic segmentation, in European Conference on Computer Vision eds VedaldiA.BischofHorst.BroxT.FrahmJ. (Glasgow: Springer), 347–365. 10.1007/978-3-030-58536-5_21
- Pubmed Abstract
- CrossRef
- Google Scholar
52
SuttonC.McCallumA.et al. (2012). An introduction to conditional random fields. Found. Trends Mach. Learn. 4, 267–373. 10.1561/2200000013
- CrossRef
- Google Scholar
53
TareefA.SongY.CaiW.HuangH.ChangH.WangY.et al. (2017). Automatic segmentation of overlapping cervical smear cells based on local distinctive features and guided shape deformation. _Neurocomputing_221, 94–107. 10.1016/j.neucom.2016.09.070
- CrossRef
- Google Scholar
54
TareefA.SongY.HuangH.FengD.ChenM.WangY.et al. (2018). Multi-pass fast watershed for accurate segmentation of overlapping cervical cells. _IEEE Trans. Med. Imaging_37, 2044–2059. 10.1109/TMI.2018.2815013
- Pubmed Abstract
- CrossRef
- Google Scholar
55
WadaK. (2016). Labelme: Image Polygonal Annotation with Python. 10.5281/zenodo.5711226
- CrossRef
- Google Scholar
56
WanT.XuS.SangC.JinY.QinZ. (2019). Accurate segmentation of overlapping cells in cervical cytology with deep convolutional neural networks. _Neurocomputing_365, 157–170. 10.1016/j.neucom.2019.06.086
- Pubmed Abstract
- CrossRef
- Google Scholar
57
WangX.HuaY.KodirovE.CliftonD. A.RobertsonN. M. (2021). ProSelflC: Progressive self label correction for training robust deep neural networks, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (Computer Vision Foundation/IEEE), 752–761. 10.1109/CVPR46437.2021.00081
- CrossRef
- Google Scholar
58
WeiD.LeeK.LiH.LuR.BaeJ. A.LiuZ.et al. (2021). AxonEM dataset: 3d axon instance segmentation of brain cortical regions, in International Conference on Medical Image Computing and Computer-Assisted Intervention eds BruijneM.CattinP. C.CotinS.PadoyN.SpeidelS.ZhengY.et al. (Strasbourg: Springer), 175–185. 10.1007/978-3-030-87193-2_17
- CrossRef
- Google Scholar
59
WibisonoJ. K.HangH.-M. (2020). Fined: Fast inference network for edge detection. arXiv preprint arXiv:2012.08392. 10.1109/ICME51207.2021.9428230
- Pubmed Abstract
- CrossRef
- Google Scholar
60
ZhangC.LiuD.WangL.LiY.ChenX.LuoR.et al. (2019). DCCL: a benchmark for cervical cytology analysis, in International Workshop on Machine Learning in Medical Imaging eds SukH.LiuM.YanP.LianC. (Shenzhen: Springer), 63–72. 10.1007/978-3-030-32692-0_8
- CrossRef
- Google Scholar
61
ZhangH.ZhuH.LingX. (2020). Polar coordinate sampling-based segmentation of overlapping cervical cells using attention u-net and random walk. _Neurocomputing_383, 212–223. 10.1016/j.neucom.2019.12.036
- CrossRef
- Google Scholar
62
ZhangZ.PengQ.FuS.WangW.CheungY.-M.ZhaoY.et al. (2022). A componentwise approach to weakly supervised semantic segmentation using dual-feedback network. IEEE Trans. Neural Netw. Learn. Syst. 1–14. 10.1109/TNNLS.2022.3144194
- Pubmed Abstract
- CrossRef
- Google Scholar
63
ZhaoJ.YanS.FengJ. (2022). Towards age-invariant face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 44, 474–487. 10.1109/TPAMI.2020.3011426
- Pubmed Abstract
- CrossRef
- Google Scholar
64
ZhengG.AwadallahA. H.DumaisS. (2021). Meta label correction for noisy label learning, in Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI Press). 11053–11061.
- Google Scholar
65
ZhouB.LapedrizaA.KhoslaA.OlivaA.TorralbaA. (2017). Places: a 10 million image database for scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 40, 1452–1464. 10.1109/TPAMI.2017.2723009
- Pubmed Abstract
- CrossRef
- Google Scholar
66
ZhouZ.SiddiqueeM. M. R.TajbakhshN.LiangJ. (2019). UNet++: Redesigning skip connections to exploit multiscale features in image segmentation. _IEEE Trans. Med. Imaging_39, 1856–1867. 10.1109/TMI.2019.2959609
- Pubmed Abstract
- CrossRef
- Google Scholar
67
ZhuX.LiuG.SuB.NeesJ. P. (2019). Dynamic label correction for distant supervision relation extraction via semantic similarity, in CCF International Conference on Natural Language Processing and Chinese Computing eds TangJ.KanM.ZhaoD.LiS.ZanH. (Dunhuang: Springer), 16–27. 10.1007/978-3-030-32236-6_2
- CrossRef
- Google Scholar

Keywords

label correction, point correction, edge detection, segmentation, local point smoothing, cervical cell dataset

Citation

Liu J, Fan H, Wang Q, Li W, Tang Y, Wang D, Zhou M and Chen L (2022) Local Label Point Correction for Edge Detection of Overlapping Cervical Cells. Front. Neuroinform. 16:895290. doi: 10.3389/fninf.2022.895290

Received

13 March 2022

Accepted

20 April 2022

Published

12 May 2022

Volume

16 - 2022

Edited by

Zhenyu Tang, Beihang University, China

Reviewed by

Yunzhi Huang, Nanjing University of Information Science and Technology, China; Xuanang Xu, Rensselaer Polytechnic Institute, United States

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Huijie Fan fanhuijie@sia.cnDanbo Wang wangdanbo@cancerhosp-ln-cmu.com

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.