Daniel Pizarro | Universidad de Alcala (original) (raw)
Papers by Daniel Pizarro
IEEE Transactions on Signal Processing, 2016
Fertility and sterility, Jan 12, 2017
To report the use of augmented reality (AR) in gynecology. AR is a surgical guidance technology t... more To report the use of augmented reality (AR) in gynecology. AR is a surgical guidance technology that enables important hidden surface structures to be visualized in endoscopic images. AR has been used for other organs, but never in gynecology and never with a very mobile organ like the uterus. We have developed a new AR approach specifically for uterine surgery and demonstrated its use for myomectomy. Tertiary university hospital. Three patients with one, two, and multiple myomas, respectively. AR was used during laparoscopy to localize the myomas. Three-dimensional (3D) models of the patient's uterus and myomas were constructed before surgery from T2-weighted magnetic resonance imaging. The intraoperative 3D shape of the uterus was determined. These models were automatically aligned and "fused" with the laparoscopic video in real time. The live fused video made the uterus appear semitransparent, and the surgeon can see the location of the myoma in real time while movi...
2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016
IEEE transactions on pattern analysis and machine intelligence, Jan 4, 2016
Shape-from-Template (SfT) reconstructs the shape of a deforming surface from a single image, a 3D... more Shape-from-Template (SfT) reconstructs the shape of a deforming surface from a single image, a 3D template and a deformation prior. For isometric deformations, this is a well-posed problem. However, previous methods which require no initialization break down when the perspective effects are small, which happens when the object is small or viewed from larger distances. That is, they do not handle all projection geometries. We propose stable SfT methods that accurately reconstruct the 3D shape for all projection geometries. We follow the existing approach of using first-order differential constraints and obtain local analytical solutions for depth and the first-order quantities: the depth-gradient or the surface normal. Previous methods use the depth solution directly to obtain the 3D shape. We prove that the depth solution is unstable when the projection geometry tends to affine, while the solution for the first-order quantities remain stable for all projection geometries. We therefo...
Terceras Jornadas De Jovenes Investigadores De La Universidad De Alcala 2011 Isbn 978 84 8138 925 8 Pags 151 162, 2011
Surgical Endoscopy, 2016
Augmented Reality (AR) is a technology that can allow a surgeon to see subsurface structures. Thi... more Augmented Reality (AR) is a technology that can allow a surgeon to see subsurface structures. This works by overlaying information from another modality, such as MRI and fusing it in real time with the endoscopic images. AR has never been developed for a very mobile organ like the uterus and has never been performed for gynecology. Myomas are not always easy to localize in laparoscopic surgery when they do not significantly change the surface of the uterus, or are at multiple locations. To study the accuracy of myoma localization using a new AR system compared to MRI-only localization. Ten residents were asked to localize six myomas (on a uterine model into a laparoscopic box) when either using AR or in conditions that simulate a standard method (only the MRI was available). Myomas were randomly divided in two groups: the control group (MRI only, AR not activated) and the AR group (AR activated). Software was used to automatically measure the distance between the point of contact on the uterine surface and the myoma. We compared these distances to the true shortest distance to obtain accuracy measures. The time taken to perform the task was measured, and an assessment of the complexity was performed. The mean accuracy in the control group was 16.80 mm [0.1-52.2] versus 0.64 mm [0.01-4.71] with AR. In the control group, the mean time to perform the task was 18.68 [6.4-47.1] s compared to 19.6 [3.9-77.5] s with AR. The mean score of difficulty (evaluated for each myoma) was 2.36 [1-4] versus 0.87 [0-4], respectively, for the control and the AR group. We developed an AR system for a very mobile organ. This is the first user study to quantitatively evaluate an AR system for improving a surgical task. In our model, AR improves localization accuracy.
We propose a novel formulation of the generalized cross correlation with phase transform (GCC-PHA... more We propose a novel formulation of the generalized cross correlation with phase transform (GCC-PHAT) for a pair of microphones in diffuse sound field. This formulation elucidates the links between the microphone distances and the GCC-PHAT output. Hence, it leads to a new model that enables estimation of the pairwise distances by optimizing over the distances best matching the GCC-PHAT observations. Furthermore, the relation of this model to the coherence function is elaborated along with the dependency on the signal bandwidth. The experiments conducted on real data recordings demonstrate the theories and support the effectiveness of the proposed method.
Terceras Jornadas De Jovenes Investigadores De La Universidad De Alcala 2011 Isbn 978 84 8138 925 8 Pags 163 174, 2011
2015 IEEE International Conference on Computer Vision (ICCV), 2015
2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015
We propose a novel formulation of the generalized cross correlation with phase transform (GCC-PHA... more We propose a novel formulation of the generalized cross correlation with phase transform (GCC-PHAT) for a pair of microphones in diffuse sound field. This formulation elucidates the links between the microphone distances and the GCC-PHAT output. Hence, it leads to a new model that enables estimation of the pairwise distances by optimizing over the distances best matching the GCC-PHAT observations. Furthermore, the relation of this model to the coherence function is elaborated along with the dependency on the signal bandwidth. The experiments conducted on real data recordings demonstrate the theories and support the effectiveness of the proposed method.
Proceedings of the British Machine Vision Conference 2014, 2014
2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012
ABSTRACT Multipath interference of light is the cause of important errors in Time of Flight (ToF)... more ABSTRACT Multipath interference of light is the cause of important errors in Time of Flight (ToF) depth estimation. This paper proposes an algorithm that removes multipath distortion from a single depth map obtained by a ToF camera. Our approach does not require information about the scene, apart from ToF measurements. The method is based on fitting ToF measurements with a radiometric model. Model inputs are depth values free from multipath interference whereas model outputs consist of synthesized ToF measurements. We propose an iterative optimization algorithm that obtains model parameters that best reproduce ToF measurements, recovering the depth of the scene without distortion. We show results with both synthetic and real scenes captured by commercial ToF sensors. In all cases, our algorithm accurately corrects the multipath distortion, obtaining depth maps that are very close to ground truth data.
2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014
It has been recently shown that reconstructing an isometric surface from a single 2D input image ... more It has been recently shown that reconstructing an isometric surface from a single 2D input image matched to a 3D template was a well-posed problem. This however does not tell us how reconstruction algorithms will behave in practical conditions, where the amount of perspective is generally small and the projection thus behaves like weak-perspective or orthography. We here bring answers to what is theoretically recoverable in such imaging conditions, and explain why existing convex numerical solutions and analytical solutions to 3D reconstruction may be unstable. We then propose a new algorithm which works under all imaging conditions, from strong to loose perspective. We empirically show that the gain in stability is tremendous, bringing our results close to the iterative minimization of a statisticallyoptimal cost. Our algorithm has a low complexity, is simple and uses only one round of linear least-squares.
Image and Vision Computing, 2014
ABSTRACT One of the leading time of flight imaging technologies for depth sensing is based on Pho... more ABSTRACT One of the leading time of flight imaging technologies for depth sensing is based on Photonic Mixer Devices (PMD). In PMD sensors each pixel samples the correlation between emitted and received light signals. Current PMD cameras compute eight correlation samples per pixel in four sequential stages to obtain depth with invariance to signal amplitude and offset variations. With motion, PMD pixels capture different depths at each stage. As a result, correlation samples are not coherent with a single depth, producing artifacts. We propose to detect and remove motion artifacts from a single frame taken by a PMD camera. The algorithm we propose is very fast, simple and can be easily included in camera hardware. We recover depth of each pixel by exploiting consistency of the correlation samples and local neighbors of the pixel. In addition, our method obtains the motion flow of occluding contours in the image from a single frame. The system has been validated in real scenes using a commercial low-cost PMD camera and high speed dynamics. In all cases our method produces accurate results and it highly reduces motion artifacts.
2006 IEEE International Conference on Mechatronics, 2006
... Daniel Pizarro Departamento de Electr6nica, Escuela Politecnica ... [4] Gamo, J; Horche, P. R... more ... Daniel Pizarro Departamento de Electr6nica, Escuela Politecnica ... [4] Gamo, J; Horche, P. R.; Merchan, M.; Rodriguez, M. y Rosales ... [5] J. Gamo, 0. Demuynck, 0. Esteban, JL Lazaro, A. Cubillo, "Calibration of incoherent optical-fiber-bundle for image transmission purpose", Proc. ...
This paper describes a relative localization system used to achieve the navigation of a convoy of... more This paper describes a relative localization system used to achieve the navigation of a convoy of robotic units in indoor environments. This positioning system is carried out fusing two sensorial sources: (a) an odometric system and (b) a laser scanner together with artificial landmarks located on top of the units. The laser source allows one to compensate the cumulative error inherent to dead-reckoning; whereas the odometry source provides less pose uncertainty in short trajectories. A discrete Extended Kalman Filter, customized for this application, is used in order to accomplish this aim under real time constraints. Different experimental results with a convoy of Pioneer P3-DX units tracking non-linear trajectories are shown. The paper shows that a simple setup based on low cost laser range systems and robot built-in odometry sensors is able to give a high degree of robustness and accuracy to the relative localization problem of convoy units for indoor applications.
Image and Vision Computing, 2014
ABSTRACT Multipath interference of light is the cause of important errors in Time of Flight (ToF)... more ABSTRACT Multipath interference of light is the cause of important errors in Time of Flight (ToF) depth estimation. This paper proposes an algorithm that removes multipath distortion from a single depth map obtained by a ToF camera. Our approach does not require information about the scene, apart from ToF measurements. The method is based on fitting ToF measurements with a radiometric model. Model inputs are depth values free from multipath interference whereas model outputs consist of synthesized ToF measurements. We propose an iterative optimization algorithm that obtains model parameters that best reproduce ToF measurements, recovering the depth of the scene without distortion. We show results with both synthetic and real scenes captured by commercial ToF sensors. In all cases, our algorithm accurately corrects the multipath distortion, obtaining depth maps that are very close to ground truth data.
IEEE Transactions on Signal Processing, 2016
Fertility and sterility, Jan 12, 2017
To report the use of augmented reality (AR) in gynecology. AR is a surgical guidance technology t... more To report the use of augmented reality (AR) in gynecology. AR is a surgical guidance technology that enables important hidden surface structures to be visualized in endoscopic images. AR has been used for other organs, but never in gynecology and never with a very mobile organ like the uterus. We have developed a new AR approach specifically for uterine surgery and demonstrated its use for myomectomy. Tertiary university hospital. Three patients with one, two, and multiple myomas, respectively. AR was used during laparoscopy to localize the myomas. Three-dimensional (3D) models of the patient's uterus and myomas were constructed before surgery from T2-weighted magnetic resonance imaging. The intraoperative 3D shape of the uterus was determined. These models were automatically aligned and "fused" with the laparoscopic video in real time. The live fused video made the uterus appear semitransparent, and the surgeon can see the location of the myoma in real time while movi...
2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016
IEEE transactions on pattern analysis and machine intelligence, Jan 4, 2016
Shape-from-Template (SfT) reconstructs the shape of a deforming surface from a single image, a 3D... more Shape-from-Template (SfT) reconstructs the shape of a deforming surface from a single image, a 3D template and a deformation prior. For isometric deformations, this is a well-posed problem. However, previous methods which require no initialization break down when the perspective effects are small, which happens when the object is small or viewed from larger distances. That is, they do not handle all projection geometries. We propose stable SfT methods that accurately reconstruct the 3D shape for all projection geometries. We follow the existing approach of using first-order differential constraints and obtain local analytical solutions for depth and the first-order quantities: the depth-gradient or the surface normal. Previous methods use the depth solution directly to obtain the 3D shape. We prove that the depth solution is unstable when the projection geometry tends to affine, while the solution for the first-order quantities remain stable for all projection geometries. We therefo...
Terceras Jornadas De Jovenes Investigadores De La Universidad De Alcala 2011 Isbn 978 84 8138 925 8 Pags 151 162, 2011
Surgical Endoscopy, 2016
Augmented Reality (AR) is a technology that can allow a surgeon to see subsurface structures. Thi... more Augmented Reality (AR) is a technology that can allow a surgeon to see subsurface structures. This works by overlaying information from another modality, such as MRI and fusing it in real time with the endoscopic images. AR has never been developed for a very mobile organ like the uterus and has never been performed for gynecology. Myomas are not always easy to localize in laparoscopic surgery when they do not significantly change the surface of the uterus, or are at multiple locations. To study the accuracy of myoma localization using a new AR system compared to MRI-only localization. Ten residents were asked to localize six myomas (on a uterine model into a laparoscopic box) when either using AR or in conditions that simulate a standard method (only the MRI was available). Myomas were randomly divided in two groups: the control group (MRI only, AR not activated) and the AR group (AR activated). Software was used to automatically measure the distance between the point of contact on the uterine surface and the myoma. We compared these distances to the true shortest distance to obtain accuracy measures. The time taken to perform the task was measured, and an assessment of the complexity was performed. The mean accuracy in the control group was 16.80 mm [0.1-52.2] versus 0.64 mm [0.01-4.71] with AR. In the control group, the mean time to perform the task was 18.68 [6.4-47.1] s compared to 19.6 [3.9-77.5] s with AR. The mean score of difficulty (evaluated for each myoma) was 2.36 [1-4] versus 0.87 [0-4], respectively, for the control and the AR group. We developed an AR system for a very mobile organ. This is the first user study to quantitatively evaluate an AR system for improving a surgical task. In our model, AR improves localization accuracy.
We propose a novel formulation of the generalized cross correlation with phase transform (GCC-PHA... more We propose a novel formulation of the generalized cross correlation with phase transform (GCC-PHAT) for a pair of microphones in diffuse sound field. This formulation elucidates the links between the microphone distances and the GCC-PHAT output. Hence, it leads to a new model that enables estimation of the pairwise distances by optimizing over the distances best matching the GCC-PHAT observations. Furthermore, the relation of this model to the coherence function is elaborated along with the dependency on the signal bandwidth. The experiments conducted on real data recordings demonstrate the theories and support the effectiveness of the proposed method.
Terceras Jornadas De Jovenes Investigadores De La Universidad De Alcala 2011 Isbn 978 84 8138 925 8 Pags 163 174, 2011
2015 IEEE International Conference on Computer Vision (ICCV), 2015
2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015
We propose a novel formulation of the generalized cross correlation with phase transform (GCC-PHA... more We propose a novel formulation of the generalized cross correlation with phase transform (GCC-PHAT) for a pair of microphones in diffuse sound field. This formulation elucidates the links between the microphone distances and the GCC-PHAT output. Hence, it leads to a new model that enables estimation of the pairwise distances by optimizing over the distances best matching the GCC-PHAT observations. Furthermore, the relation of this model to the coherence function is elaborated along with the dependency on the signal bandwidth. The experiments conducted on real data recordings demonstrate the theories and support the effectiveness of the proposed method.
Proceedings of the British Machine Vision Conference 2014, 2014
2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012
ABSTRACT Multipath interference of light is the cause of important errors in Time of Flight (ToF)... more ABSTRACT Multipath interference of light is the cause of important errors in Time of Flight (ToF) depth estimation. This paper proposes an algorithm that removes multipath distortion from a single depth map obtained by a ToF camera. Our approach does not require information about the scene, apart from ToF measurements. The method is based on fitting ToF measurements with a radiometric model. Model inputs are depth values free from multipath interference whereas model outputs consist of synthesized ToF measurements. We propose an iterative optimization algorithm that obtains model parameters that best reproduce ToF measurements, recovering the depth of the scene without distortion. We show results with both synthetic and real scenes captured by commercial ToF sensors. In all cases, our algorithm accurately corrects the multipath distortion, obtaining depth maps that are very close to ground truth data.
2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014
It has been recently shown that reconstructing an isometric surface from a single 2D input image ... more It has been recently shown that reconstructing an isometric surface from a single 2D input image matched to a 3D template was a well-posed problem. This however does not tell us how reconstruction algorithms will behave in practical conditions, where the amount of perspective is generally small and the projection thus behaves like weak-perspective or orthography. We here bring answers to what is theoretically recoverable in such imaging conditions, and explain why existing convex numerical solutions and analytical solutions to 3D reconstruction may be unstable. We then propose a new algorithm which works under all imaging conditions, from strong to loose perspective. We empirically show that the gain in stability is tremendous, bringing our results close to the iterative minimization of a statisticallyoptimal cost. Our algorithm has a low complexity, is simple and uses only one round of linear least-squares.
Image and Vision Computing, 2014
ABSTRACT One of the leading time of flight imaging technologies for depth sensing is based on Pho... more ABSTRACT One of the leading time of flight imaging technologies for depth sensing is based on Photonic Mixer Devices (PMD). In PMD sensors each pixel samples the correlation between emitted and received light signals. Current PMD cameras compute eight correlation samples per pixel in four sequential stages to obtain depth with invariance to signal amplitude and offset variations. With motion, PMD pixels capture different depths at each stage. As a result, correlation samples are not coherent with a single depth, producing artifacts. We propose to detect and remove motion artifacts from a single frame taken by a PMD camera. The algorithm we propose is very fast, simple and can be easily included in camera hardware. We recover depth of each pixel by exploiting consistency of the correlation samples and local neighbors of the pixel. In addition, our method obtains the motion flow of occluding contours in the image from a single frame. The system has been validated in real scenes using a commercial low-cost PMD camera and high speed dynamics. In all cases our method produces accurate results and it highly reduces motion artifacts.
2006 IEEE International Conference on Mechatronics, 2006
... Daniel Pizarro Departamento de Electr6nica, Escuela Politecnica ... [4] Gamo, J; Horche, P. R... more ... Daniel Pizarro Departamento de Electr6nica, Escuela Politecnica ... [4] Gamo, J; Horche, P. R.; Merchan, M.; Rodriguez, M. y Rosales ... [5] J. Gamo, 0. Demuynck, 0. Esteban, JL Lazaro, A. Cubillo, "Calibration of incoherent optical-fiber-bundle for image transmission purpose", Proc. ...
This paper describes a relative localization system used to achieve the navigation of a convoy of... more This paper describes a relative localization system used to achieve the navigation of a convoy of robotic units in indoor environments. This positioning system is carried out fusing two sensorial sources: (a) an odometric system and (b) a laser scanner together with artificial landmarks located on top of the units. The laser source allows one to compensate the cumulative error inherent to dead-reckoning; whereas the odometry source provides less pose uncertainty in short trajectories. A discrete Extended Kalman Filter, customized for this application, is used in order to accomplish this aim under real time constraints. Different experimental results with a convoy of Pioneer P3-DX units tracking non-linear trajectories are shown. The paper shows that a simple setup based on low cost laser range systems and robot built-in odometry sensors is able to give a high degree of robustness and accuracy to the relative localization problem of convoy units for indoor applications.
Image and Vision Computing, 2014
ABSTRACT Multipath interference of light is the cause of important errors in Time of Flight (ToF)... more ABSTRACT Multipath interference of light is the cause of important errors in Time of Flight (ToF) depth estimation. This paper proposes an algorithm that removes multipath distortion from a single depth map obtained by a ToF camera. Our approach does not require information about the scene, apart from ToF measurements. The method is based on fitting ToF measurements with a radiometric model. Model inputs are depth values free from multipath interference whereas model outputs consist of synthesized ToF measurements. We propose an iterative optimization algorithm that obtains model parameters that best reproduce ToF measurements, recovering the depth of the scene without distortion. We show results with both synthetic and real scenes captured by commercial ToF sensors. In all cases, our algorithm accurately corrects the multipath distortion, obtaining depth maps that are very close to ground truth data.