Othon Gonzalez - Academia.edu (original) (raw)
Papers by Othon Gonzalez
arXiv (Cornell University), Apr 12, 2022
Video captioning is the process of describing the content of a sequence of images capturing its s... more Video captioning is the process of describing the content of a sequence of images capturing its semantic relationships and meanings. Dealing with this task with a single image is arduous, not to mention how difficult it is for a video (or images sequence). The amount and relevance of the applications of video captioning are vast, mainly to deal with a significant amount of video recordings in video surveillance, or assisting people visually impaired, to mention a few. To analyze where the efforts of our community to solve the video captioning task are, as well as what route could be better to follow, this manuscript presents an extensive review of more than 105 papers for the period of 2016 to 2021. As a result, the most-used datasets and metrics are identified. Also, the main approaches used and the best ones. We compute a set of rankings based on several performance metrics to obtain, according to its performance, the best method with the best result on the video captioning task. Finally, some insights are concluded about which could be the next steps or opportunity areas to improve dealing with this complex task.
arXiv (Cornell University), Jul 4, 2022
Image Captioning is a current research task to describe the image content using the objects and t... more Image Captioning is a current research task to describe the image content using the objects and their relationships in the scene. To tackle this task, two important research areas converge, artificial vision, and natural language processing. In Image Captioning, as in any computational intelligence task, the performance metrics are crucial for knowing how well (or bad) a method performs. In recent years, it has been observed that classical metrics based on
Computer Vision and Image Understanding, Jun 1, 2023
Video captioning is the process of describing the content of a sequence of images capturing its s... more Video captioning is the process of describing the content of a sequence of images capturing its semantic relationships and meanings. Dealing with this task with a single image is arduous, not to mention how difficult it is for a video (or images sequence). The amount and relevance of the applications of video captioning are vast, mainly to deal with a significant amount of video recordings in video surveillance, or assisting people visually impaired, to mention a few. To analyze where the efforts of our community to solve the video captioning task are, as well as what route could be better to follow, this manuscript presents an extensive review of more than 105 papers for the period of 2016 to 2021. As a result, the most-used datasets and metrics are identified. Also, the main approaches used and the best ones. We compute a set of rankings based on several performance metrics to obtain, according to its performance, the best method with the best result on the video captioning task. Finally, some insights are concluded about which could be the next steps or opportunity areas to improve dealing with this complex task.
Thermographies are a source of abundant and rapid information, valuable in precision agriculture ... more Thermographies are a source of abundant and rapid information, valuable in precision agriculture tasks such as crop stress assessment, plant disease analysis, and soil moisture evaluation. Traditionally, practitioners obtain soil temperature directly from the ground or using satellites and other airborne methods, which are costly and have a low spatial and temporal resolution. In this paper, we introduce a method for short term tracking of thermal radiance inertia with the use of an unmanned aerial system (UAS). In our approach, we retro-project the spatial reconstruction obtained with structure from motion (SfM) to estimate the thermal radiation corresponding to three-dimensional structures. Then, we register the resulting orthomosaics using a pyramidal scheme. We use the first cloud of points as the fixed reference as new orthomosaics become available. Finally, we estimate the dynamics of the thermal radiation using the difference of the registered orthomosaic radiation intensity ...
IEEE Transactions on Instrumentation and Measurement, 2019
Infrared thermal cameras have applications in a range of fields including security, agriculture, ... more Infrared thermal cameras have applications in a range of fields including security, agriculture, inspection, and health. In this paper, we are interested in their sensing properties, where we model and estimate the camera parameters required to transform digital counts to temperature values. Specifically, we propose a method to characterize the spatial deviation for the irradiance observed at the image plane for an uncooled focal-plane array camera deprived of an internal measurement of temperature. In our approach, we establish the relationship between the radiance emitted by a temperature varying blackbody chamber and the uncooled camera digital output. The model approximates this relationship as a quadratic polynomial whose variations are themselves approximated with another polynomial expression. Our results suggest a significant improvement over the commonly used Plancklike expression.
Lecture Notes in Computer Science, 2019
Thermographies are a source of abundant and rapid information, valuable in precision agriculture ... more Thermographies are a source of abundant and rapid information, valuable in precision agriculture tasks such as crop stress assessment, plant disease analysis, and soil moisture evaluation. Traditionally, practitioners obtain soil temperature directly from the ground or using satellites and other airborne methods, which are costly and have a low spatial and temporal resolution. In this paper, we introduce a method for short term tracking of thermal radiance inertia with the use of an unmanned aerial system (UAS). In our approach, we retro-project the spatial reconstruction obtained with structure from motion (SfM) to estimate the thermal radiation corresponding to three-dimensional structures. Then, we register the resulting orthomosaics using a pyramidal scheme. We use the first cloud of points as the fixed reference as new orthomosaics become available. Finally, we estimate the dynamics of the thermal radiation using the difference of the registered orthomosaic radiation intensity measurements.
IEEE Transactions on Instrumentation and Measurement, 2019
Infrared thermal cameras have applications in a range of fields including security, agriculture, ... more Infrared thermal cameras have applications in a range of fields including security, agriculture, inspection, and health. In this paper, we are interested in their sensing properties, where we model and estimate the camera parameters required to transform digital counts to temperature values. Specifically, we propose a method to characterize the spatial deviation for the irradiance observed at the image plane for an uncooled focal-plane array camera deprived of an internal measurement of temperature. In our approach, we establish the relationship between the radiance emitted by a temperature varying blackbody chamber and the uncooled camera digital output. The model approximates this relationship as a quadratic polynomial whose variations are themselves approximated with another polynomial expression. Our results suggest a significant improvement over the commonly used Plancklike expression.
arXiv (Cornell University), Apr 12, 2022
Video captioning is the process of describing the content of a sequence of images capturing its s... more Video captioning is the process of describing the content of a sequence of images capturing its semantic relationships and meanings. Dealing with this task with a single image is arduous, not to mention how difficult it is for a video (or images sequence). The amount and relevance of the applications of video captioning are vast, mainly to deal with a significant amount of video recordings in video surveillance, or assisting people visually impaired, to mention a few. To analyze where the efforts of our community to solve the video captioning task are, as well as what route could be better to follow, this manuscript presents an extensive review of more than 105 papers for the period of 2016 to 2021. As a result, the most-used datasets and metrics are identified. Also, the main approaches used and the best ones. We compute a set of rankings based on several performance metrics to obtain, according to its performance, the best method with the best result on the video captioning task. Finally, some insights are concluded about which could be the next steps or opportunity areas to improve dealing with this complex task.
arXiv (Cornell University), Jul 4, 2022
Image Captioning is a current research task to describe the image content using the objects and t... more Image Captioning is a current research task to describe the image content using the objects and their relationships in the scene. To tackle this task, two important research areas converge, artificial vision, and natural language processing. In Image Captioning, as in any computational intelligence task, the performance metrics are crucial for knowing how well (or bad) a method performs. In recent years, it has been observed that classical metrics based on
Computer Vision and Image Understanding, Jun 1, 2023
Video captioning is the process of describing the content of a sequence of images capturing its s... more Video captioning is the process of describing the content of a sequence of images capturing its semantic relationships and meanings. Dealing with this task with a single image is arduous, not to mention how difficult it is for a video (or images sequence). The amount and relevance of the applications of video captioning are vast, mainly to deal with a significant amount of video recordings in video surveillance, or assisting people visually impaired, to mention a few. To analyze where the efforts of our community to solve the video captioning task are, as well as what route could be better to follow, this manuscript presents an extensive review of more than 105 papers for the period of 2016 to 2021. As a result, the most-used datasets and metrics are identified. Also, the main approaches used and the best ones. We compute a set of rankings based on several performance metrics to obtain, according to its performance, the best method with the best result on the video captioning task. Finally, some insights are concluded about which could be the next steps or opportunity areas to improve dealing with this complex task.
Thermographies are a source of abundant and rapid information, valuable in precision agriculture ... more Thermographies are a source of abundant and rapid information, valuable in precision agriculture tasks such as crop stress assessment, plant disease analysis, and soil moisture evaluation. Traditionally, practitioners obtain soil temperature directly from the ground or using satellites and other airborne methods, which are costly and have a low spatial and temporal resolution. In this paper, we introduce a method for short term tracking of thermal radiance inertia with the use of an unmanned aerial system (UAS). In our approach, we retro-project the spatial reconstruction obtained with structure from motion (SfM) to estimate the thermal radiation corresponding to three-dimensional structures. Then, we register the resulting orthomosaics using a pyramidal scheme. We use the first cloud of points as the fixed reference as new orthomosaics become available. Finally, we estimate the dynamics of the thermal radiation using the difference of the registered orthomosaic radiation intensity ...
IEEE Transactions on Instrumentation and Measurement, 2019
Infrared thermal cameras have applications in a range of fields including security, agriculture, ... more Infrared thermal cameras have applications in a range of fields including security, agriculture, inspection, and health. In this paper, we are interested in their sensing properties, where we model and estimate the camera parameters required to transform digital counts to temperature values. Specifically, we propose a method to characterize the spatial deviation for the irradiance observed at the image plane for an uncooled focal-plane array camera deprived of an internal measurement of temperature. In our approach, we establish the relationship between the radiance emitted by a temperature varying blackbody chamber and the uncooled camera digital output. The model approximates this relationship as a quadratic polynomial whose variations are themselves approximated with another polynomial expression. Our results suggest a significant improvement over the commonly used Plancklike expression.
Lecture Notes in Computer Science, 2019
Thermographies are a source of abundant and rapid information, valuable in precision agriculture ... more Thermographies are a source of abundant and rapid information, valuable in precision agriculture tasks such as crop stress assessment, plant disease analysis, and soil moisture evaluation. Traditionally, practitioners obtain soil temperature directly from the ground or using satellites and other airborne methods, which are costly and have a low spatial and temporal resolution. In this paper, we introduce a method for short term tracking of thermal radiance inertia with the use of an unmanned aerial system (UAS). In our approach, we retro-project the spatial reconstruction obtained with structure from motion (SfM) to estimate the thermal radiation corresponding to three-dimensional structures. Then, we register the resulting orthomosaics using a pyramidal scheme. We use the first cloud of points as the fixed reference as new orthomosaics become available. Finally, we estimate the dynamics of the thermal radiation using the difference of the registered orthomosaic radiation intensity measurements.
IEEE Transactions on Instrumentation and Measurement, 2019
Infrared thermal cameras have applications in a range of fields including security, agriculture, ... more Infrared thermal cameras have applications in a range of fields including security, agriculture, inspection, and health. In this paper, we are interested in their sensing properties, where we model and estimate the camera parameters required to transform digital counts to temperature values. Specifically, we propose a method to characterize the spatial deviation for the irradiance observed at the image plane for an uncooled focal-plane array camera deprived of an internal measurement of temperature. In our approach, we establish the relationship between the radiance emitted by a temperature varying blackbody chamber and the uncooled camera digital output. The model approximates this relationship as a quadratic polynomial whose variations are themselves approximated with another polynomial expression. Our results suggest a significant improvement over the commonly used Plancklike expression.