Roberto Iacoviello - Academia.edu (original) (raw)

Uploads

Books by Roberto Iacoviello

Research paper thumbnail of Towards Pervasive and Trustworthy Artificial Intelligence How standards can put a great technology at the service of humankind

Papers by Roberto Iacoviello

Research paper thumbnail of Semi -Automated Digital Human Production for Enhanced Media Broadcasting

Research paper thumbnail of A Learnable EVC Intra Predictor Using Masked Convolutions

Lecture Notes in Computer Science, Dec 31, 2022

Research paper thumbnail of AI-based End-to-End and AI-Enhanced Video Coding

A standards body for AI-based data codingMoving Picture, Audio, and Data Coding by Artificial Int... more A standards body for AI-based data codingMoving Picture, Audio, and Data Coding by Artificial Intelligence is the Standards Developing Organisation whose mission is Data Coding by Artificial Intelligence. Founded in September 2020, it has developed and published 12 technical documents on audio enhancement, human-machine conversation, execution of AI applications, company performance prediction, neural network watermarking, and metaverse model. The talk on [MPAI](https://mpai.community) will briefly describe the mission, the organisation, and the scope of activities.AI-Enhanced video codingVideo internet traffic has grown rapidly, leading to increased strain on infrastructure, slower speeds, and higher costs. Video compression can alleviate this problem; deep learning-based coding tools have shown great potential to reduce bitrate and improve quality. MPAI is working to standardize these tools. In particular, MPAI-EVC is working to improve an existing video codec by 25% in terms of e...

Research paper thumbnail of AI-Based Media Coding Standards

SMPTE Motion Imaging Journal

Research paper thumbnail of ARSSET: Augmented Reality Support on SET

Lecture Notes in Computer Science

The preparation of a set for a television production is a complex work; usually, several objects ... more The preparation of a set for a television production is a complex work; usually, several objects have to be manually placed in the environment and the configuration might be changed many times before finding the final set up. This configuration phase can be expensive and time consuming when large and heavy objects have to be moved. In order to tackle this issue, virtual sets allow the director of production to create virtual scenes before placing real objects. This paper proposes an alternative approach based on augmented reality technologies: objects of the scene are computer generated assets, which can be placed and manipulated in a real environment. With respect virtual sets, the proposed solution allows the director to move in a real scene enriched by computer-generated objects to be placed in the environment. The user wears an AR headset and manipulates objects by a tablet. The proposed system was evaluated by a group of 9 testers, which had to create an augmented TV set. Subjective and objective parameters have been used to assess the system usability.

Research paper thumbnail of A Proof of Concept Mixed Reality Application for Augmented City Tourism

Augmented Reality is gaining attention primarily driven by the availability of consumer devices s... more Augmented Reality is gaining attention primarily driven by the availability of consumer devices such as Head Mounted Displays (HDM) on the market. This paper focuses on a different flavour of Augmented Reality, called Mixed Reality and it describes the work carried under the H2020 European project 5GCity. We realized an application running on Microsoft Hololens and designed to provide enhanced experience information on a city scale. The application provides information about historical buildings, thus supporting cultural outdoor tourism. The user experience is enriched with content coming from the archives of the Italian public broadcaster RAI. A cloud application (conceived and designed to run on a 5G-ready infrastructure) based on a visual search engine receives an image flow captured by the HMD by the user and identifies known objects. The user can freely watch at the object for which augmented contents have to be displayed and interact with these contents through a set of pre-de...

Research paper thumbnail of A Mixed Reality application to support TV Studio Production

2.3 Augmented & Mixed Reality Application Fields ______________________ 2.3.1 Medicine and surger... more 2.3 Augmented & Mixed Reality Application Fields ______________________ 2.3.1 Medicine and surgery _______________________________________________ 2.3.2 Teaching _________________________________________________________ 2.3.3 Cultural heritage ___________________________________________________ 2.3.4 Tourism __________________________________________________________ 2.3.5 Assembly and maintenance __________________________________________ 2.3.6 Entertainment ____________________________________________________ 2.3.7 Military applications ________________________________________________ 2.3.8 Architecture and Design _____________________________________________ 2.4 Previsualization techniques for TV and Cinema preproduction _________ 2.4.1 Storyboarding _____________________________________________________ 2.4.2 Animatics & CG Renderings __________________________________________ 2.4.3 Real-time compositing ______________________________________________

Research paper thumbnail of Design of a 3D Indoor Localization System Enabling Augmented Reality TV Applications

2021 30th Conference of Open Innovations Association FRUCT

This paper focuses on the design of a robust Real Time Locating Systems (RTLS) based on the Ultra... more This paper focuses on the design of a robust Real Time Locating Systems (RTLS) based on the Ultra-Wide Band (UWB) technology for Augmented Reality (AR) applications in TV studios that require artists and/or a presenter to be accurately localized. According to a UWB-based measurement campaign, carried out in a TV studio environment, ranging measurements are heavily affected by the human body interference. Indeed, lots of outliers are present as the UWB receiver may synchronize to reflected paths, which result to be much stronger than the direct one. As a consequence, range errors are very large. In this context, to improve the localization performance, we increased the redundancy of the RTLS by employing more than one tag to localize the artists on the TV scene. In particular, we have applied the Extended Kalman Filter (EKF) algorithm to work with two and three tags. Moreover, an outlier detection and correction procedure have been defined and adopted for the ranging phase. The resulting localization performance, based on real range measurements, shows that the EKF with two tags outperforms by 83.5 % the one with single tag.

Research paper thumbnail of Future video coding: new tools and algorithms

VLSI Architectures for Future Video Coding

In the recent years, there has been a real revolution in the world of film and television, thanks... more In the recent years, there has been a real revolution in the world of film and television, thanks to the advent of new digital formats that has involved the entire chain of production of multimedia products. This revolution has impacted the multimedia industry, consumer electronics and communication networks, opening new opportunities for convergence. Video quality has grown exponentially, aiming to emulate the chromatic richness, dynamics and rendering of details typical of human vision, posing major challenges with respect to bandwidth in transmission channels and media storage, making necessary to have new and more performing standards of compression of the video signal that leaves the quality unchanged. Moreover, video content itself has experienced important changes related to the quality delivered to the users and also in the way they consume it. From one point of view, HD and beyond-HD resolutions have become increasingly popular and from the other point of view video-on-demand, mobile television services, stereo and multiview capture and display are some examples of how the video content is evolving nowadays. All these services demand efficient solutions to store huge amounts of data and to deliver the same video content at different resolutions. Although communication networks have also evolved to provide higher capacities, these new requirements concerning video content require to compress the video signal very efficiently in order to store it and stream it reliably.

Research paper thumbnail of HoloCities: A Shared Reality application for Collaborative Tourism

IOP Conference Series: Materials Science and Engineering

Communication is a key arena for shared-reality application: our mixed reality application is dev... more Communication is a key arena for shared-reality application: our mixed reality application is developed on Microsoft Hololens and it has been designed to provide new engaging ways to discover the city using augmented reality. Most augmented reality tourism applications isolate the user, therefore, this application has been made multiplayer and collaborative to encourage shared experiences and socialization. In this project two scenarios are described: In the first scenario there is a real guide that exposes to the group, each equipped with Head Mounted Display, the peculiarities of the monument visited. Each user share the 3D models manipulated by the guide that can be in the same room or it can be a remote guide leveraging on the 5G network low latency. Moreover, the guide could highlight and label to convey relevant information about the objects. While in the second scenario there is no real guide, so the application automatically recognizes the framed monument thanks to a visual ...

Research paper thumbnail of Enhancing cultural tourism by a mixed reality application for outdoor navigation and information browsing using immersive devices

IOP Conference Series: Materials Science and Engineering

In this paper a mixed reality application is introduced; this application runs on Microsoft Holol... more In this paper a mixed reality application is introduced; this application runs on Microsoft Hololens and has been designed to provide information on a city scale. The application was developed to provide information about historical buildings, thus supporting cultural outdoor tourism. The huge amount of multimedia data stored in the archives of the Italian public broadcaster RAI, is used to enrich the user experience. A remote application of image and video analysis receives an image flow by the user and identifies known objects framed in the images. The user can select the object (monument/building/artwork) for which augmented contents have to be displayed (video, text audio); the user can interact with these contents by a set of defined gestures. Moreover, if the object of interest is detected and tracked by the mixed reality application, also 3D contents can be overlapped and aligned with the real world.

Research paper thumbnail of Towards Pervasive and Trustworthy Artificial Intelligence How standards can put a great technology at the service of humankind

Research paper thumbnail of Semi -Automated Digital Human Production for Enhanced Media Broadcasting

Research paper thumbnail of A Learnable EVC Intra Predictor Using Masked Convolutions

Lecture Notes in Computer Science, Dec 31, 2022

Research paper thumbnail of AI-based End-to-End and AI-Enhanced Video Coding

A standards body for AI-based data codingMoving Picture, Audio, and Data Coding by Artificial Int... more A standards body for AI-based data codingMoving Picture, Audio, and Data Coding by Artificial Intelligence is the Standards Developing Organisation whose mission is Data Coding by Artificial Intelligence. Founded in September 2020, it has developed and published 12 technical documents on audio enhancement, human-machine conversation, execution of AI applications, company performance prediction, neural network watermarking, and metaverse model. The talk on [MPAI](https://mpai.community) will briefly describe the mission, the organisation, and the scope of activities.AI-Enhanced video codingVideo internet traffic has grown rapidly, leading to increased strain on infrastructure, slower speeds, and higher costs. Video compression can alleviate this problem; deep learning-based coding tools have shown great potential to reduce bitrate and improve quality. MPAI is working to standardize these tools. In particular, MPAI-EVC is working to improve an existing video codec by 25% in terms of e...

Research paper thumbnail of AI-Based Media Coding Standards

SMPTE Motion Imaging Journal

Research paper thumbnail of ARSSET: Augmented Reality Support on SET

Lecture Notes in Computer Science

The preparation of a set for a television production is a complex work; usually, several objects ... more The preparation of a set for a television production is a complex work; usually, several objects have to be manually placed in the environment and the configuration might be changed many times before finding the final set up. This configuration phase can be expensive and time consuming when large and heavy objects have to be moved. In order to tackle this issue, virtual sets allow the director of production to create virtual scenes before placing real objects. This paper proposes an alternative approach based on augmented reality technologies: objects of the scene are computer generated assets, which can be placed and manipulated in a real environment. With respect virtual sets, the proposed solution allows the director to move in a real scene enriched by computer-generated objects to be placed in the environment. The user wears an AR headset and manipulates objects by a tablet. The proposed system was evaluated by a group of 9 testers, which had to create an augmented TV set. Subjective and objective parameters have been used to assess the system usability.

Research paper thumbnail of A Proof of Concept Mixed Reality Application for Augmented City Tourism

Augmented Reality is gaining attention primarily driven by the availability of consumer devices s... more Augmented Reality is gaining attention primarily driven by the availability of consumer devices such as Head Mounted Displays (HDM) on the market. This paper focuses on a different flavour of Augmented Reality, called Mixed Reality and it describes the work carried under the H2020 European project 5GCity. We realized an application running on Microsoft Hololens and designed to provide enhanced experience information on a city scale. The application provides information about historical buildings, thus supporting cultural outdoor tourism. The user experience is enriched with content coming from the archives of the Italian public broadcaster RAI. A cloud application (conceived and designed to run on a 5G-ready infrastructure) based on a visual search engine receives an image flow captured by the HMD by the user and identifies known objects. The user can freely watch at the object for which augmented contents have to be displayed and interact with these contents through a set of pre-de...

Research paper thumbnail of A Mixed Reality application to support TV Studio Production

2.3 Augmented & Mixed Reality Application Fields ______________________ 2.3.1 Medicine and surger... more 2.3 Augmented & Mixed Reality Application Fields ______________________ 2.3.1 Medicine and surgery _______________________________________________ 2.3.2 Teaching _________________________________________________________ 2.3.3 Cultural heritage ___________________________________________________ 2.3.4 Tourism __________________________________________________________ 2.3.5 Assembly and maintenance __________________________________________ 2.3.6 Entertainment ____________________________________________________ 2.3.7 Military applications ________________________________________________ 2.3.8 Architecture and Design _____________________________________________ 2.4 Previsualization techniques for TV and Cinema preproduction _________ 2.4.1 Storyboarding _____________________________________________________ 2.4.2 Animatics & CG Renderings __________________________________________ 2.4.3 Real-time compositing ______________________________________________

Research paper thumbnail of Design of a 3D Indoor Localization System Enabling Augmented Reality TV Applications

2021 30th Conference of Open Innovations Association FRUCT

This paper focuses on the design of a robust Real Time Locating Systems (RTLS) based on the Ultra... more This paper focuses on the design of a robust Real Time Locating Systems (RTLS) based on the Ultra-Wide Band (UWB) technology for Augmented Reality (AR) applications in TV studios that require artists and/or a presenter to be accurately localized. According to a UWB-based measurement campaign, carried out in a TV studio environment, ranging measurements are heavily affected by the human body interference. Indeed, lots of outliers are present as the UWB receiver may synchronize to reflected paths, which result to be much stronger than the direct one. As a consequence, range errors are very large. In this context, to improve the localization performance, we increased the redundancy of the RTLS by employing more than one tag to localize the artists on the TV scene. In particular, we have applied the Extended Kalman Filter (EKF) algorithm to work with two and three tags. Moreover, an outlier detection and correction procedure have been defined and adopted for the ranging phase. The resulting localization performance, based on real range measurements, shows that the EKF with two tags outperforms by 83.5 % the one with single tag.

Research paper thumbnail of Future video coding: new tools and algorithms

VLSI Architectures for Future Video Coding

In the recent years, there has been a real revolution in the world of film and television, thanks... more In the recent years, there has been a real revolution in the world of film and television, thanks to the advent of new digital formats that has involved the entire chain of production of multimedia products. This revolution has impacted the multimedia industry, consumer electronics and communication networks, opening new opportunities for convergence. Video quality has grown exponentially, aiming to emulate the chromatic richness, dynamics and rendering of details typical of human vision, posing major challenges with respect to bandwidth in transmission channels and media storage, making necessary to have new and more performing standards of compression of the video signal that leaves the quality unchanged. Moreover, video content itself has experienced important changes related to the quality delivered to the users and also in the way they consume it. From one point of view, HD and beyond-HD resolutions have become increasingly popular and from the other point of view video-on-demand, mobile television services, stereo and multiview capture and display are some examples of how the video content is evolving nowadays. All these services demand efficient solutions to store huge amounts of data and to deliver the same video content at different resolutions. Although communication networks have also evolved to provide higher capacities, these new requirements concerning video content require to compress the video signal very efficiently in order to store it and stream it reliably.

Research paper thumbnail of HoloCities: A Shared Reality application for Collaborative Tourism

IOP Conference Series: Materials Science and Engineering

Communication is a key arena for shared-reality application: our mixed reality application is dev... more Communication is a key arena for shared-reality application: our mixed reality application is developed on Microsoft Hololens and it has been designed to provide new engaging ways to discover the city using augmented reality. Most augmented reality tourism applications isolate the user, therefore, this application has been made multiplayer and collaborative to encourage shared experiences and socialization. In this project two scenarios are described: In the first scenario there is a real guide that exposes to the group, each equipped with Head Mounted Display, the peculiarities of the monument visited. Each user share the 3D models manipulated by the guide that can be in the same room or it can be a remote guide leveraging on the 5G network low latency. Moreover, the guide could highlight and label to convey relevant information about the objects. While in the second scenario there is no real guide, so the application automatically recognizes the framed monument thanks to a visual ...

Research paper thumbnail of Enhancing cultural tourism by a mixed reality application for outdoor navigation and information browsing using immersive devices

IOP Conference Series: Materials Science and Engineering

In this paper a mixed reality application is introduced; this application runs on Microsoft Holol... more In this paper a mixed reality application is introduced; this application runs on Microsoft Hololens and has been designed to provide information on a city scale. The application was developed to provide information about historical buildings, thus supporting cultural outdoor tourism. The huge amount of multimedia data stored in the archives of the Italian public broadcaster RAI, is used to enrich the user experience. A remote application of image and video analysis receives an image flow by the user and identifies known objects framed in the images. The user can select the object (monument/building/artwork) for which augmented contents have to be displayed (video, text audio); the user can interact with these contents by a set of defined gestures. Moreover, if the object of interest is detected and tracked by the mixed reality application, also 3D contents can be overlapped and aligned with the real world.