Conference on Multimedia Modeling 2020 (original) (raw)



default search action
- combined dblp search
- author search
- venue search
- publication search
Authors:
- no matches

Venues:
- no matches

Publications:
- no matches


26th MMM 2020: Daejeon, South Korea

jump to- Poster Session
- Special Session Papers // SS1: AI-Powered 3D Vision
- SS2: Multimedia Analytics: Perspectives, Tools and Applications
- SS3: Multimedia Datasets for Repeatable Experimentation (MDRE)
- SS4: MMAC: Multi-modal Affective Computing of Large-Scale Multimedia Data
- SS5: MULTIMED2020: Multimedia and Multimodal Analytics in the Medical Domain and Pervasive Environments
- SS6: Intelligent Multimedia Security
- DEMO Papers
- VBS Papers

- > Home > Conferences and Workshops > MMM
SPARQL queries 
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as

Yong Man Ro, Wen-Huang Cheng, Junmo Kim, Wei-Ta Chu, Peng Cui, Jung-Woo Choi, Min-Chun Hu, Wesley De Neve:
MultiMedia Modeling - 26th International Conference, MMM 2020, Daejeon, South Korea, January 5-8, 2020, Proceedings, Part II. Lecture Notes in Computer Science 11962, Springer 2020, ISBN 978-3-030-37733-5
Poster Session

Pengfei Chen, Minglei Yuan, Tong Lu:
Multi-scale Comparison Network for Few-Shot Learning. 3-13

Jiayu Song, Qinghua Xu
, Wei Liu, Yueran Zu, Mengdong Chen:
Semantic and Morphological Information Guided Chinese Text Classification. 14-26

Duc V. Nguyen
, Huyen T. T. Tran
, Truong Cong Thang
:
A Delay-Aware Adaptation Framework for Cloud Gaming Under the Computation Constraint of User Devices. 27-38

Dongbiao He, Jinlei Jiang, Cédric Westphal, Guangwen Yang:
Efficient Edge Caching for High-Quality 360-Degree Video Delivery. 39-51

Suping Zhou, Jia Jia, Long Zhang, Yanfeng Wang, Wei Chen, Fanbo Meng, Fei Yu, Jialie Shen:
Inferring Emphasis for Real Voice Data: An Attentive Multimodal Neural Network Approach. 52-62

Xi Yang, Yeo-Jin Kim, Michelle Taub, Roger Azevedo, Min Chi:
PRIME: Block-Wise Missingness Handling for Multi-modalities in Intelligent Tutoring Systems. 63-75

Yuwei Yang, Fanman Meng, Hongliang Li, Qingbo Wu, Xiaolong Xu, Shuai Chen:
A New Local Transformation Module for Few-Shot Segmentation. 76-87

Mingjie Wu, Yongfei Zhang
, Tianyu Zhang, Wenqi Zhang:
Background Segmentation for Vehicle Re-identification. 88-99

Joanna Hong, Hong Joo Lee, Yelin Kim, Yong Man Ro
:
Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence Through Facial Action Units. 100-111

Yang Wang, Ye Qian, Jiahao Shi, Feng Su:
A Deep Convolutional Deblurring and Detection Neural Network for Localizing Text in Videos. 112-124

Wei Hou, Dakui Wang, Xiaojun Chen:
Generate Images with Obfuscated Attributes for Private Image Classification. 125-135

Xiaozhong Ji, Yirui Wu, Tong Lu:
Context-Aware Residual Network with Promotion Gates for Single Image Super-Resolution. 136-147

Kai Huang, Jianjun Li, Shichao Cheng, Jie Yu, Wanyong Tian, Lulu Zhao, Junfeng Hu, Chin-Chen Chang:
An Efficient Algorithm of Facial Expression Recognition by TSG-RNN Network. 161-174

Yiming Li, Xiaoshan Yang, Changsheng Xu:
Structured Neural Motifs: Scene Graph Parsing via Enhanced Context. 175-188

Duanzheng Guan, Dengshi Li, Xuebei Cai, Xiaochen Wang, Ruimin Hu:
Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet. 189-200

Xiaoge Song, Yirui Wu, Wenhai Wang, Tong Lu:
TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation. 201-213

Hirotaka Kato, Takatsugu Hirayama, Ichiro Ide
, Keisuke Doman, Yasutomo Kawanishi
, Daisuke Deguchi
, Hiroshi Murase:
More-Natural Mimetic Words Generation for Fine-Grained Gait Description. 214-225

Ying Zhao
, Zhiwei Luo, Changqin Quan
, Dianchao Liu, Gang Wang:
Lite Hourglass Network for Multi-person Pose Estimation. 226-238
Special Session Papers // SS1: AI-Powered 3D Vision

Yunhan Sun
, Jinlong Shi, Suqin Bai, Qiang Qian, Zhengxing Sun:
Single View Depth Estimation via Dense Convolution Network with Self-supervision. 241-253

Menghan Zhang, Yunbo Rao, Jiansu Pu, Xun Luo, Qifei Wang:
Multi-data UAV Images for Large Scale Reconstruction of Buildings. 254-266

Sen Xiang, Qiong Liu, Huiping Deng, Jin Wu, Li Yu:
Deformed Phase Prediction Using SVM for Structured Light Depth Generation. 267-278

Liang Wang
, Biying Yan, Fuqing Duan, Ke Lu:
Extraction of Multi-class Multi-instance Geometric Primitives from Point Clouds Using Energy Minimization. 279-290

Xiangyu Sun, Qiong Liu, You Yang:
Similarity Graph Convolutional Construction Network for Interactive Action Recognition. 291-303

Zihao Chen, Xu Wang, Yu Zhou
, Longhao Zou, Jianmin Jiang:
Content-Aware Cubemap Projection for Panoramic Image via Deep Q-Learning. 304-315

Teng Wan, Shaoyi Du, Wenting Cui, Qixing Xie, Yuying Liu, Zuoyong Li:
Robust RGB-D Data Registration Based on Correntropy and Bi-directional Distance. 316-326

Hui Cao, Haikuan Du, Siyu Zhang, Shen Cai:
InSphereNet: A Concise Representation and Classification Method for 3D Object. 327-339

Wenting Cui, Shaoyi Du, Teng Wan, Yan Liu, Yuying Liu, Yang Yang, Qingnan Mou, Mengqi Han, Yu-Cheng Guo:
3-D Oral Shape Retrieval Using Registration Algorithm. 340-349

Yu Wang, Tao Lu
, Ruobo Xu, Yanduo Zhang:
Face Super-Resolution by Learning Multi-view Texture Compensation. 350-360

Junlin Zhang, Xu Wang:
Light Field Salient Object Detection via Hybrid Priors. 361-372
SS2: Multimedia Analytics: Perspectives, Tools and Applications

Werner Bailer, Maarten Wijnants, Hendrik Lievens
, Sandy Claes:
Multimedia Analytics Challenges and Opportunities for Creating Interactive Radio Content. 375-387

Iva Gornishka
, Stevan Rudinac, Marcel Worring
:
Interactive Search and Exploration in Discussion Forums Using Multimodal Embeddings. 388-399

Xixun Wu, Binheng Song, Zhixiang Wang, Chun Yuan:
An Inverse Mapping with Manifold Alignment for Zero-Shot Learning. 400-411

Aaron Duane, Cathal Gurrin
:
Baseline Analysis of a Conventional and Virtual Reality Lifelog Retrieval System. 412-423

Aikaterini Katmada, George Kalpakis
, Theodora Tsikrika
, Stelios Andreadis
, Stefanos Vrochidis
, Ioannis Kompatsiaris:
An Extensible Framework for Interactive Real-Time Visualizations of Large-Scale Heterogeneous Multimedia Information from Online Sources. 424-435
SS3: Multimedia Datasets for Repeatable Experimentation (MDRE)

Andreas Leibetseder
, Sabrina Kletz
, Klaus Schoeffmann
, Simon Keckstein, Jörg Keckstein:
GLENDA: Gynecologic Laparoscopy Endometriosis Dataset. 439-450

Debesh Jha
, Pia H. Smedsrud, Michael A. Riegler, Pål Halvorsen, Thomas de Lange, Dag Johansen, Håvard D. Johansen:
Kvasir-SEG: A Segmented Polyp Dataset. 451-462

Frank Hopfgartner
, Cathal Gurrin
, Hideo Joho:
Rethinking the Test Collection Methodology for Personal Self-tracking Data. 463-474

Graham Healy
, Zhengwei Wang, Tomás Ward, Alan F. Smeaton, Cathal Gurrin
:
Experiences and Insights from the Collection of a Novel Multimedia EEG Dataset. 475-486
SS4: MMAC: Multi-modal Affective Computing of Large-Scale Multimedia Data

Zhilei Liu
, Jiahui Dong, Cuicui Zhang
, Longbiao Wang, Jianwu Dang:
Relation Modeling with Graph Convolutional Networks for Facial Action Unit Detection. 489-501

Jian Guan
, Liming Yin, Jianguo Sun, Shuhan Qi, Xuan Wang, Qing Liao
:
Enhanced Gaze Following via Object Detection and Human Pose Estimation. 502-513

Zhilei Liu
, Diyi Liu, Yunpeng Wu:
Region Based Adversarial Synthesis of Facial Action Units. 514-526

Zhilei Liu
, Le Li, Yunpeng Wu, Cuicui Zhang
:
Facial Expression Restoration Based on Improved Graph Convolutional Networks. 527-539
SS5: MULTIMED2020: Multimedia and Multimodal Analytics in the Medical Domain and Pervasive Environments

Henning Müller
, Vincent Andrearczyk, Oscar Alfonso Jiménez del Toro, Anjani Dhrangadhariya, Roger Schaer, Manfredo Atzori:
Studying Public Medical Images from the Open Access Literature and Social Networks for Model Training and Knowledge Extraction. 553-564

Jun Wu, Yao Zhang, Jie Wang, Jianchun Zhao, Dayong Ding, Ningjiang Chen, Lingling Wang, Xuan Chen, Chunhui Jiang, Xuan Zou, Xing Liu, Hui Xiao, Yuan Tian
, Zongjiang Shang, Kaiwei Wang, Xirong Li, Gang Yang, Jianping Fan:
AttenNet: Deep Attention Based Retinal Disease Classification in OCT Images. 565-576

Tobias Baur, Sina Clausen, Alexander Heimerl, Florian Lingenfelser, Wolfgang Lutz
, Elisabeth André:
NOVA: A Tool for Explanatory Multimodal Behavior Analysis and Its Application to Psychotherapy. 577-588

Sabrina Kletz, Klaus Schoeffmann
, Andreas Leibetseder, Jenny Benois-Pineau, Heinrich Husslein:
Instrument Recognition in Laparoscopy for Technical Skill Assessment. 589-600

Panagiotis Giannakeris, Georgios Meditskos, Konstantinos Avgerinakis, Stefanos Vrochidis
, Ioannis Kompatsiaris:
Real-Time Recognition of Daily Actions Based on 3D Joint Movements and Fisher Encoding. 601-613

Athina Tsanousa, Angelos Chatzimichail, Georgios Meditskos, Stefanos Vrochidis
, Ioannis Kompatsiaris:
Model-Based and Class-Based Fusion of Multisensor Data. 614-625

Natalia Sokolova
, Klaus Schoeffmann
, Mario Taschwer, Doris Putzgruber-Adamitsch, Yosuf El-Shabrawi:
Evaluating the Generalization Performance of Instrument Classification in Cataract Surgery Videos. 626-636
SS6: Intelligent Multimedia Security

Yajun Xu, Zhendong Mao, Peng Zhang, Bin Wang:
Compact Position-Aware Attention Network for Image Semantic Segmentation. 639-650

Chuanbin Liu, Youliang Tian, Hongtao Xie:
Law Is Order: Protecting Multimedia Network Transmission by Game Theory and Mechanism Design. 651-668

Qiuxian Li, Youliang Tian:
Rational Delegation Computing Using Information Theory and Game Theory Approach. 669-680

Xuecheng Ning, Xiaoshan Yang, Changsheng Xu:
Multi-hop Interactive Cross-Modal Retrieval. 681-693
DEMO Papers

Marc A. Kastner
, Ichiro Ide
, Yasutomo Kawanishi
, Takatsugu Hirayama, Daisuke Deguchi
, Hiroshi Murase:
Browsing Visual Sentiment Datasets Using Psycholinguistic Groundings. 697-702

Chih-Yao Chang, Bo-I Chuang, Chi-Chun Hsia, Wen-Cheng Chen, Min-Chun Hu:
Framework Design for Multiplayer Motion Sensing Game in Mixture Reality. 703-708

Yi Yu, Florian Harscoët, Simon Canales, Gurunath Reddy M, Suhua Tang
, Junjun Jiang
:
Lyrics-Conditioned Neural Melody Generation. 709-714

Abdullah Alfarrarjeh, Zeyu Ma, Seon Ho Kim, Yeonsoo Park, Cyrus Shahabi:
A Web-Based Visualization Tool for 3D Spatial Coverage Measurement of Aerial Images. 715-721

Zhongbo Sun, Yannan Wang, Li Cao:
An Attention Based Speaker-Independent Audio-Visual Deep Learning Model for Speech Enhancement. 722-728

Tony Zhao, Jaeyoung Choi, Gerald Friedland:
DIME: An Online Tool for the Visual Comparison of Cross-modal Retrieval Models. 729-733

Jung-Woo Choi
:
Real-Time Demonstration of Personal Audio and 3D Audio Rendering Using Line Array Systems. 734-738

Yongwoo Kim, Jae-Seok Choi, Jaehyup Lee, Munchurl Kim:
A CNN-Based Multi-scale Super-Resolution Architecture on FPGA for 4K/8K UHD Applications. 739-744

Abdul Muqeet
, Sung-Ho Bae:
Effective Utilization of Hybrid Residual Modules in Deep Neural Networks for Super Resolution. 745-750
VBS Papers

Andreas Leibetseder, Bernd Münzer, Jürgen Primus, Sabrina Kletz, Klaus Schoeffmann
:
diveXplore 4.0: The ITEC Deep Interactive Video Exploration System at VBS2020. 753-759

Loris Sauter
, Mahnaz Amiri Parian
, Ralph Gasser
, Silvan Heller
, Luca Rossetto
, Heiko Schuldt
:
Combining Boolean and Multimedia Retrieval in vitrivr for Large-Scale Video Search. 760-765

Nguyen-Khang Le
, Dieu-Hien Nguyen
, Minh-Triet Tran
:
An Interactive Video Search Platform for Multi-modal Retrieval with Advanced Concepts. 766-771

Phuong Anh Nguyen
, Jiaxin Wu
, Chong-Wah Ngo, Danny Francis
, Benoit Huet:
VIREO @ Video Browser Showdown 2020. 772-777

Jakub Lokoc, Gregor Kovalcík, Tomás Soucek:
VIRET at Video Browser Showdown 2020. 784-789

Miroslav Kratochvíl
, Patrik Veselý, Frantisek Mejzlík, Jakub Lokoc:
SOM-Hunter: Video Browsing with Relevance-to-SOM Feedback Loop. 790-795

Björn Þór Jónsson, Omar Shahbaz Khan
, Dennis C. Koelma, Stevan Rudinac, Marcel Worring
, Jan Zahálka
:
Exquisitor at the Video Browser Showdown 2020. 796-802

Byoungjun Kim, Ji Yea Shim, Minho Park, Yong Man Ro
:
Deep Learning-Based Video Retrieval Using Object Relationships and Associated Audio Classes. 803-808

Sungjune Park, Jaeyub Song, Minho Park, Yong Man Ro
:
IVIST: Interactive VIdeo Search Tool in VBS 2020. 809-814

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
load links from unpaywall.org
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
load content from archive.org
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
load data from openalex.org
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
dblp was originally created in 1993 at:
since 2018, dblp has been operated and maintained by:






