Conference on Multimedia Modeling 2020 (original) (raw)

default search action

combined dblp search
author search
venue search
publication search

Authors:

no matches

Venues:

no matches

Publications:

no matches

clear

26th MMM 2020: Daejeon, South Korea

jump to

mirror

> Home > Conferences and Workshops > MMM

SPARQL queries

Refine list

note

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

- Yong Man Ro, Wen-Huang Cheng, Junmo Kim, Wei-Ta Chu, Peng Cui, Jung-Woo Choi, Min-Chun Hu, Wesley De Neve:
  MultiMedia Modeling - 26th International Conference, MMM 2020, Daejeon, South Korea, January 5-8, 2020, Proceedings, Part II. Lecture Notes in Computer Science 11962, Springer 2020, ISBN 978-3-030-37733-5

Poster Session

- Pengfei Chen, Minglei Yuan, Tong Lu:
  Multi-scale Comparison Network for Few-Shot Learning. 3-13
- Jiayu Song, Qinghua Xu, Wei Liu, Yueran Zu, Mengdong Chen:
  Semantic and Morphological Information Guided Chinese Text Classification. 14-26
- Duc V. Nguyen, Huyen T. T. Tran, Truong Cong Thang:
  A Delay-Aware Adaptation Framework for Cloud Gaming Under the Computation Constraint of User Devices. 27-38
- Dongbiao He, Jinlei Jiang, Cédric Westphal, Guangwen Yang:
  Efficient Edge Caching for High-Quality 360-Degree Video Delivery. 39-51
- Suping Zhou, Jia Jia, Long Zhang, Yanfeng Wang, Wei Chen, Fanbo Meng, Fei Yu, Jialie Shen:
  Inferring Emphasis for Real Voice Data: An Attentive Multimodal Neural Network Approach. 52-62
- Xi Yang, Yeo-Jin Kim, Michelle Taub, Roger Azevedo, Min Chi:
  PRIME: Block-Wise Missingness Handling for Multi-modalities in Intelligent Tutoring Systems. 63-75
- Yuwei Yang, Fanman Meng, Hongliang Li, Qingbo Wu, Xiaolong Xu, Shuai Chen:
  A New Local Transformation Module for Few-Shot Segmentation. 76-87
- Mingjie Wu, Yongfei Zhang, Tianyu Zhang, Wenqi Zhang:
  Background Segmentation for Vehicle Re-identification. 88-99
- Joanna Hong, Hong Joo Lee, Yelin Kim, Yong Man Ro:
  Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence Through Facial Action Units. 100-111
- Yang Wang, Ye Qian, Jiahao Shi, Feng Su:
  A Deep Convolutional Deblurring and Detection Neural Network for Localizing Text in Videos. 112-124
- Wei Hou, Dakui Wang, Xiaojun Chen:
  Generate Images with Obfuscated Attributes for Private Image Classification. 125-135
- Xiaozhong Ji, Yirui Wu, Tong Lu:
  Context-Aware Residual Network with Promotion Gates for Single Image Super-Resolution. 136-147
- Xiaoyu Xu, Jian Qian, Li Yu, Shengju Yu, Hao Tao, Ran Zhu:
  A Compact Deep Neural Network for Single Image Super-Resolution. 148-160
- Kai Huang, Jianjun Li, Shichao Cheng, Jie Yu, Wanyong Tian, Lulu Zhao, Junfeng Hu, Chin-Chen Chang:
  An Efficient Algorithm of Facial Expression Recognition by TSG-RNN Network. 161-174
- Yiming Li, Xiaoshan Yang, Changsheng Xu:
  Structured Neural Motifs: Scene Graph Parsing via Enhanced Context. 175-188
- Duanzheng Guan, Dengshi Li, Xuebei Cai, Xiaochen Wang, Ruimin Hu:
  Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet. 189-200
- Xiaoge Song, Yirui Wu, Wenhai Wang, Tong Lu:
  TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation. 201-213
- Hirotaka Kato, Takatsugu Hirayama, Ichiro Ide, Keisuke Doman, Yasutomo Kawanishi, Daisuke Deguchi, Hiroshi Murase:
  More-Natural Mimetic Words Generation for Fine-Grained Gait Description. 214-225
- Ying Zhao, Zhiwei Luo, Changqin Quan, Dianchao Liu, Gang Wang:
  Lite Hourglass Network for Multi-person Pose Estimation. 226-238

Special Session Papers // SS1: AI-Powered 3D Vision

- Yunhan Sun, Jinlong Shi, Suqin Bai, Qiang Qian, Zhengxing Sun:
  Single View Depth Estimation via Dense Convolution Network with Self-supervision. 241-253
- Menghan Zhang, Yunbo Rao, Jiansu Pu, Xun Luo, Qifei Wang:
  Multi-data UAV Images for Large Scale Reconstruction of Buildings. 254-266
- Sen Xiang, Qiong Liu, Huiping Deng, Jin Wu, Li Yu:
  Deformed Phase Prediction Using SVM for Structured Light Depth Generation. 267-278
- Liang Wang, Biying Yan, Fuqing Duan, Ke Lu:
  Extraction of Multi-class Multi-instance Geometric Primitives from Point Clouds Using Energy Minimization. 279-290
- Xiangyu Sun, Qiong Liu, You Yang:
  Similarity Graph Convolutional Construction Network for Interactive Action Recognition. 291-303
- Zihao Chen, Xu Wang, Yu Zhou, Longhao Zou, Jianmin Jiang:
  Content-Aware Cubemap Projection for Panoramic Image via Deep Q-Learning. 304-315
- Teng Wan, Shaoyi Du, Wenting Cui, Qixing Xie, Yuying Liu, Zuoyong Li:
  Robust RGB-D Data Registration Based on Correntropy and Bi-directional Distance. 316-326
- Hui Cao, Haikuan Du, Siyu Zhang, Shen Cai:
  InSphereNet: A Concise Representation and Classification Method for 3D Object. 327-339
- Wenting Cui, Shaoyi Du, Teng Wan, Yan Liu, Yuying Liu, Yang Yang, Qingnan Mou, Mengqi Han, Yu-Cheng Guo:
  3-D Oral Shape Retrieval Using Registration Algorithm. 340-349
- Yu Wang, Tao Lu, Ruobo Xu, Yanduo Zhang:
  Face Super-Resolution by Learning Multi-view Texture Compensation. 350-360
- Junlin Zhang, Xu Wang:
  Light Field Salient Object Detection via Hybrid Priors. 361-372

SS2: Multimedia Analytics: Perspectives, Tools and Applications

- Werner Bailer, Maarten Wijnants, Hendrik Lievens, Sandy Claes:
  Multimedia Analytics Challenges and Opportunities for Creating Interactive Radio Content. 375-387
- Iva Gornishka, Stevan Rudinac, Marcel Worring:
  Interactive Search and Exploration in Discussion Forums Using Multimodal Embeddings. 388-399
- Xixun Wu, Binheng Song, Zhixiang Wang, Chun Yuan:
  An Inverse Mapping with Manifold Alignment for Zero-Shot Learning. 400-411
- Aaron Duane, Cathal Gurrin:
  Baseline Analysis of a Conventional and Virtual Reality Lifelog Retrieval System. 412-423
- Aikaterini Katmada, George Kalpakis, Theodora Tsikrika, Stelios Andreadis, Stefanos Vrochidis, Ioannis Kompatsiaris:
  An Extensible Framework for Interactive Real-Time Visualizations of Large-Scale Heterogeneous Multimedia Information from Online Sources. 424-435

SS3: Multimedia Datasets for Repeatable Experimentation (MDRE)

- Andreas Leibetseder, Sabrina Kletz, Klaus Schoeffmann, Simon Keckstein, Jörg Keckstein:
  GLENDA: Gynecologic Laparoscopy Endometriosis Dataset. 439-450
- Debesh Jha, Pia H. Smedsrud, Michael A. Riegler, Pål Halvorsen, Thomas de Lange, Dag Johansen, Håvard D. Johansen:
  Kvasir-SEG: A Segmented Polyp Dataset. 451-462
- Frank Hopfgartner, Cathal Gurrin, Hideo Joho:
  Rethinking the Test Collection Methodology for Personal Self-tracking Data. 463-474
- Graham Healy, Zhengwei Wang, Tomás Ward, Alan F. Smeaton, Cathal Gurrin:
  Experiences and Insights from the Collection of a Novel Multimedia EEG Dataset. 475-486

- Zhilei Liu, Jiahui Dong, Cuicui Zhang, Longbiao Wang, Jianwu Dang:
  Relation Modeling with Graph Convolutional Networks for Facial Action Unit Detection. 489-501
- Jian Guan, Liming Yin, Jianguo Sun, Shuhan Qi, Xuan Wang, Qing Liao:
  Enhanced Gaze Following via Object Detection and Human Pose Estimation. 502-513
- Zhilei Liu, Diyi Liu, Yunpeng Wu:
  Region Based Adversarial Synthesis of Facial Action Units. 514-526
- Zhilei Liu, Le Li, Yunpeng Wu, Cuicui Zhang:
  Facial Expression Restoration Based on Improved Graph Convolutional Networks. 527-539
- Xiaona Guo, Wei Zhong, Long Ye, Li Fang, Yan Heng, Qin Zhang:
  Global Affective Video Content Regression Based on Complementary Audio-Visual Features. 540-550

SS5: MULTIMED2020: Multimedia and Multimodal Analytics in the Medical Domain and Pervasive Environments

- Henning Müller, Vincent Andrearczyk, Oscar Alfonso Jiménez del Toro, Anjani Dhrangadhariya, Roger Schaer, Manfredo Atzori:
  Studying Public Medical Images from the Open Access Literature and Social Networks for Model Training and Knowledge Extraction. 553-564
- Jun Wu, Yao Zhang, Jie Wang, Jianchun Zhao, Dayong Ding, Ningjiang Chen, Lingling Wang, Xuan Chen, Chunhui Jiang, Xuan Zou, Xing Liu, Hui Xiao, Yuan Tian, Zongjiang Shang, Kaiwei Wang, Xirong Li, Gang Yang, Jianping Fan:
  AttenNet: Deep Attention Based Retinal Disease Classification in OCT Images. 565-576
- Tobias Baur, Sina Clausen, Alexander Heimerl, Florian Lingenfelser, Wolfgang Lutz, Elisabeth André:
  NOVA: A Tool for Explanatory Multimodal Behavior Analysis and Its Application to Psychotherapy. 577-588
- Sabrina Kletz, Klaus Schoeffmann, Andreas Leibetseder, Jenny Benois-Pineau, Heinrich Husslein:
  Instrument Recognition in Laparoscopy for Technical Skill Assessment. 589-600
- Panagiotis Giannakeris, Georgios Meditskos, Konstantinos Avgerinakis, Stefanos Vrochidis, Ioannis Kompatsiaris:
  Real-Time Recognition of Daily Actions Based on 3D Joint Movements and Fisher Encoding. 601-613
- Athina Tsanousa, Angelos Chatzimichail, Georgios Meditskos, Stefanos Vrochidis, Ioannis Kompatsiaris:
  Model-Based and Class-Based Fusion of Multisensor Data. 614-625
- Natalia Sokolova, Klaus Schoeffmann, Mario Taschwer, Doris Putzgruber-Adamitsch, Yosuf El-Shabrawi:
  Evaluating the Generalization Performance of Instrument Classification in Cataract Surgery Videos. 626-636

SS6: Intelligent Multimedia Security

- Yajun Xu, Zhendong Mao, Peng Zhang, Bin Wang:
  Compact Position-Aware Attention Network for Image Semantic Segmentation. 639-650
- Chuanbin Liu, Youliang Tian, Hongtao Xie:
  Law Is Order: Protecting Multimedia Network Transmission by Game Theory and Mechanism Design. 651-668
- Qiuxian Li, Youliang Tian:
  Rational Delegation Computing Using Information Theory and Game Theory Approach. 669-680
- Xuecheng Ning, Xiaoshan Yang, Changsheng Xu:
  Multi-hop Interactive Cross-Modal Retrieval. 681-693

DEMO Papers

- Marc A. Kastner, Ichiro Ide, Yasutomo Kawanishi, Takatsugu Hirayama, Daisuke Deguchi, Hiroshi Murase:
  Browsing Visual Sentiment Datasets Using Psycholinguistic Groundings. 697-702
- Chih-Yao Chang, Bo-I Chuang, Chi-Chun Hsia, Wen-Cheng Chen, Min-Chun Hu:
  Framework Design for Multiplayer Motion Sensing Game in Mixture Reality. 703-708
- Yi Yu, Florian Harscoët, Simon Canales, Gurunath Reddy M, Suhua Tang, Junjun Jiang:
  Lyrics-Conditioned Neural Melody Generation. 709-714
- Abdullah Alfarrarjeh, Zeyu Ma, Seon Ho Kim, Yeonsoo Park, Cyrus Shahabi:
  A Web-Based Visualization Tool for 3D Spatial Coverage Measurement of Aerial Images. 715-721
- Zhongbo Sun, Yannan Wang, Li Cao:
  An Attention Based Speaker-Independent Audio-Visual Deep Learning Model for Speech Enhancement. 722-728
- Tony Zhao, Jaeyoung Choi, Gerald Friedland:
  DIME: An Online Tool for the Visual Comparison of Cross-modal Retrieval Models. 729-733
- Jung-Woo Choi:
  Real-Time Demonstration of Personal Audio and 3D Audio Rendering Using Line Array Systems. 734-738
- Yongwoo Kim, Jae-Seok Choi, Jaehyup Lee, Munchurl Kim:
  A CNN-Based Multi-scale Super-Resolution Architecture on FPGA for 4K/8K UHD Applications. 739-744
- Abdul Muqeet, Sung-Ho Bae:
  Effective Utilization of Hybrid Residual Modules in Deep Neural Networks for Super Resolution. 745-750

VBS Papers

- Andreas Leibetseder, Bernd Münzer, Jürgen Primus, Sabrina Kletz, Klaus Schoeffmann:
  diveXplore 4.0: The ITEC Deep Interactive Video Exploration System at VBS2020. 753-759
- Loris Sauter, Mahnaz Amiri Parian, Ralph Gasser, Silvan Heller, Luca Rossetto, Heiko Schuldt:
  Combining Boolean and Multimedia Retrieval in vitrivr for Large-Scale Video Search. 760-765
- Nguyen-Khang Le, Dieu-Hien Nguyen, Minh-Triet Tran:
  An Interactive Video Search Platform for Multi-modal Retrieval with Advanced Concepts. 766-771
- Phuong Anh Nguyen, Jiaxin Wu, Chong-Wah Ngo, Danny Francis, Benoit Huet:
  VIREO @ Video Browser Showdown 2020. 772-777
- Stelios Andreadis, Anastasia Moumtzidou, Konstantinos Apostolidis, Konstantinos Gkountakos, Damianos Galanopoulos, Emmanouil Michail, Ilias Gialampoukidis, Stefanos Vrochidis, Vasileios Mezaris, Ioannis Kompatsiaris:
  VERGE in VBS 2020. 778-783
- Jakub Lokoc, Gregor Kovalcík, Tomás Soucek:
  VIRET at Video Browser Showdown 2020. 784-789
- Miroslav Kratochvíl, Patrik Veselý, Frantisek Mejzlík, Jakub Lokoc:
  SOM-Hunter: Video Browsing with Relevance-to-SOM Feedback Loop. 790-795
- Björn Þór Jónsson, Omar Shahbaz Khan, Dennis C. Koelma, Stevan Rudinac, Marcel Worring, Jan Zahálka:
  Exquisitor at the Video Browser Showdown 2020. 796-802
- Byoungjun Kim, Ji Yea Shim, Minho Park, Yong Man Ro:
  Deep Learning-Based Video Retrieval Using Object Relationships and Associated Audio Classes. 803-808
- Sungjune Park, Jaeyub Song, Minho Park, Yong Man Ro:
  IVIST: Interactive VIdeo Search Tool in VBS 2020. 809-814

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.

Unpaywalled article links

Add open access links from to the list of external document links (if available).

load links from unpaywall.org

Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.

Archived links via Wayback Machine

For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).

load content from archive.org

Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.

Reference lists

Add a list of references from , , and to record detail pages.

load references from crossref.org and opencitations.net

Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.

Citation data

Add a list of citing articles from and to record detail pages.

load citations from opencitations.net

Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.

OpenAlex data

Load additional information about publications from .

load data from openalex.org

Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.

dblp was originally created in 1993 at:

since 2018, dblp has been operated and maintained by: