Scientific reports



PhD thesis summaries

Sang-hyo Park

In video coding, motion estimation (ME) that predicts a block among temporally correlated frames has had a crucial impact on not only the compression efficiency, but also the computational complexity. Particularly, fast ME algorithms has been a pivot in much research that attempts to reduce the complexity of video encoder … Read more

Sucheta Ghosh

Parsing discourse is a challenging natural language processing task. In this research work first we take a data driven approach to identify arguments of explicit discourse connectives. In contrast to previous work we do not make any assumptions on the span of arguments and consider parsing as a token-level sequence … Read more

Rufael Mekuria

The Internet is used for distributed shared experiences such as video conferencing, voice calls (possibly in a group), chatting, photo sharing, online gaming and virtual reality. These technologies are changing our daily lives and the way we interact with each other. The current rapid advances in 3D depth sensing and … Read more

Svetlana Kordumova

This thesis contributes to learning machines what is in an image by avoiding direct manual annotation as training data. We either rely on tagged data from social media platforms to recognize concepts, or on objects semantics and layout to recognize scenes. We focus our effort on image search.We firstly demonstrate … Read more

Amirhossein Habibian

This thesis studies the fundamental question: what vocabulary of concepts are suited for machines to describe video content? The answer to this question involves two annotation steps: First, to specify a list of concepts by which videos are described. Second, to label a set of videos per concept as its … Read more

Masoud Mazloom

In this thesis we aim to represent an event in a video using semantic features. We start from a bank of concept detectors for representing events in video. At first we considered the relevance of concepts to the event inside the video representation. We address the problem of video event … Read more

Chien-nan Chen

3D Tele-immersion (3DTI) technology allows full-body, multimodal interaction among geographically dispersed users, which opens a variety of possibilities in cyber collaborative applications such as art performance, exergaming, and physical rehabilitation. However, with its great potential, the resource and quality demands of 3DTI rise inevitably, especially when some advanced applications target … Read more

Pengpeng Ni

Modern video coding techniques provide multidimensional adaptation options for adaptive video streaming over networks. For instance, a video server can adjust the frame-rate, frame-size or signal-to-noise ratio (SNR) of the video being requested to cope with the available bandwidth. However, these adaptation operations give rise to distinct visual artefacts, so … Read more

Hamed Ahmadi

Fulfilling cloud gaming’s (CG) ultimate goal; i.e., playing video games wherever, whenever and on every devices, requires reduction of its high bandwidth demand in a way that doesn’t adversely affect the players’ quality of experience. One way to do so is to reduce the bitrate of the regions in the … Read more

Britta Meixner

Modern Web technology makes the dream of fully interactive and enriched video come true. Nowadays it is possible to organize videos in a non-linear way playing in a sequence unknown in advance. Furthermore, additional information can be added to the video, ranging from short descriptions to animated images and further … Read more


Introduction to Conference and Social Media Reports

Awarding the Best Social Media Reporters

The SIGMM Records team has adopted a new strategy to encourage the publication of information, and thus increase the chances to reach the community, increase knowledge and foster interaction. It consists of awarding the best Social Media reporters for each SIGMM conference, being the award a free registration to one of the SIGMM conference within a period of one year. All SIGMM members are welcome to participate and contribute, and are candidates to receive the award.

The Social Media Editors will issue a new open Call for Reports (CfR) via the Social Media channels every time a new SIGMM conference takes place, so the community can remember or be aware of this initiative, as well as can refresh its requirements and criteria.

The CfR will encourage activity on Social Media channels, posting information and contents related to the SIGMM conferences, with the proper hashtags (see our Recommendations). The reporters will be encouraged to mainly use Twitter, but other channels and innovative forms or trends of dissemination will be very welcome!

The Social Media Editors will be the jury for deciding the best reports (i.e., collection of posts) on Social Media channels, and thus will not qualify for this award. The awarded reporters will be additionally asked to provide a post-summary of the conference. The number of awards for each SIGMM conference is indicated in the table below. The awarded reporter will get a free registration to one of the SIGMM conferences (of his/her choice) within a period of one year.

Read more


Conference and Social Media Reports

Report from ACM MMSys 2017

–A report from Christian Timmerer, AAU/Bitmovin Austria The ACM Multimedia Systems Conference (MMSys) provides a forum for researchers to present and share their latest research findings in multimedia systems. It is a unique event targeting “multimedia systems” from various angles and views across all domains instead of focusing on a … Read more

Report from ICMR 2017

ACM International Conference on Multimedia Retrieval (ICMR) 2017 ACM ICMR 2017 in “Little Paris” ACM ICMR is the premier International Conference on Multimedia Retrieval, and from 2011 it “illuminates the state of the arts in multimedia retrieval”. This year, ICMR was in an wonderful location: Bucharest, Romania also known as … Read more

Report from MMM 2017

MMM 2017 — 23rd International Conference on MultiMedia Modeling MMM is a leading international conference for researchers and industry practitioners for sharing new ideas, original research results and practical development experiences from all MMM related areas. The 23rd edition of MMM took place on January 4-6 of 2017, on the … Read more

Report from ICACNI 2015

Report from the 3rd International Conference on Advanced Computing, Networking, and Informatics The 3rd International Conference on Advanced Computing, Networking and Informatics (ICACNI-2015), organized by School of Computer Engineering, KIIT University, Odisha, India, was held during 23-25 June, 2015. The conference commenced with a keynote by Prof. Nikhil R. Pal … Read more

Summary of the 5th BAMMF

Bay Area Multimedia Forum (BAMMF) BAMMF is a Bay Area Multimedia Forum series. Experts from both academia and industry are invited to exchange ideas and information through talks, tutorials, posters, panel discussions and networking sessions. Topics of the forum will include emerging areas in vision, audio, touch, speech, text, various … Read more

Report from SLAM 2014

ISCA/IEEE Workshop on Speech, Language and Audio in Multimedia Following SLAM 2013 in Marseille, France, SLAM 2014 was the second edition of the workshop, held in Malaysia as a satellite of Interspeech 2014. The workshop was organized over two days, one for science and one for socializing and community building. … Read more

Report from ACM Multimedia 2013

Conference/Workshop Program Highlights ACM Multimedia 2013 was held at the CCIB (Centre de Conventions Internacional de Barcelona) from October 21st to October 25th, 2012 in Barcelona. The Art Exhibition has been held for the entire duration of the conference at the FAD (Forment de les Arts i del Disseny) in … Read more

Report from SLAM 2013

Intl. Workshop on Speech, Language and Audio in Multimedia The International Workshop on Speech, Language and Audio in Multimedia (SLAM) is a yearly series of workshop to bring together researchers working in the broad field of speech, language and audio processing applied to the analysis, indexing and use of any … Read more

GameDays & Edutainment 2012

On behalf of the conference co-chairs, we wish to provide a report of the eight GameDays, which have been held from September 18th to 20th at Technische Universität Darmstadt and in the premises of Fraunhofer IGD. The GameDays are initiated and mainly organized by Dr. Stefan Göbel, the head of … Read more

Report from NOSSDAV 2012

Setting for NOSSDAV 2012 NOSSDAV 2012, the 22nd SIGMM Workshop on Network and Operating Systems Support for Digital Audio and Video, was be held in Toronto, Canada, on June 7-8 2012. As in previous years, the workshop will continue to focus on both established and emerging research topics, high-risk high-return … Read more


Journal TOC service


TOMM Volume 13, Issue 4
September 2017
Hong-Bo Zhang, Bineng Zhong, Qing Lei, Ji-Xiang Du, Jialin Peng, Duansheng Chen, Xiao Ke: Sparse Representation-Based Semi-Supervised Regression for People Counting Shahid Akhtar, Andre Beck, Ivica Rimac: Caching Online Video: Analysis and Proposed Algorithm Duc-Tien Dang-Nguyen, Luca Piras, Giorgio Giacinto, Giulia Boato, Francesco G. B. DE Natale: Multimodal Retrieval with Diversification and Re

TOMM Volume 13, Issue 3
July 2017
Priyanka Singh, Balasubramanian Raman, Nishant Agarwal, Pradeep K. Atrey: Secure Cloud-Based Image Tampering Detection and Localization Using POB Number System Ishwarya Thirunarayanan, Khimya Khetarpal, Sanjeev Koppal, Olivier Le Meur, John Shea, Eakta Jain: Creating Segments and Effects on Comics by Clustering Gaze Data Michael E. Houle, Xiguo Ma, Vincent Oria, Jichao Sun: Query Expansion for Con

TOMM Volume 13, Issue 3s
July 2017
Kaoru Ota, Minh Son Dao, Vasileios Mezaris, Francesco G.B. De Natale: Introduction to Special Issue on Deep Learning for Mobile Multimedia Kaoru Ota, Minh Son Dao, Vasileios Mezaris, Francesco G. B. De Natale: Deep Learning for Mobile Multimedia: A Survey Lorenzo Seidenari, Claudio Baecchi, Tiberio Uricchio, Andrea Ferracani, Marco Bertini, Alberto Del Bimbo: Deep Artwork Detection and Retrieval f

TOMM Volume 13, Issue 2
May 2017
Giuseppe Lisanti, Svebor Karaman, Iacopo Masi: Multichannel-Kernel Canonical Correlation Analysis for Cross-View Person Reidentification Jun Ye, Hao Hu, Guo-Jun Qi, Kien A. Hua: A Temporal Order Modeling Approach to Human Action Recognition from Multimodal Sensor Data Shuai Wang, Yang Cong, Huijie Fan, Baojie Fan, Lianqing Liu, Yunsheng Yang, Yandong Tang, Huaici Zhao, Haibin Yu: Multi-Class Laten

TOMM Volume 13, Issue 1
January 2017
Zheng Yan: Table of Contents Hanwang Zhang, Xindi Shang, Huanbo Luan, Meng Wang, Tat-Seng Chua: Learning from Collective Intelligence: Feature Learning Using Social Images and Tags Ming Cheung, James She, Alvin Junus,


MMSJ Volume 23, Issue 5
October 2017
Britta Meixner: A pattern-based evaluation of download and cache management algorithms for annotated interactive non-linear videosJoão Nogueira, Lucas Guardalben, Bernardo Cardoso, Susana Sargento: Catch-up TV analytics: statistical characterization and consumption patterns identification on a production serviceAzin Semsar, Ali Asghar Nazari Shirehjini: Multimedia-supported virtual experiment for

MMSJ Volume 23, Issue 4
July 2017
R. L. Gomes, L. Bittencourt, E. Madeira, E. Cerqueira, M. Gerla: Management of virtual network resources for multimedia applicationsMohammad Hosseini, Gregorij Kurillo, Seyed Rasoul Etesami, Jiang Yu: Towards coordinated bandwidth adaptations for hundred-scale 3D tele-immersive systemsDaeyeoul Kim, Jinmo Kim: Procedural modeling and visualization of multiple leavesAlok Kumar Singh Kushwaha, Subodh

MMSJ Volume 23, Issue 3
June 2017
Xiaofeng Zhu, Zhi Jin, Rongrong Ji: Learning high-dimensional multimedia dataDebo Cheng, Shichao Zhang, Xingyi Liu, Ke Sun, Ming Zong: Feature selection by combining subspace learning with sparse representationShaoyi Du, Juan Liu, Yuehu Liu, Xuetao Zhang, Jianru Xue: Precise glasses detection algorithm for face with in-plane rotationLianli Gao, Jingkuan Song, Xingyi Liu, Junming Shao, Jiajun Liu

MMSJ Volume 23, Issue 2
March 2017
Liefu Ai, Junqing Yu, Zebin Wu, Yunfeng He, Tao Guan: Optimized residual vector quantization for efficient approximate nearest neighbor searchLu Lu, Zhan Yi-Ju, Jiang Qing, Cai Qing-ling: Recognizing human actions by two-level Beta process hidden Markov modelFan Wu, Lili Xu, Saru Kumari, Xiong Li: An improved and anonymous two-factor authentication protocol for health-care applications with wirele

MMSJ Volume 23, Issue 1
February 2017
Luming Zhang, Yang Yang, Rongrong Ji, Roger Zimmermann: Special issue on “visual semantic analysis with weak supervision”Lei Yu, Bing-Kun Bao, Changsheng Xu: A discriminative graph inferring framework towards weakly supervised image parsingBiao Leng, Shuang Guo, Changchun Du, Jiabei Zeng, Zhang Xiong: 3D Object retrieval based on viewpoint segmentationXirong Li: Tag relevance fusion for social


MTAP Volume 76, Issue 20
October 2017
Jože Guna, Emilija Stojmenova, Andrej Kos…: The TV-WEB project - combining internet and television – lessons learnt from the user experience studiesGrega Jakus, Kristina Stojmenova, Sašo Tomažič…: A system for efficient motor learning using multimodal augmented feedbackDiego Q. Leite, Julio C. Duarte, Luiz P. Neves…: Hand gesture recognition from depth and infrared Kinect data for CAVE

MTAP Volume 76, Issue 19
October 2017
: Editorial Note: Toward Smart World: Wireless Sensor Networks and ApplicationsDai Yun-Zhong, Luo Ren-Ze: Research of energy efficient clustering algorithm for multilayer wireless heterogeneous sensor networks prediction researchYoungjoo Shin, Dongyoung Koo, Junbeom Hur, Joobeom Yun: Secure proof of storage with deduplication for cloud storage systemsHeehoon Shin, Joon-Sang Park: Optimizing random

MTAP Volume 76, Issue 17
September 2017
: Editorial Note: Empirical Multimedia Service and its Applications for IoTLi Gun, Li Cuihua, Zhu Yingpan, Huang Feijiang: An improved speckle-reduction algorithm for SAR images based on anisotropic diffusionXiuxia Ji, Gong Zhang: Image fusion method of SAR and infrared image based on Curvelet transform with adaptive weightingXian-Yong Meng, Lei Che, Zhi-Hui Liu, Ning Che…: Towards a partial dif

MTAP Volume 76, Issue 18
September 2017
: Editorial Note: Interactive Immersive Multimedia ExperiencesWuhui Chen, Incheon Paik, Neil Y. Yen: Discovering internal social relationship for influence-aware service recommendationYan Li, Rui Gao, Xiaobin Kang, Chong Chen…: A watershed data management and visualization system using code-first approachYung-Hui Chen, Chun-Hsiung Tseng, Ching-Lien Huang…: Recommendation system based on rule-s

MTAP Volume 76, Issue 15
August 2017
Anu Pramila, Anja Keskinarkaus, Valtteri Takala…: Extracting watermarks from printouts captured with wide angles using computational photographyKhedija Arour, Taoufik Yeferny: Formal concept analysis based user model for distributed systemsMahboubeh Nazari, Amir Sharif, Majid Mollaeefar: An improved method for digital image fragile watermarking based on chaotic mapsJože Guna, Emilija Stojmenova


IJMIR Volume 6, Issue 3
September 2017
Michael S. Lew: ACM International Conference on Multimedia Retrieval (ICMR): current standing and impactYassmina Saadna, Ali Behloul: An overview of traffic sign detection and classification methodsParul Sahare, Sanjay B. Dhok: Script identification algorithms: a surveyMaia Zaharieva, Christian Breiteneder…: Unsupervised group feature selection for media classificationZied Guendil, Zied Lachiri,

IJMIR Volume 6, Issue 2
June 2017
Sanghoon Lee, Mohamed Masoud, Janani Balaji…: A survey of tag-based information retrievalAmandeep Kaur, Renu Dhir…: A survey on camera-captured scene text detection and extraction: towards Gurmukhi scriptD. Sejal, T. Ganeshsingh, K. R. Venugopal…: ACSIR: ANOVA Cosine Similarity Image Recommendation in vertical searchNastaran Borjian: Query-by-example music information retrieval by score-base

IJMIR Volume 6, Issue 1
March 2017
George Awad, Wessel Kraaij, Paul Over…: Instance search retrospective with focus on TRECVIDRashad Ahmed, Wasfi G. Al-Khatib…: A Survey on handwritten documents word spottingSusanne Boll, Winston Hsu, Jiebo Luo: Editorial for the ICMR 2016 special issueNikolaos Pappas, Miriam Redi, Mercan Topkara…: Multilingual visual sentiment concept clustering and analysisMarkus Schedl: Investigating count

IJMIR Volume 5, Issue 4
November 2016
Erwin M. Bakker: Major events in multimedia information retrievalMaaike H. T. de Boer, Klamer Schutte…: Blind late fusion in multimedia event retrievalM. Radhika Mani, D. M. Potukuchi…: A novel approach for shape-based object recognition with curvelet transformZineb Elgarrai, Othmane El Meslouhi…: Robust facial expression recognition system based on hidden Markov modelsAhmad Alzu’bi, Abbes

IJMIR Volume 5, Issue 3
September 2016
Michael S. Lew: Top multimedia information retrieval papersErwin M. Bakker: Open and free datasets for multimedia retrievalNa Zhao, Hanwang Zhang, Meng Wang…: Learning content–social influential features for influence analysisHamed Ghodrati, A. Ben Hamza: Deep shape-aware descriptor for nonrigid 3D object retrievalR. Jarrar, M. Belkhatir: On the coupled use of signal and semantic concepts to b

MMTC R-Letter

MMTC R-Letter Volume 7, Issue 4
August 2016
Message from the Review Board DirectorsIEEE ICME 2016 Bester Paper AwardsGuest Editorial Introduction by Cha Zhang (IEEE ICME’16 General Co-Chair)IEEE ICME’16 Best Paper: Phonetic Posteriorgrams for Many-to-One Voice Conversion without Parallel Data TrainingA short review for “Phonetic Posteriorgrams for Many-to-One Voice Conversion without Parallel Data Training” (Edited by Christian Timm

MMTC R-Letter Volume 7, Issue 3
June 2016
Message from the Review Board DirectorsAddressing All Senses QoE Optimized Multi-Sensorial Media StreamingA short review for “Beyond Multimedia Adaptation: Quality of Experience-Aware Multi-Sensorial Media Delivery (Edited by Frank Hartung)Evaluating Impact of Panoramic Projections of 360-degree Videos on Coding EfficiencyA short review for “A Framework to Evaluate Omnidirectional Video Coding

MMTC R-Letter Volume 7, Issue 2
April 2016
Message from the Review Board DirectorsIntegrating Deep Learning and Multiple Instance LearningA short review for “Deep Multiple Instance Learning for Image Classification and Auto-Annotation” (Edited by Jun Zhou)Learning for HTTP-Based Adaptive StreamingA short review for “Optimizing HTTP-Based Adaptive Streaming in Vehicular Environment Using Markov Decision Process” (Edited by Koichi Ad

MMTC R-Letter Volume 7, Issue 1
February 2016
Message from the Review Board DirectorsGeneral Multi-Modal Learning Framework for RGB-D Object RecognitionA short review for “Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition” (Edited by Carl James Debono)Energy-efficient multimedia transmission in wireless heterogeneous networksA short review for “Energy-efficient multimedia transmissions through base station cooperation

MMTC R-Letter Volume 6, Issue 5
October 2015
Message from the Review Board DirectorsVideo Playback Time Maximization for SmartphonesA short review for “EQ-Video: Energy and Quota-Aware Video Playback Time Maximization for Smartphones” (Edited by Koichi Adachi)How to share GPU amongst multiple jobs in cloud-based multimedia services?A short review for “VIGRIS: Virtualized GPU Resource Isolation and Scheduling in Cloud Gaming” (Edite

IEEE Multimedia

IEEE MultiMedia Volume 21, Issue 2
April–June 2014
Feature ArticlesFabien Danieau, Julien Fleureau, Philippe Guillotel, Nicolas Mollet, Marc Christie, and Anatole Lecuyer: Toward Haptic Cinematography: Enhancing Movie Experiences with Camera-Based Haptic EffectsMei-Chen Yeh and Wen-Po Wu: Clustering Faces in Movies Using an Automatically Constructed Social NetworkTao Guan, Yunfeng He, Liya Duan, Jianzhong Yang, Juan Gao, and Junqing Yu: Efficient