Scientific reports



PhD thesis summaries

Title: Quality of Experience and Access Network Traffic Management of HTTP Adaptive Video Streaming

The thesis focuses on Quality of Experience (QoE) of HTTP adaptive video streaming (HAS) and traffic management in access networks to improve the QoE of HAS. First, the QoE impact of adaptation parameters and time on layer was investigated with subjective crowdsourcing studies. The results were used to compute a … Read more

Title: Objects for spatio-temporal activity recognition in videos

This thesis investigates the role of objects for the spatio-temporal recognition of activities in videos. We investigate what, when, and where specific activities occur in visual content by examining object representations, centered around the main question: what do objects tell about the extent of activities in visual space and time? … Read more

Title: An action recognition framework for uncontrolled video capture based on a spatio-temporal video graph

The task of automatic categorization and localization of human action in video sequences is valuable for a variety of applications such as detecting relevant activities in surveillance video, summarizing and indexing video sequences or organizing a digital video library according to the relevant actions. However it remains a challenging problem … Read more

Title: Deep image representations for instance search

We address the problem of visual instance search, which consists to retrieve all the images within an dataset that contain a particular visual example provided to the system. The traditional approach of processing the image content for this task relied on extracting local low-level information within images that was “manually … Read more

Title: Improving instance search performance in video collections

This thesis presents methods to improve instance search and enhance user performance while browsing unstructured video collections. Through the use of computer vision and information retrieval techniques, we propose novel solutions to analyse visual content and build a search algorithm to address the challenges of visual instance search, while considering … Read more

Title: An Investigation Into Machine Learning Solutions Involving Time Series Across Different Problem Domains

In this thesis we will examine architectures and models for machine learning in three problem domains each of which are based around the use of time series data in time series applications. We set out to examine whether the architecture and model solutions in different problem domains will converge when … Read more

Title: Behavioural biometric identification based on human computer interaction

As we become increasingly dependent on information systems, personal identification and profiling systems have received an increasing interest, either for reasons of personalisation or security. Biometric profiling is one means of identification which can be achieved by analysing something the user is or does (e.g., a fingerprint, signature, face, voice). … Read more

Title: Investigating multi-modal features for continuous affect recognition using visual sensing

Emotion plays an essential role in human cognition, perception and rational decisionmaking. In the information age, people spend more time then ever before interacting with computers, however current technologies such as Artificial Intelligence (AI) and Human-Computer Interaction (HCI) have largely ignored the implicit information of a user’s emotional state leading … Read more

Sang-hyo Park

In video coding, motion estimation (ME) that predicts a block among temporally correlated frames has had a crucial impact on not only the compression efficiency, but also the computational complexity. Particularly, fast ME algorithms has been a pivot in much research that attempts to reduce the complexity of video encoder … Read more

Shrinivas D Desai

Medical imaging has advanced tremendously over the decade right from the inception. Among many, X-ray Computed Tomography+ (CT) is recognized as an imperative medical imaging modality to reveal the interior details of human body for effective diagnosis, treatment, operation and complication management of various clinical cases. CT operates on the principle of reconstruction of … Read more


Introduction to Conference and Social Media Reports

Awarding the Best Social Media Reporters

The SIGMM Records team has adopted a new strategy to encourage the publication of information, and thus increase the chances to reach the community, increase knowledge and foster interaction. It consists of awarding the best Social Media reporters for each SIGMM conference, being the award a free registration to one of the SIGMM conference within a period of one year. All SIGMM members are welcome to participate and contribute, and are candidates to receive the award.

The Social Media Editors will issue a new open Call for Reports (CfR) via the Social Media channels every time a new SIGMM conference takes place, so the community can remember or be aware of this initiative, as well as can refresh its requirements and criteria.

The CfR will encourage activity on Social Media channels, posting information and contents related to the SIGMM conferences, with the proper hashtags (see our Recommendations). The reporters will be encouraged to mainly use Twitter, but other channels and innovative forms or trends of dissemination will be very welcome!

The Social Media Editors will be the jury for deciding the best reports (i.e., collection of posts) on Social Media channels, and thus will not qualify for this award. The awarded reporters will be additionally asked to provide a post-summary of the conference. The number of awards for each SIGMM conference is indicated in the table below. The awarded reporter will get a free registration to one of the SIGMM conferences (of his/her choice) within a period of one year.

Read more


Conference and Social Media Reports

Report from ACM Multimedia 2017 – by Benoit Huet

  Best #SIGMM Social Media Reporter Award! Me? Really?? This was my reaction after being informed by the SIGMM Social Media Editors that I was one of the two recipients following ACM Multimedia 2017! #ACMMM What a wonderful idea this is to encourage our community to communicate, both internally and … Read more

Report from ACM Multimedia 2017 – by Conor Keighrey

My name is Conor Keighrey, I’m a PhD. candidate at the Athlone Institute Technology in Athlone, Co. Westmeath, Ireland.  The focus of my research is to understand the key influencing factors that affect Quality of Experience (QoE) in emerging immersive multimedia experiences, with a specific focus on applications in the … Read more

The Deep Learning Indaba Report

Abstract Given the focus on deep learning and machine learning, there is a need to address this problem of low participation of Africans in data science and artificial intelligence. The Deep Learning Indaba was thus born to stimulate the participation of Africans within the research and innovation landscape surrounding deep … Read more

Report from ACM MMSys 2017

–A report from Christian Timmerer, AAU/Bitmovin Austria The ACM Multimedia Systems Conference (MMSys) provides a forum for researchers to present and share their latest research findings in multimedia systems. It is a unique event targeting “multimedia systems” from various angles and views across all domains instead of focusing on a … Read more

Report from ICMR 2017

ACM International Conference on Multimedia Retrieval (ICMR) 2017 ACM ICMR 2017 in “Little Paris” ACM ICMR is the premier International Conference on Multimedia Retrieval, and from 2011 it “illuminates the state of the arts in multimedia retrieval”. This year, ICMR was in an wonderful location: Bucharest, Romania also known as … Read more

Report from MMM 2017

MMM 2017 — 23rd International Conference on MultiMedia Modeling MMM is a leading international conference for researchers and industry practitioners for sharing new ideas, original research results and practical development experiences from all MMM related areas. The 23rd edition of MMM took place on January 4-6 of 2017, on the … Read more

Report from ICACNI 2015

Report from the 3rd International Conference on Advanced Computing, Networking, and Informatics The 3rd International Conference on Advanced Computing, Networking and Informatics (ICACNI-2015), organized by School of Computer Engineering, KIIT University, Odisha, India, was held during 23-25 June, 2015. The conference commenced with a keynote by Prof. Nikhil R. Pal … Read more

Summary of the 5th BAMMF

Bay Area Multimedia Forum (BAMMF) BAMMF is a Bay Area Multimedia Forum series. Experts from both academia and industry are invited to exchange ideas and information through talks, tutorials, posters, panel discussions and networking sessions. Topics of the forum will include emerging areas in vision, audio, touch, speech, text, various … Read more

Report from SLAM 2014

ISCA/IEEE Workshop on Speech, Language and Audio in Multimedia Following SLAM 2013 in Marseille, France, SLAM 2014 was the second edition of the workshop, held in Malaysia as a satellite of Interspeech 2014. The workshop was organized over two days, one for science and one for socializing and community building. … Read more

Report from ACM Multimedia 2013

Conference/Workshop Program Highlights ACM Multimedia 2013 was held at the CCIB (Centre de Conventions Internacional de Barcelona) from October 21st to October 25th, 2012 in Barcelona. The Art Exhibition has been held for the entire duration of the conference at the FAD (Forment de les Arts i del Disseny) in … Read more


Journal TOC service


TOMM Volume 14, Issue 3s

TOMM Volume 14, Issue 2s

TOMM Volume 14, Issue 1

TOMM Volume 14, Issue 2
May 2018

TOMM Volume 14, Issue 1
December 2017
Kai Li, Guo-Jun Qi, Kien A. Hua: Learning Label Preserving Binary Codes for Multimedia Retrieval: A General Approach Rodrigo Ceballos, Beatrice Ionascu, Wanjoo Park, Mohamad Eid: Implicit Emotion Communication: EEG Classification and Haptic Feedback Jiyan Wu, Bo Cheng, Yuan Yang, Ming Wang, Junliang Chen: Delay-Aware Quality Optimization in Cloud-Assisted Video Streaming System Peisong Wang, Qingh


MMSJ Volume 24, Issue 4
July 2018
Xinyu Yang, Yizhuo Dong, Juan Li: Review of data features-based music emotion recognition methodsIlaria Bartolini, Marco Patella: A general framework for real-time analysis of massive multimedia streamsTaeyoung Choi, Seokhyun Ghan, Seongah Chin: Biological property-based artificial scar synthesis using inverse lightingAhmed I. Sallam, El-Sayed M. El-Rabaie, Osama S. Faragallah: Efficient HEVC sele

MMSJ Volume 24, Issue 3
June 2018
Árpád Huszák, Balázs Háló: Optimized camera handover scheme in free-viewpoint video streamingZongyi Xu, Qianni Zhang, Shiyang Cheng: Multilevel active registration for kinect human body scans: from low quality to high qualityMithilesh Kumar Jha, Ravi Chaudhary, Sumantra Dutta Roy, Mona Mathur…: Restricted affine motion compensation and estimation in video coding with particle filtering and

MMSJ Volume 23, Issue 5
October 2017
Britta Meixner: A pattern-based evaluation of download and cache management algorithms for annotated interactive non-linear videosJoão Nogueira, Lucas Guardalben, Bernardo Cardoso, Susana Sargento: Catch-up TV analytics: statistical characterization and consumption patterns identification on a production serviceAzin Semsar, Ali Asghar Nazari Shirehjini: Multimedia-supported virtual experiment for

MMSJ Volume 23, Issue 4
July 2017
R. L. Gomes, L. Bittencourt, E. Madeira, E. Cerqueira, M. Gerla: Management of virtual network resources for multimedia applicationsMohammad Hosseini, Gregorij Kurillo, Seyed Rasoul Etesami, Jiang Yu: Towards coordinated bandwidth adaptations for hundred-scale 3D tele-immersive systemsDaeyeoul Kim, Jinmo Kim: Procedural modeling and visualization of multiple leavesAlok Kumar Singh Kushwaha, Subodh

MMSJ Volume 23, Issue 3
June 2017
Xiaofeng Zhu, Zhi Jin, Rongrong Ji: Learning high-dimensional multimedia dataDebo Cheng, Shichao Zhang, Xingyi Liu, Ke Sun, Ming Zong: Feature selection by combining subspace learning with sparse representationShaoyi Du, Juan Liu, Yuehu Liu, Xuetao Zhang, Jianru Xue: Precise glasses detection algorithm for face with in-plane rotationLianli Gao, Jingkuan Song, Xingyi Liu, Junming Shao, Jiajun Liu


MTAP Volume 77, Issue 12
June 2018
Jin-feng Li, Hong-Xia Wang, Tao Wu, Xing-ming Sun…: Norm ratio-based audio watermarking scheme in DWT domainAmalia Luque, M. Estela Peralta, Juan Ramón Lama…: Low cost multimedia sensor networks for obtaining lighting mapsJoão Nogueira, Lucas Guardalben, Bernardo Cardoso…: Catch-up TV forecasting: enabling next-generation over-the-top multimedia TV servicesYuyun Ye, Xiaohai He, Qizhi Teng,

MTAP Volume 77, Issue 11
June 2018
Kevin Rangel-Espinoza, Eduardo Fragoso-Navarro…: Adaptive removable visible watermarking technique using dual watermarking for digital color imagesSuman Kumar Choudhury, Pankaj Kumar Sa…: Improved pedestrian detection using motion segmentation and silhouette orientationXiuhui Wang, Ke Yan: Automatic color correction for multi-projector display systemsAref Miri, Karim Faez: An image steganograp

MTAP Volume 77, Issue 9
May 2018
: Editorial Note: Big Multimedia Data in Robotics ApplicationsShui-Hua Wang, Yin Zhang, Yu-Jie Li, Wen-Juan Jia…: Single slice based detection for Alzheimer’s disease via wavelet entropy and multilayer perceptron trained by biogeography-based optimizationChunjuan Bo, Huchuan Lu, Dong Wang: Spectral-spatial K-Nearest Neighbor approach for hyperspectral image classificationSongtao Wu, Shenghua Z

MTAP Volume 77, Issue 10
May 2018
Zhiguo Hu, Qiqiang Zhang: A new approach for packet loss measurement of video streaming and its applicationChi-Man Pun, Cai-Ping Yan, Xiao-Chen Yuan: Robust image hashing using progressive feature selection for tampering detectionYoones A. Sekhavat, Jeffrey Parsons: The effect of tracking technique on the quality of user experience for augmented reality mobile navigationBalakrishnan Ramalingam, Dh

MTAP Volume 77, Issue 8
April 2018
: Editorial Note: Multimedia Systems for Critical EngineeringVassilios Vonikakis, Rigas Kouskouridas…: On the evaluation of illumination compensation algorithmsEnkhtogtokh Togootogtokh, Timothy K. Shih…: 3D finger tracking and recognition image processing for real-time music playing with depth sensorsKanchan Lata Kashyap, Manish Kumar Bajpai…: An efficient algorithm for mass detection and sh


IJMIR Volume 7, Issue 2
June 2018
Yanming Guo, Yu Liu, Theodoros Georgiou…: A review of semantic segmentation using deep neural networksMarkus Schedl, Hamed Zamani, Ching-Wei Chen…: Current challenges and visions in music recommender systems researchMatthias Dorfer, Jan Schlüter, Andreu Vall…: End-to-end cross-modality retrieval with CCA projections and pairwise ranking lossSrinivasa Perumal Ramalingam…: Robustness of DR-

IJMIR Volume 6, Issue 3
September 2017
Michael S. Lew: ACM International Conference on Multimedia Retrieval (ICMR): current standing and impactYassmina Saadna, Ali Behloul: An overview of traffic sign detection and classification methodsParul Sahare, Sanjay B. Dhok: Script identification algorithms: a surveyMaia Zaharieva, Christian Breiteneder…: Unsupervised group feature selection for media classificationZied Guendil, Zied Lachiri,

IJMIR Volume 6, Issue 2
June 2017
Sanghoon Lee, Mohamed Masoud, Janani Balaji…: A survey of tag-based information retrievalAmandeep Kaur, Renu Dhir…: A survey on camera-captured scene text detection and extraction: towards Gurmukhi scriptD. Sejal, T. Ganeshsingh, K. R. Venugopal…: ACSIR: ANOVA Cosine Similarity Image Recommendation in vertical searchNastaran Borjian: Query-by-example music information retrieval by score-base

IJMIR Volume 6, Issue 1
March 2017
George Awad, Wessel Kraaij, Paul Over…: Instance search retrospective with focus on TRECVIDRashad Ahmed, Wasfi G. Al-Khatib…: A Survey on handwritten documents word spottingSusanne Boll, Winston Hsu, Jiebo Luo: Editorial for the ICMR 2016 special issueNikolaos Pappas, Miriam Redi, Mercan Topkara…: Multilingual visual sentiment concept clustering and analysisMarkus Schedl: Investigating count

IJMIR Volume 5, Issue 4
November 2016
Erwin M. Bakker: Major events in multimedia information retrievalMaaike H. T. de Boer, Klamer Schutte…: Blind late fusion in multimedia event retrievalM. Radhika Mani, D. M. Potukuchi…: A novel approach for shape-based object recognition with curvelet transformZineb Elgarrai, Othmane El Meslouhi…: Robust facial expression recognition system based on hidden Markov modelsAhmad Alzu’bi, Abbes

MMTC R-Letter

MMTC R-Letter Volume 7, Issue 4
August 2016
Message from the Review Board DirectorsIEEE ICME 2016 Bester Paper AwardsGuest Editorial Introduction by Cha Zhang (IEEE ICME’16 General Co-Chair)IEEE ICME’16 Best Paper: Phonetic Posteriorgrams for Many-to-One Voice Conversion without Parallel Data TrainingA short review for “Phonetic Posteriorgrams for Many-to-One Voice Conversion without Parallel Data Training” (Edited by Christian Timm

MMTC R-Letter Volume 7, Issue 3
June 2016
Message from the Review Board DirectorsAddressing All Senses QoE Optimized Multi-Sensorial Media StreamingA short review for “Beyond Multimedia Adaptation: Quality of Experience-Aware Multi-Sensorial Media Delivery (Edited by Frank Hartung)Evaluating Impact of Panoramic Projections of 360-degree Videos on Coding EfficiencyA short review for “A Framework to Evaluate Omnidirectional Video Coding

MMTC R-Letter Volume 7, Issue 2
April 2016
Message from the Review Board DirectorsIntegrating Deep Learning and Multiple Instance LearningA short review for “Deep Multiple Instance Learning for Image Classification and Auto-Annotation” (Edited by Jun Zhou)Learning for HTTP-Based Adaptive StreamingA short review for “Optimizing HTTP-Based Adaptive Streaming in Vehicular Environment Using Markov Decision Process” (Edited by Koichi Ad

MMTC R-Letter Volume 7, Issue 1
February 2016
Message from the Review Board DirectorsGeneral Multi-Modal Learning Framework for RGB-D Object RecognitionA short review for “Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition” (Edited by Carl James Debono)Energy-efficient multimedia transmission in wireless heterogeneous networksA short review for “Energy-efficient multimedia transmissions through base station cooperation

MMTC R-Letter Volume 6, Issue 5
October 2015
Message from the Review Board DirectorsVideo Playback Time Maximization for SmartphonesA short review for “EQ-Video: Energy and Quota-Aware Video Playback Time Maximization for Smartphones” (Edited by Koichi Adachi)How to share GPU amongst multiple jobs in cloud-based multimedia services?A short review for “VIGRIS: Virtualized GPU Resource Isolation and Scheduling in Cloud Gaming” (Edite

IEEE Multimedia

IEEE MultiMedia Volume 21, Issue 2
April–June 2014
Feature ArticlesFabien Danieau, Julien Fleureau, Philippe Guillotel, Nicolas Mollet, Marc Christie, and Anatole Lecuyer: Toward Haptic Cinematography: Enhancing Movie Experiences with Camera-Based Haptic EffectsMei-Chen Yeh and Wen-Po Wu: Clustering Faces in Movies Using an Automatically Constructed Social NetworkTao Guan, Yunfeng He, Liya Duan, Jianzhong Yang, Juan Gao, and Junqing Yu: Efficient