Machine Learning with Multimedia Data

Special session at the 11th IEEE International Conference on Machine Learning and Applications (ICMLA 2012)
Boca Raton Marriott Hotel, Boca Raton, Florida, USA (Dec 12-15, 2012)

Vast amounts of new data in a large variety of formats and media modalities are made available worldwide on a daily basis. Much of this information is not easily reachable; for instance, access to rich multimedia information (e.g. audio, images, video) is an extremely challenging problem. Machine learning is used with varying success for diverse tasks dealing with multimedia data. Many problems remain, including effective data representation schemes, semantic-enabled feature representations, algorithms capable of dealing with high-dimension spatio-temporal data, fusion of multi-modal content or techniques for enabling cross-modal access to information (e.g. textual queries of video recordings).

This session aims at bringing together researchers working on applying machine learning to different types of data, including music, video, speech, or images. We welcome papers describing work in progress and encourage submissions that make datasets available to the community.

Topics covered by this special session include, but are not limited, to the following:

  • Feature extraction from multimedia data
  • Semantic content analysis, classification & annotation
  • Semi-supervised learning in multimedia data analysis
  • Learning techniques for cross-media enabled scenarios
  • Learning from user generated content in Social Media
  • Multimedia personalization and recommender systems
  • Multimedia machine learning in biometrics
  • Optimal Learning from multimodal features
  • Semantic-aware interfaces for multimedia and cross-media navigation
  • Multimedia information retrieval
  • Spoken document retrieval
  • Speech recognition
  • Speaker recognition & diarization

Accepted papers will be published in the proceedings of the 11th IEEE ICMLA 2012 conference.


Paper Submission Deadline
Acceptance Notification
Camera-ready Submission and Registration    
ICMLA Conference
August 6th 2012 August 20th 2012
September 7th 2012 September 17th 2012
October 1st 2012
December 12th -15th 2012


Dr. Jens Grivolla
Barcelona Media Innovation Centre
Dr. Jose San Pedro
Telefonica Research


Hanna Lukashevich (Fraunhofer IDMT)
Maurizio Montagnuolo (RAI, Italy)
Roberto Basili (University of Roma, Tor Vergata)
Ching-Wei Chen (Media Technology Lab, Gracenote, Inc.)
Adrian Weller
Yiqun Hu
Lin Yang (Google)
Joan Codina (Universitat Pompeu Fabra, Web Research Group, Spain)
Chengcui Zhang (Department of Computer and Information Sciences, The University of Alabama at Birmingham, AL, USA)
Christian Raymond (INSA/IRISA)
Rafael Banchs (Barcelona Media)
Sam Davies (BBC R&D)
Jialie Shen (School of Information Systems, Singapore Management University)
Teva Merlin (Laboratoire d'Informatique de l'Université du Maine, France)
Sergio Dominguez (Universidad Politécnica de Madrid, Spain)
Stefan Siersdorfer (L3S Research Center, Germany)
Jonathon Hare (University of Southampton, UK)
Xavier Sevillano (Universidad La Salle, Spain)
Xavier Anguera (Telefonica Research, Spain)
Daniel Gärtner (Fraunhofer IDMT, Semantic Music Technologies)