Dr. Iván González Díaz's personal page

Welcome to my personal web page. I'm a member of the Multimedia Processing Group of the Universidad Carlos III de Madrid.

Teaching Resources

Tutorials for Digital Audio Processing (Kindly provided by Sergio Sanz Escalona).

Research Resources


Keyframe ground-truth segmentation database: Download

QR Codes Database: Download

Louvre and Madrid Video Annotation Datasets: Download

Grasping in the Wild Dataset: Access


Windows and Linux executables of the Motion Classification based Search (MCS): Download

Matlab Code for Geometric Image retrieval and ROI segmentation: Download

Code for DermaKNet (skin lesion diagnosis) at GitHub

Code for Density-based clustering in scenarios with variable density at GitHub

Code for License Plate Detection (LPD) in unconstrained environments at GitHub


Multimedia Processing Group

Signal Theory and Communications Department

Universidad Carlos III de Madrid


Journal papers

  1. M.A. Fernández-Torres, I. González-Díaz, F. Díaz-de-María, "Probabilistic Topic Model for Context-Driven Visual Attention Understanding". Accepted for publication in IEEE Transactions on Circuits and Systems for Video Technology, 2019. doi
  2. M. Molina-Moreno, I. González-Díaz and F. Díaz-de-María. "Efficient Scale-Adaptive License Plate Detection System". In IEEE Transactions on Intelligent Transportation Systems, vol. 20, no. 6, pp. 2109-2121, June 2019. doi[Software]
  3. Iván González-Díaz, Jenny Benois-Pineau, Jean-Philippe Domenger, Daniel Cattaert, Aymar de Rugy , Perceptually-guided deep neural networks for ego-action prediction: Object grasping. Special issue on Bio/neuroscience Pattern Recognition, Edited by Farzin Deravi, Zhongfei Zhang, Chaabane Djeraba, Pattern Recognition, Volume 88, April 2019, Pages 223-235, doi[Grasping in the Wild Dataset]
  4. Iván González-Díaz., "DermaKNet: Incorporating the knowledge of dermatologists to Convolutional Neural Networks for skin lesion diagnosis". In IEEE Journal of Biomedical and Health Informatics, vol. 23, no. 2, pp. 547-559, March 2019. doi [Software]
  5. E. Pla-Sacristán, I. González-Díaz, T. Martínez-Cortés, F. Díaz-de-María. "Finding landmarks within settled areas using hierarchical density-based clustering and meta-data from publicly available images". In Expert Systems with Applications, Volume 123, 2019, Pages 315-327, ISSN 0957-4174, doi[Software]
  6. F Fernández-Martínez, A Hernández-García, MA Fernández-Torres, I González-Díaz, Á García-Faura, F Díaz de María. Exploiting visual saliency for assessing the impact of car commercials upon viewers. Multimed Tools Appl (2018) 77: 18903. doi
  7. López-Labraca, J., Fernández-Torres, M.Á., González-Díaz, I., Díaz-de-María, F. and Pizarro, A.. Enriched dermoscopic-structure-based cad system for melanoma diagnosis. Multimed Tools Appl (2018) 77: 12171. doi
  8. Iván González-Díaz, Murat Birinci, Fernando Díaz-de-María, Edward J. Delp. . Neighborhood Matching for Image Retrieval. IEEE Trans. Multimedia 19(3): 544-558 , 2017 . doi
  9. Iván González Díaz, Vincent Buso, Jenny Benois-Pineau. Perceptual modeling in the problem of active object recognition in visual scenes. Pattern Recognition 56: 129-141, 2016 . doi
  10. Vincent Buso, Iván González Díaz, Jenny Benois-Pineau. Goal-oriented top-down probabilistic visual attention model for recognition of manipulated objects in egocentric videos. In Signal Processing: Image Communication Volume 39, Part B, November 2015, Pages 418–431. doi
  11. Iván González-Díaz, Tomás Martínez-Cortés, Ascensión Gallardo-Antolín, Fernando Díaz-de-María. Temporal segmentation and keyframe selection methods for user-generated video search-based annotation. In Expert Systems with Applications, 42(1): 488-502 (2015) doi[Datasets][Online Demo]
  12. Iván González-Díaz, Carlos E. Baz-Hormigos, Fernando Díaz-de-María. A Generative Model for Concurrent Image Retrieval and ROI Segmentation. In IEEE Transactions on Multimedia, 16(1): 169-183 (2014) doi[software][Online Demo]
  13. Iván González-Díaz, Fernando Díaz-de-María. A region-centered topic model for object discovery and category-based image segmentation. In Pattern Recognition, P 46(9): 2437-2449 (2013) doi
  14. David Munoz-Mejias, Iván González-Díaz, Fernando Díaz-de-María.A low-complexity pre-processing system for restoring low-quality QR code images. In IEEE Trans. Consumer Electronics, 57(3): 1320-1328 (2011) doi
  15. Iván González-Díaz, Fernando Díaz-de-María.Adaptive Multipattern Fast Block-Matching Algorithm Based on Motion Classification Techniques. In IEEE Transactions on Circuits and Systems for Video Technology, vol.18, no.10, pp.1369-1382, Oct. 2008 doi[software]

Book Chapters

  1. Iván González Díaz, Vincent Buso, Jenny Benois-Pineau, Guillaume Bormaud, Gaelle Usseglio, Rémi Mégret,Yann Gaestel, Jean-François Dartigues. Recognition of instrumental activities of daily living in egocentric video for activity monitoring of patients with dementia. In Health monitoring and personlized feedback using multimedia data. pp. 161 - 178. Springer, 07/2015. ISBN 978-3-319-17962-9
  2. Iván González-Díaz, Jenny Benois-Pineau, Vincent Buso, Hugo Boujut. Fusion of Multiple Visual Cues for Object Recognition in Video. In Fusion in Computer Vision - Understanding Complex Visual Content. Chapter 4, - ISBN 978-3-319-05695,pp. 79 - 108. Springer Advances in Computer Vision and Pattern Recognition series, 04/2014.

International Conferences

  1. Tomás Martínez-Cortés, Iván González-Díaz and Fernando Díaz-de-María. Automatic Learning of Image Representations combining Content and Meta-data. In 2018 25th IEEE International Conference on Image Processing (ICIP), Athens, 2018, pp. 1972-1976. doi
  2. Iván González Díaz, Jenny Benois-Pineau, Jean-Philippe Domenger and Aymar de Rugy. Perceptually-guided understanding of egocentric video content: recognition of objects to grasp . In Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval (ICMR '18). ACM, New York, NY, USA, 434-441. .
  3. Miguel-Ángel Fernández-Torres, Iván González-Díaz and Fernando Díaz-De-María.A Probabilistic Topic Approach for Context-Aware Visual Attention Modeling.. In Content–Based Multimedia Indexing, 2016
  4. Vincent Buso, Iván González Díaz, Jenny Benois-Pineau. Object Recognition with top-down visual attention modeling for behavioral studies. In IEEE International Conference on Image Processing, 2015. Selected as Top 10% ICIP 2015 papers.
  5. Tomás Martínez Cortés, Miguel Ángel Fernández Torres, Amaya Jiménez Moreno, Iván González Díaz,Fernando Díaz de María, Juan Adán Guzmán de Villoria, Pilar Fernández. A bayesian model for brain tumor classification using clinical-based features. In IEEE International Conference on Image Processing, 2014
  6. Iván González Díaz, Vincent Buso, Jenny Benois-Pineau, Guillaume Bourmaud, Rèmi Megret. Modeling Instrumental Activities of Daily Living in Egocentric Vision as Sequences of Active Objects and Context for Alzheimer Disease Research Healthcare. In 1st ACM MM Workshop on Multimedia Indexing and information Retrieval for ACMMM'13. doi
  7. Fernando de-la-Calle-Silos, Iván González-Díaz, Fernando Díaz-de-María. Mid-level feature set for specific event and anomaly detection in crowded scenes. In IEEE International Conference on Image Processing, 2013, pages 4001-4005 doi
  8. Iván González-Díaz, Carlos E. Baz-Hormigos, Moises Berdonces, Fernando Díaz-de-María. A generative model for concurrent image retrieval and ROI segmentation. In Content–Based Multimedia Indexing, 2012 pages 1-6 doi
  9. Iván González-Díaz, Vanessa Gómez-Verdejo, Fernando Díaz-de-María, Jerónimo Arenas-García.UC3M AT TRECVID 2010 Semantic Indexing Task. In 2010 TRECVID Workshop.
  10. Iván González-Díaz, Dario García-García, Fernando Díaz-de María.A Spatially-Aware Generative Model for Image Classification. In IEEE International Conference on Image Processing, 2009. ICIP 2009. doi
  11. Iván González-Díaz, Vanessa Gómez-Verdejo, Manel Martínez-Ramon, Fernando Díaz-de-María, Jerónimo Arenas-García.UC3M AT TRECVID 2009. In 2009 TRECVID Workshop.
  12. Iván González-Díaz, Dario García-García, Rubén Solera-Ureña, Jaisiel Madrid-Sánchez, Vanessa Gómez-Verdejo, Manel Martínez-Ramón, Fernando Díaz-de-María, Jerónimo Arenas-García.UC3M High Level Feature Extraction at TRECVID 2008. In 2008 TRECVID Workshop.
  13. Iván González-Díaz, Kevin McGuinness, Tomasz Adamek, Noel E. O'Connor, Fernando Díaz-de-María.Incorporating spatio-temporal mid level features in a region segmentation algorithm for video sequences. In IEEE International Conference on Image Processing, 2008. ICIP 2008. , Pages 1-4, 2008 doi
  14. Iván González-Díaz, Fernando Díaz-de-María. Improved Motion Classification Techniques for Adaptive Multi-Pattern Fast Block-Matching Algorithm. In IEEE International Conference on Image Processing, 2007. ICIP 2007., Volume 2, Pages 0-485, 2007. details doi
  15. Iván González-Díaz, Manuel de-Frutos-López, Sergio Sanz-Rodríguez, Fernando Díaz-de-María. Adaptive Multi-Pattern Fast Block-Matching Algorithm Based on Motion Classification Techniques. In IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007., Volume 1, Pages 0-1177, 2007. details doi pdf
  16. Sergio Sanz-Rodríguez, Manuel de-Frutos-López, Iván González-Díaz, Jesús Cid-Sueiro. A Rate Control Algorithm for Low-Delay H.264 Video Coding with Stored-B Pictures. In IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007., Volume 1, Pages 0-1153, 2007. details doi pdf

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Contact Info




  Phone: +34-916246262




  Iván González Díaz

  Grupo de Procesado Multimedia

  Dpt. Teoría de la Señal y Comunicaciones

  Universidad Carlos III de Madrid

  Avda Universidad, 30 , 28911, Leganés- Madrid, Spain




  Fax: +34-916248749