Riccardo Gibello
I am a software engineer and PhD student in Natural Language Processing at D-HygeaLab. My research focuses on medical device classification under European standards and clinical text coding (discharge summaries) using multi-label, hierarchical models. I have hands-on experience in the full data pipeline: dataset curation, data augmentation, and model development for challenging, real-world problems. My work includes the use of LLMs with zero-shot learning and the implementation of custom hierarchical models. I thrive in collaborative, multidisciplinary teams and am driven by curiosity and scientific rigor.
Education
Sept 2023 -- Sept 2026
PhD in Data Analytics and Decision Sciences
Politecnico di Milano, Italy
Thesis: Developing generative models for multi-label hierarchical classification of biomedical texts, focusing on zero-shot learning, data augmentation, and task-specific model adaptation for medical device categorization and clinical coding.
Sept 2020 -- May 2023
MSc in Computer Science & Engineering
Politecnico di Milano, Italy
Thesis: Developed a distributed application for automated mapping of Global Medical Device Nomenclature codes to European equivalents, leveraging a web-scraped dataset and machine learning for text classification.
Sept 2017 -- Sept 2020
BSc in Computer Science & Engineering
Politecnico di Milano, Italy
Thesis: Co-developed a distributed Java application for a two-player version of the Santorini board game.
Research Experience
Sept 2025
Mediterranean Machine Learning Summer School
Split, Croatia
Attended this highly selective summer school (<18% acceptance rate) organized by Google DeepMind, covering topics in Computer Vision, Natural Language Processing, and Reinforcement Learning, with a focus on applications of large language models, and presented a poster on recent work.
Aug 2025
Oxford MLx: Representation Learning & Generative AI
Oxford, England
Attended the Oxford MLx summer school on representation learning and generative AI, covering topics such as model-free statistical methods for uncertainty estimation and generative models for text, images, and graphs.
Apr 2025 -- Present
Myocardial Scar Detection Project
Politecnico di Milano, Italy
Collaborating in a multidisciplinary team to develop an advanced system for multi-label classification of myocardial scars across 17 cardiac segments, integrating uncertainty quantification techniques and multi-modality data analysis.
March 2025 -- Present
Rheumatology Report Summarization System
Politecnico di Milano, Italy
Development of an abstractive summarization system for rheumatology clinical reports at Niguarda Hospital, addressing the lack of structure in free-text reports through automatic segmentation and generation of concise, structured summaries of key clinical information for physicians.
Sep 2023 -- Apr 2024
CTO, MONIMEDS
Politecnico di Milano, Italy
Led the design and development of MONIMEDS, a distributed dashboard platform aggregating web-scraped medical device safety notices across Europe. Achievements include winning the StartCup Lombardia award, reaching the finals of Switch2Product (Politecnico di Milano), and gaining experience in business planning, investor relations, and technical communication.
Publications
Nov 2023
Development of an AI-based IT tool to support medical device nomenclature standardization for post-market surveillance by automated mapping from GMDN to EMDN standards
R Gibello, Y Ren, E G Caiani
European Heart Journal, Volume 44, Issue Supplement_2
10.1093/eurheartj/ehad655.3024