Explainable Deep Learning Classification of Respiratory Sound for Telemedicine Applications

Chapter

Publication Date:

2022

Short description:

Explainable Deep Learning Classification of Respiratory Sound for Telemedicine Applications / Lo Giudice, M.; Mammone, N.; Ieracitano, C.; Aguglia, U.; Mandic, D.; Morabito, F. C.. - 1724:(2022), pp. 391-403. [10.1007/978-3-031-24801-6_28]

abstract:

The recent pandemic crisis combined with the explosive growth of Artificial Intellignence (AI) algorithms has highlighted the potential benefits of telemedicine for decentralised, accurate and automated clinical diagnoses. One of the most popular and essential diagnoses is the auscultation; it is non-invasive, real-time and very informative diagnoses for knowing the state of the respiratory system. To implement a possible automated auscultation analysis, the decision-making explanation of complex models (such as Deep Learning models) is crucial for trusted application in the clinical domain. In this context, we will analyse the behaviour of a Convolutional Neural Network (CNN) in classifying the largest publicly available database of respiratory sounds, originally compiled to support the scientific challenge organized at Int. Conf. on Biomedical Health Informatics (ICBHI17). It contains respiratory sounds (recorded with auscultation) of normal respiratory cycles, crackles, wheezes and both. To capture the phonetically important features of breath sounds, the Mel-Frequency Cepstrum (MFC) for short-term power spectrum representation was applied. The MFC allowed us to identify latent features without losing the temporal information so that we could easily identify the correspondence of the features to the starting sound. The MFCs were used as input to the proposed CNN who was able to classify the four above-mentioned respiratory classes with an accuracy of 72.8%. Despite interesting results, the main focus of the present study was to investigate how the CNN achieved this classification. The explainable Artificial Intelligence (xAI) technique of Gradient-weighted Class Activation Mapping (Grad-CAM) was applied. xAI made it possible to visually identify the most relevant areas, especially for the recognition of abnormal sounds, which is crucial for inspecting the correct learning of the CNN.

Iris type:

2.1 Contributo in volume (Capitolo o Saggio)

Keywords:

Convolutional neural network; Deep learning; Explainable artificial intelligence; Grad-CAM; Mel-frequency cepstrum; Respiratory sound; Telemedicine

List of contributors:

Lo Giudice, M.; Mammone, N.; Ieracitano, C.; Aguglia, U.; Mandic, D.; Morabito, F. C.

Authors of the University:

IERACITANO Cosimo

MORABITO Francesco Carlo

Mammone Nadia

Handle:

https://iris.unirc.it/handle/20.500.12318/137386

Book title:

Communications in Computer and Information Science

Published in:

COMMUNICATIONS IN COMPUTER AND INFORMATION SCIENCE

Series

Overview

URL

https://link.springer.com/chapter/10.1007/978-3-031-24801-6_28

Explainable Deep Learning Classification of Respiratory Sound for Telemedicine Applications

Lo Giudice, M.; Mammone, N.; Ieracitano, C.; Aguglia, U.; Mandic, D.; Morabito, F. C.

COMMUNICATIONS IN COMPUTER AND INFORMATION SCIENCE

Overview

URL