Skip to Main Content (Press Enter)

Logo UNIRC
  • ×
  • Home
  • Degrees
  • Courses
  • Jobs
  • People
  • Outputs
  • Organizations
  • Projects
  • Expertise & Skills

UNI-FIND
Logo UNIRC

|

UNI-FIND

unirc.it
  • ×
  • Home
  • Degrees
  • Courses
  • Jobs
  • People
  • Outputs
  • Organizations
  • Projects
  • Expertise & Skills
  1. Outputs

Explainable Deep Learning Classification of Respiratory Sound for Telemedicine Applications

Chapter
Publication Date:
2022
Short description:
Explainable Deep Learning Classification of Respiratory Sound for Telemedicine Applications / Lo Giudice, M.; Mammone, N.; Ieracitano, C.; Aguglia, U.; Mandic, D.; Morabito, F. C.. - 1724:(2022), pp. 391-403. [10.1007/978-3-031-24801-6_28]
abstract:
The recent pandemic crisis combined with the explosive growth of Artificial Intellignence (AI) algorithms has highlighted the potential benefits of telemedicine for decentralised, accurate and automated clinical diagnoses. One of the most popular and essential diagnoses is the auscultation; it is non-invasive, real-time and very informative diagnoses for knowing the state of the respiratory system. To implement a possible automated auscultation analysis, the decision-making explanation of complex models (such as Deep Learning models) is crucial for trusted application in the clinical domain. In this context, we will analyse the behaviour of a Convolutional Neural Network (CNN) in classifying the largest publicly available database of respiratory sounds, originally compiled to support the scientific challenge organized at Int. Conf. on Biomedical Health Informatics (ICBHI17). It contains respiratory sounds (recorded with auscultation) of normal respiratory cycles, crackles, wheezes and both. To capture the phonetically important features of breath sounds, the Mel-Frequency Cepstrum (MFC) for short-term power spectrum representation was applied. The MFC allowed us to identify latent features without losing the temporal information so that we could easily identify the correspondence of the features to the starting sound. The MFCs were used as input to the proposed CNN who was able to classify the four above-mentioned respiratory classes with an accuracy of 72.8%. Despite interesting results, the main focus of the present study was to investigate how the CNN achieved this classification. The explainable Artificial Intelligence (xAI) technique of Gradient-weighted Class Activation Mapping (Grad-CAM) was applied. xAI made it possible to visually identify the most relevant areas, especially for the recognition of abnormal sounds, which is crucial for inspecting the correct learning of the CNN.
Iris type:
2.1 Contributo in volume (Capitolo o Saggio)
Keywords:
Convolutional neural network; Deep learning; Explainable artificial intelligence; Grad-CAM; Mel-frequency cepstrum; Respiratory sound; Telemedicine
List of contributors:
Lo Giudice, M.; Mammone, N.; Ieracitano, C.; Aguglia, U.; Mandic, D.; Morabito, F. C.
Authors of the University:
IERACITANO Cosimo
MORABITO Francesco Carlo
Mammone Nadia
Handle:
https://iris.unirc.it/handle/20.500.12318/137386
Book title:
Communications in Computer and Information Science
Published in:
COMMUNICATIONS IN COMPUTER AND INFORMATION SCIENCE
Series
  • Overview

Overview

URL

https://link.springer.com/chapter/10.1007/978-3-031-24801-6_28
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.1.0