Examples of feature attribution-based XAI methods on audio models. The positively relevant features toward the predicted class are marked in green. © 2022 IEEE. Reprinted, with permission, from Wullenweber A, Akman A, Schuller BW. CoughLIME: Sonified Explanations for the Predictions of COVID-19 Cough Classifiers. Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:1342-1345. doi: 10.1109/EMBC48229.2022.9871291. PMID: 36086189. Credit: Intelligent Computing (2023). DOI: 10.34133/icomputing.0074

AI decision-making is now common in self-driving cars, patient diagnosis and legal consultation, and it needs to be safe and trustworthy. Researchers have been trying to demystify complex AI models by developing interpretable and transparent models, collectively known as explainable AI methods or explainable AI (XAI) methods. A research team offered their insight specifically into audio XAI models in a review article published in Intelligent Computing.

Although audio tasks are less researched than visual tasks, their expressive power is not less important. Audio signals are easy to understand and communicate, as they typically depend less on expert explanations than visual signals do. Moreover, scenarios like speech recognition and environmental sound classification are inherently audio-specific.

The review categorizes existing audio XAI methods into two groups: general methods applicable to audio models and audio-specific methods.

Using general methods means choosing an appropriate generic model originally built for non-audio tasks and adjusting it to suit a certain audio task. These methods explain audio models through various input representations like spectrograms and waveforms and different output formats like features, examples, and concepts.

Popular general methods include guided backpropagation, which enhances the standard backpropagation process by highlighting the most relevant parts of the input data; LIME, which approximates a complex model with a simpler model; and network dissection, which analyzes the internal representations learned by a neural network.

Audio-specific methods, on the other hand, are specially designed for audio tasks. They aim to decompose audio inputs into meaningful components, focusing on the auditory nature of audio data. Some examples are CoughLIME, which provides sonified explanations for cough sounds in COVID-19 detection, and audioLIME, which uses source separation to explain music tagging models by attributing importance to audio components.

XAI methods can also be categorized by their stage, scope, input data type, and output format. Stage refers to the period where the explanations are generated, whether before, during, or after the training process. Scope determines whether the explanation targets the entire model or a specific input.

XAI usually involves different strategies, such as explaining with predefined rules or specific input examples, highlighting the most important features, focus areas, or input changes, and using simpler models to explain complex ones locally.

The research team identifies several ways audio models could be made more interpretable, such as using raw waveforms or spectrograms to provide listenable explanations and defining higher-level concepts in audio data, which is similar to how superpixels are used in image data. They also believe the expressive power of audio explanations could be extended to non-audio models, and offering a complementary communication channel for vision-based user interactions could be one possibility.

More information:
Alican Akman et al, Audio Explainable Artificial Intelligence: A Review, Intelligent Computing (2023). DOI: 10.34133/icomputing.0074

Provided by
Intelligent Computing

Post Disclaimer

The information provided in our posts or blogs are for educational and informative purposes only. We do not guarantee the accuracy, completeness or suitability of the information. We do not provide financial or investment advice. Readers should always seek professional advice before making any financial or investment decisions based on the information provided in our content. We will not be held responsible for any losses, damages or consequences that may arise from relying on the information provided in our content.

Audio explainable artificial intelligence: Demystifying ‘black box’ models

Post Disclaimer

Top 5 Integration Platform as a Service (iPaaS) Vendors in 2025: Comprehensive Analysis, Rankings, and Use Cases

Infrastructure as Code (IaC): How Corporations Thrive in 2025

AI-Driven Identity and Access Management Application Manufacturers in 2025: Vendor Analysis, Market Leaders, and Industry Insights

Machine Identity Management Application Manufacturers 2025: Comprehensive Vendor Analysis and Industry Leaders

Explainable AI Trends 2025: Boosting Transparency and Trust in Artificial Intelligence

Revolutionary AI Agent Technology for 2025

The Role of AI in Medical Imaging: Current Trends and the Future Outlook for 2025

Most Popular

Top Gen AI Trends Transforming Supply Chain Operations 2025

What is an Order Management System (2023)

Will Supply Chain Issues Continue in 2024? – A Detailed Outlook of the USA

Discussing the Red Sea from a Supply Chain Perspective

Recent Comments

EDITOR PICKS

The Rising Tide of AI and Machine Learning in Cybersecurity

Navigating the Web 3.0: A Guide to Harnessing Its Power in 2024

The Future of Payments: How AI and Machine Learning are Revolutionizing Account-to-Account (A2A) Transactions

POPULAR POSTS

Gamified Choice Boards

The Best WooCommerce Store Examples for 2023

Generative AI in Payments: Enhancing Security Measures

POPULAR CATEGORY

ABOUT TECH ONLINE NEWS

FOLLOW US