MODE: automated neural network model debugging via state differential analysis and input selection

Shiqing Ma; Yingqi Liu; Wen-Chuan Lee; Xiangyu Zhang; Ananth Grama

doi:10.1145/3236024.3236082

Back

Conference paper

MODE: automated neural network model debugging via state differential analysis and input selection

Shiqing Ma, Yingqi Liu, Wen-Chuan Lee, Xiangyu Zhang and Ananth Grama

ESEC/FSE'18: Proceedings of the 2018 26th ACM Joint Meeting On European Software Engineering Conference and Symposium on the Foundations of Software Engineering, pp.175-186

Assoc Computing Machinery

ESEC/FSE 2018: ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 26 (Lake Buena Vista, FL, 11/04/2018–11/09/2018)

01/01/2018

DOI: https://doi.org/10.1145/3236024.3236082

Abstract

Computer Science, Software Engineering

Science & Technology

Computer Science

Technology

Artificial intelligence models are becoming an integral part of modern computing systems. Just like software inevitably has bugs, models have bugs too, leading to poor classification/prediction accuracy. Unlike software bugs, model bugs cannot be easily fixed by directly modifying models. Existing solutions work by providing additional training inputs. However, they have limited effectiveness due to the lack of understanding of model misbehaviors and hence the incapability of selecting proper inputs. Inspired by software debugging, we propose a novel model debugging technique that works by first conducting model state differential analysis to identify the internal features of the model that are responsible for model bugs and then performing training input selection that is similar to program input selection in regression testing. Our evaluation results on 29 different models for 6 different applications show that our technique can fix model bugs effectively and efficiently without introducing new bugs. For simple applications (e.g., digit recognition), MODE improves the test accuracy from 75% to 93% on average whereas the state-of-the-art can only improve to 85% with 11 times more training time. For complex applications and models (e.g., object recognition), MODE is able to improve the accuracy from 75% to over 91% in minutes to a few hours, whereas state-of-the-art fails to fix the bug or even degrades the test accuracy.

Files and links (3)

pdf

FSE18 MODE automated neural 20182.32 MB

Version of Record (VoR) Restricted Access, To request access, contact soarhelp@libraries.rutgers.edu.

url

https://doi.org/10.1145/3236024.3236082View

Version of Record (VoR) ACM digital library

url

Report an accessibility issueView

Please complete a content remediation request to report an accessibility issue with a library electronic resource, website, or service.

Metrics

86 Record Views

See more details

Details

Title: Subtitle: MODE: automated neural network model debugging via state differential analysis and input selection
Creators: Shiqing Ma - Rutgers University, Computer Science (SAS)
Yingqi Liu - Purdue Univ, W Lafayette, IN 47907 USA
Wen-Chuan Lee - Purdue University System
Xiangyu Zhang - Rutgers University, Chemistry (SASN)
Ananth Grama - Purdue University System
Publication Details: ESEC/FSE'18: Proceedings of the 2018 26th ACM Joint Meeting On European Software Engineering Conference and Symposium on the Foundations of Software Engineering, pp.175-186
Conference: ESEC/FSE 2018: ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 26 (Lake Buena Vista, FL, 11/04/2018–11/09/2018)
Date published: 01/01/2018
Publisher: Assoc Computing Machinery
Number of pages: 12
Grant note: FA8650-15-C-7562 / DARPA; United States Department of Defense; Defense Advanced Research Projects Agency (DARPA) N000141410468; N000141712947 / ONR; Office of Naval Research 1701331 / Sandia National Lab; United States Department of Energy (DOE) United States Air Force; United States Department of Defense 1748764; 1409668; 1320444 / NSF; National Science Foundation (NSF)
Academic Unit: Computer Science (SAS)
Language: English
Resource Type: Conference paper
Identifiers: 991031794683204646