Logo image
MODE: automated neural network model debugging via state differential analysis and input selection
Conference paper

MODE: automated neural network model debugging via state differential analysis and input selection

Shiqing Ma, Yingqi Liu, Wen-Chuan Lee, Xiangyu Zhang and Ananth Grama
ESEC/FSE'18: Proceedings of the 2018 26th ACM Joint Meeting On European Software Engineering Conference and Symposium on the Foundations of Software Engineering, pp.175-186
Assoc Computing Machinery
ESEC/FSE 2018: ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 26 (Lake Buena Vista, FL, 11/04/2018–11/09/2018)
01/01/2018

Abstract

Computer Science, Software Engineering Science & Technology Computer Science Technology
Artificial intelligence models are becoming an integral part of modern computing systems. Just like software inevitably has bugs, models have bugs too, leading to poor classification/prediction accuracy. Unlike software bugs, model bugs cannot be easily fixed by directly modifying models. Existing solutions work by providing additional training inputs. However, they have limited effectiveness due to the lack of understanding of model misbehaviors and hence the incapability of selecting proper inputs. Inspired by software debugging, we propose a novel model debugging technique that works by first conducting model state differential analysis to identify the internal features of the model that are responsible for model bugs and then performing training input selection that is similar to program input selection in regression testing. Our evaluation results on 29 different models for 6 different applications show that our technique can fix model bugs effectively and efficiently without introducing new bugs. For simple applications (e.g., digit recognition), MODE improves the test accuracy from 75% to 93% on average whereas the state-of-the-art can only improve to 85% with 11 times more training time. For complex applications and models (e.g., object recognition), MODE is able to improve the accuracy from 75% to over 91% in minutes to a few hours, whereas state-of-the-art fails to fix the bug or even degrades the test accuracy.
pdf
FSE18 MODE automated neural 20182.32 MB
Version of Record (VoR) Restricted Access, To request access, contact soarhelp@libraries.rutgers.edu.
url
https://doi.org/10.1145/3236024.3236082View
Version of Record (VoR) ACM digital library
url
Report an accessibility issueView
Please complete a content remediation request to report an accessibility issue with a library electronic resource, website, or service.

Metrics

86 Record Views

Details

Logo image