Logo image
The problem with noise and small disjuncts
Technical documentation   Open access

The problem with noise and small disjuncts

Gary M. Weiss
Rutgers University
1994
DOI:
https://doi.org/10.7282/t3-ea9m-ev91

Abstract

Systems that learn from examples often create a disjunctive concept definition. The disjuncts in the concept definition which cover only a few training examples are referred to as small disjuncts. The problem with small disjuncts is that they are more error prone than large disjuncts, but may be necessary to achieve a high level of predictive accuracy [Holte, Acker, and Porter, 1989]. This paper extends previous work done on the problem of small disjuncts by taking noise into account. It investigates the assertion that it is hard to learn from noisy data because it is difficult to distinguish between noise and true exceptions. In the process of evaluating this assertion, insights are gained into the mechanisms by which noise affects learning. Two domains are investigated. The experimental results in this paper suggest that for both Shapiro’s chess endgame domain [Shapiro, 1987] and for the Wisconsin breast cancer domain [Wolberg, 1990], the assertion is true, at least for low levels (5- 10%) of class noise.
pdf
ml-tr-38280.21 kBDownloadView
Author's Original (AO) Open Access
url
Report an accessibility issueView
Please complete a content remediation request to report an accessibility issue with a library electronic resource, website, or service.

Metrics

104 File downloads
168 Record Views

Details

Logo image