LAGO: a computationally efficient approach for statistical detection

Author
Su, Wanhua
Zhu, M.
Chipman, H. A.
Faculty Advisor
Date
2006
Keywords
adaptive bandwidth kernel density estimator , average precision , drug discovery , nearest neighbor , order statistic , radial basis function network, , support vector machine
Abstract (summary)
We study a general class of statistical detection problems where the underlying objective is to detect items belonging to a rare class from a very large database. We propose a computationally efficient method to achieve this goal. Our method consists of two steps. In the first step we estimate the density function of the rare class alone with an adaptive bandwidth kernel density estimator. The adaptive choice of the bandwidth is inspired by the ancient Chinese board game known today as Go. In the second step we adjust this density locally depending on the density of the background class nearby. We show that the amount of adjustment needed in the second step is approximately equal to the adaptive bandwidth from the first step, which gives us additional computational savings. We name the resulting method LAGO, for “locally adjusted Go-kernel density estimator.” We then apply LAGO to a real drug discovery dataset and compare its performance with a number of existing and popular methods.
Publication Information
Zhu, M., Su, W., & Chipman, H. A. (2006). Lago: a computationally efficient approach for statistical detection. Technometrics, 48(2), 193-205.
DOI
Notes
Item Type
Article
Language
English
Rights
All Rights Reserved