Repository logo
 

LAGO: a computationally efficient approach for statistical detection

Faculty Advisor

Date

2006

Keywords

adaptive bandwidth kernel density estimator, average precision, drug discovery, nearest neighbor, order statistic, radial basis function network,, support vector machine

Abstract (summary)

We study a general class of statistical detection problems where the underlying objective is to detect items belonging to a rare class from a very large database. We propose a computationally efficient method to achieve this goal. Our method consists of two steps. In the first step we estimate the density function of the rare class alone with an adaptive bandwidth kernel density estimator. The adaptive choice of the bandwidth is inspired by the ancient Chinese board game known today as Go. In the second step we adjust this density locally depending on the density of the background class nearby. We show that the amount of adjustment needed in the second step is approximately equal to the adaptive bandwidth from the first step, which gives us additional computational savings. We name the resulting method LAGO, for “locally adjusted Go-kernel density estimator.” We then apply LAGO to a real drug discovery dataset and compare its performance with a number of existing and popular methods.

Publication Information

Zhu, M., Su, W., & Chipman, H. A. (2006). Lago: a computationally efficient approach for statistical detection. Technometrics, 48(2), 193-205.

Notes

Item Type

Article

Language

English

Rights

All Rights Reserved