|
Data Smoothing: An Application of CHAID
Bruce Ratner, Ph.D. Smoothing is a method of removing the rough (the error or noise component in data) and retaining the smooth (the predictable component in data) by averaging within “neighborhoods” of similar data values. Its utility is self-evident: No data analyst wants to model noise, producing a model that yields unreliable (large error variance) and inaccurate (large prediction bias) results. The purpose of this article is to discuss the concept behind smoothing, and then present CHAID as a data smoother. The illustration discussed is also found in - Ratner, B., Statistical Modeling and Analysis for Database Marketing: Effective Techniques for Mining Big Data, pages 74 – 84, CRC Press, Boca Raton, 2006. Related Articles: Dr. Bruce Ratner has explicated many novel and effective uses of CHAID ranging from statistical modeling and analysis to data mining. 1 800 DM STAT-1, or e-mail at br@dmstat1.com. |
|