|
A Comparison of Two Popular Machine Learning Methods: Common Pitfalls
Bruce Ratner, Ph.D. Machine learning, a computer-based approach for solving problems, has recently been the subject of comparative studies because it represents a potentially viable alternative to traditional statistical methodology. The purpose of this article is not to debate the comparative merits of either type of model; suffice it to say that much about machine learning has value. Given their expertise with quantitative methods, analysts - whether statisticians or computer scientists - have been not been especially mindful of how they were comparing methods, and thus neglected the essential trinity of contingencies: • proper implementation of the method • the method’s explicit measure of performance, • the data However, by introducing a new approach such as machine learning, traditionalists should be cautioned that without a strict adherence to proper comparison techniques, their findings would be flawed. In providing a technical review of the two most popular machine learning methods - genetic programming and neural networks – I have utilized the “holy” trinity of contingencies, demonstrating for analysts how to conduct their own balanced evaluations of these popular methods. I offer a few definitions of machine learning, after which I provide a motivational theme of machine learning. Then, I provide a technical review of genetic programming and neural networks. 1 800 DM STAT-1, or e-mail at br@dmstat1.com. |
|