BenchmarkingOverview
In statistical learning benchmarking is the methodology of comparing learners or algorithms with respect to a certain performance measure. New benchmark experiments are published on almost a daily basis. Especially in the machine learning community benchmarking is the primary method of choice to evaluate new learning algorithms. The benchmarking process abstractly consists of three levels: Setup, Execution and Analysis. In each level different statistical and computational aspects play a role. The aim of our work is to investigate each level in detail and provide a statistically correct way of comparing learning methods with smallest possible computational effort. In addition to theoretical investigations, we are developing an open-source reference implementation within R -- an environment for statistical computing and graphics. The implementation will allow reproducible comparisons of learning algorithms (existing or newly developed ones) in an easy manner. The ultimate goal is to set a quasi-standard for the comparison of learning methods with this toolbox. LMU Project MembersResourcesSelected publications
|