The problem of cross-validation averaging and bias, repetition and significance - Details

Author：

Powers, David M.W. (Powers, David M.W..) | Atyabi, Adham (Atyabi, Adham.)

Indexed by：

EI Scopus

Abstract：

Cross-Validation　(CV)　is　the　primary　mechanism　used　in　Machine　Learning　to　control　generalization　error　in　the　absence　of　sufficiently　large　quantities　of　marked　up　(tagged　or　labelled)　data　to　undertake　independent　testing,　training　and　validation　(including　early　stopping,　feature　selection,　parameter　tuning,　boosting　and/or　fusion).　Repeated　Cross-Validation　(RCV)　is　used　to　try　to　further　improve　the　accuracy　of　our　performance　estimates,　including　compensating　for　outliers.　Typically　a　Machine　Learning　researcher　will　the　compare　a　new　target　algorithm　against　a　wide　range　of　competing　algorithms　on　a　wide　range　of　standard　datasets.　The　combination　of　many　training　folds,　many　CV　repetitions,　many　algorithms　and　parameterizations,　and　many　training　sets,　adds　up　to　a　very　large　number　of　data　points　to　compare,　and　a　massive　multiple　testing　problem　quadratic　in　the　number　of　individual　test　combinations.　Research　in　Machine　Learning　sometimes　involves　basic　significance　testing,　or　provides　confidence　intervals,　but　seldom　addresses　the　multiple　testing　problem　whereby　the　assumption　of　p

Keyword：

Correlation methods Adaptive boosting Errors Machine learning

Author Community：

[ 1 ] [Powers, David M.W.]Beijing Municipal Lab for Multimedia and Intelligent Software, Beijing University of Technology, Beijing, China
[ 2 ] [Powers, David M.W.]KIT Centre, School of Comp.Sci., Eng. and Maths, Flinders University, Adelaide, Australia
[ 3 ] [Atyabi, Adham]KIT Centre, School of Comp.Sci., Eng. and Maths, Flinders University, Adelaide, Australia

Reprint Author's Address：

Email：

powers@ieee.org

Show more details

Related Keywords：

Adabook & multibook: Adaptive boosting with chance correction
2013，10th International Conference on Informatics in Control, Automation and Robotics, ICINCO 2013
User Identification by Keystroke Dynamics Based on Feature Correlation Analysis and Feature optimization
2019，5th IEEE International Conference on Computer and Communications, ICCC 2019
To identify key geometric errors of 3-axis NC machine tool by machining accuracy failure mode analysis
2019，ASME 2019 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, IDETC-CIE 2019
A smoothing algorithm for strain measurement by digital image correlation method
2007，2007 8th International Conference on Electronic Packaging Technology, ICEPT

Source ：

Year： 2012

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 10

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 5

Affiliated Colleges：

城市建设学部建筑工程学院

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to