Validation accuracy vs Testing accuracyInformation on how value of k in k-fold cross-validation affects resulting accuraciesEstimating the variance of a bootstrap aggregator performance?Inconsistency in cross-validation resultsCross-validation including training, validation, and testing. Why do we need three subsets?My Test accuracy is pretty bad compared to cross-validation accuracyBetter accuracy with validation set than test setFeature selection: is nested cross-validation needed?10-fold cross validation, why having a validation set?Bias-Variance terminology for loss functions in ML vs cross-validation — different things?Is cross-validation better/worse than a third holdout set?

Pronouncing Dictionary.com's W.O.D "vade mecum" in English

Why is "Reports" in sentence down without "The"

Can I make popcorn with any corn?

Are tax years 2016 & 2017 back taxes deductible for tax year 2018?

Is it possible to do 50 km distance without any previous training?

Question about Goedel's incompleteness Proof

A newer friend of my brother's gave him a load of baseball cards that are supposedly extremely valuable. Is this a scam?

Download, install and reboot computer at night if needed

How can I automatically replace [[ and ]] with the [LeftDoubleBracket] and [RightDoubleBracket] operators?

I probably found a bug with the sudo apt install function

Validation accuracy vs Testing accuracy

How to type dʒ symbol (IPA) on Mac?

How is this relation reflexive?

Why don't electron-positron collisions release infinite energy?

How can I fix this gap between bookcases I made?

How can I hide my bitcoin transactions to protect anonymity from others?

Why are 150k or 200k jobs considered good when there are 300k+ births a month?

DOS, create pipe for stdin/stdout of command.com(or 4dos.com) in C or Batch?

Why CLRS example on residual networks does not follows its formula?

How to re-create Edward Weson's Pepper No. 30?

How to get the available space of $HOME as a variable in shell scripting?

XeLaTeX and pdfLaTeX ignore hyphenation

Do airline pilots ever risk not hearing communication directed to them specifically, from traffic controllers?

Continuity at a point in terms of closure

Validation accuracy vs Testing accuracy

Information on how value of k in k-fold cross-validation affects resulting accuraciesEstimating the variance of a bootstrap aggregator performance?Inconsistency in cross-validation resultsCross-validation including training, validation, and testing. Why do we need three subsets?My Test accuracy is pretty bad compared to cross-validation accuracyBetter accuracy with validation set than test setFeature selection: is nested cross-validation needed?10-fold cross validation, why having a validation set?Bias-Variance terminology for loss functions in ML vs cross-validation — different things?Is cross-validation better/worse than a third holdout set?

.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;

I am trying to get my head straight on terminology which appears confusing. I know there are three 'splits' of data used in Machine learning models.:

Training Data - Train the model

Validation Data - Cross validation for model selection

Testing Data - Test the generalisation error.

Now, as far as I am aware, the validation data is not always used as one can use k-fold cross-validation, reducing the need to further reduce ones dataset. The results of which are known as the validation accuracy. Then once the best model is selected, the model is tested on a 33% split from the initial data set (which has not been used to train). The results of this would be the testing accuracy?

Is this the right way around? or is vice versa? I am finding conflicting terminology used online! I am trying to find some explanations why my validation error is larger than my testing error, but before I find a solution, i would like to get my terminology correct.

Thanks.

asked 4 hours ago

BillyJo_rambler

296

add a comment |

I am trying to get my head straight on terminology which appears confusing. I know there are three 'splits' of data used in Machine learning models.:

Training Data - Train the model

Validation Data - Cross validation for model selection

Testing Data - Test the generalisation error.

Thanks.

asked 4 hours ago

BillyJo_rambler

296

add a comment |

I am trying to get my head straight on terminology which appears confusing. I know there are three 'splits' of data used in Machine learning models.:

Training Data - Train the model

Validation Data - Cross validation for model selection

Testing Data - Test the generalisation error.

Thanks.

asked 4 hours ago

BillyJo_rambler

296

I am trying to get my head straight on terminology which appears confusing. I know there are three 'splits' of data used in Machine learning models.:

Training Data - Train the model

Validation Data - Cross validation for model selection

Testing Data - Test the generalisation error.

Thanks.

machine-learning

asked 4 hours ago

BillyJo_rambler

296

asked 4 hours ago

BillyJo_rambler

296

asked 4 hours ago

BillyJo_rambler

296

asked 4 hours ago

BillyJo_rambler

296

asked 4 hours ago

BillyJo_rambler

296

add a comment |

2 Answers
2

active

oldest

votes

There isn't a standard terminology in this context (and I have seen long discussions and debates regarding this topic), so I completely understand you, but you should get used to different terminology (and assume that terminology might not be consistent or it change across sources).

I would like to point out a few things:

I have never seen people use the expression "validation accuracy" (or dataset) to refer to the test accuracy (or dataset), but I have seen people use the term "test accuracy" (or dataset) to refer to the validation accuracy (or dataset). In other words, the test (or testing) accuracy often refers to the validation accuracy, that is, the accuracy you calculate on the data set you do not use for training, but you use (during the training process) for validating (or "testing") the generalisation ability of your model or for "early stopping".

In k-fold cross-validation, people usually only mention two datasets: training and testing (or validation).

k-fold cross-validation is just a way of validating the model on different subsets of the data. This can be done for several reasons. For example, you have a small amount of data, so your validation (and training) dataset is quite small, so you want to have a better understanding of the model's generalisation ability by validating it on several subsets of the whole dataset.

You should likely have a separate (from the validation dataset) dataset for testing, because the validation dataset can be used for early stopping, so, in a certain way, it is dependent on the training process

I would suggest to use the following terminology

Training dataset: the data used to fit the model.

Validation dataset: the data used to validate the generalisation ability of the model or for early stopping, during the training process.

Testing dataset: the data used to for other purposes other than training and validating.

Note that some of these datasets might overlap. If that's a "good" thing or not, it's another question.

edited 4 hours ago

answered 4 hours ago

nbro

8111023

add a comment |

@nbro's answer is complete. I just add a couple of explanations to supplement. In more traditional textbooks data is often partitioned into two sets: training and test. In recent years, with more complex models and increasing need for model selection, development sets or validations sets are also considered. Devel/validation should have no overlap with the test set or the reporting accuracy/ error evaluation is not valid. In the modern setting: the model is trained on the training set, tested on the validation set to see if it is a good fit, possibly model is tweaked and trained again and validated again for multiple times. When the final model is selected, the testing set is used to calculate accuracy, error reports. The important thing is that the test set is only touched once.

answered 1 hour ago

user3089485

162

New contributor

add a comment |

Your Answer

StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\$","\$"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "65"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f401696%2fvalidation-accuracy-vs-testing-accuracy%23new-answer', 'question_page');

);

Post as a guest

Name

Required, but never shown

2 Answers
2

active

oldest

votes

2 Answers
2

active

oldest

votes

I would like to point out a few things:

I have never seen people use the expression "validation accuracy" (or dataset) to refer to the test accuracy (or dataset), but I have seen people use the term "test accuracy" (or dataset) to refer to the validation accuracy (or dataset). In other words, the test (or testing) accuracy often refers to the validation accuracy, that is, the accuracy you calculate on the data set you do not use for training, but you use (during the training process) for validating (or "testing") the generalisation ability of your model or for "early stopping".

In k-fold cross-validation, people usually only mention two datasets: training and testing (or validation).

k-fold cross-validation is just a way of validating the model on different subsets of the data. This can be done for several reasons. For example, you have a small amount of data, so your validation (and training) dataset is quite small, so you want to have a better understanding of the model's generalisation ability by validating it on several subsets of the whole dataset.

You should likely have a separate (from the validation dataset) dataset for testing, because the validation dataset can be used for early stopping, so, in a certain way, it is dependent on the training process

I would suggest to use the following terminology

Training dataset: the data used to fit the model.

Validation dataset: the data used to validate the generalisation ability of the model or for early stopping, during the training process.

Testing dataset: the data used to for other purposes other than training and validating.

Note that some of these datasets might overlap. If that's a "good" thing or not, it's another question.

edited 4 hours ago

answered 4 hours ago

nbro

8111023

add a comment |

I would like to point out a few things:

I have never seen people use the expression "validation accuracy" (or dataset) to refer to the test accuracy (or dataset), but I have seen people use the term "test accuracy" (or dataset) to refer to the validation accuracy (or dataset). In other words, the test (or testing) accuracy often refers to the validation accuracy, that is, the accuracy you calculate on the data set you do not use for training, but you use (during the training process) for validating (or "testing") the generalisation ability of your model or for "early stopping".

In k-fold cross-validation, people usually only mention two datasets: training and testing (or validation).

k-fold cross-validation is just a way of validating the model on different subsets of the data. This can be done for several reasons. For example, you have a small amount of data, so your validation (and training) dataset is quite small, so you want to have a better understanding of the model's generalisation ability by validating it on several subsets of the whole dataset.

You should likely have a separate (from the validation dataset) dataset for testing, because the validation dataset can be used for early stopping, so, in a certain way, it is dependent on the training process

I would suggest to use the following terminology

Training dataset: the data used to fit the model.

Validation dataset: the data used to validate the generalisation ability of the model or for early stopping, during the training process.

Testing dataset: the data used to for other purposes other than training and validating.

Note that some of these datasets might overlap. If that's a "good" thing or not, it's another question.

edited 4 hours ago

answered 4 hours ago

nbro

8111023

add a comment |

I would like to point out a few things:

I have never seen people use the expression "validation accuracy" (or dataset) to refer to the test accuracy (or dataset), but I have seen people use the term "test accuracy" (or dataset) to refer to the validation accuracy (or dataset). In other words, the test (or testing) accuracy often refers to the validation accuracy, that is, the accuracy you calculate on the data set you do not use for training, but you use (during the training process) for validating (or "testing") the generalisation ability of your model or for "early stopping".

In k-fold cross-validation, people usually only mention two datasets: training and testing (or validation).

k-fold cross-validation is just a way of validating the model on different subsets of the data. This can be done for several reasons. For example, you have a small amount of data, so your validation (and training) dataset is quite small, so you want to have a better understanding of the model's generalisation ability by validating it on several subsets of the whole dataset.

You should likely have a separate (from the validation dataset) dataset for testing, because the validation dataset can be used for early stopping, so, in a certain way, it is dependent on the training process

I would suggest to use the following terminology

Training dataset: the data used to fit the model.

Validation dataset: the data used to validate the generalisation ability of the model or for early stopping, during the training process.

Testing dataset: the data used to for other purposes other than training and validating.

Note that some of these datasets might overlap. If that's a "good" thing or not, it's another question.

edited 4 hours ago

answered 4 hours ago

nbro

8111023

I would like to point out a few things:

I have never seen people use the expression "validation accuracy" (or dataset) to refer to the test accuracy (or dataset), but I have seen people use the term "test accuracy" (or dataset) to refer to the validation accuracy (or dataset). In other words, the test (or testing) accuracy often refers to the validation accuracy, that is, the accuracy you calculate on the data set you do not use for training, but you use (during the training process) for validating (or "testing") the generalisation ability of your model or for "early stopping".

In k-fold cross-validation, people usually only mention two datasets: training and testing (or validation).

k-fold cross-validation is just a way of validating the model on different subsets of the data. This can be done for several reasons. For example, you have a small amount of data, so your validation (and training) dataset is quite small, so you want to have a better understanding of the model's generalisation ability by validating it on several subsets of the whole dataset.

You should likely have a separate (from the validation dataset) dataset for testing, because the validation dataset can be used for early stopping, so, in a certain way, it is dependent on the training process

I would suggest to use the following terminology

Training dataset: the data used to fit the model.

Validation dataset: the data used to validate the generalisation ability of the model or for early stopping, during the training process.

Testing dataset: the data used to for other purposes other than training and validating.

Note that some of these datasets might overlap. If that's a "good" thing or not, it's another question.

edited 4 hours ago

answered 4 hours ago

nbro

8111023

edited 4 hours ago

answered 4 hours ago

nbro

8111023

answered 4 hours ago

nbro

8111023

answered 4 hours ago

nbro

8111023

add a comment |

answered 1 hour ago

user3089485

162

New contributor

add a comment |

answered 1 hour ago

user3089485

162

New contributor

add a comment |

answered 1 hour ago

user3089485

162

New contributor

answered 1 hour ago

user3089485

162

New contributor

answered 1 hour ago

user3089485

162

New contributor

answered 1 hour ago

user3089485

162

answered 1 hour ago

user3089485

162

New contributor

user3089485 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Cross Validated!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Arsthbt

2 Answers
2

Your Answer

Post as a guest

2 Answers
2

2 Answers
2

Post as a guest

Popular posts from this blog

2 Answers 2

Your Answer

Sign up or log in

Post as a guest

Post as a guest

2 Answers 2

2 Answers 2

Sign up or log in

Post as a guest

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Popular posts from this blog

2 Answers
2

2 Answers
2

2 Answers
2