Is a model fitted to data or is data fitted to a model?
$begingroup$
Is there a conceptual or procedural difference between fitting a model to data and fitting data to model? An example of the first wording can be seen in https://courses.washington.edu/matlab1/ModelFitting.html, and of the second in https://reference.wolfram.com/applications/eda/FittingDataToLinearModelsByLeast-SquaresTechniques.html.
terminology
$endgroup$
add a comment |
$begingroup$
Is there a conceptual or procedural difference between fitting a model to data and fitting data to model? An example of the first wording can be seen in https://courses.washington.edu/matlab1/ModelFitting.html, and of the second in https://reference.wolfram.com/applications/eda/FittingDataToLinearModelsByLeast-SquaresTechniques.html.
terminology
$endgroup$
7
$begingroup$
+1 I am not impressed by the second link, but I am entertained.
$endgroup$
– The Laconic
2 days ago
$begingroup$
Many models fits current data, but data typically fits best one model
$endgroup$
– Agnius Vasiliauskas
17 hours ago
add a comment |
$begingroup$
Is there a conceptual or procedural difference between fitting a model to data and fitting data to model? An example of the first wording can be seen in https://courses.washington.edu/matlab1/ModelFitting.html, and of the second in https://reference.wolfram.com/applications/eda/FittingDataToLinearModelsByLeast-SquaresTechniques.html.
terminology
$endgroup$
Is there a conceptual or procedural difference between fitting a model to data and fitting data to model? An example of the first wording can be seen in https://courses.washington.edu/matlab1/ModelFitting.html, and of the second in https://reference.wolfram.com/applications/eda/FittingDataToLinearModelsByLeast-SquaresTechniques.html.
terminology
terminology
edited yesterday
Nick Cox
39k587131
39k587131
asked 2 days ago
enjayesenjayes
1037
1037
7
$begingroup$
+1 I am not impressed by the second link, but I am entertained.
$endgroup$
– The Laconic
2 days ago
$begingroup$
Many models fits current data, but data typically fits best one model
$endgroup$
– Agnius Vasiliauskas
17 hours ago
add a comment |
7
$begingroup$
+1 I am not impressed by the second link, but I am entertained.
$endgroup$
– The Laconic
2 days ago
$begingroup$
Many models fits current data, but data typically fits best one model
$endgroup$
– Agnius Vasiliauskas
17 hours ago
7
7
$begingroup$
+1 I am not impressed by the second link, but I am entertained.
$endgroup$
– The Laconic
2 days ago
$begingroup$
+1 I am not impressed by the second link, but I am entertained.
$endgroup$
– The Laconic
2 days ago
$begingroup$
Many models fits current data, but data typically fits best one model
$endgroup$
– Agnius Vasiliauskas
17 hours ago
$begingroup$
Many models fits current data, but data typically fits best one model
$endgroup$
– Agnius Vasiliauskas
17 hours ago
add a comment |
4 Answers
4
active
oldest
votes
$begingroup$
Pretty much every source or person I've ever interacted with except the Wolfram source you linked refers to the process as fitting a model to data. This makes sense, since the model is the dynamic object and the data is static (a.k.a. fixed and constant).
To put a point on it, I like Larry Wasserman's approach to this. In his telling, a statistical model is a collection of distributions. For example, the collection of all normal distributions:
$$ { text{Normal}(mu, sigma) : mu, sigma in R, sigma > 0 } $$
or the set of all Poisson distributions:
$$ { text{Poisson}(lambda) : lambda in R, lambda > 0 } $$
Fitting a distribution to data is any algorithm that combines a statistical model with a set of data (the data is fixed), and chooses exactly one of the distributions from the model as the one that "best" reflects the data.
The model is the thing that changes (sort of): we are collapsing it from an entire collection of possibilities into a single best choice. The data is just the data; nothing happens to it at all.
$endgroup$
add a comment |
$begingroup$
In the field of Rasch modelling it is common to fit the data to the model. The model is assumed to be correct and it is the analyst's job to find data which conform to it. The Wikipedia article on Rasch contains more details about the how and the why.
But I agree with others that in general in statistics we fit the model to the data because we can change the model but it is felt to be bad form to select or modify the data.
$endgroup$
add a comment |
$begingroup$
Typically, the observed data are fixed while the model is mutable (e.g. because parameters are estimated), so it is the model that is made to fit the data, not the other way around. (Usually people mean this case when they say either expression.)
When people say they fit data to a model I find myself trying to figure out what the heck did they do to the data?.
[Now if you're transforming data, that would arguably be 'fitting data to a model', but people almost never say that for this case.]
$endgroup$
5
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
yesterday
1
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
yesterday
1
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
yesterday
1
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
11 hours ago
add a comment |
$begingroup$
Usually, we assume our data corresponds to the "real world" and making any modifications means we are moving away from modelling the "real world". For example, one needs to take care removing outliers since even if it makes computation nicer, outliers were still part of our data.
When testing a model or estimating properties of an estimator using bootstrap or other resampling techniques, we may simulate new data using an estimated model and our original data. This makes the assumption that the model is correct, and we are not modifying our original data.
$endgroup$
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "65"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f399094%2fis-a-model-fitted-to-data-or-is-data-fitted-to-a-model%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
4 Answers
4
active
oldest
votes
4 Answers
4
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
Pretty much every source or person I've ever interacted with except the Wolfram source you linked refers to the process as fitting a model to data. This makes sense, since the model is the dynamic object and the data is static (a.k.a. fixed and constant).
To put a point on it, I like Larry Wasserman's approach to this. In his telling, a statistical model is a collection of distributions. For example, the collection of all normal distributions:
$$ { text{Normal}(mu, sigma) : mu, sigma in R, sigma > 0 } $$
or the set of all Poisson distributions:
$$ { text{Poisson}(lambda) : lambda in R, lambda > 0 } $$
Fitting a distribution to data is any algorithm that combines a statistical model with a set of data (the data is fixed), and chooses exactly one of the distributions from the model as the one that "best" reflects the data.
The model is the thing that changes (sort of): we are collapsing it from an entire collection of possibilities into a single best choice. The data is just the data; nothing happens to it at all.
$endgroup$
add a comment |
$begingroup$
Pretty much every source or person I've ever interacted with except the Wolfram source you linked refers to the process as fitting a model to data. This makes sense, since the model is the dynamic object and the data is static (a.k.a. fixed and constant).
To put a point on it, I like Larry Wasserman's approach to this. In his telling, a statistical model is a collection of distributions. For example, the collection of all normal distributions:
$$ { text{Normal}(mu, sigma) : mu, sigma in R, sigma > 0 } $$
or the set of all Poisson distributions:
$$ { text{Poisson}(lambda) : lambda in R, lambda > 0 } $$
Fitting a distribution to data is any algorithm that combines a statistical model with a set of data (the data is fixed), and chooses exactly one of the distributions from the model as the one that "best" reflects the data.
The model is the thing that changes (sort of): we are collapsing it from an entire collection of possibilities into a single best choice. The data is just the data; nothing happens to it at all.
$endgroup$
add a comment |
$begingroup$
Pretty much every source or person I've ever interacted with except the Wolfram source you linked refers to the process as fitting a model to data. This makes sense, since the model is the dynamic object and the data is static (a.k.a. fixed and constant).
To put a point on it, I like Larry Wasserman's approach to this. In his telling, a statistical model is a collection of distributions. For example, the collection of all normal distributions:
$$ { text{Normal}(mu, sigma) : mu, sigma in R, sigma > 0 } $$
or the set of all Poisson distributions:
$$ { text{Poisson}(lambda) : lambda in R, lambda > 0 } $$
Fitting a distribution to data is any algorithm that combines a statistical model with a set of data (the data is fixed), and chooses exactly one of the distributions from the model as the one that "best" reflects the data.
The model is the thing that changes (sort of): we are collapsing it from an entire collection of possibilities into a single best choice. The data is just the data; nothing happens to it at all.
$endgroup$
Pretty much every source or person I've ever interacted with except the Wolfram source you linked refers to the process as fitting a model to data. This makes sense, since the model is the dynamic object and the data is static (a.k.a. fixed and constant).
To put a point on it, I like Larry Wasserman's approach to this. In his telling, a statistical model is a collection of distributions. For example, the collection of all normal distributions:
$$ { text{Normal}(mu, sigma) : mu, sigma in R, sigma > 0 } $$
or the set of all Poisson distributions:
$$ { text{Poisson}(lambda) : lambda in R, lambda > 0 } $$
Fitting a distribution to data is any algorithm that combines a statistical model with a set of data (the data is fixed), and chooses exactly one of the distributions from the model as the one that "best" reflects the data.
The model is the thing that changes (sort of): we are collapsing it from an entire collection of possibilities into a single best choice. The data is just the data; nothing happens to it at all.
edited 3 hours ago
answered yesterday
Matthew DruryMatthew Drury
26.7k267107
26.7k267107
add a comment |
add a comment |
$begingroup$
In the field of Rasch modelling it is common to fit the data to the model. The model is assumed to be correct and it is the analyst's job to find data which conform to it. The Wikipedia article on Rasch contains more details about the how and the why.
But I agree with others that in general in statistics we fit the model to the data because we can change the model but it is felt to be bad form to select or modify the data.
$endgroup$
add a comment |
$begingroup$
In the field of Rasch modelling it is common to fit the data to the model. The model is assumed to be correct and it is the analyst's job to find data which conform to it. The Wikipedia article on Rasch contains more details about the how and the why.
But I agree with others that in general in statistics we fit the model to the data because we can change the model but it is felt to be bad form to select or modify the data.
$endgroup$
add a comment |
$begingroup$
In the field of Rasch modelling it is common to fit the data to the model. The model is assumed to be correct and it is the analyst's job to find data which conform to it. The Wikipedia article on Rasch contains more details about the how and the why.
But I agree with others that in general in statistics we fit the model to the data because we can change the model but it is felt to be bad form to select or modify the data.
$endgroup$
In the field of Rasch modelling it is common to fit the data to the model. The model is assumed to be correct and it is the analyst's job to find data which conform to it. The Wikipedia article on Rasch contains more details about the how and the why.
But I agree with others that in general in statistics we fit the model to the data because we can change the model but it is felt to be bad form to select or modify the data.
answered yesterday
mdeweymdewey
12.5k72344
12.5k72344
add a comment |
add a comment |
$begingroup$
Typically, the observed data are fixed while the model is mutable (e.g. because parameters are estimated), so it is the model that is made to fit the data, not the other way around. (Usually people mean this case when they say either expression.)
When people say they fit data to a model I find myself trying to figure out what the heck did they do to the data?.
[Now if you're transforming data, that would arguably be 'fitting data to a model', but people almost never say that for this case.]
$endgroup$
5
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
yesterday
1
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
yesterday
1
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
yesterday
1
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
11 hours ago
add a comment |
$begingroup$
Typically, the observed data are fixed while the model is mutable (e.g. because parameters are estimated), so it is the model that is made to fit the data, not the other way around. (Usually people mean this case when they say either expression.)
When people say they fit data to a model I find myself trying to figure out what the heck did they do to the data?.
[Now if you're transforming data, that would arguably be 'fitting data to a model', but people almost never say that for this case.]
$endgroup$
5
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
yesterday
1
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
yesterday
1
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
yesterday
1
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
11 hours ago
add a comment |
$begingroup$
Typically, the observed data are fixed while the model is mutable (e.g. because parameters are estimated), so it is the model that is made to fit the data, not the other way around. (Usually people mean this case when they say either expression.)
When people say they fit data to a model I find myself trying to figure out what the heck did they do to the data?.
[Now if you're transforming data, that would arguably be 'fitting data to a model', but people almost never say that for this case.]
$endgroup$
Typically, the observed data are fixed while the model is mutable (e.g. because parameters are estimated), so it is the model that is made to fit the data, not the other way around. (Usually people mean this case when they say either expression.)
When people say they fit data to a model I find myself trying to figure out what the heck did they do to the data?.
[Now if you're transforming data, that would arguably be 'fitting data to a model', but people almost never say that for this case.]
answered yesterday
Glen_b♦Glen_b
214k23416768
214k23416768
5
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
yesterday
1
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
yesterday
1
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
yesterday
1
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
11 hours ago
add a comment |
5
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
yesterday
1
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
yesterday
1
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
yesterday
1
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
11 hours ago
5
5
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
yesterday
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
yesterday
1
1
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
yesterday
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
yesterday
1
1
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
yesterday
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
yesterday
1
1
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
11 hours ago
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
11 hours ago
add a comment |
$begingroup$
Usually, we assume our data corresponds to the "real world" and making any modifications means we are moving away from modelling the "real world". For example, one needs to take care removing outliers since even if it makes computation nicer, outliers were still part of our data.
When testing a model or estimating properties of an estimator using bootstrap or other resampling techniques, we may simulate new data using an estimated model and our original data. This makes the assumption that the model is correct, and we are not modifying our original data.
$endgroup$
add a comment |
$begingroup$
Usually, we assume our data corresponds to the "real world" and making any modifications means we are moving away from modelling the "real world". For example, one needs to take care removing outliers since even if it makes computation nicer, outliers were still part of our data.
When testing a model or estimating properties of an estimator using bootstrap or other resampling techniques, we may simulate new data using an estimated model and our original data. This makes the assumption that the model is correct, and we are not modifying our original data.
$endgroup$
add a comment |
$begingroup$
Usually, we assume our data corresponds to the "real world" and making any modifications means we are moving away from modelling the "real world". For example, one needs to take care removing outliers since even if it makes computation nicer, outliers were still part of our data.
When testing a model or estimating properties of an estimator using bootstrap or other resampling techniques, we may simulate new data using an estimated model and our original data. This makes the assumption that the model is correct, and we are not modifying our original data.
$endgroup$
Usually, we assume our data corresponds to the "real world" and making any modifications means we are moving away from modelling the "real world". For example, one needs to take care removing outliers since even if it makes computation nicer, outliers were still part of our data.
When testing a model or estimating properties of an estimator using bootstrap or other resampling techniques, we may simulate new data using an estimated model and our original data. This makes the assumption that the model is correct, and we are not modifying our original data.
answered yesterday
qwrqwr
18011
18011
add a comment |
add a comment |
Thanks for contributing an answer to Cross Validated!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f399094%2fis-a-model-fitted-to-data-or-is-data-fitted-to-a-model%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
7
$begingroup$
+1 I am not impressed by the second link, but I am entertained.
$endgroup$
– The Laconic
2 days ago
$begingroup$
Many models fits current data, but data typically fits best one model
$endgroup$
– Agnius Vasiliauskas
17 hours ago