Why use OLS when it is assumed there is heteroscedasticity?





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty{ margin-bottom:0;
}






up vote
2
down vote

favorite












So I'm slowly going through the Stock and Watson book and I'm a bit confused on how to deal with the issue of homoscedacity/heteroscedacity. Specifically, it is mentioned that economic theory tells us that there's no reason for us to assume that errors will be homoscedastic, so their advice is that we assume heteroscedasticity and always use the heteroscedastic robust standard errors when performing our regression analysis. The way I'm being taught this material, in STATA for example, is that we just run the reg command, always sure to include r for robust standard error.



My question(s) is this: if our default position is to assume heteroscedacticity, then is it also correct that OLS is no longer the best unbiased linear estimator as one of the Gauss-Markov assumptions is violated? And if this is the case, is it also correct that GLS would be the BLUE estimator? Lastly, if both of these assumptions are correct, why would we not just run GLS regressions as our default and not OLS?



Thanks.










share|cite|improve this question







New contributor




anguyen1210 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
















  • 3




    It would be good to get some clarification about your meaning of "GLS." My understanding of GLS is that you have to provide specific information about the error variances. What do you have in mind doing in the general (and by far most common) case when there is no such information directly available?
    – whuber
    12 hours ago












  • Hi, I'm not sure what to clarify. I'm not very familiar with the Generalized Least Squares method, but in your comment, is that the answer? Namely, that even assuming that our errors are heteroscedastic, we do an OLS regression anyways, because to do a GLS we need information on the error terms that we don't have? Sorry, this is all quite new to me, and I'm sure I've not phrased the question well. Thanks to all for their comments.
    – anguyen1210
    12 hours ago

















up vote
2
down vote

favorite












So I'm slowly going through the Stock and Watson book and I'm a bit confused on how to deal with the issue of homoscedacity/heteroscedacity. Specifically, it is mentioned that economic theory tells us that there's no reason for us to assume that errors will be homoscedastic, so their advice is that we assume heteroscedasticity and always use the heteroscedastic robust standard errors when performing our regression analysis. The way I'm being taught this material, in STATA for example, is that we just run the reg command, always sure to include r for robust standard error.



My question(s) is this: if our default position is to assume heteroscedacticity, then is it also correct that OLS is no longer the best unbiased linear estimator as one of the Gauss-Markov assumptions is violated? And if this is the case, is it also correct that GLS would be the BLUE estimator? Lastly, if both of these assumptions are correct, why would we not just run GLS regressions as our default and not OLS?



Thanks.










share|cite|improve this question







New contributor




anguyen1210 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
















  • 3




    It would be good to get some clarification about your meaning of "GLS." My understanding of GLS is that you have to provide specific information about the error variances. What do you have in mind doing in the general (and by far most common) case when there is no such information directly available?
    – whuber
    12 hours ago












  • Hi, I'm not sure what to clarify. I'm not very familiar with the Generalized Least Squares method, but in your comment, is that the answer? Namely, that even assuming that our errors are heteroscedastic, we do an OLS regression anyways, because to do a GLS we need information on the error terms that we don't have? Sorry, this is all quite new to me, and I'm sure I've not phrased the question well. Thanks to all for their comments.
    – anguyen1210
    12 hours ago













up vote
2
down vote

favorite









up vote
2
down vote

favorite











So I'm slowly going through the Stock and Watson book and I'm a bit confused on how to deal with the issue of homoscedacity/heteroscedacity. Specifically, it is mentioned that economic theory tells us that there's no reason for us to assume that errors will be homoscedastic, so their advice is that we assume heteroscedasticity and always use the heteroscedastic robust standard errors when performing our regression analysis. The way I'm being taught this material, in STATA for example, is that we just run the reg command, always sure to include r for robust standard error.



My question(s) is this: if our default position is to assume heteroscedacticity, then is it also correct that OLS is no longer the best unbiased linear estimator as one of the Gauss-Markov assumptions is violated? And if this is the case, is it also correct that GLS would be the BLUE estimator? Lastly, if both of these assumptions are correct, why would we not just run GLS regressions as our default and not OLS?



Thanks.










share|cite|improve this question







New contributor




anguyen1210 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











So I'm slowly going through the Stock and Watson book and I'm a bit confused on how to deal with the issue of homoscedacity/heteroscedacity. Specifically, it is mentioned that economic theory tells us that there's no reason for us to assume that errors will be homoscedastic, so their advice is that we assume heteroscedasticity and always use the heteroscedastic robust standard errors when performing our regression analysis. The way I'm being taught this material, in STATA for example, is that we just run the reg command, always sure to include r for robust standard error.



My question(s) is this: if our default position is to assume heteroscedacticity, then is it also correct that OLS is no longer the best unbiased linear estimator as one of the Gauss-Markov assumptions is violated? And if this is the case, is it also correct that GLS would be the BLUE estimator? Lastly, if both of these assumptions are correct, why would we not just run GLS regressions as our default and not OLS?



Thanks.







least-squares heteroscedasticity generalized-least-squares blue






share|cite|improve this question







New contributor




anguyen1210 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











share|cite|improve this question







New contributor




anguyen1210 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









share|cite|improve this question




share|cite|improve this question






New contributor




anguyen1210 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









asked 12 hours ago









anguyen1210

133




133




New contributor




anguyen1210 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





anguyen1210 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






anguyen1210 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.








  • 3




    It would be good to get some clarification about your meaning of "GLS." My understanding of GLS is that you have to provide specific information about the error variances. What do you have in mind doing in the general (and by far most common) case when there is no such information directly available?
    – whuber
    12 hours ago












  • Hi, I'm not sure what to clarify. I'm not very familiar with the Generalized Least Squares method, but in your comment, is that the answer? Namely, that even assuming that our errors are heteroscedastic, we do an OLS regression anyways, because to do a GLS we need information on the error terms that we don't have? Sorry, this is all quite new to me, and I'm sure I've not phrased the question well. Thanks to all for their comments.
    – anguyen1210
    12 hours ago














  • 3




    It would be good to get some clarification about your meaning of "GLS." My understanding of GLS is that you have to provide specific information about the error variances. What do you have in mind doing in the general (and by far most common) case when there is no such information directly available?
    – whuber
    12 hours ago












  • Hi, I'm not sure what to clarify. I'm not very familiar with the Generalized Least Squares method, but in your comment, is that the answer? Namely, that even assuming that our errors are heteroscedastic, we do an OLS regression anyways, because to do a GLS we need information on the error terms that we don't have? Sorry, this is all quite new to me, and I'm sure I've not phrased the question well. Thanks to all for their comments.
    – anguyen1210
    12 hours ago








3




3




It would be good to get some clarification about your meaning of "GLS." My understanding of GLS is that you have to provide specific information about the error variances. What do you have in mind doing in the general (and by far most common) case when there is no such information directly available?
– whuber
12 hours ago






It would be good to get some clarification about your meaning of "GLS." My understanding of GLS is that you have to provide specific information about the error variances. What do you have in mind doing in the general (and by far most common) case when there is no such information directly available?
– whuber
12 hours ago














Hi, I'm not sure what to clarify. I'm not very familiar with the Generalized Least Squares method, but in your comment, is that the answer? Namely, that even assuming that our errors are heteroscedastic, we do an OLS regression anyways, because to do a GLS we need information on the error terms that we don't have? Sorry, this is all quite new to me, and I'm sure I've not phrased the question well. Thanks to all for their comments.
– anguyen1210
12 hours ago




Hi, I'm not sure what to clarify. I'm not very familiar with the Generalized Least Squares method, but in your comment, is that the answer? Namely, that even assuming that our errors are heteroscedastic, we do an OLS regression anyways, because to do a GLS we need information on the error terms that we don't have? Sorry, this is all quite new to me, and I'm sure I've not phrased the question well. Thanks to all for their comments.
– anguyen1210
12 hours ago










2 Answers
2






active

oldest

votes

















up vote
5
down vote



accepted










Because GLS is BLUE if you know the form of heteroskedasticity (and correlated errors). If you misspecify the form of heteroscedasticity, GLS estimates will lose their nice properties.



Under heteroscedasticity, OLS remains unbiased and consistent, but you lose efficiency.



So unless you're certain of the form of heteroscedasticity, it makes sense to stick with unbiased and consistent estimates from OLS. Then adjust inference for heteroskedasticity using robust standard errors which are valid asymptotically if you don't know the form of heteroscedasticity.



A hybrid approach is to do your best at specifying the form of heteroskedasticity but still apply robust standard errors for inference. See Resurrecting weighted least squares (PDF).



Modeling is all about tradeoffs and resources. If you are convinced there is nothing to be learned from modeling the form of heteroscedasticity, then specifying its form is a waste of time. I would argue that there is usually something to be learned in empirical applications. But tradition/convention nudges us away from studying heteroscedasticity, looking at the variances, since all they are is "error".






share|cite|improve this answer






























    up vote
    0
    down vote













    OLS is still unbiased when the data are correlated (provided the mean model is true). The net effect of heteroscedasticity is that it offsets the errors, so that the 95% CI for the regression is, at times, too tight and at other times too wide. Even still, you can correct the standard errors by using the sandwich or heteroscedasticity consistent standard error (HC) estimator. Technically, this is not "ordinary least squares" but it results in the same effect summary measures: a slope, interecept, and 95% CIs for their values, just no global test, or F-tests, and no validity to prediction intervals.






    share|cite|improve this answer





















      Your Answer





      StackExchange.ifUsing("editor", function () {
      return StackExchange.using("mathjaxEditing", function () {
      StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
      StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
      });
      });
      }, "mathjax-editing");

      StackExchange.ready(function() {
      var channelOptions = {
      tags: "".split(" "),
      id: "65"
      };
      initTagRenderer("".split(" "), "".split(" "), channelOptions);

      StackExchange.using("externalEditor", function() {
      // Have to fire editor after snippets, if snippets enabled
      if (StackExchange.settings.snippets.snippetsEnabled) {
      StackExchange.using("snippets", function() {
      createEditor();
      });
      }
      else {
      createEditor();
      }
      });

      function createEditor() {
      StackExchange.prepareEditor({
      heartbeatType: 'answer',
      convertImagesToLinks: false,
      noModals: true,
      showLowRepImageUploadWarning: true,
      reputationToPostImages: null,
      bindNavPrevention: true,
      postfix: "",
      imageUploader: {
      brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
      contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
      allowUrls: true
      },
      onDemand: true,
      discardSelector: ".discard-answer"
      ,immediatelyShowMarkdownHelp:true
      });


      }
      });






      anguyen1210 is a new contributor. Be nice, and check out our Code of Conduct.










       

      draft saved


      draft discarded


















      StackExchange.ready(
      function () {
      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f378851%2fwhy-use-ols-when-it-is-assumed-there-is-heteroscedasticity%23new-answer', 'question_page');
      }
      );

      Post as a guest















      Required, but never shown

























      2 Answers
      2






      active

      oldest

      votes








      2 Answers
      2






      active

      oldest

      votes









      active

      oldest

      votes






      active

      oldest

      votes








      up vote
      5
      down vote



      accepted










      Because GLS is BLUE if you know the form of heteroskedasticity (and correlated errors). If you misspecify the form of heteroscedasticity, GLS estimates will lose their nice properties.



      Under heteroscedasticity, OLS remains unbiased and consistent, but you lose efficiency.



      So unless you're certain of the form of heteroscedasticity, it makes sense to stick with unbiased and consistent estimates from OLS. Then adjust inference for heteroskedasticity using robust standard errors which are valid asymptotically if you don't know the form of heteroscedasticity.



      A hybrid approach is to do your best at specifying the form of heteroskedasticity but still apply robust standard errors for inference. See Resurrecting weighted least squares (PDF).



      Modeling is all about tradeoffs and resources. If you are convinced there is nothing to be learned from modeling the form of heteroscedasticity, then specifying its form is a waste of time. I would argue that there is usually something to be learned in empirical applications. But tradition/convention nudges us away from studying heteroscedasticity, looking at the variances, since all they are is "error".






      share|cite|improve this answer



























        up vote
        5
        down vote



        accepted










        Because GLS is BLUE if you know the form of heteroskedasticity (and correlated errors). If you misspecify the form of heteroscedasticity, GLS estimates will lose their nice properties.



        Under heteroscedasticity, OLS remains unbiased and consistent, but you lose efficiency.



        So unless you're certain of the form of heteroscedasticity, it makes sense to stick with unbiased and consistent estimates from OLS. Then adjust inference for heteroskedasticity using robust standard errors which are valid asymptotically if you don't know the form of heteroscedasticity.



        A hybrid approach is to do your best at specifying the form of heteroskedasticity but still apply robust standard errors for inference. See Resurrecting weighted least squares (PDF).



        Modeling is all about tradeoffs and resources. If you are convinced there is nothing to be learned from modeling the form of heteroscedasticity, then specifying its form is a waste of time. I would argue that there is usually something to be learned in empirical applications. But tradition/convention nudges us away from studying heteroscedasticity, looking at the variances, since all they are is "error".






        share|cite|improve this answer

























          up vote
          5
          down vote



          accepted







          up vote
          5
          down vote



          accepted






          Because GLS is BLUE if you know the form of heteroskedasticity (and correlated errors). If you misspecify the form of heteroscedasticity, GLS estimates will lose their nice properties.



          Under heteroscedasticity, OLS remains unbiased and consistent, but you lose efficiency.



          So unless you're certain of the form of heteroscedasticity, it makes sense to stick with unbiased and consistent estimates from OLS. Then adjust inference for heteroskedasticity using robust standard errors which are valid asymptotically if you don't know the form of heteroscedasticity.



          A hybrid approach is to do your best at specifying the form of heteroskedasticity but still apply robust standard errors for inference. See Resurrecting weighted least squares (PDF).



          Modeling is all about tradeoffs and resources. If you are convinced there is nothing to be learned from modeling the form of heteroscedasticity, then specifying its form is a waste of time. I would argue that there is usually something to be learned in empirical applications. But tradition/convention nudges us away from studying heteroscedasticity, looking at the variances, since all they are is "error".






          share|cite|improve this answer














          Because GLS is BLUE if you know the form of heteroskedasticity (and correlated errors). If you misspecify the form of heteroscedasticity, GLS estimates will lose their nice properties.



          Under heteroscedasticity, OLS remains unbiased and consistent, but you lose efficiency.



          So unless you're certain of the form of heteroscedasticity, it makes sense to stick with unbiased and consistent estimates from OLS. Then adjust inference for heteroskedasticity using robust standard errors which are valid asymptotically if you don't know the form of heteroscedasticity.



          A hybrid approach is to do your best at specifying the form of heteroskedasticity but still apply robust standard errors for inference. See Resurrecting weighted least squares (PDF).



          Modeling is all about tradeoffs and resources. If you are convinced there is nothing to be learned from modeling the form of heteroscedasticity, then specifying its form is a waste of time. I would argue that there is usually something to be learned in empirical applications. But tradition/convention nudges us away from studying heteroscedasticity, looking at the variances, since all they are is "error".







          share|cite|improve this answer














          share|cite|improve this answer



          share|cite|improve this answer








          edited 11 hours ago

























          answered 12 hours ago









          Heteroskedastic Jim

          2,631520




          2,631520
























              up vote
              0
              down vote













              OLS is still unbiased when the data are correlated (provided the mean model is true). The net effect of heteroscedasticity is that it offsets the errors, so that the 95% CI for the regression is, at times, too tight and at other times too wide. Even still, you can correct the standard errors by using the sandwich or heteroscedasticity consistent standard error (HC) estimator. Technically, this is not "ordinary least squares" but it results in the same effect summary measures: a slope, interecept, and 95% CIs for their values, just no global test, or F-tests, and no validity to prediction intervals.






              share|cite|improve this answer

























                up vote
                0
                down vote













                OLS is still unbiased when the data are correlated (provided the mean model is true). The net effect of heteroscedasticity is that it offsets the errors, so that the 95% CI for the regression is, at times, too tight and at other times too wide. Even still, you can correct the standard errors by using the sandwich or heteroscedasticity consistent standard error (HC) estimator. Technically, this is not "ordinary least squares" but it results in the same effect summary measures: a slope, interecept, and 95% CIs for their values, just no global test, or F-tests, and no validity to prediction intervals.






                share|cite|improve this answer























                  up vote
                  0
                  down vote










                  up vote
                  0
                  down vote









                  OLS is still unbiased when the data are correlated (provided the mean model is true). The net effect of heteroscedasticity is that it offsets the errors, so that the 95% CI for the regression is, at times, too tight and at other times too wide. Even still, you can correct the standard errors by using the sandwich or heteroscedasticity consistent standard error (HC) estimator. Technically, this is not "ordinary least squares" but it results in the same effect summary measures: a slope, interecept, and 95% CIs for their values, just no global test, or F-tests, and no validity to prediction intervals.






                  share|cite|improve this answer












                  OLS is still unbiased when the data are correlated (provided the mean model is true). The net effect of heteroscedasticity is that it offsets the errors, so that the 95% CI for the regression is, at times, too tight and at other times too wide. Even still, you can correct the standard errors by using the sandwich or heteroscedasticity consistent standard error (HC) estimator. Technically, this is not "ordinary least squares" but it results in the same effect summary measures: a slope, interecept, and 95% CIs for their values, just no global test, or F-tests, and no validity to prediction intervals.







                  share|cite|improve this answer












                  share|cite|improve this answer



                  share|cite|improve this answer










                  answered 11 hours ago









                  AdamO

                  31.8k257135




                  31.8k257135






















                      anguyen1210 is a new contributor. Be nice, and check out our Code of Conduct.










                       

                      draft saved


                      draft discarded


















                      anguyen1210 is a new contributor. Be nice, and check out our Code of Conduct.













                      anguyen1210 is a new contributor. Be nice, and check out our Code of Conduct.












                      anguyen1210 is a new contributor. Be nice, and check out our Code of Conduct.















                       


                      draft saved


                      draft discarded














                      StackExchange.ready(
                      function () {
                      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f378851%2fwhy-use-ols-when-it-is-assumed-there-is-heteroscedasticity%23new-answer', 'question_page');
                      }
                      );

                      Post as a guest















                      Required, but never shown





















































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown

































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown







                      Popular posts from this blog

                      How did Captain America manage to do this?

                      迪纳利

                      南乌拉尔铁路局