Is the sample correlation always positively correlated with the sample variance? The Next CEO of Stack OverflowGiven known bivariate normal means and variances, update correlation estimate, $P(rho)$, with new data?Where does the correlation come from in the regression coefficient equation for simple regressionCDF of the ratio of two correlated $chi^2$ random variablesIs there a version of the correlation coefficient that is less-sensitive to outliers?Correlation in Distances of Points Within a Circle from Centre and One Other PointHow do I reproduce this distribution (with observed means, sd, kurtosis, skewness and correlation)?Is the formula of covariance right?Is my Correlation reasoning correct?Variance of $Y|x$ from regression lineIn a bivariate normal sample, why is the squared sample correlation Beta distributed?

Are British MPs missing the point, with these 'Indicative Votes'?

pgfplots: How to draw a tangent graph below two others?

How to pronounce fünf in 45

A hang glider, sudden unexpected lift to 25,000 feet altitude, what could do this?

How can a day be of 24 hours?

The sum of any ten consecutive numbers from a fibonacci sequence is divisible by 11

Why does freezing point matter when picking cooler ice packs?

Traveling with my 5 year old daughter (as the father) without the mother from Germany to Mexico

Why do we say “un seul M” and not “une seule M” even though M is a “consonne”?

Create custom note boxes

Why did early computer designers eschew integers?

Gauss' Posthumous Publications?

Is the sample correlation always positively correlated with the sample variance?

Can I cast Thunderwave and be at the center of its bottom face, but not be affected by it?

That's an odd coin - I wonder why

Would a grinding machine be a simple and workable propulsion system for an interplanetary spacecraft?

How does a dynamic QR code work?

Find the majority element, which appears more than half the time

What happens if you break a law in another country outside of that country?

Direct Implications Between USA and UK in Event of No-Deal Brexit

My ex-girlfriend uses my Apple ID to login to her iPad, do I have to give her my Apple ID password to reset it?

How to implement Comparable so it is consistent with identity-equality

What difference does it make matching a word with/without a trailing whitespace?

Is the offspring between a demon and a celestial possible? If so what is it called and is it in a book somewhere?



Is the sample correlation always positively correlated with the sample variance?



The Next CEO of Stack OverflowGiven known bivariate normal means and variances, update correlation estimate, $P(rho)$, with new data?Where does the correlation come from in the regression coefficient equation for simple regressionCDF of the ratio of two correlated $chi^2$ random variablesIs there a version of the correlation coefficient that is less-sensitive to outliers?Correlation in Distances of Points Within a Circle from Centre and One Other PointHow do I reproduce this distribution (with observed means, sd, kurtosis, skewness and correlation)?Is the formula of covariance right?Is my Correlation reasoning correct?Variance of $Y|x$ from regression lineIn a bivariate normal sample, why is the squared sample correlation Beta distributed?










3












$begingroup$


The sample correlation $r$ and the sample standard deviation of $X$ (call it $s_X$) seem to be positively correlated if I simulate bivariate normal $X$, $Y$ with a positive true correlation (and seem to be negatively correlated if the true correlation between $X$ and $Y$ is negative). I found this somewhat counterintuitive. Very heuristically, I suppose it reflects the fact that $r$ represents the expected increase in Y (in units of SD(Y)) for a one-SD increase in X, and if we estimate a larger $s_X$, then $r$ reflects the change in Y associated with a larger change in X.



However, I would like to know if $Cov(r, s_x) >0$ for $r>0$ holds in general (at least for the case in which X and Y are bivariate normal and with large n). Letting $sigma$ denote a true SD, we have:



$$Cov(r, s_X) = E [ r s_X] - rho sigma_x$$



$$ approx E Bigg[ fracwidehatCov(X,Y)s_Y Bigg] - fracCov(X,Y)sigma_Y $$



I tried using a Taylor expansion on the first term, but it depends on $Cov(widehatCov(X,Y), s_Y)$, so that’s a dead end. Any ideas?



EDIT



Maybe a better direction would be to try to show that $Cov(widehatbeta, s_X)=0$, where $widehatbeta$ is the OLS coefficient of Y on X. Then we could argue that since $widehatbeta = r fracs_Ys_X$, this implies the desired result. Since $widehatbeta$ is almost like a difference of sample means, maybe we could get the former result using something like the known independence of the sample mean and variance for a normal RV?










share|cite|improve this question











$endgroup$











  • $begingroup$
    It would be unchanged. Hmm. I'm afraid I don't yet see the relevance, though.
    $endgroup$
    – half-pass
    5 hours ago










  • $begingroup$
    I should probably also note that while I wish this were a homework question, it's not... :)
    $endgroup$
    – half-pass
    5 hours ago






  • 1




    $begingroup$
    Ah, I didn't read the question carefully enough. My apologies.
    $endgroup$
    – jbowman
    5 hours ago










  • $begingroup$
    The first equality in your calculation is not correct. $s_x = sqrts^2_x$ is consistent for the standard deviation, but is not unbiased: en.wikipedia.org/wiki/Unbiased_estimation_of_standard_deviation
    $endgroup$
    – Andrew M
    4 hours ago











  • $begingroup$
    It's extremely close to unbiased for large n, though -- the rule-of-thumb correction factor for a normal RV is (n - 1.5) vs. (n-1).
    $endgroup$
    – half-pass
    4 hours ago















3












$begingroup$


The sample correlation $r$ and the sample standard deviation of $X$ (call it $s_X$) seem to be positively correlated if I simulate bivariate normal $X$, $Y$ with a positive true correlation (and seem to be negatively correlated if the true correlation between $X$ and $Y$ is negative). I found this somewhat counterintuitive. Very heuristically, I suppose it reflects the fact that $r$ represents the expected increase in Y (in units of SD(Y)) for a one-SD increase in X, and if we estimate a larger $s_X$, then $r$ reflects the change in Y associated with a larger change in X.



However, I would like to know if $Cov(r, s_x) >0$ for $r>0$ holds in general (at least for the case in which X and Y are bivariate normal and with large n). Letting $sigma$ denote a true SD, we have:



$$Cov(r, s_X) = E [ r s_X] - rho sigma_x$$



$$ approx E Bigg[ fracwidehatCov(X,Y)s_Y Bigg] - fracCov(X,Y)sigma_Y $$



I tried using a Taylor expansion on the first term, but it depends on $Cov(widehatCov(X,Y), s_Y)$, so that’s a dead end. Any ideas?



EDIT



Maybe a better direction would be to try to show that $Cov(widehatbeta, s_X)=0$, where $widehatbeta$ is the OLS coefficient of Y on X. Then we could argue that since $widehatbeta = r fracs_Ys_X$, this implies the desired result. Since $widehatbeta$ is almost like a difference of sample means, maybe we could get the former result using something like the known independence of the sample mean and variance for a normal RV?










share|cite|improve this question











$endgroup$











  • $begingroup$
    It would be unchanged. Hmm. I'm afraid I don't yet see the relevance, though.
    $endgroup$
    – half-pass
    5 hours ago










  • $begingroup$
    I should probably also note that while I wish this were a homework question, it's not... :)
    $endgroup$
    – half-pass
    5 hours ago






  • 1




    $begingroup$
    Ah, I didn't read the question carefully enough. My apologies.
    $endgroup$
    – jbowman
    5 hours ago










  • $begingroup$
    The first equality in your calculation is not correct. $s_x = sqrts^2_x$ is consistent for the standard deviation, but is not unbiased: en.wikipedia.org/wiki/Unbiased_estimation_of_standard_deviation
    $endgroup$
    – Andrew M
    4 hours ago











  • $begingroup$
    It's extremely close to unbiased for large n, though -- the rule-of-thumb correction factor for a normal RV is (n - 1.5) vs. (n-1).
    $endgroup$
    – half-pass
    4 hours ago













3












3








3





$begingroup$


The sample correlation $r$ and the sample standard deviation of $X$ (call it $s_X$) seem to be positively correlated if I simulate bivariate normal $X$, $Y$ with a positive true correlation (and seem to be negatively correlated if the true correlation between $X$ and $Y$ is negative). I found this somewhat counterintuitive. Very heuristically, I suppose it reflects the fact that $r$ represents the expected increase in Y (in units of SD(Y)) for a one-SD increase in X, and if we estimate a larger $s_X$, then $r$ reflects the change in Y associated with a larger change in X.



However, I would like to know if $Cov(r, s_x) >0$ for $r>0$ holds in general (at least for the case in which X and Y are bivariate normal and with large n). Letting $sigma$ denote a true SD, we have:



$$Cov(r, s_X) = E [ r s_X] - rho sigma_x$$



$$ approx E Bigg[ fracwidehatCov(X,Y)s_Y Bigg] - fracCov(X,Y)sigma_Y $$



I tried using a Taylor expansion on the first term, but it depends on $Cov(widehatCov(X,Y), s_Y)$, so that’s a dead end. Any ideas?



EDIT



Maybe a better direction would be to try to show that $Cov(widehatbeta, s_X)=0$, where $widehatbeta$ is the OLS coefficient of Y on X. Then we could argue that since $widehatbeta = r fracs_Ys_X$, this implies the desired result. Since $widehatbeta$ is almost like a difference of sample means, maybe we could get the former result using something like the known independence of the sample mean and variance for a normal RV?










share|cite|improve this question











$endgroup$




The sample correlation $r$ and the sample standard deviation of $X$ (call it $s_X$) seem to be positively correlated if I simulate bivariate normal $X$, $Y$ with a positive true correlation (and seem to be negatively correlated if the true correlation between $X$ and $Y$ is negative). I found this somewhat counterintuitive. Very heuristically, I suppose it reflects the fact that $r$ represents the expected increase in Y (in units of SD(Y)) for a one-SD increase in X, and if we estimate a larger $s_X$, then $r$ reflects the change in Y associated with a larger change in X.



However, I would like to know if $Cov(r, s_x) >0$ for $r>0$ holds in general (at least for the case in which X and Y are bivariate normal and with large n). Letting $sigma$ denote a true SD, we have:



$$Cov(r, s_X) = E [ r s_X] - rho sigma_x$$



$$ approx E Bigg[ fracwidehatCov(X,Y)s_Y Bigg] - fracCov(X,Y)sigma_Y $$



I tried using a Taylor expansion on the first term, but it depends on $Cov(widehatCov(X,Y), s_Y)$, so that’s a dead end. Any ideas?



EDIT



Maybe a better direction would be to try to show that $Cov(widehatbeta, s_X)=0$, where $widehatbeta$ is the OLS coefficient of Y on X. Then we could argue that since $widehatbeta = r fracs_Ys_X$, this implies the desired result. Since $widehatbeta$ is almost like a difference of sample means, maybe we could get the former result using something like the known independence of the sample mean and variance for a normal RV?







correlation covariance independence






share|cite|improve this question















share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited 3 hours ago







half-pass

















asked 5 hours ago









half-passhalf-pass

1,43441931




1,43441931











  • $begingroup$
    It would be unchanged. Hmm. I'm afraid I don't yet see the relevance, though.
    $endgroup$
    – half-pass
    5 hours ago










  • $begingroup$
    I should probably also note that while I wish this were a homework question, it's not... :)
    $endgroup$
    – half-pass
    5 hours ago






  • 1




    $begingroup$
    Ah, I didn't read the question carefully enough. My apologies.
    $endgroup$
    – jbowman
    5 hours ago










  • $begingroup$
    The first equality in your calculation is not correct. $s_x = sqrts^2_x$ is consistent for the standard deviation, but is not unbiased: en.wikipedia.org/wiki/Unbiased_estimation_of_standard_deviation
    $endgroup$
    – Andrew M
    4 hours ago











  • $begingroup$
    It's extremely close to unbiased for large n, though -- the rule-of-thumb correction factor for a normal RV is (n - 1.5) vs. (n-1).
    $endgroup$
    – half-pass
    4 hours ago
















  • $begingroup$
    It would be unchanged. Hmm. I'm afraid I don't yet see the relevance, though.
    $endgroup$
    – half-pass
    5 hours ago










  • $begingroup$
    I should probably also note that while I wish this were a homework question, it's not... :)
    $endgroup$
    – half-pass
    5 hours ago






  • 1




    $begingroup$
    Ah, I didn't read the question carefully enough. My apologies.
    $endgroup$
    – jbowman
    5 hours ago










  • $begingroup$
    The first equality in your calculation is not correct. $s_x = sqrts^2_x$ is consistent for the standard deviation, but is not unbiased: en.wikipedia.org/wiki/Unbiased_estimation_of_standard_deviation
    $endgroup$
    – Andrew M
    4 hours ago











  • $begingroup$
    It's extremely close to unbiased for large n, though -- the rule-of-thumb correction factor for a normal RV is (n - 1.5) vs. (n-1).
    $endgroup$
    – half-pass
    4 hours ago















$begingroup$
It would be unchanged. Hmm. I'm afraid I don't yet see the relevance, though.
$endgroup$
– half-pass
5 hours ago




$begingroup$
It would be unchanged. Hmm. I'm afraid I don't yet see the relevance, though.
$endgroup$
– half-pass
5 hours ago












$begingroup$
I should probably also note that while I wish this were a homework question, it's not... :)
$endgroup$
– half-pass
5 hours ago




$begingroup$
I should probably also note that while I wish this were a homework question, it's not... :)
$endgroup$
– half-pass
5 hours ago




1




1




$begingroup$
Ah, I didn't read the question carefully enough. My apologies.
$endgroup$
– jbowman
5 hours ago




$begingroup$
Ah, I didn't read the question carefully enough. My apologies.
$endgroup$
– jbowman
5 hours ago












$begingroup$
The first equality in your calculation is not correct. $s_x = sqrts^2_x$ is consistent for the standard deviation, but is not unbiased: en.wikipedia.org/wiki/Unbiased_estimation_of_standard_deviation
$endgroup$
– Andrew M
4 hours ago





$begingroup$
The first equality in your calculation is not correct. $s_x = sqrts^2_x$ is consistent for the standard deviation, but is not unbiased: en.wikipedia.org/wiki/Unbiased_estimation_of_standard_deviation
$endgroup$
– Andrew M
4 hours ago













$begingroup$
It's extremely close to unbiased for large n, though -- the rule-of-thumb correction factor for a normal RV is (n - 1.5) vs. (n-1).
$endgroup$
– half-pass
4 hours ago




$begingroup$
It's extremely close to unbiased for large n, though -- the rule-of-thumb correction factor for a normal RV is (n - 1.5) vs. (n-1).
$endgroup$
– half-pass
4 hours ago










2 Answers
2






active

oldest

votes


















1












$begingroup$

It will depend on the joint distribution. For the example you mention, the bivariate (zero-mean) Normal distribution is characterized by the $rho, sigma_x, sigma_y$. It follows that one can have all possible combinations of values of these three parameters, implying that no relation between $rho$ and the standard deviations can be established.



For other bivariate distributions, the correlation coefficient may be fundamentally a function of the standard deviations (essentially both will be functions of more primitive parameters), in which case one can examine whether a monotonic relation exists.






share|cite|improve this answer









$endgroup$












  • $begingroup$
    I understand that the three parameters can have arbitrary relationships for the BVN distribution, but I don't think it follows that the sample estimates of these are asymptotically independent.
    $endgroup$
    – half-pass
    3 hours ago


















1












$begingroup$

Yes, it does hold asymptotically regardless of the distribution of X and Y. I was on the right track with the Taylor expansion; I just needed to make a symmetry argument:



enter image description here






share|cite|improve this answer









$endgroup$













    Your Answer





    StackExchange.ifUsing("editor", function ()
    return StackExchange.using("mathjaxEditing", function ()
    StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
    StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
    );
    );
    , "mathjax-editing");

    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "65"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: false,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: null,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );













    draft saved

    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f400643%2fis-the-sample-correlation-always-positively-correlated-with-the-sample-variance%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown

























    2 Answers
    2






    active

    oldest

    votes








    2 Answers
    2






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    1












    $begingroup$

    It will depend on the joint distribution. For the example you mention, the bivariate (zero-mean) Normal distribution is characterized by the $rho, sigma_x, sigma_y$. It follows that one can have all possible combinations of values of these three parameters, implying that no relation between $rho$ and the standard deviations can be established.



    For other bivariate distributions, the correlation coefficient may be fundamentally a function of the standard deviations (essentially both will be functions of more primitive parameters), in which case one can examine whether a monotonic relation exists.






    share|cite|improve this answer









    $endgroup$












    • $begingroup$
      I understand that the three parameters can have arbitrary relationships for the BVN distribution, but I don't think it follows that the sample estimates of these are asymptotically independent.
      $endgroup$
      – half-pass
      3 hours ago















    1












    $begingroup$

    It will depend on the joint distribution. For the example you mention, the bivariate (zero-mean) Normal distribution is characterized by the $rho, sigma_x, sigma_y$. It follows that one can have all possible combinations of values of these three parameters, implying that no relation between $rho$ and the standard deviations can be established.



    For other bivariate distributions, the correlation coefficient may be fundamentally a function of the standard deviations (essentially both will be functions of more primitive parameters), in which case one can examine whether a monotonic relation exists.






    share|cite|improve this answer









    $endgroup$












    • $begingroup$
      I understand that the three parameters can have arbitrary relationships for the BVN distribution, but I don't think it follows that the sample estimates of these are asymptotically independent.
      $endgroup$
      – half-pass
      3 hours ago













    1












    1








    1





    $begingroup$

    It will depend on the joint distribution. For the example you mention, the bivariate (zero-mean) Normal distribution is characterized by the $rho, sigma_x, sigma_y$. It follows that one can have all possible combinations of values of these three parameters, implying that no relation between $rho$ and the standard deviations can be established.



    For other bivariate distributions, the correlation coefficient may be fundamentally a function of the standard deviations (essentially both will be functions of more primitive parameters), in which case one can examine whether a monotonic relation exists.






    share|cite|improve this answer









    $endgroup$



    It will depend on the joint distribution. For the example you mention, the bivariate (zero-mean) Normal distribution is characterized by the $rho, sigma_x, sigma_y$. It follows that one can have all possible combinations of values of these three parameters, implying that no relation between $rho$ and the standard deviations can be established.



    For other bivariate distributions, the correlation coefficient may be fundamentally a function of the standard deviations (essentially both will be functions of more primitive parameters), in which case one can examine whether a monotonic relation exists.







    share|cite|improve this answer












    share|cite|improve this answer



    share|cite|improve this answer










    answered 3 hours ago









    Alecos PapadopoulosAlecos Papadopoulos

    42.8k297197




    42.8k297197











    • $begingroup$
      I understand that the three parameters can have arbitrary relationships for the BVN distribution, but I don't think it follows that the sample estimates of these are asymptotically independent.
      $endgroup$
      – half-pass
      3 hours ago
















    • $begingroup$
      I understand that the three parameters can have arbitrary relationships for the BVN distribution, but I don't think it follows that the sample estimates of these are asymptotically independent.
      $endgroup$
      – half-pass
      3 hours ago















    $begingroup$
    I understand that the three parameters can have arbitrary relationships for the BVN distribution, but I don't think it follows that the sample estimates of these are asymptotically independent.
    $endgroup$
    – half-pass
    3 hours ago




    $begingroup$
    I understand that the three parameters can have arbitrary relationships for the BVN distribution, but I don't think it follows that the sample estimates of these are asymptotically independent.
    $endgroup$
    – half-pass
    3 hours ago













    1












    $begingroup$

    Yes, it does hold asymptotically regardless of the distribution of X and Y. I was on the right track with the Taylor expansion; I just needed to make a symmetry argument:



    enter image description here






    share|cite|improve this answer









    $endgroup$

















      1












      $begingroup$

      Yes, it does hold asymptotically regardless of the distribution of X and Y. I was on the right track with the Taylor expansion; I just needed to make a symmetry argument:



      enter image description here






      share|cite|improve this answer









      $endgroup$















        1












        1








        1





        $begingroup$

        Yes, it does hold asymptotically regardless of the distribution of X and Y. I was on the right track with the Taylor expansion; I just needed to make a symmetry argument:



        enter image description here






        share|cite|improve this answer









        $endgroup$



        Yes, it does hold asymptotically regardless of the distribution of X and Y. I was on the right track with the Taylor expansion; I just needed to make a symmetry argument:



        enter image description here







        share|cite|improve this answer












        share|cite|improve this answer



        share|cite|improve this answer










        answered 2 hours ago









        half-passhalf-pass

        1,43441931




        1,43441931



























            draft saved

            draft discarded
















































            Thanks for contributing an answer to Cross Validated!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            Use MathJax to format equations. MathJax reference.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f400643%2fis-the-sample-correlation-always-positively-correlated-with-the-sample-variance%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Are there any AGPL-style licences that require source code modifications to be public? Planned maintenance scheduled April 23, 2019 at 23:30 UTC (7:30pm US/Eastern) Announcing the arrival of Valued Associate #679: Cesar Manara Unicorn Meta Zoo #1: Why another podcast?Force derivative works to be publicAre there any GPL like licenses for Apple App Store?Do you violate the GPL if you provide source code that cannot be compiled?GPL - is it distribution to use libraries in an appliance loaned to customers?Distributing App for free which uses GPL'ed codeModifications of server software under GPL, with web/CLI interfaceDoes using an AGPLv3-licensed library prevent me from dual-licensing my own source code?Can I publish only select code under GPLv3 from a private project?Is there published precedent regarding the scope of covered work that uses AGPL software?If MIT licensed code links to GPL licensed code what should be the license of the resulting binary program?If I use a public API endpoint that has its source code licensed under AGPL in my app, do I need to disclose my source?

            2013 GY136 Descoberta | Órbita | Referências Menu de navegação«List Of Centaurs and Scattered-Disk Objects»«List of Known Trans-Neptunian Objects»

            Button changing it's text & action. Good or terrible? The 2019 Stack Overflow Developer Survey Results Are Inchanging text on user mouseoverShould certain functions be “hard to find” for powerusers to discover?Custom liking function - do I need user login?Using different checkbox style for different checkbox behaviorBest Practices: Save and Exit in Software UIInteraction with remote validated formMore efficient UI to progress the user through a complicated process?Designing a popup notice for a gameShould bulk-editing functions be hidden until a table row is selected, or is there a better solution?Is it bad practice to disable (replace) the context menu?