Interpretation of result of covariance of two functions of two random variables The Next CEO of Stack OverflowExpectation of Max and Min of Two Uniform Random VariablesBounding probability that sum of random variables deviates from expected valueCorrelation between variablesWilcoxon rank sum test critical values?Difference between $E[X^2]$ and $E[X^3]$Tail bound on the sum of independent identical geometric random variablesGetting the smallest range contains x% of a setLower bound on CDF of a sum of exponential random variablesBound for largest eigenvalue of symmetric matrices of uniform random variables over $[0,1]$ and fixed $1$s along diagonal and scattered $1$sConditional Sample from Gaussian CopulaLowerbounding the sum of binomial random variables

Ising model simulation

Can I board the first leg of the flight without having final country's visa?

My ex-girlfriend uses my Apple ID to login to her iPad, do I have to give her my Apple ID password to reset it?

Why don't programming languages automatically manage the synchronous/asynchronous problem?

In the "Harry Potter and the Order of the Phoenix" video game, what potion is used to sabotage Umbridge's speakers?

"Eavesdropping" vs "Listen in on"

What is the difference between "hamstring tendon" and "common hamstring tendon"?

Cannot shrink btrfs filesystem although there is still data and metadata space left : ERROR: unable to resize '/home': No space left on device

Why am I getting "Static method cannot be referenced from a non static context: String String.valueOf(Object)"?

Strange use of "whether ... than ..." in official text

Airplane gently rocking its wings during whole flight

Where do students learn to solve polynomial equations these days?

Can this note be analyzed as a non-chord tone?

Why is the US ranked as #45 in Press Freedom ratings, despite its extremely permissive free speech laws?

Calculate the Mean mean of two numbers

Does Germany produce more waste than the US?

Reshaping json / reparing json inside shell script (remove trailing comma)

Is French Guiana a (hard) EU border?

Is there an equivalent of cd - for cp or mv

Audio Conversion With ADS1243

From jafe to El-Guest

Yu-Gi-Oh cards in Python 3

What connection does MS Office have to Netscape Navigator?

IC has pull-down resistors on SMBus lines?



Interpretation of result of covariance of two functions of two random variables



The Next CEO of Stack OverflowExpectation of Max and Min of Two Uniform Random VariablesBounding probability that sum of random variables deviates from expected valueCorrelation between variablesWilcoxon rank sum test critical values?Difference between $E[X^2]$ and $E[X^3]$Tail bound on the sum of independent identical geometric random variablesGetting the smallest range contains x% of a setLower bound on CDF of a sum of exponential random variablesBound for largest eigenvalue of symmetric matrices of uniform random variables over $[0,1]$ and fixed $1$s along diagonal and scattered $1$sConditional Sample from Gaussian CopulaLowerbounding the sum of binomial random variables










0












$begingroup$


This question is slightly unusual because I've obtained the answer but I am looking for an interpretation of it as it's not intuitive. If we have $X,Y : Uniform(0,1)$ and independent, let $Z=max(X,Y) , W=min(X,Y)$. Find $Cov(Z,W)$. The answer can be found here (or here), and it is $Cov(Z,W)=frac136$. I went ahead and computed the correlation coefficient and it's $rho(Z,W)=0.5$.



If we take a step back, this result is not intuitive at all: if we think of sampling intervals from $[0,1]$ then $(W,Z)$ represent the lower bound and upper bound, respectively. As we get more and more data, the overall lower bound will keep moving to 0, while the overall upper bound will keep moving towards 1, and there's no reason to assume that high values of $W$ will be associated with high values of $Z$. Therefore, my sampling argument is wrong (since the correlation is positive), but I don't see how.



Any help or clarification would be greatly appreciated, especially a geometric one.










share|cite|improve this question











$endgroup$
















    0












    $begingroup$


    This question is slightly unusual because I've obtained the answer but I am looking for an interpretation of it as it's not intuitive. If we have $X,Y : Uniform(0,1)$ and independent, let $Z=max(X,Y) , W=min(X,Y)$. Find $Cov(Z,W)$. The answer can be found here (or here), and it is $Cov(Z,W)=frac136$. I went ahead and computed the correlation coefficient and it's $rho(Z,W)=0.5$.



    If we take a step back, this result is not intuitive at all: if we think of sampling intervals from $[0,1]$ then $(W,Z)$ represent the lower bound and upper bound, respectively. As we get more and more data, the overall lower bound will keep moving to 0, while the overall upper bound will keep moving towards 1, and there's no reason to assume that high values of $W$ will be associated with high values of $Z$. Therefore, my sampling argument is wrong (since the correlation is positive), but I don't see how.



    Any help or clarification would be greatly appreciated, especially a geometric one.










    share|cite|improve this question











    $endgroup$














      0












      0








      0


      1



      $begingroup$


      This question is slightly unusual because I've obtained the answer but I am looking for an interpretation of it as it's not intuitive. If we have $X,Y : Uniform(0,1)$ and independent, let $Z=max(X,Y) , W=min(X,Y)$. Find $Cov(Z,W)$. The answer can be found here (or here), and it is $Cov(Z,W)=frac136$. I went ahead and computed the correlation coefficient and it's $rho(Z,W)=0.5$.



      If we take a step back, this result is not intuitive at all: if we think of sampling intervals from $[0,1]$ then $(W,Z)$ represent the lower bound and upper bound, respectively. As we get more and more data, the overall lower bound will keep moving to 0, while the overall upper bound will keep moving towards 1, and there's no reason to assume that high values of $W$ will be associated with high values of $Z$. Therefore, my sampling argument is wrong (since the correlation is positive), but I don't see how.



      Any help or clarification would be greatly appreciated, especially a geometric one.










      share|cite|improve this question











      $endgroup$




      This question is slightly unusual because I've obtained the answer but I am looking for an interpretation of it as it's not intuitive. If we have $X,Y : Uniform(0,1)$ and independent, let $Z=max(X,Y) , W=min(X,Y)$. Find $Cov(Z,W)$. The answer can be found here (or here), and it is $Cov(Z,W)=frac136$. I went ahead and computed the correlation coefficient and it's $rho(Z,W)=0.5$.



      If we take a step back, this result is not intuitive at all: if we think of sampling intervals from $[0,1]$ then $(W,Z)$ represent the lower bound and upper bound, respectively. As we get more and more data, the overall lower bound will keep moving to 0, while the overall upper bound will keep moving towards 1, and there's no reason to assume that high values of $W$ will be associated with high values of $Z$. Therefore, my sampling argument is wrong (since the correlation is positive), but I don't see how.



      Any help or clarification would be greatly appreciated, especially a geometric one.







      probability probability-theory geometric-interpretation






      share|cite|improve this question















      share|cite|improve this question













      share|cite|improve this question




      share|cite|improve this question








      edited Mar 19 at 22:09







      baibo

















      asked Jan 27 at 17:41









      baibobaibo

      34316




      34316




















          2 Answers
          2






          active

          oldest

          votes


















          2












          $begingroup$

          It is true that if you observe $U_1,cdots,U_nin[0,1]$, then $min(U_1,cdots,U_n)$ converges to $0$ when you get more and more date, that is $nto+infty$, and $max(U_1,cdots,U_n)$ converges to $1$.



          However, here you look at $Z=max(X,Y)$ and $W=min(X,Y)$, which corresponds in a situation in which only two observations $X$ and $Y$ on the intervals $[0,1]$ are made. So there is no connection to make with the idea of getting more and more data because you are in the context of two observations only (no more, no less).



          Let us come back to the interpretation of $rho$: it can be seen as a measure of the linear dependence between two random variables. When $rho(Z,W)$ is positive (resp. negative), we expect that $Z$ takes higher values (resp. lower values) when $W$ takes higher values.



          So the question here is: when $W$ is high, do we expect high values or low values of $Z$. Well, given that $Wle Z$, I would find it hard to believe that $Z$ can be low when $W$ is high. That is not in favour of a negative correlation.



          Imagine two experiments in which you observe two values in the intervals $[0,1]$. In the first experiment, I tell you that the lowest value is $0.2$. In the second experiment, I tell you that the lowest value is $0.8$. In both cases, I ask you to guess what is the highest value. In the first case, the lowest value is $0.2$, so the highest value is between $0.2$ and $1$. If someone had to guess what the highest value is, I am sure he would say something like $0.6$ (the middle of $0.2$ and $1$). In the second case however, we are told that $0.8$ is the lowest value, so the highest value has no chance at all to be something like $0.6$: it has to be somewhere between $0.8$ and $1$, so if you had to guess, I think you would say something like $0.9$. What we just illustrated is that the higher $W$ is, the higher $Z$ is likely to be. Therefore, we expect a positive correlation.






          share|cite|improve this answer











          $endgroup$




















            1





            +50







            $begingroup$

            Here is a geometric explanation. See if it makes sense to you.



            $(X,Y)$ is a uniform random point in the square $[0,1] times [0,1]$.



            But what is $(W,Z) equiv (min(X,Y), max(X,Y))$?



            Imagine folding the square along the $y=x$ line, aka $+45°$ line, folding the lower triangle onto the upper triangle. Then $(W,Z)$ is where $(X,Y)$ would end up:



            • If $Y>X$, the original point is in the upper triangle to begin with, and remains there (folding has no effect). In this case $(W,Z) = (X,Y)$.


            • If $X > Y$, the original point is in the lower triangle to begin with, and ends up in the upper triangle at $(Y,X)$ (that's the effect of folding). In this case $(W,Z) = (Y,X)$.


            By symmetry, therefore, sampling $(X,Y)$ uniformly in the square and then computing $W=min(X,Y)$ and $Z=max(X,Y)$ is equivalent to sampling $(W,Z)$ uniformly from just the upper triangle.



            I hope it is now obvious, because a triangle is not a square, that $W, Z$ will be dependent.



            As for the sign of the correlation: Very roughly speaking positive correlation means the area is "biased / slanted with a positive slope" and negative correlation means the area is "biased / slanted with a negative slope". I hope it is also obvious that the upper triangle implies a positive correlation.



            (Incidentally, as you probably know, dependence can still mean zero correlation, and some examples would be uniformly sampling from e.g. a circle, a $45°$ rotated square, etc.)



            Sorry the geometric argument is kinda vague, but I thought this kind of "intuitive" picture is what your question is asking (since you already know the answer is $1/36$ and just need an explanation which is not the actual calculation of covariance). If you want a slightly less "visual" reason, simply note that, conditioned on $W=w, Z sim Uniform(w,1)$, so as $w$ increases $Z$ certainly tends to also increase.



            Re: your multiple samples argument: that doesn't quite apply. If you sample many $(W,Z)$ in the upper triangle, then sure $min_j W_j rightarrow 0$ and $max_j Z_j rightarrow 1$, but so what? That doesn't say anything about how in one particular sample, knowing $W$ gives you info about $Z$ and vice versa. (In fact, just for fun, it is also true that $max_j W_j rightarrow 1$ and $min_j Z_j rightarrow 0$, so what do you make of that? ;) )






            share|cite|improve this answer











            $endgroup$












            • $begingroup$
              What do you mean by this : "positive correlation means the area is "biased / slanted with a positive slope" and negative correlation means the area is "biased / slanted with a negative slope"" Is there a way to quantify this property of an area with some sort of formula?
              $endgroup$
              – baibo
              Mar 20 at 20:29






            • 1




              $begingroup$
              Quantifying this would probably be equivalent to calculating the covariance and correlation, and I presume that's not the kind of explanation you want. See if you find this helpful: check out the second picture ("Several sets of $(x,y)$ points...") at en.wikipedia.org/wiki/Pearson_correlation_coefficient -- especially the first row with the various ellipse-like blobs. What I'm saying is the upper triangle is kinda "similar" to an ellipse that is tilted with a positive slope.
              $endgroup$
              – antkam
              Mar 20 at 20:37












            Your Answer





            StackExchange.ifUsing("editor", function ()
            return StackExchange.using("mathjaxEditing", function ()
            StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
            StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
            );
            );
            , "mathjax-editing");

            StackExchange.ready(function()
            var channelOptions =
            tags: "".split(" "),
            id: "69"
            ;
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function()
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled)
            StackExchange.using("snippets", function()
            createEditor();
            );

            else
            createEditor();

            );

            function createEditor()
            StackExchange.prepareEditor(
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: true,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: 10,
            bindNavPrevention: true,
            postfix: "",
            imageUploader:
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            ,
            noCode: true, onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            );



            );













            draft saved

            draft discarded


















            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3089871%2finterpretation-of-result-of-covariance-of-two-functions-of-two-random-variables%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown

























            2 Answers
            2






            active

            oldest

            votes








            2 Answers
            2






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            2












            $begingroup$

            It is true that if you observe $U_1,cdots,U_nin[0,1]$, then $min(U_1,cdots,U_n)$ converges to $0$ when you get more and more date, that is $nto+infty$, and $max(U_1,cdots,U_n)$ converges to $1$.



            However, here you look at $Z=max(X,Y)$ and $W=min(X,Y)$, which corresponds in a situation in which only two observations $X$ and $Y$ on the intervals $[0,1]$ are made. So there is no connection to make with the idea of getting more and more data because you are in the context of two observations only (no more, no less).



            Let us come back to the interpretation of $rho$: it can be seen as a measure of the linear dependence between two random variables. When $rho(Z,W)$ is positive (resp. negative), we expect that $Z$ takes higher values (resp. lower values) when $W$ takes higher values.



            So the question here is: when $W$ is high, do we expect high values or low values of $Z$. Well, given that $Wle Z$, I would find it hard to believe that $Z$ can be low when $W$ is high. That is not in favour of a negative correlation.



            Imagine two experiments in which you observe two values in the intervals $[0,1]$. In the first experiment, I tell you that the lowest value is $0.2$. In the second experiment, I tell you that the lowest value is $0.8$. In both cases, I ask you to guess what is the highest value. In the first case, the lowest value is $0.2$, so the highest value is between $0.2$ and $1$. If someone had to guess what the highest value is, I am sure he would say something like $0.6$ (the middle of $0.2$ and $1$). In the second case however, we are told that $0.8$ is the lowest value, so the highest value has no chance at all to be something like $0.6$: it has to be somewhere between $0.8$ and $1$, so if you had to guess, I think you would say something like $0.9$. What we just illustrated is that the higher $W$ is, the higher $Z$ is likely to be. Therefore, we expect a positive correlation.






            share|cite|improve this answer











            $endgroup$

















              2












              $begingroup$

              It is true that if you observe $U_1,cdots,U_nin[0,1]$, then $min(U_1,cdots,U_n)$ converges to $0$ when you get more and more date, that is $nto+infty$, and $max(U_1,cdots,U_n)$ converges to $1$.



              However, here you look at $Z=max(X,Y)$ and $W=min(X,Y)$, which corresponds in a situation in which only two observations $X$ and $Y$ on the intervals $[0,1]$ are made. So there is no connection to make with the idea of getting more and more data because you are in the context of two observations only (no more, no less).



              Let us come back to the interpretation of $rho$: it can be seen as a measure of the linear dependence between two random variables. When $rho(Z,W)$ is positive (resp. negative), we expect that $Z$ takes higher values (resp. lower values) when $W$ takes higher values.



              So the question here is: when $W$ is high, do we expect high values or low values of $Z$. Well, given that $Wle Z$, I would find it hard to believe that $Z$ can be low when $W$ is high. That is not in favour of a negative correlation.



              Imagine two experiments in which you observe two values in the intervals $[0,1]$. In the first experiment, I tell you that the lowest value is $0.2$. In the second experiment, I tell you that the lowest value is $0.8$. In both cases, I ask you to guess what is the highest value. In the first case, the lowest value is $0.2$, so the highest value is between $0.2$ and $1$. If someone had to guess what the highest value is, I am sure he would say something like $0.6$ (the middle of $0.2$ and $1$). In the second case however, we are told that $0.8$ is the lowest value, so the highest value has no chance at all to be something like $0.6$: it has to be somewhere between $0.8$ and $1$, so if you had to guess, I think you would say something like $0.9$. What we just illustrated is that the higher $W$ is, the higher $Z$ is likely to be. Therefore, we expect a positive correlation.






              share|cite|improve this answer











              $endgroup$















                2












                2








                2





                $begingroup$

                It is true that if you observe $U_1,cdots,U_nin[0,1]$, then $min(U_1,cdots,U_n)$ converges to $0$ when you get more and more date, that is $nto+infty$, and $max(U_1,cdots,U_n)$ converges to $1$.



                However, here you look at $Z=max(X,Y)$ and $W=min(X,Y)$, which corresponds in a situation in which only two observations $X$ and $Y$ on the intervals $[0,1]$ are made. So there is no connection to make with the idea of getting more and more data because you are in the context of two observations only (no more, no less).



                Let us come back to the interpretation of $rho$: it can be seen as a measure of the linear dependence between two random variables. When $rho(Z,W)$ is positive (resp. negative), we expect that $Z$ takes higher values (resp. lower values) when $W$ takes higher values.



                So the question here is: when $W$ is high, do we expect high values or low values of $Z$. Well, given that $Wle Z$, I would find it hard to believe that $Z$ can be low when $W$ is high. That is not in favour of a negative correlation.



                Imagine two experiments in which you observe two values in the intervals $[0,1]$. In the first experiment, I tell you that the lowest value is $0.2$. In the second experiment, I tell you that the lowest value is $0.8$. In both cases, I ask you to guess what is the highest value. In the first case, the lowest value is $0.2$, so the highest value is between $0.2$ and $1$. If someone had to guess what the highest value is, I am sure he would say something like $0.6$ (the middle of $0.2$ and $1$). In the second case however, we are told that $0.8$ is the lowest value, so the highest value has no chance at all to be something like $0.6$: it has to be somewhere between $0.8$ and $1$, so if you had to guess, I think you would say something like $0.9$. What we just illustrated is that the higher $W$ is, the higher $Z$ is likely to be. Therefore, we expect a positive correlation.






                share|cite|improve this answer











                $endgroup$



                It is true that if you observe $U_1,cdots,U_nin[0,1]$, then $min(U_1,cdots,U_n)$ converges to $0$ when you get more and more date, that is $nto+infty$, and $max(U_1,cdots,U_n)$ converges to $1$.



                However, here you look at $Z=max(X,Y)$ and $W=min(X,Y)$, which corresponds in a situation in which only two observations $X$ and $Y$ on the intervals $[0,1]$ are made. So there is no connection to make with the idea of getting more and more data because you are in the context of two observations only (no more, no less).



                Let us come back to the interpretation of $rho$: it can be seen as a measure of the linear dependence between two random variables. When $rho(Z,W)$ is positive (resp. negative), we expect that $Z$ takes higher values (resp. lower values) when $W$ takes higher values.



                So the question here is: when $W$ is high, do we expect high values or low values of $Z$. Well, given that $Wle Z$, I would find it hard to believe that $Z$ can be low when $W$ is high. That is not in favour of a negative correlation.



                Imagine two experiments in which you observe two values in the intervals $[0,1]$. In the first experiment, I tell you that the lowest value is $0.2$. In the second experiment, I tell you that the lowest value is $0.8$. In both cases, I ask you to guess what is the highest value. In the first case, the lowest value is $0.2$, so the highest value is between $0.2$ and $1$. If someone had to guess what the highest value is, I am sure he would say something like $0.6$ (the middle of $0.2$ and $1$). In the second case however, we are told that $0.8$ is the lowest value, so the highest value has no chance at all to be something like $0.6$: it has to be somewhere between $0.8$ and $1$, so if you had to guess, I think you would say something like $0.9$. What we just illustrated is that the higher $W$ is, the higher $Z$ is likely to be. Therefore, we expect a positive correlation.







                share|cite|improve this answer














                share|cite|improve this answer



                share|cite|improve this answer








                edited Mar 19 at 23:49









                Ian

                68.9k25392




                68.9k25392










                answered Mar 19 at 23:14









                WillWill

                5115




                5115





















                    1





                    +50







                    $begingroup$

                    Here is a geometric explanation. See if it makes sense to you.



                    $(X,Y)$ is a uniform random point in the square $[0,1] times [0,1]$.



                    But what is $(W,Z) equiv (min(X,Y), max(X,Y))$?



                    Imagine folding the square along the $y=x$ line, aka $+45°$ line, folding the lower triangle onto the upper triangle. Then $(W,Z)$ is where $(X,Y)$ would end up:



                    • If $Y>X$, the original point is in the upper triangle to begin with, and remains there (folding has no effect). In this case $(W,Z) = (X,Y)$.


                    • If $X > Y$, the original point is in the lower triangle to begin with, and ends up in the upper triangle at $(Y,X)$ (that's the effect of folding). In this case $(W,Z) = (Y,X)$.


                    By symmetry, therefore, sampling $(X,Y)$ uniformly in the square and then computing $W=min(X,Y)$ and $Z=max(X,Y)$ is equivalent to sampling $(W,Z)$ uniformly from just the upper triangle.



                    I hope it is now obvious, because a triangle is not a square, that $W, Z$ will be dependent.



                    As for the sign of the correlation: Very roughly speaking positive correlation means the area is "biased / slanted with a positive slope" and negative correlation means the area is "biased / slanted with a negative slope". I hope it is also obvious that the upper triangle implies a positive correlation.



                    (Incidentally, as you probably know, dependence can still mean zero correlation, and some examples would be uniformly sampling from e.g. a circle, a $45°$ rotated square, etc.)



                    Sorry the geometric argument is kinda vague, but I thought this kind of "intuitive" picture is what your question is asking (since you already know the answer is $1/36$ and just need an explanation which is not the actual calculation of covariance). If you want a slightly less "visual" reason, simply note that, conditioned on $W=w, Z sim Uniform(w,1)$, so as $w$ increases $Z$ certainly tends to also increase.



                    Re: your multiple samples argument: that doesn't quite apply. If you sample many $(W,Z)$ in the upper triangle, then sure $min_j W_j rightarrow 0$ and $max_j Z_j rightarrow 1$, but so what? That doesn't say anything about how in one particular sample, knowing $W$ gives you info about $Z$ and vice versa. (In fact, just for fun, it is also true that $max_j W_j rightarrow 1$ and $min_j Z_j rightarrow 0$, so what do you make of that? ;) )






                    share|cite|improve this answer











                    $endgroup$












                    • $begingroup$
                      What do you mean by this : "positive correlation means the area is "biased / slanted with a positive slope" and negative correlation means the area is "biased / slanted with a negative slope"" Is there a way to quantify this property of an area with some sort of formula?
                      $endgroup$
                      – baibo
                      Mar 20 at 20:29






                    • 1




                      $begingroup$
                      Quantifying this would probably be equivalent to calculating the covariance and correlation, and I presume that's not the kind of explanation you want. See if you find this helpful: check out the second picture ("Several sets of $(x,y)$ points...") at en.wikipedia.org/wiki/Pearson_correlation_coefficient -- especially the first row with the various ellipse-like blobs. What I'm saying is the upper triangle is kinda "similar" to an ellipse that is tilted with a positive slope.
                      $endgroup$
                      – antkam
                      Mar 20 at 20:37
















                    1





                    +50







                    $begingroup$

                    Here is a geometric explanation. See if it makes sense to you.



                    $(X,Y)$ is a uniform random point in the square $[0,1] times [0,1]$.



                    But what is $(W,Z) equiv (min(X,Y), max(X,Y))$?



                    Imagine folding the square along the $y=x$ line, aka $+45°$ line, folding the lower triangle onto the upper triangle. Then $(W,Z)$ is where $(X,Y)$ would end up:



                    • If $Y>X$, the original point is in the upper triangle to begin with, and remains there (folding has no effect). In this case $(W,Z) = (X,Y)$.


                    • If $X > Y$, the original point is in the lower triangle to begin with, and ends up in the upper triangle at $(Y,X)$ (that's the effect of folding). In this case $(W,Z) = (Y,X)$.


                    By symmetry, therefore, sampling $(X,Y)$ uniformly in the square and then computing $W=min(X,Y)$ and $Z=max(X,Y)$ is equivalent to sampling $(W,Z)$ uniformly from just the upper triangle.



                    I hope it is now obvious, because a triangle is not a square, that $W, Z$ will be dependent.



                    As for the sign of the correlation: Very roughly speaking positive correlation means the area is "biased / slanted with a positive slope" and negative correlation means the area is "biased / slanted with a negative slope". I hope it is also obvious that the upper triangle implies a positive correlation.



                    (Incidentally, as you probably know, dependence can still mean zero correlation, and some examples would be uniformly sampling from e.g. a circle, a $45°$ rotated square, etc.)



                    Sorry the geometric argument is kinda vague, but I thought this kind of "intuitive" picture is what your question is asking (since you already know the answer is $1/36$ and just need an explanation which is not the actual calculation of covariance). If you want a slightly less "visual" reason, simply note that, conditioned on $W=w, Z sim Uniform(w,1)$, so as $w$ increases $Z$ certainly tends to also increase.



                    Re: your multiple samples argument: that doesn't quite apply. If you sample many $(W,Z)$ in the upper triangle, then sure $min_j W_j rightarrow 0$ and $max_j Z_j rightarrow 1$, but so what? That doesn't say anything about how in one particular sample, knowing $W$ gives you info about $Z$ and vice versa. (In fact, just for fun, it is also true that $max_j W_j rightarrow 1$ and $min_j Z_j rightarrow 0$, so what do you make of that? ;) )






                    share|cite|improve this answer











                    $endgroup$












                    • $begingroup$
                      What do you mean by this : "positive correlation means the area is "biased / slanted with a positive slope" and negative correlation means the area is "biased / slanted with a negative slope"" Is there a way to quantify this property of an area with some sort of formula?
                      $endgroup$
                      – baibo
                      Mar 20 at 20:29






                    • 1




                      $begingroup$
                      Quantifying this would probably be equivalent to calculating the covariance and correlation, and I presume that's not the kind of explanation you want. See if you find this helpful: check out the second picture ("Several sets of $(x,y)$ points...") at en.wikipedia.org/wiki/Pearson_correlation_coefficient -- especially the first row with the various ellipse-like blobs. What I'm saying is the upper triangle is kinda "similar" to an ellipse that is tilted with a positive slope.
                      $endgroup$
                      – antkam
                      Mar 20 at 20:37














                    1





                    +50







                    1





                    +50



                    1




                    +50



                    $begingroup$

                    Here is a geometric explanation. See if it makes sense to you.



                    $(X,Y)$ is a uniform random point in the square $[0,1] times [0,1]$.



                    But what is $(W,Z) equiv (min(X,Y), max(X,Y))$?



                    Imagine folding the square along the $y=x$ line, aka $+45°$ line, folding the lower triangle onto the upper triangle. Then $(W,Z)$ is where $(X,Y)$ would end up:



                    • If $Y>X$, the original point is in the upper triangle to begin with, and remains there (folding has no effect). In this case $(W,Z) = (X,Y)$.


                    • If $X > Y$, the original point is in the lower triangle to begin with, and ends up in the upper triangle at $(Y,X)$ (that's the effect of folding). In this case $(W,Z) = (Y,X)$.


                    By symmetry, therefore, sampling $(X,Y)$ uniformly in the square and then computing $W=min(X,Y)$ and $Z=max(X,Y)$ is equivalent to sampling $(W,Z)$ uniformly from just the upper triangle.



                    I hope it is now obvious, because a triangle is not a square, that $W, Z$ will be dependent.



                    As for the sign of the correlation: Very roughly speaking positive correlation means the area is "biased / slanted with a positive slope" and negative correlation means the area is "biased / slanted with a negative slope". I hope it is also obvious that the upper triangle implies a positive correlation.



                    (Incidentally, as you probably know, dependence can still mean zero correlation, and some examples would be uniformly sampling from e.g. a circle, a $45°$ rotated square, etc.)



                    Sorry the geometric argument is kinda vague, but I thought this kind of "intuitive" picture is what your question is asking (since you already know the answer is $1/36$ and just need an explanation which is not the actual calculation of covariance). If you want a slightly less "visual" reason, simply note that, conditioned on $W=w, Z sim Uniform(w,1)$, so as $w$ increases $Z$ certainly tends to also increase.



                    Re: your multiple samples argument: that doesn't quite apply. If you sample many $(W,Z)$ in the upper triangle, then sure $min_j W_j rightarrow 0$ and $max_j Z_j rightarrow 1$, but so what? That doesn't say anything about how in one particular sample, knowing $W$ gives you info about $Z$ and vice versa. (In fact, just for fun, it is also true that $max_j W_j rightarrow 1$ and $min_j Z_j rightarrow 0$, so what do you make of that? ;) )






                    share|cite|improve this answer











                    $endgroup$



                    Here is a geometric explanation. See if it makes sense to you.



                    $(X,Y)$ is a uniform random point in the square $[0,1] times [0,1]$.



                    But what is $(W,Z) equiv (min(X,Y), max(X,Y))$?



                    Imagine folding the square along the $y=x$ line, aka $+45°$ line, folding the lower triangle onto the upper triangle. Then $(W,Z)$ is where $(X,Y)$ would end up:



                    • If $Y>X$, the original point is in the upper triangle to begin with, and remains there (folding has no effect). In this case $(W,Z) = (X,Y)$.


                    • If $X > Y$, the original point is in the lower triangle to begin with, and ends up in the upper triangle at $(Y,X)$ (that's the effect of folding). In this case $(W,Z) = (Y,X)$.


                    By symmetry, therefore, sampling $(X,Y)$ uniformly in the square and then computing $W=min(X,Y)$ and $Z=max(X,Y)$ is equivalent to sampling $(W,Z)$ uniformly from just the upper triangle.



                    I hope it is now obvious, because a triangle is not a square, that $W, Z$ will be dependent.



                    As for the sign of the correlation: Very roughly speaking positive correlation means the area is "biased / slanted with a positive slope" and negative correlation means the area is "biased / slanted with a negative slope". I hope it is also obvious that the upper triangle implies a positive correlation.



                    (Incidentally, as you probably know, dependence can still mean zero correlation, and some examples would be uniformly sampling from e.g. a circle, a $45°$ rotated square, etc.)



                    Sorry the geometric argument is kinda vague, but I thought this kind of "intuitive" picture is what your question is asking (since you already know the answer is $1/36$ and just need an explanation which is not the actual calculation of covariance). If you want a slightly less "visual" reason, simply note that, conditioned on $W=w, Z sim Uniform(w,1)$, so as $w$ increases $Z$ certainly tends to also increase.



                    Re: your multiple samples argument: that doesn't quite apply. If you sample many $(W,Z)$ in the upper triangle, then sure $min_j W_j rightarrow 0$ and $max_j Z_j rightarrow 1$, but so what? That doesn't say anything about how in one particular sample, knowing $W$ gives you info about $Z$ and vice versa. (In fact, just for fun, it is also true that $max_j W_j rightarrow 1$ and $min_j Z_j rightarrow 0$, so what do you make of that? ;) )







                    share|cite|improve this answer














                    share|cite|improve this answer



                    share|cite|improve this answer








                    edited Mar 20 at 3:39

























                    answered Mar 20 at 3:31









                    antkamantkam

                    2,597312




                    2,597312











                    • $begingroup$
                      What do you mean by this : "positive correlation means the area is "biased / slanted with a positive slope" and negative correlation means the area is "biased / slanted with a negative slope"" Is there a way to quantify this property of an area with some sort of formula?
                      $endgroup$
                      – baibo
                      Mar 20 at 20:29






                    • 1




                      $begingroup$
                      Quantifying this would probably be equivalent to calculating the covariance and correlation, and I presume that's not the kind of explanation you want. See if you find this helpful: check out the second picture ("Several sets of $(x,y)$ points...") at en.wikipedia.org/wiki/Pearson_correlation_coefficient -- especially the first row with the various ellipse-like blobs. What I'm saying is the upper triangle is kinda "similar" to an ellipse that is tilted with a positive slope.
                      $endgroup$
                      – antkam
                      Mar 20 at 20:37

















                    • $begingroup$
                      What do you mean by this : "positive correlation means the area is "biased / slanted with a positive slope" and negative correlation means the area is "biased / slanted with a negative slope"" Is there a way to quantify this property of an area with some sort of formula?
                      $endgroup$
                      – baibo
                      Mar 20 at 20:29






                    • 1




                      $begingroup$
                      Quantifying this would probably be equivalent to calculating the covariance and correlation, and I presume that's not the kind of explanation you want. See if you find this helpful: check out the second picture ("Several sets of $(x,y)$ points...") at en.wikipedia.org/wiki/Pearson_correlation_coefficient -- especially the first row with the various ellipse-like blobs. What I'm saying is the upper triangle is kinda "similar" to an ellipse that is tilted with a positive slope.
                      $endgroup$
                      – antkam
                      Mar 20 at 20:37
















                    $begingroup$
                    What do you mean by this : "positive correlation means the area is "biased / slanted with a positive slope" and negative correlation means the area is "biased / slanted with a negative slope"" Is there a way to quantify this property of an area with some sort of formula?
                    $endgroup$
                    – baibo
                    Mar 20 at 20:29




                    $begingroup$
                    What do you mean by this : "positive correlation means the area is "biased / slanted with a positive slope" and negative correlation means the area is "biased / slanted with a negative slope"" Is there a way to quantify this property of an area with some sort of formula?
                    $endgroup$
                    – baibo
                    Mar 20 at 20:29




                    1




                    1




                    $begingroup$
                    Quantifying this would probably be equivalent to calculating the covariance and correlation, and I presume that's not the kind of explanation you want. See if you find this helpful: check out the second picture ("Several sets of $(x,y)$ points...") at en.wikipedia.org/wiki/Pearson_correlation_coefficient -- especially the first row with the various ellipse-like blobs. What I'm saying is the upper triangle is kinda "similar" to an ellipse that is tilted with a positive slope.
                    $endgroup$
                    – antkam
                    Mar 20 at 20:37





                    $begingroup$
                    Quantifying this would probably be equivalent to calculating the covariance and correlation, and I presume that's not the kind of explanation you want. See if you find this helpful: check out the second picture ("Several sets of $(x,y)$ points...") at en.wikipedia.org/wiki/Pearson_correlation_coefficient -- especially the first row with the various ellipse-like blobs. What I'm saying is the upper triangle is kinda "similar" to an ellipse that is tilted with a positive slope.
                    $endgroup$
                    – antkam
                    Mar 20 at 20:37


















                    draft saved

                    draft discarded
















































                    Thanks for contributing an answer to Mathematics Stack Exchange!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid


                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.

                    Use MathJax to format equations. MathJax reference.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function ()
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3089871%2finterpretation-of-result-of-covariance-of-two-functions-of-two-random-variables%23new-answer', 'question_page');

                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    Lowndes Grove History Architecture References Navigation menu32°48′6″N 79°57′58″W / 32.80167°N 79.96611°W / 32.80167; -79.9661132°48′6″N 79°57′58″W / 32.80167°N 79.96611°W / 32.80167; -79.9661178002500"National Register Information System"Historic houses of South Carolina"Lowndes Grove""+32° 48' 6.00", −79° 57' 58.00""Lowndes Grove, Charleston County (260 St. Margaret St., Charleston)""Lowndes Grove"The Charleston ExpositionIt Happened in South Carolina"Lowndes Grove (House), Saint Margaret Street & Sixth Avenue, Charleston, Charleston County, SC(Photographs)"Plantations of the Carolina Low Countrye

                    random experiment with two different functions on unit interval Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 23, 2019 at 00:00UTC (8:00pm US/Eastern)Random variable and probability space notionsRandom Walk with EdgesFinding functions where the increase over a random interval is Poisson distributedNumber of days until dayCan an observed event in fact be of zero probability?Unit random processmodels of coins and uniform distributionHow to get the number of successes given $n$ trials , probability $P$ and a random variable $X$Absorbing Markov chain in a computer. Is “almost every” turned into always convergence in computer executions?Stopped random walk is not uniformly integrable

                    How should I support this large drywall patch? Planned maintenance scheduled April 23, 2019 at 00:00UTC (8:00pm US/Eastern) Announcing the arrival of Valued Associate #679: Cesar Manara Unicorn Meta Zoo #1: Why another podcast?How do I cover large gaps in drywall?How do I keep drywall around a patch from crumbling?Can I glue a second layer of drywall?How to patch long strip on drywall?Large drywall patch: how to avoid bulging seams?Drywall Mesh Patch vs. Bulge? To remove or not to remove?How to fix this drywall job?Prep drywall before backsplashWhat's the best way to fix this horrible drywall patch job?Drywall patching using 3M Patch Plus Primer