About Sampling and Random Variables

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty margin-bottom:0;







up vote
1
down vote

favorite












So I've recently started an introductory course in econometrics and I'm having trouble grasping the idea of Random Variables and Sample distributions



1) If we have a population and we take a sample $Y= Y1, Y2,...,Yn$, is $Y$ a random variable? Because what I usually read in texts is that Y is a random variable but I don't get how?



2) I understand the concept behind the distribution of the sample mean: Basically you do repeated sampling, find the mean of each sample and then draw the curve of the mean i.e. what values it can take with what probabilities. I'm assuming my concept is clear, can anyone confirm?



Thank you!










share|cite|improve this question







New contributor




A.Asad is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.

























    up vote
    1
    down vote

    favorite












    So I've recently started an introductory course in econometrics and I'm having trouble grasping the idea of Random Variables and Sample distributions



    1) If we have a population and we take a sample $Y= Y1, Y2,...,Yn$, is $Y$ a random variable? Because what I usually read in texts is that Y is a random variable but I don't get how?



    2) I understand the concept behind the distribution of the sample mean: Basically you do repeated sampling, find the mean of each sample and then draw the curve of the mean i.e. what values it can take with what probabilities. I'm assuming my concept is clear, can anyone confirm?



    Thank you!










    share|cite|improve this question







    New contributor




    A.Asad is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
    Check out our Code of Conduct.





















      up vote
      1
      down vote

      favorite









      up vote
      1
      down vote

      favorite











      So I've recently started an introductory course in econometrics and I'm having trouble grasping the idea of Random Variables and Sample distributions



      1) If we have a population and we take a sample $Y= Y1, Y2,...,Yn$, is $Y$ a random variable? Because what I usually read in texts is that Y is a random variable but I don't get how?



      2) I understand the concept behind the distribution of the sample mean: Basically you do repeated sampling, find the mean of each sample and then draw the curve of the mean i.e. what values it can take with what probabilities. I'm assuming my concept is clear, can anyone confirm?



      Thank you!










      share|cite|improve this question







      New contributor




      A.Asad is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.











      So I've recently started an introductory course in econometrics and I'm having trouble grasping the idea of Random Variables and Sample distributions



      1) If we have a population and we take a sample $Y= Y1, Y2,...,Yn$, is $Y$ a random variable? Because what I usually read in texts is that Y is a random variable but I don't get how?



      2) I understand the concept behind the distribution of the sample mean: Basically you do repeated sampling, find the mean of each sample and then draw the curve of the mean i.e. what values it can take with what probabilities. I'm assuming my concept is clear, can anyone confirm?



      Thank you!







      distributions mean random-variable sample population






      share|cite|improve this question







      New contributor




      A.Asad is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.











      share|cite|improve this question







      New contributor




      A.Asad is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      share|cite|improve this question




      share|cite|improve this question






      New contributor




      A.Asad is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      asked 4 hours ago









      A.Asad

      1083




      1083




      New contributor




      A.Asad is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.





      New contributor





      A.Asad is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.






      A.Asad is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.




















          2 Answers
          2






          active

          oldest

          votes

















          up vote
          2
          down vote



          accepted










          The random variable $Y$ describes a relationship between events and the corresponding probabilities of those events. In more practical terms, a random variable describes a data-generating process. When you generate a random data point that is described by the random variable $Y$, the probability distribution of $Y$ describes the probability distribution of values that can result.



          You can think of a "population" as an infinite reservoir of values drawn from $Y$. Sampling from a population is analogous to repeatedly drawing new values from $Y$. A sample of size $N$ is a size-$N$ collection of individual draws from $Y$.



          The sample is clearly not the same thing as the random variable itself, so we need a different notation for it. Let's call it $s = y_1, y_2, dots, y_N $. Each $y_n$ is a single draw from $Y$. The sample mean is a single number. Let's call it $bar s$. It is the mean of the sequence $s$, i.e. $bar s = fracy_1 + y_2 + dots + y_NN$.



          We can make an interesting observation here! $N$ independent, identical draws from a random variable $Y$ is the same thing as one draw from each of $N$ independent, identical random variables $Y_n$. Now, we can talk about the sample itself as a random variable $S = Y_1, dots, Y_N $.



          Note the difference between



          $$
          s = y_1, dots, y_N
          $$



          and



          $$
          S = Y_1, dots, Y_N
          $$



          $S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.



          Therefore the sample mean itself can be restated as a random variable $bar S$.



          Compare



          $$
          bar s = frac y_1 + cdots + y_NN
          $$



          with



          $$
          bar S = frac Y_1 + cdots + Y_NN
          $$



          $bar s$ is just a number: it is the mean of a sequence of numbers $y_1, cdots, y_N$. But $bar S$ is a random variable! Specifically, it is a statistic, a single quantity that is calculated from a sample. The value of a statistic for a specific sample is a realization of the distribution for that statistic.



          Being a random variable, draws from $bar S$ are described by a probability distribution. The distribution of sample means, across all possible samples, is described by the distribution of $bar S$. This distribution is the sampling distribution of the mean.



          With regard to your first question, you are probably confused between the random variable $Y$ and the matrix $Y$. It is an unfortunate clash in notation that random variables and matrices are both conventionally written with capital letters. It is often mathematically convenient to express samples as matrices, so that you can do linear algebra operations on observed data (to generate estimates from that data, e.g. with ordinary least squares). The matrix $Y$ would be a matrix of observed values. Take care to observe the context, to avoid this confusion.



          To address your 2nd question, there are many ways to derive or describe a sampling distribution. One possible technique is called resampling: repeatedly draw samples from a population that is distributed according to $Y$, and measure the sample mean in each of those samples. The distribution of those sample means should follow the sampling distribution of the mean.






          share|cite|improve this answer




















          • Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,…,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
            – A.Asad
            3 hours ago










          • @A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
            – shadowtalker
            3 hours ago











          • So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
            – A.Asad
            3 hours ago










          • @A.Asad that's correct
            – shadowtalker
            3 hours ago










          • Thank you so much!!
            – A.Asad
            2 hours ago

















          up vote
          0
          down vote













          For the sake of redundancy and addition, a random value is (the Mathematical modeling of the process of having a ) a measurement or experiment whose value is not predictable/deterministic ; its value can only be understood probabilistically, meaning that it can be tested over the longer run. A standard example is that of throwing a fair coin *: one cannot tell for any throw whether it will land heads or tails, but a pattern should emerge over the long run of limiting values for the probability P(Heads)=P(Tails)=1/2.



          Re your $Y$, yes, the layout is a bit confusing. Like Shadowtalker said, $Y_1,Y_2,...,Y_n$ are the implementations of the same process $Y$ , where $Y$ may represent throwing a die, flipping a coin, etc. Then $Y_1, Y_2,..,Y_n$ , if independent, are said to be IID RVs , Independent , Identically-Distributed Random Variables.



          And, yes, the sampling mean is the random variable that takes sample (quantitative) values $Y_1, Y_2,....,Y_n $ and assigns to them the value $frac X_1 +X_2+...+X_nn$. There are many other possible sample statistics: Sample variance, Sample error, etc.



          An important result to note, I think, is the CLT: Central Limit Theorem which tells you that , no matter what the distribution , if $Z_1,Z_2,....,Z_m$ are independent and identically - distributed, then the sample mean will approach a normal distribution as n becomes large-enough ( $n>30 ; n>40$ for higher accuracy).



          • Assume we know it is fair to avoid a rabbit hole.





          share|cite|improve this answer




















            Your Answer




            StackExchange.ifUsing("editor", function ()
            return StackExchange.using("mathjaxEditing", function ()
            StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
            StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
            );
            );
            , "mathjax-editing");

            StackExchange.ready(function()
            var channelOptions =
            tags: "".split(" "),
            id: "65"
            ;
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function()
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled)
            StackExchange.using("snippets", function()
            createEditor();
            );

            else
            createEditor();

            );

            function createEditor()
            StackExchange.prepareEditor(
            heartbeatType: 'answer',
            convertImagesToLinks: false,
            noModals: false,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: null,
            bindNavPrevention: true,
            postfix: "",
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            );



            );






            A.Asad is a new contributor. Be nice, and check out our Code of Conduct.









             

            draft saved


            draft discarded


















            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f368492%2fabout-sampling-and-random-variables%23new-answer', 'question_page');

            );

            Post as a guest






























            2 Answers
            2






            active

            oldest

            votes








            2 Answers
            2






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes








            up vote
            2
            down vote



            accepted










            The random variable $Y$ describes a relationship between events and the corresponding probabilities of those events. In more practical terms, a random variable describes a data-generating process. When you generate a random data point that is described by the random variable $Y$, the probability distribution of $Y$ describes the probability distribution of values that can result.



            You can think of a "population" as an infinite reservoir of values drawn from $Y$. Sampling from a population is analogous to repeatedly drawing new values from $Y$. A sample of size $N$ is a size-$N$ collection of individual draws from $Y$.



            The sample is clearly not the same thing as the random variable itself, so we need a different notation for it. Let's call it $s = y_1, y_2, dots, y_N $. Each $y_n$ is a single draw from $Y$. The sample mean is a single number. Let's call it $bar s$. It is the mean of the sequence $s$, i.e. $bar s = fracy_1 + y_2 + dots + y_NN$.



            We can make an interesting observation here! $N$ independent, identical draws from a random variable $Y$ is the same thing as one draw from each of $N$ independent, identical random variables $Y_n$. Now, we can talk about the sample itself as a random variable $S = Y_1, dots, Y_N $.



            Note the difference between



            $$
            s = y_1, dots, y_N
            $$



            and



            $$
            S = Y_1, dots, Y_N
            $$



            $S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.



            Therefore the sample mean itself can be restated as a random variable $bar S$.



            Compare



            $$
            bar s = frac y_1 + cdots + y_NN
            $$



            with



            $$
            bar S = frac Y_1 + cdots + Y_NN
            $$



            $bar s$ is just a number: it is the mean of a sequence of numbers $y_1, cdots, y_N$. But $bar S$ is a random variable! Specifically, it is a statistic, a single quantity that is calculated from a sample. The value of a statistic for a specific sample is a realization of the distribution for that statistic.



            Being a random variable, draws from $bar S$ are described by a probability distribution. The distribution of sample means, across all possible samples, is described by the distribution of $bar S$. This distribution is the sampling distribution of the mean.



            With regard to your first question, you are probably confused between the random variable $Y$ and the matrix $Y$. It is an unfortunate clash in notation that random variables and matrices are both conventionally written with capital letters. It is often mathematically convenient to express samples as matrices, so that you can do linear algebra operations on observed data (to generate estimates from that data, e.g. with ordinary least squares). The matrix $Y$ would be a matrix of observed values. Take care to observe the context, to avoid this confusion.



            To address your 2nd question, there are many ways to derive or describe a sampling distribution. One possible technique is called resampling: repeatedly draw samples from a population that is distributed according to $Y$, and measure the sample mean in each of those samples. The distribution of those sample means should follow the sampling distribution of the mean.






            share|cite|improve this answer




















            • Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,…,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
              – A.Asad
              3 hours ago










            • @A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
              – shadowtalker
              3 hours ago











            • So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
              – A.Asad
              3 hours ago










            • @A.Asad that's correct
              – shadowtalker
              3 hours ago










            • Thank you so much!!
              – A.Asad
              2 hours ago














            up vote
            2
            down vote



            accepted










            The random variable $Y$ describes a relationship between events and the corresponding probabilities of those events. In more practical terms, a random variable describes a data-generating process. When you generate a random data point that is described by the random variable $Y$, the probability distribution of $Y$ describes the probability distribution of values that can result.



            You can think of a "population" as an infinite reservoir of values drawn from $Y$. Sampling from a population is analogous to repeatedly drawing new values from $Y$. A sample of size $N$ is a size-$N$ collection of individual draws from $Y$.



            The sample is clearly not the same thing as the random variable itself, so we need a different notation for it. Let's call it $s = y_1, y_2, dots, y_N $. Each $y_n$ is a single draw from $Y$. The sample mean is a single number. Let's call it $bar s$. It is the mean of the sequence $s$, i.e. $bar s = fracy_1 + y_2 + dots + y_NN$.



            We can make an interesting observation here! $N$ independent, identical draws from a random variable $Y$ is the same thing as one draw from each of $N$ independent, identical random variables $Y_n$. Now, we can talk about the sample itself as a random variable $S = Y_1, dots, Y_N $.



            Note the difference between



            $$
            s = y_1, dots, y_N
            $$



            and



            $$
            S = Y_1, dots, Y_N
            $$



            $S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.



            Therefore the sample mean itself can be restated as a random variable $bar S$.



            Compare



            $$
            bar s = frac y_1 + cdots + y_NN
            $$



            with



            $$
            bar S = frac Y_1 + cdots + Y_NN
            $$



            $bar s$ is just a number: it is the mean of a sequence of numbers $y_1, cdots, y_N$. But $bar S$ is a random variable! Specifically, it is a statistic, a single quantity that is calculated from a sample. The value of a statistic for a specific sample is a realization of the distribution for that statistic.



            Being a random variable, draws from $bar S$ are described by a probability distribution. The distribution of sample means, across all possible samples, is described by the distribution of $bar S$. This distribution is the sampling distribution of the mean.



            With regard to your first question, you are probably confused between the random variable $Y$ and the matrix $Y$. It is an unfortunate clash in notation that random variables and matrices are both conventionally written with capital letters. It is often mathematically convenient to express samples as matrices, so that you can do linear algebra operations on observed data (to generate estimates from that data, e.g. with ordinary least squares). The matrix $Y$ would be a matrix of observed values. Take care to observe the context, to avoid this confusion.



            To address your 2nd question, there are many ways to derive or describe a sampling distribution. One possible technique is called resampling: repeatedly draw samples from a population that is distributed according to $Y$, and measure the sample mean in each of those samples. The distribution of those sample means should follow the sampling distribution of the mean.






            share|cite|improve this answer




















            • Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,…,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
              – A.Asad
              3 hours ago










            • @A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
              – shadowtalker
              3 hours ago











            • So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
              – A.Asad
              3 hours ago










            • @A.Asad that's correct
              – shadowtalker
              3 hours ago










            • Thank you so much!!
              – A.Asad
              2 hours ago












            up vote
            2
            down vote



            accepted







            up vote
            2
            down vote



            accepted






            The random variable $Y$ describes a relationship between events and the corresponding probabilities of those events. In more practical terms, a random variable describes a data-generating process. When you generate a random data point that is described by the random variable $Y$, the probability distribution of $Y$ describes the probability distribution of values that can result.



            You can think of a "population" as an infinite reservoir of values drawn from $Y$. Sampling from a population is analogous to repeatedly drawing new values from $Y$. A sample of size $N$ is a size-$N$ collection of individual draws from $Y$.



            The sample is clearly not the same thing as the random variable itself, so we need a different notation for it. Let's call it $s = y_1, y_2, dots, y_N $. Each $y_n$ is a single draw from $Y$. The sample mean is a single number. Let's call it $bar s$. It is the mean of the sequence $s$, i.e. $bar s = fracy_1 + y_2 + dots + y_NN$.



            We can make an interesting observation here! $N$ independent, identical draws from a random variable $Y$ is the same thing as one draw from each of $N$ independent, identical random variables $Y_n$. Now, we can talk about the sample itself as a random variable $S = Y_1, dots, Y_N $.



            Note the difference between



            $$
            s = y_1, dots, y_N
            $$



            and



            $$
            S = Y_1, dots, Y_N
            $$



            $S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.



            Therefore the sample mean itself can be restated as a random variable $bar S$.



            Compare



            $$
            bar s = frac y_1 + cdots + y_NN
            $$



            with



            $$
            bar S = frac Y_1 + cdots + Y_NN
            $$



            $bar s$ is just a number: it is the mean of a sequence of numbers $y_1, cdots, y_N$. But $bar S$ is a random variable! Specifically, it is a statistic, a single quantity that is calculated from a sample. The value of a statistic for a specific sample is a realization of the distribution for that statistic.



            Being a random variable, draws from $bar S$ are described by a probability distribution. The distribution of sample means, across all possible samples, is described by the distribution of $bar S$. This distribution is the sampling distribution of the mean.



            With regard to your first question, you are probably confused between the random variable $Y$ and the matrix $Y$. It is an unfortunate clash in notation that random variables and matrices are both conventionally written with capital letters. It is often mathematically convenient to express samples as matrices, so that you can do linear algebra operations on observed data (to generate estimates from that data, e.g. with ordinary least squares). The matrix $Y$ would be a matrix of observed values. Take care to observe the context, to avoid this confusion.



            To address your 2nd question, there are many ways to derive or describe a sampling distribution. One possible technique is called resampling: repeatedly draw samples from a population that is distributed according to $Y$, and measure the sample mean in each of those samples. The distribution of those sample means should follow the sampling distribution of the mean.






            share|cite|improve this answer












            The random variable $Y$ describes a relationship between events and the corresponding probabilities of those events. In more practical terms, a random variable describes a data-generating process. When you generate a random data point that is described by the random variable $Y$, the probability distribution of $Y$ describes the probability distribution of values that can result.



            You can think of a "population" as an infinite reservoir of values drawn from $Y$. Sampling from a population is analogous to repeatedly drawing new values from $Y$. A sample of size $N$ is a size-$N$ collection of individual draws from $Y$.



            The sample is clearly not the same thing as the random variable itself, so we need a different notation for it. Let's call it $s = y_1, y_2, dots, y_N $. Each $y_n$ is a single draw from $Y$. The sample mean is a single number. Let's call it $bar s$. It is the mean of the sequence $s$, i.e. $bar s = fracy_1 + y_2 + dots + y_NN$.



            We can make an interesting observation here! $N$ independent, identical draws from a random variable $Y$ is the same thing as one draw from each of $N$ independent, identical random variables $Y_n$. Now, we can talk about the sample itself as a random variable $S = Y_1, dots, Y_N $.



            Note the difference between



            $$
            s = y_1, dots, y_N
            $$



            and



            $$
            S = Y_1, dots, Y_N
            $$



            $S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.



            Therefore the sample mean itself can be restated as a random variable $bar S$.



            Compare



            $$
            bar s = frac y_1 + cdots + y_NN
            $$



            with



            $$
            bar S = frac Y_1 + cdots + Y_NN
            $$



            $bar s$ is just a number: it is the mean of a sequence of numbers $y_1, cdots, y_N$. But $bar S$ is a random variable! Specifically, it is a statistic, a single quantity that is calculated from a sample. The value of a statistic for a specific sample is a realization of the distribution for that statistic.



            Being a random variable, draws from $bar S$ are described by a probability distribution. The distribution of sample means, across all possible samples, is described by the distribution of $bar S$. This distribution is the sampling distribution of the mean.



            With regard to your first question, you are probably confused between the random variable $Y$ and the matrix $Y$. It is an unfortunate clash in notation that random variables and matrices are both conventionally written with capital letters. It is often mathematically convenient to express samples as matrices, so that you can do linear algebra operations on observed data (to generate estimates from that data, e.g. with ordinary least squares). The matrix $Y$ would be a matrix of observed values. Take care to observe the context, to avoid this confusion.



            To address your 2nd question, there are many ways to derive or describe a sampling distribution. One possible technique is called resampling: repeatedly draw samples from a population that is distributed according to $Y$, and measure the sample mean in each of those samples. The distribution of those sample means should follow the sampling distribution of the mean.







            share|cite|improve this answer












            share|cite|improve this answer



            share|cite|improve this answer










            answered 3 hours ago









            shadowtalker

            7,32323180




            7,32323180











            • Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,…,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
              – A.Asad
              3 hours ago










            • @A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
              – shadowtalker
              3 hours ago











            • So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
              – A.Asad
              3 hours ago










            • @A.Asad that's correct
              – shadowtalker
              3 hours ago










            • Thank you so much!!
              – A.Asad
              2 hours ago
















            • Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,…,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
              – A.Asad
              3 hours ago










            • @A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
              – shadowtalker
              3 hours ago











            • So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
              – A.Asad
              3 hours ago










            • @A.Asad that's correct
              – shadowtalker
              3 hours ago










            • Thank you so much!!
              – A.Asad
              2 hours ago















            Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,…,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
            – A.Asad
            3 hours ago




            Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,…,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
            – A.Asad
            3 hours ago












            @A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
            – shadowtalker
            3 hours ago





            @A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
            – shadowtalker
            3 hours ago













            So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
            – A.Asad
            3 hours ago




            So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
            – A.Asad
            3 hours ago












            @A.Asad that's correct
            – shadowtalker
            3 hours ago




            @A.Asad that's correct
            – shadowtalker
            3 hours ago












            Thank you so much!!
            – A.Asad
            2 hours ago




            Thank you so much!!
            – A.Asad
            2 hours ago












            up vote
            0
            down vote













            For the sake of redundancy and addition, a random value is (the Mathematical modeling of the process of having a ) a measurement or experiment whose value is not predictable/deterministic ; its value can only be understood probabilistically, meaning that it can be tested over the longer run. A standard example is that of throwing a fair coin *: one cannot tell for any throw whether it will land heads or tails, but a pattern should emerge over the long run of limiting values for the probability P(Heads)=P(Tails)=1/2.



            Re your $Y$, yes, the layout is a bit confusing. Like Shadowtalker said, $Y_1,Y_2,...,Y_n$ are the implementations of the same process $Y$ , where $Y$ may represent throwing a die, flipping a coin, etc. Then $Y_1, Y_2,..,Y_n$ , if independent, are said to be IID RVs , Independent , Identically-Distributed Random Variables.



            And, yes, the sampling mean is the random variable that takes sample (quantitative) values $Y_1, Y_2,....,Y_n $ and assigns to them the value $frac X_1 +X_2+...+X_nn$. There are many other possible sample statistics: Sample variance, Sample error, etc.



            An important result to note, I think, is the CLT: Central Limit Theorem which tells you that , no matter what the distribution , if $Z_1,Z_2,....,Z_m$ are independent and identically - distributed, then the sample mean will approach a normal distribution as n becomes large-enough ( $n>30 ; n>40$ for higher accuracy).



            • Assume we know it is fair to avoid a rabbit hole.





            share|cite|improve this answer
























              up vote
              0
              down vote













              For the sake of redundancy and addition, a random value is (the Mathematical modeling of the process of having a ) a measurement or experiment whose value is not predictable/deterministic ; its value can only be understood probabilistically, meaning that it can be tested over the longer run. A standard example is that of throwing a fair coin *: one cannot tell for any throw whether it will land heads or tails, but a pattern should emerge over the long run of limiting values for the probability P(Heads)=P(Tails)=1/2.



              Re your $Y$, yes, the layout is a bit confusing. Like Shadowtalker said, $Y_1,Y_2,...,Y_n$ are the implementations of the same process $Y$ , where $Y$ may represent throwing a die, flipping a coin, etc. Then $Y_1, Y_2,..,Y_n$ , if independent, are said to be IID RVs , Independent , Identically-Distributed Random Variables.



              And, yes, the sampling mean is the random variable that takes sample (quantitative) values $Y_1, Y_2,....,Y_n $ and assigns to them the value $frac X_1 +X_2+...+X_nn$. There are many other possible sample statistics: Sample variance, Sample error, etc.



              An important result to note, I think, is the CLT: Central Limit Theorem which tells you that , no matter what the distribution , if $Z_1,Z_2,....,Z_m$ are independent and identically - distributed, then the sample mean will approach a normal distribution as n becomes large-enough ( $n>30 ; n>40$ for higher accuracy).



              • Assume we know it is fair to avoid a rabbit hole.





              share|cite|improve this answer






















                up vote
                0
                down vote










                up vote
                0
                down vote









                For the sake of redundancy and addition, a random value is (the Mathematical modeling of the process of having a ) a measurement or experiment whose value is not predictable/deterministic ; its value can only be understood probabilistically, meaning that it can be tested over the longer run. A standard example is that of throwing a fair coin *: one cannot tell for any throw whether it will land heads or tails, but a pattern should emerge over the long run of limiting values for the probability P(Heads)=P(Tails)=1/2.



                Re your $Y$, yes, the layout is a bit confusing. Like Shadowtalker said, $Y_1,Y_2,...,Y_n$ are the implementations of the same process $Y$ , where $Y$ may represent throwing a die, flipping a coin, etc. Then $Y_1, Y_2,..,Y_n$ , if independent, are said to be IID RVs , Independent , Identically-Distributed Random Variables.



                And, yes, the sampling mean is the random variable that takes sample (quantitative) values $Y_1, Y_2,....,Y_n $ and assigns to them the value $frac X_1 +X_2+...+X_nn$. There are many other possible sample statistics: Sample variance, Sample error, etc.



                An important result to note, I think, is the CLT: Central Limit Theorem which tells you that , no matter what the distribution , if $Z_1,Z_2,....,Z_m$ are independent and identically - distributed, then the sample mean will approach a normal distribution as n becomes large-enough ( $n>30 ; n>40$ for higher accuracy).



                • Assume we know it is fair to avoid a rabbit hole.





                share|cite|improve this answer












                For the sake of redundancy and addition, a random value is (the Mathematical modeling of the process of having a ) a measurement or experiment whose value is not predictable/deterministic ; its value can only be understood probabilistically, meaning that it can be tested over the longer run. A standard example is that of throwing a fair coin *: one cannot tell for any throw whether it will land heads or tails, but a pattern should emerge over the long run of limiting values for the probability P(Heads)=P(Tails)=1/2.



                Re your $Y$, yes, the layout is a bit confusing. Like Shadowtalker said, $Y_1,Y_2,...,Y_n$ are the implementations of the same process $Y$ , where $Y$ may represent throwing a die, flipping a coin, etc. Then $Y_1, Y_2,..,Y_n$ , if independent, are said to be IID RVs , Independent , Identically-Distributed Random Variables.



                And, yes, the sampling mean is the random variable that takes sample (quantitative) values $Y_1, Y_2,....,Y_n $ and assigns to them the value $frac X_1 +X_2+...+X_nn$. There are many other possible sample statistics: Sample variance, Sample error, etc.



                An important result to note, I think, is the CLT: Central Limit Theorem which tells you that , no matter what the distribution , if $Z_1,Z_2,....,Z_m$ are independent and identically - distributed, then the sample mean will approach a normal distribution as n becomes large-enough ( $n>30 ; n>40$ for higher accuracy).



                • Assume we know it is fair to avoid a rabbit hole.






                share|cite|improve this answer












                share|cite|improve this answer



                share|cite|improve this answer










                answered 49 mins ago









                gary

                238210




                238210




















                    A.Asad is a new contributor. Be nice, and check out our Code of Conduct.









                     

                    draft saved


                    draft discarded


















                    A.Asad is a new contributor. Be nice, and check out our Code of Conduct.












                    A.Asad is a new contributor. Be nice, and check out our Code of Conduct.











                    A.Asad is a new contributor. Be nice, and check out our Code of Conduct.













                     


                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function ()
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f368492%2fabout-sampling-and-random-variables%23new-answer', 'question_page');

                    );

                    Post as a guest













































































                    Comments

                    Popular posts from this blog

                    Long meetings (6-7 hours a day): Being “babysat” by supervisor

                    Is the Concept of Multiple Fantasy Races Scientifically Flawed? [closed]

                    Confectionery