About Sampling and Random Variables
Clash Royale CLAN TAG#URR8PPP
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty margin-bottom:0;
up vote
1
down vote
favorite
So I've recently started an introductory course in econometrics and I'm having trouble grasping the idea of Random Variables and Sample distributions
1) If we have a population and we take a sample $Y= Y1, Y2,...,Yn$, is $Y$ a random variable? Because what I usually read in texts is that Y is a random variable but I don't get how?
2) I understand the concept behind the distribution of the sample mean: Basically you do repeated sampling, find the mean of each sample and then draw the curve of the mean i.e. what values it can take with what probabilities. I'm assuming my concept is clear, can anyone confirm?
Thank you!
distributions mean random-variable sample population
New contributor
add a comment |Â
up vote
1
down vote
favorite
So I've recently started an introductory course in econometrics and I'm having trouble grasping the idea of Random Variables and Sample distributions
1) If we have a population and we take a sample $Y= Y1, Y2,...,Yn$, is $Y$ a random variable? Because what I usually read in texts is that Y is a random variable but I don't get how?
2) I understand the concept behind the distribution of the sample mean: Basically you do repeated sampling, find the mean of each sample and then draw the curve of the mean i.e. what values it can take with what probabilities. I'm assuming my concept is clear, can anyone confirm?
Thank you!
distributions mean random-variable sample population
New contributor
add a comment |Â
up vote
1
down vote
favorite
up vote
1
down vote
favorite
So I've recently started an introductory course in econometrics and I'm having trouble grasping the idea of Random Variables and Sample distributions
1) If we have a population and we take a sample $Y= Y1, Y2,...,Yn$, is $Y$ a random variable? Because what I usually read in texts is that Y is a random variable but I don't get how?
2) I understand the concept behind the distribution of the sample mean: Basically you do repeated sampling, find the mean of each sample and then draw the curve of the mean i.e. what values it can take with what probabilities. I'm assuming my concept is clear, can anyone confirm?
Thank you!
distributions mean random-variable sample population
New contributor
So I've recently started an introductory course in econometrics and I'm having trouble grasping the idea of Random Variables and Sample distributions
1) If we have a population and we take a sample $Y= Y1, Y2,...,Yn$, is $Y$ a random variable? Because what I usually read in texts is that Y is a random variable but I don't get how?
2) I understand the concept behind the distribution of the sample mean: Basically you do repeated sampling, find the mean of each sample and then draw the curve of the mean i.e. what values it can take with what probabilities. I'm assuming my concept is clear, can anyone confirm?
Thank you!
distributions mean random-variable sample population
distributions mean random-variable sample population
New contributor
New contributor
New contributor
asked 4 hours ago
A.Asad
1083
1083
New contributor
New contributor
add a comment |Â
add a comment |Â
2 Answers
2
active
oldest
votes
up vote
2
down vote
accepted
The random variable $Y$ describes a relationship between events and the corresponding probabilities of those events. In more practical terms, a random variable describes a data-generating process. When you generate a random data point that is described by the random variable $Y$, the probability distribution of $Y$ describes the probability distribution of values that can result.
You can think of a "population" as an infinite reservoir of values drawn from $Y$. Sampling from a population is analogous to repeatedly drawing new values from $Y$. A sample of size $N$ is a size-$N$ collection of individual draws from $Y$.
The sample is clearly not the same thing as the random variable itself, so we need a different notation for it. Let's call it $s = y_1, y_2, dots, y_N $. Each $y_n$ is a single draw from $Y$. The sample mean is a single number. Let's call it $bar s$. It is the mean of the sequence $s$, i.e. $bar s = fracy_1 + y_2 + dots + y_NN$.
We can make an interesting observation here! $N$ independent, identical draws from a random variable $Y$ is the same thing as one draw from each of $N$ independent, identical random variables $Y_n$. Now, we can talk about the sample itself as a random variable $S = Y_1, dots, Y_N $.
Note the difference between
$$
s = y_1, dots, y_N
$$
and
$$
S = Y_1, dots, Y_N
$$
$S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.
Therefore the sample mean itself can be restated as a random variable $bar S$.
Compare
$$
bar s = frac y_1 + cdots + y_NN
$$
with
$$
bar S = frac Y_1 + cdots + Y_NN
$$
$bar s$ is just a number: it is the mean of a sequence of numbers $y_1, cdots, y_N$. But $bar S$ is a random variable! Specifically, it is a statistic, a single quantity that is calculated from a sample. The value of a statistic for a specific sample is a realization of the distribution for that statistic.
Being a random variable, draws from $bar S$ are described by a probability distribution. The distribution of sample means, across all possible samples, is described by the distribution of $bar S$. This distribution is the sampling distribution of the mean.
With regard to your first question, you are probably confused between the random variable $Y$ and the matrix $Y$. It is an unfortunate clash in notation that random variables and matrices are both conventionally written with capital letters. It is often mathematically convenient to express samples as matrices, so that you can do linear algebra operations on observed data (to generate estimates from that data, e.g. with ordinary least squares). The matrix $Y$ would be a matrix of observed values. Take care to observe the context, to avoid this confusion.
To address your 2nd question, there are many ways to derive or describe a sampling distribution. One possible technique is called resampling: repeatedly draw samples from a population that is distributed according to $Y$, and measure the sample mean in each of those samples. The distribution of those sample means should follow the sampling distribution of the mean.
Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,â¦,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
â A.Asad
3 hours ago
@A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
â shadowtalker
3 hours ago
So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
â A.Asad
3 hours ago
@A.Asad that's correct
â shadowtalker
3 hours ago
Thank you so much!!
â A.Asad
2 hours ago
 |Â
show 1 more comment
up vote
0
down vote
For the sake of redundancy and addition, a random value is (the Mathematical modeling of the process of having a ) a measurement or experiment whose value is not predictable/deterministic ; its value can only be understood probabilistically, meaning that it can be tested over the longer run. A standard example is that of throwing a fair coin *: one cannot tell for any throw whether it will land heads or tails, but a pattern should emerge over the long run of limiting values for the probability P(Heads)=P(Tails)=1/2.
Re your $Y$, yes, the layout is a bit confusing. Like Shadowtalker said, $Y_1,Y_2,...,Y_n$ are the implementations of the same process $Y$ , where $Y$ may represent throwing a die, flipping a coin, etc. Then $Y_1, Y_2,..,Y_n$ , if independent, are said to be IID RVs , Independent , Identically-Distributed Random Variables.
And, yes, the sampling mean is the random variable that takes sample (quantitative) values $Y_1, Y_2,....,Y_n $ and assigns to them the value $frac X_1 +X_2+...+X_nn$. There are many other possible sample statistics: Sample variance, Sample error, etc.
An important result to note, I think, is the CLT: Central Limit Theorem which tells you that , no matter what the distribution , if $Z_1,Z_2,....,Z_m$ are independent and identically - distributed, then the sample mean will approach a normal distribution as n becomes large-enough ( $n>30 ; n>40$ for higher accuracy).
- Assume we know it is fair to avoid a rabbit hole.
add a comment |Â
2 Answers
2
active
oldest
votes
2 Answers
2
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
2
down vote
accepted
The random variable $Y$ describes a relationship between events and the corresponding probabilities of those events. In more practical terms, a random variable describes a data-generating process. When you generate a random data point that is described by the random variable $Y$, the probability distribution of $Y$ describes the probability distribution of values that can result.
You can think of a "population" as an infinite reservoir of values drawn from $Y$. Sampling from a population is analogous to repeatedly drawing new values from $Y$. A sample of size $N$ is a size-$N$ collection of individual draws from $Y$.
The sample is clearly not the same thing as the random variable itself, so we need a different notation for it. Let's call it $s = y_1, y_2, dots, y_N $. Each $y_n$ is a single draw from $Y$. The sample mean is a single number. Let's call it $bar s$. It is the mean of the sequence $s$, i.e. $bar s = fracy_1 + y_2 + dots + y_NN$.
We can make an interesting observation here! $N$ independent, identical draws from a random variable $Y$ is the same thing as one draw from each of $N$ independent, identical random variables $Y_n$. Now, we can talk about the sample itself as a random variable $S = Y_1, dots, Y_N $.
Note the difference between
$$
s = y_1, dots, y_N
$$
and
$$
S = Y_1, dots, Y_N
$$
$S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.
Therefore the sample mean itself can be restated as a random variable $bar S$.
Compare
$$
bar s = frac y_1 + cdots + y_NN
$$
with
$$
bar S = frac Y_1 + cdots + Y_NN
$$
$bar s$ is just a number: it is the mean of a sequence of numbers $y_1, cdots, y_N$. But $bar S$ is a random variable! Specifically, it is a statistic, a single quantity that is calculated from a sample. The value of a statistic for a specific sample is a realization of the distribution for that statistic.
Being a random variable, draws from $bar S$ are described by a probability distribution. The distribution of sample means, across all possible samples, is described by the distribution of $bar S$. This distribution is the sampling distribution of the mean.
With regard to your first question, you are probably confused between the random variable $Y$ and the matrix $Y$. It is an unfortunate clash in notation that random variables and matrices are both conventionally written with capital letters. It is often mathematically convenient to express samples as matrices, so that you can do linear algebra operations on observed data (to generate estimates from that data, e.g. with ordinary least squares). The matrix $Y$ would be a matrix of observed values. Take care to observe the context, to avoid this confusion.
To address your 2nd question, there are many ways to derive or describe a sampling distribution. One possible technique is called resampling: repeatedly draw samples from a population that is distributed according to $Y$, and measure the sample mean in each of those samples. The distribution of those sample means should follow the sampling distribution of the mean.
Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,â¦,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
â A.Asad
3 hours ago
@A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
â shadowtalker
3 hours ago
So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
â A.Asad
3 hours ago
@A.Asad that's correct
â shadowtalker
3 hours ago
Thank you so much!!
â A.Asad
2 hours ago
 |Â
show 1 more comment
up vote
2
down vote
accepted
The random variable $Y$ describes a relationship between events and the corresponding probabilities of those events. In more practical terms, a random variable describes a data-generating process. When you generate a random data point that is described by the random variable $Y$, the probability distribution of $Y$ describes the probability distribution of values that can result.
You can think of a "population" as an infinite reservoir of values drawn from $Y$. Sampling from a population is analogous to repeatedly drawing new values from $Y$. A sample of size $N$ is a size-$N$ collection of individual draws from $Y$.
The sample is clearly not the same thing as the random variable itself, so we need a different notation for it. Let's call it $s = y_1, y_2, dots, y_N $. Each $y_n$ is a single draw from $Y$. The sample mean is a single number. Let's call it $bar s$. It is the mean of the sequence $s$, i.e. $bar s = fracy_1 + y_2 + dots + y_NN$.
We can make an interesting observation here! $N$ independent, identical draws from a random variable $Y$ is the same thing as one draw from each of $N$ independent, identical random variables $Y_n$. Now, we can talk about the sample itself as a random variable $S = Y_1, dots, Y_N $.
Note the difference between
$$
s = y_1, dots, y_N
$$
and
$$
S = Y_1, dots, Y_N
$$
$S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.
Therefore the sample mean itself can be restated as a random variable $bar S$.
Compare
$$
bar s = frac y_1 + cdots + y_NN
$$
with
$$
bar S = frac Y_1 + cdots + Y_NN
$$
$bar s$ is just a number: it is the mean of a sequence of numbers $y_1, cdots, y_N$. But $bar S$ is a random variable! Specifically, it is a statistic, a single quantity that is calculated from a sample. The value of a statistic for a specific sample is a realization of the distribution for that statistic.
Being a random variable, draws from $bar S$ are described by a probability distribution. The distribution of sample means, across all possible samples, is described by the distribution of $bar S$. This distribution is the sampling distribution of the mean.
With regard to your first question, you are probably confused between the random variable $Y$ and the matrix $Y$. It is an unfortunate clash in notation that random variables and matrices are both conventionally written with capital letters. It is often mathematically convenient to express samples as matrices, so that you can do linear algebra operations on observed data (to generate estimates from that data, e.g. with ordinary least squares). The matrix $Y$ would be a matrix of observed values. Take care to observe the context, to avoid this confusion.
To address your 2nd question, there are many ways to derive or describe a sampling distribution. One possible technique is called resampling: repeatedly draw samples from a population that is distributed according to $Y$, and measure the sample mean in each of those samples. The distribution of those sample means should follow the sampling distribution of the mean.
Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,â¦,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
â A.Asad
3 hours ago
@A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
â shadowtalker
3 hours ago
So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
â A.Asad
3 hours ago
@A.Asad that's correct
â shadowtalker
3 hours ago
Thank you so much!!
â A.Asad
2 hours ago
 |Â
show 1 more comment
up vote
2
down vote
accepted
up vote
2
down vote
accepted
The random variable $Y$ describes a relationship between events and the corresponding probabilities of those events. In more practical terms, a random variable describes a data-generating process. When you generate a random data point that is described by the random variable $Y$, the probability distribution of $Y$ describes the probability distribution of values that can result.
You can think of a "population" as an infinite reservoir of values drawn from $Y$. Sampling from a population is analogous to repeatedly drawing new values from $Y$. A sample of size $N$ is a size-$N$ collection of individual draws from $Y$.
The sample is clearly not the same thing as the random variable itself, so we need a different notation for it. Let's call it $s = y_1, y_2, dots, y_N $. Each $y_n$ is a single draw from $Y$. The sample mean is a single number. Let's call it $bar s$. It is the mean of the sequence $s$, i.e. $bar s = fracy_1 + y_2 + dots + y_NN$.
We can make an interesting observation here! $N$ independent, identical draws from a random variable $Y$ is the same thing as one draw from each of $N$ independent, identical random variables $Y_n$. Now, we can talk about the sample itself as a random variable $S = Y_1, dots, Y_N $.
Note the difference between
$$
s = y_1, dots, y_N
$$
and
$$
S = Y_1, dots, Y_N
$$
$S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.
Therefore the sample mean itself can be restated as a random variable $bar S$.
Compare
$$
bar s = frac y_1 + cdots + y_NN
$$
with
$$
bar S = frac Y_1 + cdots + Y_NN
$$
$bar s$ is just a number: it is the mean of a sequence of numbers $y_1, cdots, y_N$. But $bar S$ is a random variable! Specifically, it is a statistic, a single quantity that is calculated from a sample. The value of a statistic for a specific sample is a realization of the distribution for that statistic.
Being a random variable, draws from $bar S$ are described by a probability distribution. The distribution of sample means, across all possible samples, is described by the distribution of $bar S$. This distribution is the sampling distribution of the mean.
With regard to your first question, you are probably confused between the random variable $Y$ and the matrix $Y$. It is an unfortunate clash in notation that random variables and matrices are both conventionally written with capital letters. It is often mathematically convenient to express samples as matrices, so that you can do linear algebra operations on observed data (to generate estimates from that data, e.g. with ordinary least squares). The matrix $Y$ would be a matrix of observed values. Take care to observe the context, to avoid this confusion.
To address your 2nd question, there are many ways to derive or describe a sampling distribution. One possible technique is called resampling: repeatedly draw samples from a population that is distributed according to $Y$, and measure the sample mean in each of those samples. The distribution of those sample means should follow the sampling distribution of the mean.
The random variable $Y$ describes a relationship between events and the corresponding probabilities of those events. In more practical terms, a random variable describes a data-generating process. When you generate a random data point that is described by the random variable $Y$, the probability distribution of $Y$ describes the probability distribution of values that can result.
You can think of a "population" as an infinite reservoir of values drawn from $Y$. Sampling from a population is analogous to repeatedly drawing new values from $Y$. A sample of size $N$ is a size-$N$ collection of individual draws from $Y$.
The sample is clearly not the same thing as the random variable itself, so we need a different notation for it. Let's call it $s = y_1, y_2, dots, y_N $. Each $y_n$ is a single draw from $Y$. The sample mean is a single number. Let's call it $bar s$. It is the mean of the sequence $s$, i.e. $bar s = fracy_1 + y_2 + dots + y_NN$.
We can make an interesting observation here! $N$ independent, identical draws from a random variable $Y$ is the same thing as one draw from each of $N$ independent, identical random variables $Y_n$. Now, we can talk about the sample itself as a random variable $S = Y_1, dots, Y_N $.
Note the difference between
$$
s = y_1, dots, y_N
$$
and
$$
S = Y_1, dots, Y_N
$$
$S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.
Therefore the sample mean itself can be restated as a random variable $bar S$.
Compare
$$
bar s = frac y_1 + cdots + y_NN
$$
with
$$
bar S = frac Y_1 + cdots + Y_NN
$$
$bar s$ is just a number: it is the mean of a sequence of numbers $y_1, cdots, y_N$. But $bar S$ is a random variable! Specifically, it is a statistic, a single quantity that is calculated from a sample. The value of a statistic for a specific sample is a realization of the distribution for that statistic.
Being a random variable, draws from $bar S$ are described by a probability distribution. The distribution of sample means, across all possible samples, is described by the distribution of $bar S$. This distribution is the sampling distribution of the mean.
With regard to your first question, you are probably confused between the random variable $Y$ and the matrix $Y$. It is an unfortunate clash in notation that random variables and matrices are both conventionally written with capital letters. It is often mathematically convenient to express samples as matrices, so that you can do linear algebra operations on observed data (to generate estimates from that data, e.g. with ordinary least squares). The matrix $Y$ would be a matrix of observed values. Take care to observe the context, to avoid this confusion.
To address your 2nd question, there are many ways to derive or describe a sampling distribution. One possible technique is called resampling: repeatedly draw samples from a population that is distributed according to $Y$, and measure the sample mean in each of those samples. The distribution of those sample means should follow the sampling distribution of the mean.
answered 3 hours ago
shadowtalker
7,32323180
7,32323180
Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,â¦,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
â A.Asad
3 hours ago
@A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
â shadowtalker
3 hours ago
So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
â A.Asad
3 hours ago
@A.Asad that's correct
â shadowtalker
3 hours ago
Thank you so much!!
â A.Asad
2 hours ago
 |Â
show 1 more comment
Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,â¦,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
â A.Asad
3 hours ago
@A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
â shadowtalker
3 hours ago
So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
â A.Asad
3 hours ago
@A.Asad that's correct
â shadowtalker
3 hours ago
Thank you so much!!
â A.Asad
2 hours ago
Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,â¦,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
â A.Asad
3 hours ago
Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,â¦,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
â A.Asad
3 hours ago
@A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
â shadowtalker
3 hours ago
@A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
â shadowtalker
3 hours ago
So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
â A.Asad
3 hours ago
So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
â A.Asad
3 hours ago
@A.Asad that's correct
â shadowtalker
3 hours ago
@A.Asad that's correct
â shadowtalker
3 hours ago
Thank you so much!!
â A.Asad
2 hours ago
Thank you so much!!
â A.Asad
2 hours ago
 |Â
show 1 more comment
up vote
0
down vote
For the sake of redundancy and addition, a random value is (the Mathematical modeling of the process of having a ) a measurement or experiment whose value is not predictable/deterministic ; its value can only be understood probabilistically, meaning that it can be tested over the longer run. A standard example is that of throwing a fair coin *: one cannot tell for any throw whether it will land heads or tails, but a pattern should emerge over the long run of limiting values for the probability P(Heads)=P(Tails)=1/2.
Re your $Y$, yes, the layout is a bit confusing. Like Shadowtalker said, $Y_1,Y_2,...,Y_n$ are the implementations of the same process $Y$ , where $Y$ may represent throwing a die, flipping a coin, etc. Then $Y_1, Y_2,..,Y_n$ , if independent, are said to be IID RVs , Independent , Identically-Distributed Random Variables.
And, yes, the sampling mean is the random variable that takes sample (quantitative) values $Y_1, Y_2,....,Y_n $ and assigns to them the value $frac X_1 +X_2+...+X_nn$. There are many other possible sample statistics: Sample variance, Sample error, etc.
An important result to note, I think, is the CLT: Central Limit Theorem which tells you that , no matter what the distribution , if $Z_1,Z_2,....,Z_m$ are independent and identically - distributed, then the sample mean will approach a normal distribution as n becomes large-enough ( $n>30 ; n>40$ for higher accuracy).
- Assume we know it is fair to avoid a rabbit hole.
add a comment |Â
up vote
0
down vote
For the sake of redundancy and addition, a random value is (the Mathematical modeling of the process of having a ) a measurement or experiment whose value is not predictable/deterministic ; its value can only be understood probabilistically, meaning that it can be tested over the longer run. A standard example is that of throwing a fair coin *: one cannot tell for any throw whether it will land heads or tails, but a pattern should emerge over the long run of limiting values for the probability P(Heads)=P(Tails)=1/2.
Re your $Y$, yes, the layout is a bit confusing. Like Shadowtalker said, $Y_1,Y_2,...,Y_n$ are the implementations of the same process $Y$ , where $Y$ may represent throwing a die, flipping a coin, etc. Then $Y_1, Y_2,..,Y_n$ , if independent, are said to be IID RVs , Independent , Identically-Distributed Random Variables.
And, yes, the sampling mean is the random variable that takes sample (quantitative) values $Y_1, Y_2,....,Y_n $ and assigns to them the value $frac X_1 +X_2+...+X_nn$. There are many other possible sample statistics: Sample variance, Sample error, etc.
An important result to note, I think, is the CLT: Central Limit Theorem which tells you that , no matter what the distribution , if $Z_1,Z_2,....,Z_m$ are independent and identically - distributed, then the sample mean will approach a normal distribution as n becomes large-enough ( $n>30 ; n>40$ for higher accuracy).
- Assume we know it is fair to avoid a rabbit hole.
add a comment |Â
up vote
0
down vote
up vote
0
down vote
For the sake of redundancy and addition, a random value is (the Mathematical modeling of the process of having a ) a measurement or experiment whose value is not predictable/deterministic ; its value can only be understood probabilistically, meaning that it can be tested over the longer run. A standard example is that of throwing a fair coin *: one cannot tell for any throw whether it will land heads or tails, but a pattern should emerge over the long run of limiting values for the probability P(Heads)=P(Tails)=1/2.
Re your $Y$, yes, the layout is a bit confusing. Like Shadowtalker said, $Y_1,Y_2,...,Y_n$ are the implementations of the same process $Y$ , where $Y$ may represent throwing a die, flipping a coin, etc. Then $Y_1, Y_2,..,Y_n$ , if independent, are said to be IID RVs , Independent , Identically-Distributed Random Variables.
And, yes, the sampling mean is the random variable that takes sample (quantitative) values $Y_1, Y_2,....,Y_n $ and assigns to them the value $frac X_1 +X_2+...+X_nn$. There are many other possible sample statistics: Sample variance, Sample error, etc.
An important result to note, I think, is the CLT: Central Limit Theorem which tells you that , no matter what the distribution , if $Z_1,Z_2,....,Z_m$ are independent and identically - distributed, then the sample mean will approach a normal distribution as n becomes large-enough ( $n>30 ; n>40$ for higher accuracy).
- Assume we know it is fair to avoid a rabbit hole.
For the sake of redundancy and addition, a random value is (the Mathematical modeling of the process of having a ) a measurement or experiment whose value is not predictable/deterministic ; its value can only be understood probabilistically, meaning that it can be tested over the longer run. A standard example is that of throwing a fair coin *: one cannot tell for any throw whether it will land heads or tails, but a pattern should emerge over the long run of limiting values for the probability P(Heads)=P(Tails)=1/2.
Re your $Y$, yes, the layout is a bit confusing. Like Shadowtalker said, $Y_1,Y_2,...,Y_n$ are the implementations of the same process $Y$ , where $Y$ may represent throwing a die, flipping a coin, etc. Then $Y_1, Y_2,..,Y_n$ , if independent, are said to be IID RVs , Independent , Identically-Distributed Random Variables.
And, yes, the sampling mean is the random variable that takes sample (quantitative) values $Y_1, Y_2,....,Y_n $ and assigns to them the value $frac X_1 +X_2+...+X_nn$. There are many other possible sample statistics: Sample variance, Sample error, etc.
An important result to note, I think, is the CLT: Central Limit Theorem which tells you that , no matter what the distribution , if $Z_1,Z_2,....,Z_m$ are independent and identically - distributed, then the sample mean will approach a normal distribution as n becomes large-enough ( $n>30 ; n>40$ for higher accuracy).
- Assume we know it is fair to avoid a rabbit hole.
answered 49 mins ago
gary
238210
238210
add a comment |Â
add a comment |Â
A.Asad is a new contributor. Be nice, and check out our Code of Conduct.
A.Asad is a new contributor. Be nice, and check out our Code of Conduct.
A.Asad is a new contributor. Be nice, and check out our Code of Conduct.
A.Asad is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f368492%2fabout-sampling-and-random-variables%23new-answer', 'question_page');
);
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password