About Sampling and Random Variables

.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty margin-bottom:0;

up vote
1
down vote

favorite

So I've recently started an introductory course in econometrics and I'm having trouble grasping the idea of Random Variables and Sample distributions

1) If we have a population and we take a sample $Y= Y1, Y2,...,Yn$, is $Y$ a random variable? Because what I usually read in texts is that Y is a random variable but I don't get how?

2) I understand the concept behind the distribution of the sample mean: Basically you do repeated sampling, find the mean of each sample and then draw the curve of the mean i.e. what values it can take with what probabilities. I'm assuming my concept is clear, can anyone confirm?

Thank you!

asked 4 hours ago

A.Asad

1083

New contributor

add a commentÂ |Â

up vote
1
down vote

favorite

So I've recently started an introductory course in econometrics and I'm having trouble grasping the idea of Random Variables and Sample distributions

1) If we have a population and we take a sample $Y= Y1, Y2,...,Yn$, is $Y$ a random variable? Because what I usually read in texts is that Y is a random variable but I don't get how?

Thank you!

asked 4 hours ago

A.Asad

1083

New contributor

add a commentÂ |Â

up vote
1
down vote

favorite

So I've recently started an introductory course in econometrics and I'm having trouble grasping the idea of Random Variables and Sample distributions

1) If we have a population and we take a sample $Y= Y1, Y2,...,Yn$, is $Y$ a random variable? Because what I usually read in texts is that Y is a random variable but I don't get how?

Thank you!

asked 4 hours ago

A.Asad

1083

New contributor

So I've recently started an introductory course in econometrics and I'm having trouble grasping the idea of Random Variables and Sample distributions

1) If we have a population and we take a sample $Y= Y1, Y2,...,Yn$, is $Y$ a random variable? Because what I usually read in texts is that Y is a random variable but I don't get how?

Thank you!

distributions mean random-variable sample population

asked 4 hours ago

A.Asad

1083

New contributor

asked 4 hours ago

A.Asad

1083

New contributor

asked 4 hours ago

A.Asad

1083

New contributor

asked 4 hours ago

A.Asad

1083

asked 4 hours ago

A.Asad

1083

New contributor

A.Asad is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.

add a commentÂ |Â

2 Answers
2

active

oldest

votes

up vote
2
down vote

accepted

The random variable $Y$ describes a relationship between events and the corresponding probabilities of those events. In more practical terms, a random variable describes a data-generating process. When you generate a random data point that is described by the random variable $Y$, the probability distribution of $Y$ describes the probability distribution of values that can result.

You can think of a "population" as an infinite reservoir of values drawn from $Y$. Sampling from a population is analogous to repeatedly drawing new values from $Y$. A sample of size $N$ is a size-$N$ collection of individual draws from $Y$.

The sample is clearly not the same thing as the random variable itself, so we need a different notation for it. Let's call it $s = y_1, y_2, dots, y_N $. Each $y_n$ is a single draw from $Y$. The sample mean is a single number. Let's call it $bar s$. It is the mean of the sequence $s$, i.e. $bar s = fracy_1 + y_2 + dots + y_NN$.

We can make an interesting observation here! $N$ independent, identical draws from a random variable $Y$ is the same thing as one draw from each of $N$ independent, identical random variables $Y_n$. Now, we can talk about the sample itself as a random variable $S = Y_1, dots, Y_N $.

Note the difference between

$$
s = y_1, dots, y_N
$$

and

$$
S = Y_1, dots, Y_N
$$

$S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.

Therefore the sample mean itself can be restated as a random variable $bar S$.

Compare

$$
bar s = frac y_1 + cdots + y_NN
$$

with

$$
bar S = frac Y_1 + cdots + Y_NN
$$

$bar s$ is just a number: it is the mean of a sequence of numbers $y_1, cdots, y_N$. But $bar S$ is a random variable! Specifically, it is a statistic, a single quantity that is calculated from a sample. The value of a statistic for a specific sample is a realization of the distribution for that statistic.

Being a random variable, draws from $bar S$ are described by a probability distribution. The distribution of sample means, across all possible samples, is described by the distribution of $bar S$. This distribution is the sampling distribution of the mean.

With regard to your first question, you are probably confused between the random variable $Y$ and the matrix $Y$. It is an unfortunate clash in notation that random variables and matrices are both conventionally written with capital letters. It is often mathematically convenient to express samples as matrices, so that you can do linear algebra operations on observed data (to generate estimates from that data, e.g. with ordinary least squares). The matrix $Y$ would be a matrix of observed values. Take care to observe the context, to avoid this confusion.

To address your 2nd question, there are many ways to derive or describe a sampling distribution. One possible technique is called resampling: repeatedly draw samples from a population that is distributed according to $Y$, and measure the sample mean in each of those samples. The distribution of those sample means should follow the sampling distribution of the mean.

answered 3 hours ago

shadowtalker

7,32323180

Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,Ã¢Â€Â¦,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
â€“Â A.Asad
3 hours ago

@A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
â€“Â shadowtalker
3 hours ago

So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
â€“Â A.Asad
3 hours ago

@A.Asad that's correct
â€“Â shadowtalker
3 hours ago

Thank you so much!!
â€“Â A.Asad
2 hours ago

Â |Â
show 1 more comment

up vote
0
down vote

For the sake of redundancy and addition, a random value is (the Mathematical modeling of the process of having a ) a measurement or experiment whose value is not predictable/deterministic ; its value can only be understood probabilistically, meaning that it can be tested over the longer run. A standard example is that of throwing a fair coin *: one cannot tell for any throw whether it will land heads or tails, but a pattern should emerge over the long run of limiting values for the probability P(Heads)=P(Tails)=1/2.

Re your $Y$, yes, the layout is a bit confusing. Like Shadowtalker said, $Y_1,Y_2,...,Y_n$ are the implementations of the same process $Y$ , where $Y$ may represent throwing a die, flipping a coin, etc. Then $Y_1, Y_2,..,Y_n$ , if independent, are said to be IID RVs , Independent , Identically-Distributed Random Variables.

And, yes, the sampling mean is the random variable that takes sample (quantitative) values $Y_1, Y_2,....,Y_n $ and assigns to them the value $frac X_1 +X_2+...+X_nn$. There are many other possible sample statistics: Sample variance, Sample error, etc.

An important result to note, I think, is the CLT: Central Limit Theorem which tells you that , no matter what the distribution , if $Z_1,Z_2,....,Z_m$ are independent and identically - distributed, then the sample mean will approach a normal distribution as n becomes large-enough ( $n>30 ; n>40$ for higher accuracy).

Assume we know it is fair to avoid a rabbit hole.

answered 49 mins ago

gary

238210

add a commentÂ |Â

Your Answer

StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\$","\$"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "65"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
convertImagesToLinks: false,
noModals: false,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

A.Asad is a new contributor. Be nice, and check out our Code of Conduct.

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f368492%2fabout-sampling-and-random-variables%23new-answer', 'question_page');

);

Post as a guest

Name

2 Answers
2

active

oldest

votes

2 Answers
2

active

oldest

votes

up vote
2
down vote

accepted

Note the difference between

$$
s = y_1, dots, y_N
$$

and

$$
S = Y_1, dots, Y_N
$$

$S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.

Therefore the sample mean itself can be restated as a random variable $bar S$.

Compare

$$
bar s = frac y_1 + cdots + y_NN
$$

with

$$
bar S = frac Y_1 + cdots + Y_NN
$$

answered 3 hours ago

shadowtalker

7,32323180

Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,Ã¢Â€Â¦,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
â€“Â A.Asad
3 hours ago

@A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
â€“Â shadowtalker
3 hours ago

So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
â€“Â A.Asad
3 hours ago

@A.Asad that's correct
â€“Â shadowtalker
3 hours ago

Thank you so much!!
â€“Â A.Asad
2 hours ago

Â |Â
show 1 more comment

up vote
2
down vote

accepted

Note the difference between

$$
s = y_1, dots, y_N
$$

and

$$
S = Y_1, dots, Y_N
$$

$S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.

Therefore the sample mean itself can be restated as a random variable $bar S$.

Compare

$$
bar s = frac y_1 + cdots + y_NN
$$

with

$$
bar S = frac Y_1 + cdots + Y_NN
$$

answered 3 hours ago

shadowtalker

7,32323180

Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,Ã¢Â€Â¦,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
â€“Â A.Asad
3 hours ago

@A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
â€“Â shadowtalker
3 hours ago

So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
â€“Â A.Asad
3 hours ago

@A.Asad that's correct
â€“Â shadowtalker
3 hours ago

Thank you so much!!
â€“Â A.Asad
2 hours ago

Â |Â
show 1 more comment

up vote
2
down vote

accepted

Note the difference between

$$
s = y_1, dots, y_N
$$

and

$$
S = Y_1, dots, Y_N
$$

$S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.

Therefore the sample mean itself can be restated as a random variable $bar S$.

Compare

$$
bar s = frac y_1 + cdots + y_NN
$$

with

$$
bar S = frac Y_1 + cdots + Y_NN
$$

answered 3 hours ago

shadowtalker

7,32323180

Note the difference between

$$
s = y_1, dots, y_N
$$

and

$$
S = Y_1, dots, Y_N
$$

$S$ is random: it is a sequence of random variables. $s$ is not random. It is the realized value of a draw from $S$, i.e. a sequence of realized values of draws from $Y_1, dots, Y_N$.

Therefore the sample mean itself can be restated as a random variable $bar S$.

Compare

$$
bar s = frac y_1 + cdots + y_NN
$$

with

$$
bar S = frac Y_1 + cdots + Y_NN
$$

answered 3 hours ago

shadowtalker

7,32323180

answered 3 hours ago

shadowtalker

7,32323180

answered 3 hours ago

shadowtalker

7,32323180

answered 3 hours ago

shadowtalker

7,32323180

Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,Ã¢Â€Â¦,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
â€“Â A.Asad
3 hours ago

@A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
â€“Â shadowtalker
3 hours ago

So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
â€“Â A.Asad
3 hours ago

@A.Asad that's correct
â€“Â shadowtalker
3 hours ago

Thank you so much!!
â€“Â A.Asad
2 hours ago

Â |Â
show 1 more comment

Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,Ã¢Â€Â¦,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
â€“Â A.Asad
3 hours ago

@A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
â€“Â shadowtalker
3 hours ago

So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
â€“Â A.Asad
3 hours ago

@A.Asad that's correct
â€“Â shadowtalker
3 hours ago

Thank you so much!!
â€“Â A.Asad
2 hours ago

Hello! Thank you for the detailed answer. However, I've lost you on this statement :"N independent, identical draws from a random variable Y is the same thing as one draw from each of N independent, identical random variables Yn. Now, we can talk about the sample itself as a random variable S=Y1,Ã¢Â€Â¦,YN."Can you please elaborate on this further? Or maybe simplify it for me as I fail to understand what you're speaking of. Thank you!
â€“Â A.Asad
3 hours ago

@A.Asad , pick a number $N$. If you flip one coin $N$ times, that is the same as flipping $N$ identical coins, once per coin.
â€“Â shadowtalker
3 hours ago

So what you're suggesting is that picking a random sample of size $N$ is the same as picking one number from each from $N$ identically distributed populations? This makes the sample $S=Y1,....,YN$ a random variable as it is a composition of N random variables? Thank you!
â€“Â A.Asad
3 hours ago

@A.Asad that's correct
â€“Â shadowtalker
3 hours ago

Thank you so much!!
â€“Â A.Asad
2 hours ago

Â |Â
show 1 more comment

up vote
0
down vote

Assume we know it is fair to avoid a rabbit hole.

answered 49 mins ago

gary

238210

add a commentÂ |Â

up vote
0
down vote

Assume we know it is fair to avoid a rabbit hole.

answered 49 mins ago

gary

238210

add a commentÂ |Â

up vote
0
down vote

Assume we know it is fair to avoid a rabbit hole.

answered 49 mins ago

gary

238210

Assume we know it is fair to avoid a rabbit hole.

answered 49 mins ago

gary

238210

answered 49 mins ago

gary

238210

answered 49 mins ago

gary

238210

answered 49 mins ago

gary

238210

add a commentÂ |Â

A.Asad is a new contributor. Be nice, and check out our Code of Conduct.

draft saved

draft discarded

A.Asad is a new contributor. Be nice, and check out our Code of Conduct.

draft saved

draft discarded

Post as a guest

Name

Search This Blog

Iyfjky