Split topic from
here.
sum wrote:
I'm going to bump this topic because I am also having trouble getting an intuitive grasp, but this seems intuitive to the contributing posters.
Basically, if you have something like

, where

is a random variable and

is an event, then you get a scalar. However,

is not a scalar, but in fact, a random variable, but I don't know what this random variable represents. In the first case, the intuition is: if I know this event

happens, what is the new expected value of

. However, in the second case, I'm confused by the intuition. It doesn't make sense to say: what is the expected value of

if

happens (because what does it mean for

to happen? This is a collection of events, not just one.)
Can anyone explain this?
It may be useful to think of it this in the other way:
Conditional expectation is about the "best prediction" you can make with incomplete information (especially if you are thinking about stochastic processes --- we know about what happened in the past (well, sort of), but we in general don't know what will happen in the future, but to write

means you know what happens at time t for all t). So

represents the maximal information contained in

about

(again, in stochastic processes your

will typically be something like

and you want

). Since

itself is a random variable, we should still get a random variable back, but this time it has to be

-measurable since all we know is

(so, for example, if

doesn't distinguish between two points, then there is no reason we can make different prediction about

at these two points).
Then

as a scalar quantity, where A is a nonnull event, becomes a special case. It is the value of

at points of

(recall

is the sigma algebra generated by

), since all you know is whether A happened or not (it has). And of course, the "best" prediction we got about

in this case is its expected value "when A happens", which is your scalar (if

is scalar-valued)

.