Estatísticas suficientes em conjunto: uniforme (a, b)

Vamos cuidar do cálculo de rotina para você, para que você possa chegar ao cerne do problema e aproveitar a formulação de uma solução. Tudo se resume a construir retângulos como uniões e diferenças de triângulos.

Primeiro, escolha valores de $a$ e $b$ que fazem os detalhes mais simples possível. Gosto de $a=0,b=1$ : a densidade univariada de qualquer componente de $X=(X_1,X_2,\ldots,X_n)$ é apenas a função indicadora do intervalo $[0,1]$ .

Vamos encontrar a função de distribuição $F$ de $(Y_1,Y_n)$ . Por definição, para qualquer número real $y_1 \le y_n$ isso é

\begin{matrix} (1) & F (y_{1}, y_{n}) = Pr (Y_{1} \leq y_{1} and Y_{n} \leq y_{n}) . \end{matrix}

$F(y_1,y_n) = \Pr(Y_1\le y_1\text{ and } Y_n \le y_n).\tag{1}$

Os valores de são obviamente ou , caso algum de ou esteja fora do intervalo , então vamos assumir que ambos estão nesse intervalo. (Vamos também assumir para evitar discutir trivialidades.) Nesse caso, o evento pode ser descrito em termos das variáveis originais $F$ $0$ $1$ $y_1$ $y_n$ $[a,b] = [0,1]$ $n\ge 2$ $(1)$ como "pelo menos um dos é menor ou igual a e nenhum dos excede ". Equivalentemente, todos os estão em mas não é o caso em que todos eles se encontram . $X=(X_1,X_2,\ldots,X_n)$ $X_i$ $y_1$ $X_i$ $y_n$ $X_i$ $[0,y_n]$ $(y_1,y_n]$

Because the $X_i$ are independent, their probabilities multiply and give $(y_n-0)^n = y_n^n$ and $(y_n-y_1)^n$ , respectively, for these two events just mentioned. Thus,

F (y_{1}, y_{n}) = y_{n}^{n} - (y_{n} - y_{1})^{n} .

$F(y_1,y_n) = y_n^n - (y_n-y_1)^n.$

The density $f$ is the mixed partial derivative of $F$ ,

f (y_{1}, y_{n}) = \frac{\partial^{2} F}{\partial y_{1} \partial y_{n}} (y_{1}, y_{n}) = n (n - 1) (y_{n} - y_{1})^{n - 2} .

$f(y_1,y_n) = \frac{\partial^2 F}{\partial y_1 \partial y_n}(y_1,y_n) = n(n-1)(y_n-y_1)^{n-2}.$

The general case for $(a,b)$ scales the variables by the factor $b-a$ and shifts the location by $a$ . Thus, for $a \lt y_1 \le y_n \lt b$ ,

F (y_{1}, y_{n}; a, b) = ({(\frac{y_{n} - a}{b - a})}^{n} - {(\frac{y_{n} - a}{b - a} - \frac{y_{1} - a}{b - a})}^{n}) = \frac{(y_{n} - a)^{n} - (y_{n} - y_{1})^{n}}{(b - a)^{n}} .

$F(y_1,y_n; a,b) = \left(\left(\frac{y_n-a}{b-a}\right)^n - \left(\frac{y_n-a}{b-a} - \frac{y_1-a}{b-a}\right)^n\right) = \frac{(y_n-a)^n - (y_n-y_1)^n}{(b-a)^n}.$

Differentiating as before we obtain

f (y_{1}, y_{n}; a, b) = \frac{n (n - 1)}{(b - a)^{n}} (y_{n} - y_{1})^{n - 2} .

$f(y_1,y_n; a,b) = \frac{n(n-1)}{(b-a)^n}(y_n-y_1)^{n-2}.$

Consider the definition of completeness. Let $g$ be any measurable function of two real variables. By definition,

\begin{matrix} (2) & \begin{aligned} E [g (Y_{1}, Y_{n})] & = \int_{y_{1}}^{b} \int_{a}^{b} g (y_{1}, y_{n}) f (y_{1}, y_{n}) d y_{1} d y_{n} \\ \propto \int_{y_{1}}^{b} \int_{a}^{b} g (y_{1}, y_{n}) (y_{n} - y_{1})^{n - 2} d y_{1} d y_{n} . \end{aligned} \end{matrix}

$\eqalign{E[g(Y_1,Y_n)] &= \int_{y_1}^b\int_a^b g(y_1,y_n) f(y_1,y_n)dy_1dy_n\\ &\propto\int_{y_1}^b\int_a^b g(y_1,y_n) (y_n-y_1)^{n-2} dy_1dy_n.\tag{2} }$

We need to show that when this expectation is zero for all $(a,b)$ , then it's certain that $g=0$ for any $(a,b)$ .

Here's your hint. Let $h:\mathbb{R}^2\to \mathbb{R}$ be any measurable function. I would like to express it in the form suggested by $(2)$ as $h(x,y)=g(x,y)(y-x)^{n-2}$ . To do that, obviously we must divide $h$ by $(y-x)^{n-2}$ . Unfortunately, for $n\gt 2$ this isn't defined whenever $y-x$ . The key is that this set has measure zero so we can neglect it.

Accordingly, given any measurable $h$ , define

g (x, y) = {\begin{matrix} h (x, y) / (y - x)^{n - 2} & x \neq y \\ 0 & x = y \end{matrix}

$g(x,y) = \left\{\matrix{h(x,y)/(y-x)^{n-2} & x \ne y \\ 0 & x=y}\right.$

Then $(2)$ becomes

\begin{matrix} (3) & \int_{y_{1}}^{b} \int_{a}^{b} h (y_{1}, y_{n}) d y_{1} d y_{n} \propto E [g (Y_{1}, Y_{n})] . \end{matrix}

$\int_{y_1}^b\int_a^b h(y_1,y_n) dy_1dy_n \propto E[g(Y_1,Y_n)].\tag{3}$

(When the task is showing that something is zero, we may ignore nonzero constants of proportionality. Here, I have dropped $n(n-1)/(b-a)^{n-2}$ from the left hand side.)

This is an integral over a right triangle with hypotenuse extending from $(a,a)$ to $(b,b)$ and vertex at $(a,b)$ . Let's denote such a triangle $\Delta(a,b)$ .

Ergo, what you need to show is that if the integral of an arbitrary measurable function $h$ over all triangles $\Delta(a,b)$ is zero, then for any $a\lt b$ , $h(x,y)=0$ (almost surely) for all $(x,y)\in \Delta(a,b)$ .

Although it might seem we haven't gotten any further, consider any rectangle $[u_1,u_2]\times [v_1,v_2]$ wholly contained in the half-plane $y \gt x$ . It can be expressed in terms of triangles:

[u_{1}, u_{2}] \times [v_{1}, v_{2}] = Δ (u_{1}, v_{2}) ∖ (Δ (u_{1}, v_{1}) \cup Δ (u_{2}, v_{2})) \cup Δ (u_{2}, v_{1}) .

$[u_1,u_2]\times [v_1,v_2] = \Delta(u_1,v_2) \setminus\left(\Delta(u_1,v_1) \cup \Delta(u_2,v_2)\right)\cup \Delta(u_2,v_1).$

In this figure, the rectangle is what is left over from the big triangle when we remove the overlapping red and green triangles (which double counts their brown intersection) and then replace their intersection.

Consequently, you may immediately deduce that the integral of $h$ over all such rectangles is zero. It remains only to show that $h(x,y)$ must be zero (apart from its values on some set of measure zero) whenever $y \gt x$ . The proof of this (intuitively clear) assertion depends on what approach you want to take to the definition of integration.

whuber
fonte

I tried to set equation 3 equal to zero, take the derivative on both sides and interchange the signs (a reflex action I guess) but the results look quite scary [1]. Is there a more reasonable approach? [1] en.wikipedia.org/wiki/Leibniz_integral_rule#Higher_dimensions

mugen

Consider finite collections of smaller and smaller triangles all lying along the hypotenuse in the picture and take the limit as the diameter of the largest triangle in the collection goes to zero.

whuber

Estatísticas suficientes em conjunto: uniforme (a, b)

Respostas: