User:IssaRice/Pareto distribution: Difference between revisions

From Machinelearning
No edit summary
No edit summary
Line 11: Line 11:
The above can also be expressed as <math>\frac{\mathbb E[X \mid X > c]}{\mathbb E[X]} = 0.80</math> where the cutoff c is chosen by <math>\mathbb P(X > c) = 0.20</math>.
The above can also be expressed as <math>\frac{\mathbb E[X \mid X > c]}{\mathbb E[X]} = 0.80</math> where the cutoff c is chosen by <math>\mathbb P(X > c) = 0.20</math>.


Using <math>\alpha = (\log 5)/(\log 4)</math> in the Pareto distribution, we get <math>c = \frac1{0.20^{1/\alpha} = \frac{1}{0.20^{(\log 4)/(\log 5)}}</math>. The mean is <math>\frac{\alpha}{\alpha - 1} \approx 7.213</math> and <math>\int_c^\infty xf(x)\, dx \approx 5.77</math>. Sure enough, 5.77/7.213 is very close to 0.80.
Using <math>\alpha = (\log 5)/(\log 4)</math> in the Pareto distribution, we get <math>c = \frac1{0.20^{1/\alpha}} = \frac{1}{0.20^{(\log 4)/(\log 5)}}</math>. The mean is <math>\frac{\alpha}{\alpha - 1} \approx 7.213</math> and <math>\int_c^\infty xf(x)\, dx \approx 5.77</math>. Sure enough, 5.77/7.213 is very close to 0.80.

Revision as of 20:21, 5 February 2020


I still don't know how this is derived, but here's how to make sense of the 80/20 rule:

What does it mean to say that the top 20% own 80% of the wealth? It means that where we choose the cutoff c according to .

Why? is the fraction of people with wealth above the cutoff c; we're just integrating the pdf of the distribution.

is the amount of wealth owned by the people with wealth above c. This is because at each wealth level x, such people are a fraction f(x)dx of people, and they each own wealth x, so the wealth owned by them is xf(x)dx. Now just sum over all of them from c to infinity.

The above can also be expressed as where the cutoff c is chosen by .

Using in the Pareto distribution, we get . The mean is and . Sure enough, 5.77/7.213 is very close to 0.80.