User:IssaRice/Pareto distribution: Difference between revisions

From Machinelearning
No edit summary
Line 18: Line 18:


from the graph of the inverse cdf, it's clear that the y axis isn't what we want (it doesn't correspond to income). it seems natural to scale the whole graph in this setting, so given the inverse cdf <math>x = \frac1{(1-F)^{1/\alpha}}</math>, scale this by a constant <math>\beta > 0</math> to get <math>x = \frac\beta{(1-F)^{1/\alpha}}</math>. Solving this for F gives <math>F = 1 - \beta^\alpha x^{-\alpha}</math> and differentiating wrt x gives the pdf <math>f(x) = \frac{\beta^\alpha\alpha}{x^{\alpha+1}}</math>.
from the graph of the inverse cdf, it's clear that the y axis isn't what we want (it doesn't correspond to income). it seems natural to scale the whole graph in this setting, so given the inverse cdf <math>x = \frac1{(1-F)^{1/\alpha}}</math>, scale this by a constant <math>\beta > 0</math> to get <math>x = \frac\beta{(1-F)^{1/\alpha}}</math>. Solving this for F gives <math>F = 1 - \beta^\alpha x^{-\alpha}</math> and differentiating wrt x gives the pdf <math>f(x) = \frac{\beta^\alpha\alpha}{x^{\alpha+1}}</math>.
since the y-axis of the inverse cdf was shifted uniformly, the "80/20"-ness of the distribution doesn't change.


==three ways to picture the pareto distribution==
==three ways to picture the pareto distribution==

Revision as of 22:13, 5 February 2020


I still don't know how this is derived, but here's how to make sense of the 80/20 rule:

What does it mean to say that the top 20% own 80% of the wealth? It means that where we choose the cutoff c according to .

Why? is the fraction of people with wealth above the cutoff c; we're just integrating the pdf of the distribution.

is the amount of wealth owned by the people with wealth above c. This is because at each wealth level x, such people are a fraction f(x)dx of people, and they each own wealth x, so the wealth owned by them is xf(x)dx. Now just sum over all of them from c to infinity.

The above can also be expressed as where the cutoff c is chosen by .

Using (the specific value of alpha needed to make the 80/20 rule hold) and in the Pareto distribution, we get . The mean is [1] and [2]. Sure enough, 5.77/7.213 is very close to 0.80.

Another way to look at this is to look at the inverse CDF, which takes the percentile of wealth to the actual value of the wealth. We should get [3].

deriving the scaled pareto

from the graph of the inverse cdf, it's clear that the y axis isn't what we want (it doesn't correspond to income). it seems natural to scale the whole graph in this setting, so given the inverse cdf , scale this by a constant to get . Solving this for F gives and differentiating wrt x gives the pdf .

since the y-axis of the inverse cdf was shifted uniformly, the "80/20"-ness of the distribution doesn't change.

three ways to picture the pareto distribution

  • pdf
  • inverse cdf; examples: [4], [5]
  • lorenz curve; examples: [6]