User:IssaRice/Pareto distribution

From Machinelearning
Revision as of 21:39, 5 February 2020 by IssaRice (talk | contribs)


I still don't know how this is derived, but here's how to make sense of the 80/20 rule:

What does it mean to say that the top 20% own 80% of the wealth? It means that where we choose the cutoff c according to .

Why? is the fraction of people with wealth above the cutoff c; we're just integrating the pdf of the distribution.

is the amount of wealth owned by the people with wealth above c. This is because at each wealth level x, such people are a fraction f(x)dx of people, and they each own wealth x, so the wealth owned by them is xf(x)dx. Now just sum over all of them from c to infinity.

The above can also be expressed as where the cutoff c is chosen by .

Using (the specific value of alpha needed to make the 80/20 rule hold) and in the Pareto distribution, we get . The mean is [1] and [2]. Sure enough, 5.77/7.213 is very close to 0.80.

Another way to look at this is to look at the inverse CDF, which takes the percentile of wealth to the actual value of the wealth. We should get [3].

three ways to picture the pareto distribution

  • pdf
  • inverse cdf; examples: [4], [5]
  • lorenz curve; examples: [6]