Distributions
Distributions are parametrized random generators with well-defined statistical properties. Each distribution describes a family of random variables characterized by specific parameters.
Notation
- — is distributed as standard normal
- — estimate computed from sample
- — true value (asymptotic limit)
- — asymptotic case (large sample approximation)
Additive (Normal)
- : location parameter (center of the distribution), consistent with
- : scale parameter (standard deviation), can be rescaled to


- Formation: the sum of many variables under mild CLT (Central Limit Theorem) conditions (e.g., Lindeberg-Feller).
- Origin: historically called Normal or Gaussian distribution after Carl Friedrich Gauss and others.
- Rename Motivation: renamed to to reflect its formation mechanism through addition.
- Properties: symmetric, bell-shaped, characterized by central limit theorem convergence.
- Applications: measurement errors, heights and weights in populations, test scores, temperature variations.
- Characteristics: symmetric around the mean, light tails, finite variance.
- Caution: no perfectly additive distributions exist in real data; all real-world measurements contain deviations. Traditional estimators like and lack robustness to outliers; use them only when strong evidence supports approximate additivity with no extreme measurements.
Multiplic (LogNormal)
- : mean of log values (location parameter; equals the geometric mean)
- : standard deviation of log values (scale parameter; controls multiplicative spread)


- Formation: the product of many positive variables with mild conditions (e.g., finite variance of ).
- Origin: historically called Log-Normal or Galton distribution after Francis Galton.
- Rename Motivation: renamed to to reflect its formation mechanism through multiplication.
- Properties: logarithm of a (LogNormal) variable follows an (Normal) distribution.
- Applications: stock prices, file sizes, reaction times, income distributions, biological growth rates.
- Caution: no perfectly multiplic distributions exist in real data; all real-world measurements contain deviations. Traditional estimators struggle with the inherent skewness and heavy right tail.
Exponential
- : rate parameter (, controls decay speed; mean = )


- Formation: the waiting time between events in a Poisson process.
- Origin: naturally arises from memoryless processes where the probability of an event occurring is constant over time.
- Properties: memoryless (past events do not affect future probabilities).
- Applications: time between failures, waiting times in queues, radioactive decay, customer service times.
- Characteristics: always positive, right-skewed with a light (exponential) tail.
- Caution: extreme skewness makes traditional location estimators like unreliable; robust estimators provide more stable results.
Power (Pareto)
- : minimum value (lower bound, )
- : shape parameter (, controls tail heaviness; smaller values = heavier tails)


- Formation: follows a power-law relationship where large values are rare but possible.
- Origin: historically called Pareto distribution after Vilfredo Paretos work on wealth distribution.
- Rename Motivation: renamed to to reflect its connection with power-law.
- Properties: exhibits scale invariance and extremely heavy tails.
- Applications: wealth distribution, city population sizes, word frequencies, earthquake magnitudes, website traffic.
- Characteristics: infinite variance for many parameter values; extreme outliers are common.
- Caution: traditional variance-based estimators completely fail; robust estimators are essential for reliable analysis.
Uniform
- : lower bound of the support interval
- : upper bound of the support interval ()


- Formation: all values within a bounded interval have equal probability.
- Origin: represents complete uncertainty within known bounds.
- Properties: rectangular probability density, finite support with hard boundaries.
- Applications: random number generation, round-off errors, arrival times within known intervals.
- Characteristics: symmetric, bounded, no tail behavior.
- Note: traditional estimators work reasonably well due to symmetry and bounded nature.