Musings on the Weibull distribution

Introduction

The Weibull distribution has two parameters: \(\alpha\) parameter sets the time scale whereas the shape parameter \(\beta\) sets the shape of the distribution. The failure probability density, cumulative failure distributions and the hazard rate (failure rate) are defined as follows:

\[\begin{equation} f_{\alpha,\beta}(t)=\frac{\beta }{\alpha} \left( \frac{t}{\alpha}\right)^{\beta -1} e^{-\left( \frac{t}{\alpha}\right)^{\beta}},\tag{1} \end{equation}\] \[\begin{equation} F_{\alpha,\beta}(t)= \int_{0}^{t} d\tau f_{\alpha,\beta}(\tau)=1-e^{-\left(\frac{t}{\alpha}\right)^{\beta}},\tag{2} \end{equation}\] \[\begin{equation} h_{\alpha,\beta}(t)= \frac{f_{\alpha,\beta}(t)}{1-F_{\alpha,\beta}(t)}=\frac{\beta }{\alpha} \left( \frac{t}{\alpha}\right)^{\beta -1}.\tag{3} \end{equation}\]

There is a misconception that Weibull distribution has a theoretical physical basis, which is clearly not true. In fact, Weibull was first to acknowledge that in the very original paper [1]:

“… The objection has been stated that this distribution function has no theoretical basis. But in so far as the author understands, there are-with very few exceptions-the same objections against all other [functions], applied to real populations from natural or biological fields, at least in so far as the theoretical basis has anything to do with the population in question. Furthermore, it utterly hopeless to expect a theoretical basis for distribution functions of random variables…”

Ernst Hjalmar Waloddi Weibull

I think one of the root causes of this misconception is the way this topic is introduced. Typically, the functions are defined in the order as in Eqs. (1) -(3). \(F_{\alpha,\beta}\), for example, is so neat and so simple, it almost looks like it comes off of a theoretical derivation. A more natural approach would have started from \(h_{\alpha,\beta}\), i.e., the hazard rate. It is the failure rate normalized to the survivors. What can the simplest form of \(h\) be? Let’s say it just a constant, and see where it gets us:

\[\begin{eqnarray} h_{\alpha,\beta}(t)& \equiv& \lambda=\frac{f_{\alpha,\beta}(t)}{1-F_{\alpha,\beta}(t)}= \frac{\frac{d}{dt} F_{\alpha,\beta}(t)}{1-F_{\alpha,\beta}(t)}=-\frac{d}{dt}\left[ln\left( 1-F_{\alpha,\beta}(t)\right)\right]\nonumber\\ &\implies&F_{\alpha,\beta}(t)=1-e^{-\int_0^td\tau \lambda}=1-e^{-\lambda t}. \tag{4} \end{eqnarray}\]
There you have it: if you go with the simplest assumption on \(h_{\alpha,\beta}\), you get the exponential distribution.

Let’s try a bit more generic function for \(h\). Let’s say it is a function of \(t\), and try \(t^\kappa\), where \(\kappa\) is a constant. It also needs a coefficient to fix the units, so let’s try the following:

\[\begin{eqnarray} h_{\alpha,\beta}(t)&\equiv& \zeta t^\kappa=-\frac{d}{dt}\left[ln\left( 1-F_{\alpha,\beta}(t)\right)\right]\equiv \lambda\nonumber\\ &\implies&F_{\alpha,\beta}(t)=1-e^{-\zeta \int_0^td\tau \tau^\kappa}=1-e^{-\frac{\zeta}{\kappa+1} t^{\kappa+1}}. \tag{5} \end{eqnarray}\]
Again, \(\kappa\) and \(\zeta\) are some constants, however we do not know what they are. Just to simplify the expression we can redefine \(\kappa+1\rightarrow\beta\), and and \(\zeta\rightarrow \frac{\beta}{\alpha^\beta}\), which gives \(F_{\alpha,\beta}(t)=1-e^{-\left(\frac{t}{\alpha}\right)^{\beta}}\).

The take away from this analysis is that, there is nothing mysterious about the Weibull function. It is basically the second simplest guess one can make.

Visualizing failure distributions

In the interactive plot below, \(\alpha\) and \(\beta\) values can be adjusted. The toggles on the right change the scales.
\(\beta\)
\(\alpha(hours)\)
\(x:\rm{lin/log}\)
\(y:\rm{lin/prob}\)
An interactive plot of Weibull distributions defined in  Eqs.<span class='myeqref'> \@ref(eq:weipdf) </span>,<span class='myeqref'> \@ref(eq:weicdf) </span> and  <span class='myeqref'>\@ref(eq:weihazard)</span>. Use the toggles for linear/log scales.  The vertical dashed lines mark the value $t=\alpha$, and the horizontal one on $F$ marks the value $0.63$. The $y$ axis can be transformed to $ln\left[-ln(1-F_{\alpha,\beta})\right]$, and  the time axis  can be transformed to $ln(t)$.   The effect of $\beta$ on $F$ is best observed with  the log time scale, and $y$ in the probability scale: notice how $F_{\alpha,\beta}$ becomes a line with the slope $\beta$. Selecting the log scale for time and linear scale for the $y$ axis, shows that the changing scale parameter $\alpha$ moves the functions horizontally while preserving their shapes.

Figure 1: An interactive plot of Weibull distributions defined in Eqs. (1) , (2) and (3). Use the toggles for linear/log scales. The vertical dashed lines mark the value \(t=\alpha\), and the horizontal one on \(F\) marks the value \(0.63\). The \(y\) axis can be transformed to \(ln\left[-ln(1-F_{\alpha,\beta})\right]\), and the time axis can be transformed to \(ln(t)\). The effect of \(\beta\) on \(F\) is best observed with the log time scale, and \(y\) in the probability scale: notice how \(F_{\alpha,\beta}\) becomes a line with the slope \(\beta\). Selecting the log scale for time and linear scale for the \(y\) axis, shows that the changing scale parameter \(\alpha\) moves the functions horizontally while preserving their shapes.

[1]
W. Weibull, “A statistical distribution function of wide applicability,” Journal of Applied Mechanics, vol. 18, pp. 293–297, 1951 [Online]. Available: http://web.cecs.pdx.edu/cgshirl/Documents/Weibull-ASME-Paper-1951.pdf

Related