4.6: Public Health Statistics
- Page ID
- 116195
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\dsum}{\displaystyle\sum\limits} \)
\( \newcommand{\dint}{\displaystyle\int\limits} \)
\( \newcommand{\dlim}{\displaystyle\lim\limits} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)Public health uses data -- including a lot of numerical data -- to understand what's happening at the level of population health. Public health relies on a number of different types of statistics that are routinely collected, such as births, deaths, hospitalizations, and rates of disease. All of these are discussed below.
Health statistics are analyzed by epidemiologists and biostatisticians (think of it as "math for the life sciences") to (a) describe what's happening in a population and (b) understand the relationships among different exposures and the diseases or illnesses associated with them.
Vital statistics, prevalence, incidence, life expectancies -- all these are examples of health statistics.
Like all sciences, epidemiology has to deal with a degree of uncertainty. That isn't because science isn't working -- it's because it is the nature of science to be constantly evolving. When I was a kid, the idea that earthquakes were due to movement of huge plates on the earth (plate tectonics) was just one theory -- now it is the dominant explanation for why earthquakes arise. When I was a kid, we thought ulcers were caused by stress and if you had an ulcer, you were told to drink milk and rest more. Now we know that stomach ulcers are caused by bacteria and you take an antibiotic, not milk, to cure it. Knowledge is expanding through a slow accumulation of many different studies that clarify the answer to scientific questions over time. In the Covid-19 epidemic, you may remember, there have been a number of scientific discoveries -- and also some scientific mistakes. For example, it took a very long time for epidemiologists to confirm that covid could spread via aerosols, not just droplets. (Both aerosols and droplets are small water drops in the air that contain the virus, but they behave very differently. Droplets can only travel about 6 feet and they fairly quickly fall to the ground. Aerosols are much smaller and can stay suspended in air for a longer time and can move around with air currents. Masks are ideal for protecting against the transmission of covid through droplets. Ventilation is probably our best tool for protecting against the transmission of covid through aerosols -- although masks also help.) Uncertainty is a natural part of the scientific process.
Statistical analysis often produces a result with a "margin of error." For example, in elections, you might see that Candidate A is leading over Candidate B by a margin of "two to five percentage points." That's a margin of error -- they know the real number is somewhere between two and five, but they can't quite pin it down. Still, it provides a useful snapshot of what's happening in the election campaign. Similarly, in public health, we might have a "margin of error" around the measurement of how often an exposure is associated with a disease. For example, in the US, people who smoke tobacco die an average of 10 years younger than people who never smoked tobacco. That 10 years isn't a precise number -- there's an association between tobacco and premature death that falls in a margin of error around 10 years. When the "margin of error" is all above zero, we can still be certain that the exposure to tobacco (in this case) is truly related to the health outcome (premature death). When the "margin of error" includes the number 0, then we can't be sure. For example, in that election example, if we said that Candidate A is ahead of Candidate B by 3 percentage points, give or take 4 points, then our margin of error is -1 to 7% (taking that 3% and looking at the numbers 4 points below and 4 points above.) If Candidate A is -1% "ahead" then they aren't really ahead! It would be more accurate to say the two Candidates right now are tied, and we can't tell who is ahead, though it seems Candidate A might be taking the lead.
Below, we discuss various types of health statistics that public health uses most often.
OPTIONAL Video defining epidemiology
The epidemiologist in this video speaks rather fast, but he does an excellent job of explaining what epidemiology is, in just four minutes with useful diagrams. You can turn on captions (the CC symbol on the bottom bar of the video) if you like. (It may take a moment to load, because I set it to start about 2 minutes into the video.) I appreciate how he breaks down epidemiology's focus on examining the connection between an exposure to a germ or a condition and the health outcome that it might produce.


