Publishable Stuff

Rasmus Bååth's Research Blog

The Most Comprehensive Review of Comic Books Teaching Statistics

As I’m more or less an autodidact when it comes to statistics, I have a weak spot for books that try to introduce statistics in an accessible and pedagogical way. I have therefore collected what I believe are all books that introduces statistics using comics (at least those written in English). What follows are highly subjective reviews of those four books. If you know of any other comic book on statistics, please do tell me!

I’ll start with a tl;dr version of the reviews, but first here are the four books:

  • ☆☆☆☆☆ / 5
    The Cartoon Guide to Statistics by Larry Gonick and Woollcott Smith
    Witty, pedagogical and comprehensive, this is the best book of the bunch! It provides a historical perspective and covers quite advanced topics such as confidence intervals, regression analysis and probability theory. The book contains a fair deal of mathematical notation but still manages to be accessible.
  • ☆☆☆☆ / 5
    The Manga Guide to Statistics by Shin Takahashi
    This book is perhaps a bit more manga than statistics but still manages to cover the basics of data analysis and hypothesis testing. The artwork is great (if you like manga, that is) and the storyline is cute (again, if you like manga). If you don’t like manga and/or hypothesis testing, then this might not be your cup of bubble tea.
  • ☆☆ / 5
    The Cartoon Introduction to Statistics by Grady Klein and Alan Dabney
    Does a decent job at introducing hypothesis testing and confidence intervals, but I found the both the presentation and the content lacking in many respects. The writing is constantly “chopped up”, where a sentence can span several panels, which makes the book tedious to read. The artwork is ok but looks (to me) like it is missing outlines. The content is very focused on normal sampling distributions, unwarrantedly so if it aims to be a general introduction to statistics.
  • ☆ / 5
    Introducing Statistics, a Graphic Guide by Eileen Magnello and Borin Van Loon
    The artwork is absurdist, perhaps it is too cool for me but I found it annoying. The content is a mess, a haphazard mix of historical facts, probability theory and statistical concepts. It is also full of obscure explanations and outright errors. Would not read again.

The Cartoon Guide to Statistics

Written in 1993 by Larry Gonick and Woollcott Smith this is still the book, out of the four reviewed, that feels most up to date. It covers a wide range of topics starting with summary statistics and basic probability and working its way through probability distributions, experimental design, confidence intervals and linear regression. It even touches on more advanced subjects such as resampling methods. The book is not written as a standard comic book with panels and a story line, rather it is a well written, easy going text that is accompanied by the fun and lively sketches by Larry Gonick.

The book doesn’t skimp on the mathematical notation, which might put some off, but the presentation is still more accessible than most of the introductory stats books I’ve come across. If you are interested in the statistical programming language R, a thing to note is that many of the graphs in the book are made with S, the precursor to R, like the scatter plot below:

Like all the books reviewed here the Cartoon Guide to Statistics is focused on frequentist statistics, but at least the Bayesian perspective gets a mention:

This is a great book which is witty and pleasant to read (+ it is pretty cheap). Highly recommended!

Amazon link: The Cartoon Guide to Statistics

The Manga Guide to Statistics

This book, written by Shin Takahashi and illustrated by Iroha Inoue, is one in a long series of Manga Guides translated and published by No Starch Press. Like the Manga Guide to Databases or the Manga Guide to Calculus, the Manga Guide to Statistics takes a subject with a reputation for being difficult and technical and adds a cliché story full of huge eyes, naive schoolgirls and geeky geeks without any social ability.

As opposed to the Cartoon Guide to Statistics the Manga Guide reads more like a standard comic book with panels and a story line. The story centers around the schoolgirl Rui who wants to learn statistics to impress the handsome Mr. Igarashi. To her rescue comes Mr. Yamamoto, a stats nerd with thick glasses. The story and the artwork is archetypal manga (including very stereotype gender roles) but if you can live with that it is a pretty fun story.

Quite a lot of space in the book is devoted to the storyline rather that to teaching statistics and to compensate for this there are text passages interspersed between the manga passages. Still the book doesn’t cover that much ground and except for basic graphs and summary statistics the book is focused on classical hypothesis testing.

The book also describes the basic probability distributions but mostly from a hypothesis testing perspective, which feels a bit odd. I could imagine that without any background in statistics a reader of the Manga Guide would find it quite hard to understand the rational for actually performing hypothesis tests. Maybe a good companion will be the soon to be released Manga Guide to Regression Analysis

Amazon link: The Manga Guide to Statistics

The Cartoon Introduction to Statistics

A book by Grady Klein and Alan Dabney with a similar aim and a deceptively similar name to The Cartoon Guide to Statistics. I, however, found the Cartoon Introduction lacking in almost all aspects compared to the Cartoon Guide. The artwork didn’t click with me, somehow it looks like the characters lack outlines and the illustrations distracts from the text rather than enhances it. The most irritating… aspect of the… book is that… most sentences are spread out… over many panels.

This “dilution” of the text makes it hard to follow and for a book of ~200 pages it covers relatively few concepts. I’ve also got a gripe with what the book covers as it is extremely focused on normal sampling distributions. Sure it is an important concept in classical statistics but it takes up a really large part of the book while other important concepts, such as probability, get little attention. This being a new book (2013) it is strange that it does not mention Bayesian statistics at all, it is actually the most frequentist book of all the books reviewed here. However, when looking at how confidence intervals are characterized it seems like the book actually is describing a Bayesian credible interval…

Amazon link: The Cartoon Introduction to Statistics

See also this review of the Cartoon Introduction to Statistics by Christian P. Robert.

Introducing Statistics, a Graphic Guide

Wow! This is an odd book that might be one of the worst introductions to statistics you can get as a beginner. The artwork by the surrealist painter Borin van Loon is truly surreal. Perhaps it is hip and cool (and perhaps I’m not cool enough), but it definitely does not add clarity to the subject the the book aims to introduce. The worst part of the book is the text which introduces statistical concepts and historical facts in a completely haphazard order. The book is very focused on early statistical scientists such as Galton, Pearson, Gosset and Fisher, and here is one of the many surreal versions of Pearson’s head:

Notice the speech balloon, coming out of Pearson’s mouth, which does not make sense. What does it mean to measure a group and why would the coefficient of variation help when measuring groups with “a great deal of variation”? In general there are a lot of statements in this book that does not make sense or that are plain wrong. A random sample:

  • “Vital statistics is concerned with averages whereas mathematical statistics deals with variation.” When characterizing the two “types” of statistics.

  • “There are two types of statistical distributions: probability distributions, which describe the possible events in a sample and the frequency with which each occur; and frequency distributions.”

  • “The Poisson distribution […] is a discrete probability distribution used to describe the occurrence of unlikely events in a large number of of independent repeated trials.” Why unlikely? Why a large number of trials?

  • “The standard deviation shows the deviation from the mean and the frequency of this deviation.” The frequency of a deviation?

  • “The variance is also a measure of variation, but [as opposed to the standard deviation,] it is used for random variables and indicates the extent to which its values are spread around the expected values.” The many expected values? So the variance is only for random variables…

The list could go on, so why not let it?

  • “[The Bayesian approach] is a means of calculating from the number of times that an event has not occurred to determine the probability that it will occur in future trials.” I can’t even begin to understand what this means…

  • ”[Relative frequency] is a more scientific and objective approach than the other types of probability, and is used for finding out about the world and assessing actual existing objects. One can flip a coin 100 times and record the number of heads and tails and the ratio of the number of heads to the total number of flips.” Assessing actual existing objects? More scientific? What does this has to do with flipping a coin 100 times?

  • “[The chi-square distribution and the chi-square goodness of fit test’s] overriding significance was that statisticians could now use statistical methods that did not depend on the normal distribution to interpret their findings.”

  • “The expected values represent the average amount one ‘expects’ as the outcome of the random trial when identical odds are repeated many times.”

Or what about this panel on “higher mathematics”:

If it is not clear by now, this is not a book I recommend unless you really are looking for a surreal introduction to statistics…

Amazon link: Introducing Statistics, a graphic guide

All images and quotes included in this review are copyrighted by their respective copyrighted holders, however I believe that the inclusion of these quotes and images in in this review constitutes fair use.

Comments