Real Numeracy

December 31, 2021

Happy birthday to you

Everyday is somebody’s birthday. Image from wishbirthday.org

by George Taniwaki

I have a Facebook friend whose birthdate was listed as January 1. When I saw this, I thought, “That can’t be true. Nobody is born on January 1.” But later, I realized that isn’t true. People are born on every day of the year. So just how likely is it that my friend’s birthdate in January 1?

Distribution of actual birthdates

If birthdates are distributed evenly, then every day should have the same number of births. Even if births are distributed evenly on all days, not every birthdate will have the same number of births because of leap years. Thus, except February 29, each day has a probability of 1 out of 365.242, or about 0.00274. Having a birthdate on February 29 would have a probability of 0.242 out of 365.242 or about 0.00066.

Birthdates are not randomly distributed, there is some seasonality. More children are born in the summer than in the winter (Wikipedia). Separately, studies show that childhood mortality may be dependent on birthdate. For example, death after preterm birth may be higher for babies born in the summer Epidemiology Sep 2009 and juvenile skin cancer deaths may be higher for babies born in the spring Brit J. Cancer Oct 2014.

Further, delivery date can be strongly influenced by desire of both the mother and the care giver. For instance, an obstetrician may not want to deliver a baby or make post-delivery rounds on weekends. If so, they may induce labor early in the work week. In the U.S., more babies are born on Monday through Wednesday than on Thursday through Sunday (Wikipedia). This has no effect on day of year distribution though because weekends can fall on any day. However, the desire to avoid non-work day births can affect holiday births. Many holidays are observed on Mondays, which will spread the distribution effect. However, some holidays are still observed on a specific date, including New Year’s Day, U.S. Independence Day, and Christmas. One would expect fewer births on or the days leading up to January 1.

A distribution of birthdates from 480,040 life insurance policy application forms submitted between 1981 to 1994 is available at https://www.panix.com/~murphy/bdata.txt. The counts and probability are plotted in Figure 1 below.

Figure 1. Distribution of birthdates from life insurance policy applications

The dip on February 29 is expected. There is also a dip between December 22 through December 26, likely days that both pregnant patients and caregivers want to avoid spending time in a hospital.

There are 1482 birthdates listed as January 1 (p = 0.00308) which is significantly higher than expected. It is also significantly higher than the number of births on December 31 (0.00281) or January 2 (0.00252). It appears that both pregnant patients and care givers like to delivery babies on New Year’s Day. However, given a choice between a December 31 and January 1 birth, this is poor tax planning strategy (IRS).

Anyway, it isn’t that unlikely that my friend’s birthdate is actually January 1.

Here we assume that the life insurance birthdate data is accurate and truthful. That’s because people do not have much incentive to lie about their birthdate, especially since a false statement on an application can cause the company to reject a claim.

Distribution of stated birthdates

Before we conclude that my friend was born on January 1, there is a second issue. That is, why did I think that my friend’s birthdate is not January 1? Shouldn’t I believe everything people post on Facebook (Independent, Mar 2015, PLOS One Feb 2015)?

My friend’s birthdate may not be January 1 even if they say it is. Perhaps they don’t know when their actual birthdate is (they were adopted and have never seen their birth certificate) and list it as January 1 because it seems like a good default date to use. Maybe they consider their birthdate to be private information and use January 1 as their publicly stated birthdate. Perhaps they think their real birthdate is uninteresting and use January 1 to make it more interesting. Finally, maybe they just like to lie.

Table 1 below shows all the combinations of actual birthdates (rows) and self-reported birthdates (columns).

Table 1. Probability of all combinations of self-reported and actual birthdates

The cells are color coded. The legend is shown in Figure 2. The yellow cell in the upper left corner is the probability that a person is telling the truth; they say their birthdate is January 1 and it actually is. The green shaded cells represent people who lie. They say their birthdate is January 1, but it is not. The blue shaded cells also represent people who lie. They are born on January 1 but state their birthdate is a different date. The remaining yellow shaded cells and the white cells represent all the other combinations that do nor involve January 1.

Figure 2. Legend for Table 1

The sum of all the probabilities in the first row (yellow shaded and blue shaded cells) will give the probability that the actual birthdate is January 1. The sum for any row gives the probability the actual birthdate is the i^th day of the year. We can estimate this probability using the life insurance data listed above.

The sum of all the probabilities in the first column (yellow shaded and green shaded cells) will give the probability that the self-reported birthdate is January 1. The sum for any column gives the probability a person will say their birthdate is the j^th day of the year. Facebook has this data for Facebook users, but I was not able to find any reference to it.

Measuring the nonsampling error

To find the probability that my friend is telling the truth, we need to know the ratio of truth telling to lying about birthdate for everyone who claims to be born on January 1. Unfortunately, we cannot estimate it by simply combining the life insurance birthdate data with the Facebook birthdate data. The life insurance dataset and the Facebook dataset each contain 366 values. The probability matrix has 366 x 366 = 133,956 cells. Even if we make some simplifying assumptions about truth-telling vs lying, there are too many degrees of freedom in the matrix to fill it.

Conclusion

We cannot tell if my friend is telling the truth or not and do not know if their birthdate is January 1 or not. So happy birthday!

[Correction1: Changed “women” to “patients”

Correction2: Clarified that some holidays are on Monday and others on a specific date]

February 27, 2021

Is that salad fresh?

Posted by gtaniwaki under Personal | Tags: Food |
Leave a Comment

by George Taniwaki

The local Kroger store sells sushi and wakame salad. Is it fresh? You bet, it’s good until 02-30. And that bright green color not found in nature. It contains enough blue and yellow food coloring to scare away any self-respecting bacteria. Tastes pretty good, just don’t look at it too carefully.

February 25, 2021

Celebrating Purim

Posted by gtaniwaki under Personal | Tags: Food |
Leave a Comment

Poppy Hamantashen just like my mom used to make. Photo by George Taniwaki

by George Taniwaki

Because of the pandemic I can’t join in big crowds and dance in the streets to celebrate Purim. Instead, I will remain in isolation and snack on poppy Hamantashen from Kroger.

Unfortunately, I have never participated in a Purim celebration in the past. Though, the Japanese have a custom of bestowing sweets when traveling, called omiyage. Luckily, I don’t need an excuse like a holiday or a visit to stuff my face with sweets.

December 6, 2020

Graphic Medicine—The Best of 2020

Posted by gtaniwaki under Health care, Real Numeracy | Tags: Access to healthcare |
Leave a Comment

Comics are fun. That makes them a good way to explain behavioral changes needed to stay healthy and safe. Image by Gemma Corell from Creativity in Captivity

by George Taniwaki

The latest issue of JAMA (Nov 2020) has a short article about the best nontechnical graphic work in medicine this past year. Naturally, the most popular topic has been the coronavirus pandemic.

I’m a big fan of visual displays of data and of comics, so this story really piqued my interest. In case you cannot access the JAMA article, I reproduce the links below.

July 11, 2020

Thinking about mental errors

Posted by gtaniwaki under Health care, Real Numeracy | Tags: Cognitive bias, Covid-19 |
Leave a Comment

Don’t let mental errors cloud your thinking. Image by Jan Buchczik for The Atlantic

by George Taniwaki

Arthur Brooks is a conservative social scientist. He is on the faculty of Harvard Business School and was formerly president of the American Enterprise Institute. Since 2019, he has been writing a series of articles in The Atlantic, now called “How to Build a Life.” With the onset of the Covid-19 pandemic, the articles have included advice on how to live a a happier and better life by understanding our life circumstances.

In his Apr 23, 2020 article entitled “Two Errors Our Minds Make When Trying to Grasp the Pandemic”, he makes the case that we would be happier if we understood the difference between two experiences that make us unhappy and two conditions that make us nervous. It is a very thought provoking article and I highly recommend it.

Regret and disappointment

Regret and disappointment both lead to unhappiness. They seem similar but are not. We should only feel regret for bad decisions that we have made. Then we should work hard to develop strategies to do better next time. But we should not feel disappointment.

In contrast, we should only feel disappointment when we are in situations where we had no control, like the Covid-19 pandemic. And once we recognize we have no control, we should endeavor to stop our disappointment and get on with other thoughts that will make us happy. As Brooks says, “rumination on what you would be doing if it weren’t for the coronavirus is a destructive waste of your time.”

Risk and uncertainty

Most people dislike risk and uncertainty. Again, these conditions seem similar but are not. As Secretary of Defense Donald Rumsfeld famously stated, “There are known unknowns. That is to say, there are things that we know we don’t know. But there are also unknown unknowns. There are things we don’t know we don’t know.”

Risks can be thought of as the known unknowns. These are outcomes that we cannot accurately predict, but understand well enough that we can forecast them using stochastic models. We can also mitigate and manage risks by working hard using the appropriate strategies and interventions.

Uncertainty are the unknown unknowns. How many people will die from Covid-19? Is it safe to open schools in the fall? Will I or a family member get the disease? We don’t know and can’t predict these with the information currently available. That is, we as laypersons cannot convert uncertainty into risk. Thus, we should not spend a lot of time worrying about these questions. Doing so will exhaust us and make us unhappy without leading us to a better prediction.

Acknowledge, distinguish, resolve

Mr. Brooks has a three step solution to overcoming these two cognitive errors. He calls his solution “acknowledge, distinguish, resolve.” As he writes, “Disappointment and uncertainty are inevitable, but we don’t have to turn them into suffering.”

June 8, 2020

Trump as war hero

Posted by gtaniwaki under Personal | Tags: History |
Leave a Comment

This is the end… Image from MGM United Artists

by George Taniwaki

On June 1, 2020, President Donald Trump led a group of White House cabinet members and advisors across the street to St. John’s Episcopal Church. Once there he staged a photo op of him holding a bible. He took a few questions but did not have any prepared statements. Then everyone walked back to the White House.

Prior to the walk, National Park Service police cleared out mostly peaceful protesters from Lafayette Square through use of rubber bullets and pepper munitions. Once they “captured” the square, they formed a cordon around the path for the president and his entourage.

I was rather startled by this event and immediately thought of the parallels to a scene in the movie Apocalypse Now. This classic Vietnam War movie, released in 1979, was directed by Francis Ford Coppola. In the scene I am thinking of, Lieutenant Colonel Bill Kilgore, played by Robert Duvall, is the leader of a helicopter cavalry unit. He decides he wants to go surf with his men, so he calls in a napalm strike against a fishing village sympathetic to the Viet Cong. Once the beach is “neutralized” he ends up unhappy because the napalm and helicopters are causing the wind to blow the wrong way, ruining the waves.

One of the most famous quotes from the movie are spoken by Duvall’s character during the scene. “I love the smell of napalm in the morning… It smells like victory.” I can almost imagine Trump saying it.

More scenes from Apocalypse Now. Images from MGM United Artists

Check out the images below from news sites and compare them to the images at top and above taken from the movie.

Images from President Trump’s photo op (from top to bottom): Park Service Police clearing Lafayette Square (AP Photo Alex Brandon); Trump and his entourage crossing the Square, Trump giving a fist bump to police; Army Gen. Mark Milley, chairman of the Joint Chiefs of Staff in fatigues on the right (AP Photo Patrick Semansky); Trump ignoring the graffiti of FTP (AP Photo Patrick Semansky); Trump holding the bible while pointing at reporter

Update: Corrected first paragraph. President Trump did take questions. But he did not have a prepared statement.

Real Numeracy

Happy birthday to you

Distribution of actual birthdates

Distribution of stated birthdates

Measuring the nonsampling error

Conclusion

Is that salad fresh?

Celebrating Purim

Graphic Medicine—The Best of 2020

Instructional comics

Personal stories

Comics as Therapy

Thinking about mental errors

Regret and disappointment

Risk and uncertainty

Acknowledge, distinguish, resolve

Trump as war hero

Search this blog

RSS feeds

Categories

More by other kidney donors

More by other data crunchers

Archives