Chapter 9 Robust estimation

Puzzle 1

What is a robust estimate?

A robust estimate is one that is, on average, equal to the expected population value even when the normal assumptions of the statistic are not met.

Puzzle 2

What is the difference between trimming data and winsorizing it?

They both give robust estimates, but the trimmed mean is the mean based on scores that have had a percentage of extreme scores removed. For example, removing the highest and lowest 20% of scores and then computing the mean of the remaining scores would give us the 20% trimmed mean. Winsorizing data, on the other hand, is where a percentage of the highest scores are replaced with the next highest score (rather than being discarded) in the data and the same percentage of the lowest scores are replaced with the next lowest score in the data.

Puzzle 3

Zach randomly selected 10 scores from the professional services non-employees (see Figure 9.1 in the book): 14, 15, 13, 11, 16, 13, 21, 12, 11, 15. Calculate the mean, the 20% trimmed mean, the 10% trimmed mean, and the 20% winsorized mean.

First, let’s calculate the mean by adding the scores and dividing by the number of scores:

$$ \begin{aligned} \bar{X} &= \frac{\sum_{i = 1}^n x_i}{n} \\ &= \frac{14+15+13+11+16+13+21+12+11+15}{10} \\ &= \frac{141}{10} \\ &= 14.1. \end{aligned} $$

To trim 20% of the data from the two ends of the distribution, we need to trim 2 scores from each end (because 20% of 10 is 2). The mean of the remaining 6 scores is the 20% trimmed mean. We first need to arrange the scores in ascending order: 11, 11, 12, 13, 13, 14, 15, 15, 16, 21. Then we trim (i.e. delete) 2 scores from each end. The data are now: 12, 13, 13, 14, 15, 15 (note that we trimmed the two 11s from the bottom, and the 16 and 21 from the top). Finally, we calculate the mean of these 6 scores:

$$ \begin{aligned} \bar{X} &= \frac{\sum_{i = 1}^n x_i}{n} \\ &= \frac{12+13+13+14+15+15}{6} \\ &= \frac{82}{6} \\ &= 13.67. \end{aligned} $$

To trim 10% of the data, we need to trim 1 score from each end because 10% of 10 is 1. This involves removing the lowest score (11) and highest score (21). The remaining 8 scores are: 11, 12, 13, 13, 14, 15, 15, 16. The 10% trimmed mean will be the mean of these scores:

$$ \begin{aligned} \bar{X} &= \frac{\sum_{i = 1}^n x_i}{n} \\ &= \frac{11+ 12+13+13+14+15+15+16}{8} \\ &= \frac{109}{8} \\ &= 13.63. \end{aligned} $$

To calculate the 20% winsorized mean, we need to replace the top and bottom 20% of scores with the next highest or lowest score. For these data, the top 2 scores (16 and 21) are both replaced with the next highest score (15), and the bottom two scores (11 and 11) are replaced with the next lowest score (12). So the data becomes: 12, 12, 12, 13, 13, 14, 15, 15, 15, 15. We then calculate the mean of these data:

$$ \begin{aligned} \bar{X} &= \frac{\sum_{i = 1}^n x_i}{n} \\ &= \frac{12+12+12+13+13+14+15+15+15+15}{10} \\ &= \frac{136}{10} \\ &= 13.6. \end{aligned} $$

Puzzle 4

Square-root transform the above scores.

To square root transform the scores we replace each score with its square root.

Scores and their square root transformation
Original score $x_i$	Transformed score $\sqrt{x_i}$
14	3.74
15	3.87
13	3.61
11	3.32
16	4.00
13	3.61
21	4.58
12	3.46
11	3.32
15	3.87

Puzzle 5

Using the data in Table 9.3 (in the book), what was the mean strength of scientists in both the JIG:SAW group and the non-employees?

To calculate the mean strength, we need to add up all the scores in each group and then divide the total by the number of scientists in each group.

Let’s start with the strength scores for JIG:SAW employees:

1161, 1141, 1174, 1112, 1185, 1095, 1102, 1112, 1071, 1244, 1102, 1216, 1884, 1276, 1373, 1145, 1169, 1136, 1313, 1129, 1119, 1197, 1111, 1121, 1274, 1197, 1139, 1233, 1334, 1150, 1138, 1185, 1158, 1445, 1525, 1408, 1128, 1723

$$ \bar{X} = \frac{\sum_{i = 1}^n x_i}{n} = \frac{46725}{38} = 1229.61. $$

The mean scientists’ strength score for JIG:SAW employees was 1229.61.

Now, let’s move into the strength scores for Non-employees:

1321, 1153, 1072, 1218, 1088, 1373, 1135, 1055, 1096, 1007, 1223, 1291, 1171, 1101, 2091, 1308, 1141, 1433, 1141, 1212, 1769, 1071, 1412, 1214, 1031, 1209, 1222, 1241, 1740, 1367, 1313, 1208, 1257, 1376, 1155, 1065, 1147, 1166, 1566, 1436

$$ \bar{X} = \frac{\sum_{i = 1}^n x_i}{n} = \frac{50595}{40} = 1264.88. $$

The mean scientists’ strength score for non-employees was 1264.88.

Puzzle 6

Using the data in Table 9.3 (in the book and reproduced above), what was the 20% trimmed mean strength of scientists in both the JIG:SAW group and the non-employees?

First, we will calculate the 20% trimmed mean strength for the JIG:SAW employees. There are 38 scores in total and 20% of 38 is 7.6. We can’t remove 7.6 scores, so we will take 8 scores from each end of the distribution instead. The table shows the raw scores listed in ascending order, and in the final column I have deleted the bottom and top 8 scores. The 20% trimmed mean is the mean of the scores in this final column

$$ \bar{X}_\text{20% trimmed} = \frac{25896}{22} = 1177.09. $$

Trimming 20% of the strength scores (JIG:SAW employees)
	Participant ID	Strength (complete)	Strength (20% trimmed)
	14	1071
	8	1095
	12	1102
	17	1102
	51	1111
	5	1112
	13	1112
	44	1119
	53	1121	1121
	78	1128	1128
	43	1129	1129
	40	1136	1136
	66	1138	1138
	56	1139	1139
	2	1141	1141
	31	1145	1145
	65	1150	1150
	69	1158	1158
	1	1161	1161
	38	1169	1169
	3	1174	1174
	7	1185	1185
	68	1185	1185
	47	1197	1197
	55	1197	1197
	18	1216	1216
	60	1233	1233
	16	1244	1244
	54	1274	1274
	23	1276	1276
	41	1313
	62	1334
	24	1373
	75	1408
	72	1445
	74	1525
	82	1723
	22	1884
Sum		46,725.00	25,896.00
n		38.00	22.00
Mean		1,229.61	1,177.09

We calculate the 20% trimmed mean strength of the non-employees in exactly the same way. There are 40 scores in total, 20% of 40 = 8, so we will take 8 scores from each end of the distribution (after putting them in ascending order) and then calculate the mean of the remaining scores. The table shows the raw scores listed in ascending order, and in the final column I have deleted the bottom and top 8 scores. The 20% trimmed mean will be the mean of the scores in this column

$$ \bar{X}_\text{20% trimmed} = \frac{29287}{24} = 1220.29. $$

Trimming 20% of the strength scores (Non-employees)
	Participant ID	Strength (complete)	Strength (Winsorized)
	25	1007
	50	1031
	20	1055
	77	1065
	42	1071
	9	1072
	11	1088
	21	1096
	30	1101	1101
	19	1135	1135
	34	1141	1141
	36	1141	1141
	79	1147	1147
	6	1153	1153
	76	1155	1155
	80	1166	1166
	28	1171	1171
	67	1208	1208
	52	1209	1209
	37	1212	1212
	49	1214	1214
	10	1218	1218
	57	1222	1222
	26	1223	1223
	58	1241	1241
	70	1257	1257
	27	1291	1291
	33	1308	1308
	63	1313	1313
	4	1321	1321
	61	1367	1367
	15	1373	1373
	73	1376
	45	1412
	35	1433
	83	1436
	81	1566
	59	1740
	39	1769
	32	2091
Sum		50,595.00	29,287.00
n		40.00	24.00
Mean		1,264.88	1,220.29

Puzzle 7

Using the data in Table 9.3 (in the book and reproduced in Puzzle 5), what was the 20% winsorized mean strength of scientists in both the JIG:SAW group and non-employees?

To calculate the 20% Winsorized mean, we need to replace the top and bottom 20% of scores with the next highest or lowest score. If we start with the JIG:SAW employees, there were 38 in total and 20% of 38 is 7.6, but we would round this up to 8 because we need a whole number. Therefore, we take 8 scores from each end of the distribution and replace them with the next highest or lowest score. First, I put the scores into ascending order. I have done this in the table below. In the final column, I have replaced the largest 8 scores with the next largest score (1276), and replaced the lowest 8 scores with the next lowest score (1121). To get the 20% winsorized mean, calculate the mean of the final column

$$ \bar{X}_\text{winsorized} = \frac{45072}{38} = 1186.11. $$

Winsorizing the strength scores (JIG:SAW employees)
	Participant ID	Strength (complete)	Strength (Winsorized)
	14	1071	1121
	8	1095	1121
	12	1102	1121
	17	1102	1121
	51	1111	1121
	5	1112	1121
	13	1112	1121
	44	1119	1121
	53	1121	1121
	78	1128	1128
	43	1129	1129
	40	1136	1136
	66	1138	1138
	56	1139	1139
	2	1141	1141
	31	1145	1145
	65	1150	1150
	69	1158	1158
	1	1161	1161
	38	1169	1169
	3	1174	1174
	7	1185	1185
	68	1185	1185
	47	1197	1197
	55	1197	1197
	18	1216	1216
	60	1233	1233
	16	1244	1244
	54	1274	1274
	23	1276	1276
	41	1313	1276
	62	1334	1276
	24	1373	1276
	75	1408	1276
	72	1445	1276
	74	1525	1276
	82	1723	1276
	22	1884	1276
Sum		46,725.00	45,072.00
n		38.00	38.00
Mean		1,229.61	1,186.11

I did exactly the same for the non-employees: because there were 40 scores in total and 20% of 40 is 8, I took the raw scores and replaced the largest 8 scores with the next largest score (1373), and replaced the lowest 8 scores with the next lowest score (1101) — see the table below (final column). To get the 20% Winsorized mean, calculate the mean of the final column

$$ \bar{X}_\text{winsorized} = \frac{49079}{40} = 1226.97. $$

Winsorizing the strength scores (Non-employees)
	Participant ID	Strength (complete)	Strength (20% trimmed)
	25	1007	1101
	50	1031	1101
	20	1055	1101
	77	1065	1101
	42	1071	1101
	9	1072	1101
	11	1088	1101
	21	1096	1101
	30	1101	1101
	19	1135	1135
	34	1141	1141
	36	1141	1141
	79	1147	1147
	6	1153	1153
	76	1155	1155
	80	1166	1166
	28	1171	1171
	67	1208	1208
	52	1209	1209
	37	1212	1212
	49	1214	1214
	10	1218	1218
	57	1222	1222
	26	1223	1223
	58	1241	1241
	70	1257	1257
	27	1291	1291
	33	1308	1308
	63	1313	1313
	4	1321	1321
	61	1367	1367
	15	1373	1373
	73	1376	1373
	45	1412	1373
	35	1433	1373
	83	1436	1373
	81	1566	1373
	59	1740	1373
	39	1769	1373
	32	2091	1373
Sum		50,595.00	49,079.00
n		40.00	40.00
Mean		1,264.88	1,226.97

Puzzle 8

Using your answers above, how do the robust estimates of the mean differ from those based on the raw data?

If we collate our answers from the previous Puzzles it will make it easier to compare the robust estimates:

Estimated mean strength
	Mean strength
	Raw score	20% trimmed	20% winsorized
JIG:SAW	1229.61	1177.09	1186.11
Non-employee	1264.88	1220.29	1226.97

Looking at the means based on the raw scores, we can see that there is not much difference between the mean strength of scientists in the JIG:SAW and non-employee groups; the non-employees were slightly stronger than the JIG:SAW employees, but not by very much. Looking at the 20% trimmed and 20% winsorized means, these robust estimates are smaller than the raw mean by about 40–45 units in the non-employee group, and smaller by about 40–50 units in the JIG:SAW group. In other words, the change in the mean is fairly similar in the two groups, and the differences between the groups have stayed fairly similar (raw mean difference = 35.27, trimmed mean difference = 43.2, winsorized mean difference = 40.67). (You might think that 35.27 is quite different to 43.2, and you’d be correct if the scale of measurement perhaps ranged from 0 to 50, but the strength scores range from 1000 to 2000, and in that context a difference of around 8 is not particularly startling.)

Puzzle 9

Log-transform the JIG:SAW data from Table 9.3 (in the book and reproduced in Puzzle 5).

To log transform the JIG:SAW data we need to take the natural log of each score. You can use software such as Excel, SPSS or R to do this for you. I used R to create this table.

Scores and their log transformations
ID	Strength score $x_i$	Nautural log $\ln{x_i}$	Log (base 10) $\log_{10}{x_i}$
4	1321	7.19	3.12
6	1153	7.05	3.06
9	1072	6.98	3.03
10	1218	7.10	3.09
11	1088	6.99	3.04
15	1373	7.22	3.14
19	1135	7.03	3.05
20	1055	6.96	3.02
21	1096	7.00	3.04
25	1007	6.91	3.00
26	1223	7.11	3.09
27	1291	7.16	3.11
28	1171	7.07	3.07
30	1101	7.00	3.04
32	2091	7.65	3.32
33	1308	7.18	3.12
34	1141	7.04	3.06
35	1433	7.27	3.16
36	1141	7.04	3.06
37	1212	7.10	3.08
39	1769	7.48	3.25
42	1071	6.98	3.03
45	1412	7.25	3.15
49	1214	7.10	3.08
50	1031	6.94	3.01
52	1209	7.10	3.08
57	1222	7.11	3.09
58	1241	7.12	3.09
59	1740	7.46	3.24
61	1367	7.22	3.14
63	1313	7.18	3.12
67	1208	7.10	3.08
70	1257	7.14	3.10
73	1376	7.23	3.14
76	1155	7.05	3.06
77	1065	6.97	3.03
79	1147	7.04	3.06
80	1166	7.06	3.07
81	1566	7.36	3.19
83	1436	7.27	3.16

Puzzle 10

Describe the process of bootstrapping.

Bootstrapping is a technique from which the sampling distribution of a statistic is estimated by taking repeated samples (with replacement) from the data set (in effect, treating the data as a population from which smaller samples are taken). The statistic of interest (e.g., the mean, or b coefficient) is calculated for each sample, from which the sampling distribution of the statistic is estimated. The standard error of the statistic is estimated as the standard deviation of the sampling distribution created from the bootstrap samples. From this process, confidence intervals and significance tests can be computed too.

Last updated on Feb 11, 2022