STATA output for tables and figures used in the reanalysis of The Bell Curve data presented at this web site.

. use #NATION.TXT

TABLE 1 . tab race4z samplez | Sample Race4 | Military Suppleme XSection | Total -----------+---------------------------------+---------- Black | 251 2172 751 | 3174 Hisp | 78 1480 444 | 2002 Other | 80 176 313 | 569 White | 871 1467 4603 | 6941 -----------+---------------------------------+---------- Total | 1280 5295 6111 | 12686 tab class samplez

gen class=1 if how_miss==1 (4384 missing values generated) #4384 cases with missing values + 8302 without = 12686 . recode class 1=11 if race4z==4 (4613 changes made) . recode class 1=2 if eschl==1 (3547 changes made) . recode class 11=22 if eschl==1 (4492 changes made) . gen classify=1 if class==22 & samplez==3 (9319 missing values generated) . recode classify .=2 if class==22 & samplez==2 (1067 changes made) . tab classify class CASES WITHOUT MISSING VALUES BY SUBSAMPLE & ANALYSIS STATUS | Sample class | Military Suppleme XSection | Total -----------+---------------------------------+---------- 1 | 1 93 48 | 142 #not white; students 2 | 74 2475 998 | 3547 #not white; non-students 11 | 5 20 96 | 121 #white; students 22 | 58 1067 3367 | 4492 #white; non-students -----------+---------------------------------+---------- Total | 138 3655 4509 | 8302 FIGURE 1 | classify | -----------+-------- 1 | 3367 | #sample HM used in POVERTY analysis 2 | 1067 | #comparable supplemental subsample white -----------+-------- 4434

FIGURE 2 . gr zage , hist by (classify) normal . gr zses , hist by (classify) normal . gr zafqt89 , hist by (classify) normal

FIGURE 3 . gr zses zafqt89 , by (classify) . gr zafqt89 zage , by (classify) . gr zafqt89 zses , by (classify)

FIGURE 4 . gr zage zafqt89 zses , box by (classify) yline . table classify, c (mean zafqt89 n zafqt89 sd zafqt89) format(%9.2f) ----------+-------------------------------------------- classify | mean(zafqt89) N(zafqt89) sd(zafqt89) ----------+-------------------------------------------- 1 | 0.22 3367 0.90 2 | -0.07 1067 1.03 ----------+-------------------------------------------- . table classify, c (mean zses n zses sd zses) format(%9.2f) ----------+----------------------------------- classify | mean(zses) N(zses) sd(zses) ----------+----------------------------------- 1 | 0.22 3367 0.87 2 | -0.42 1067 1.06 ----------+----------------------------------- . table classify, c (mean zage n zage sd zage) format(%9.2f) ----------+----------------------------------- classify | mean(zage) N(zage) sd(zage) ----------+----------------------------------- 1 | -0.09 3367 0.97 2 | 0.19 1067 1.02 ----------+----------------------------------- . table classify, c (mean kwage89 n kwage89 sd kwage89) format(%9.2f) ----------+-------------------------------------------- classify | mean(kwage89) N(kwage89) sd(kwage89) ----------+-------------------------------------------- 1 | 20242.00 3366 18580.96 2 | 16598.30 1067 18097.82 ----------+-------------------------------------------- . table classify, c (mean pov89 n pov89 sd pov89) format(%9.2f) ----------+-------------------------------------- classify | mean(pov89) N(pov89) sd(pov89) ----------+-------------------------------------- 1 | 0.07 3367 0.26 2 | 0.15 1067 0.35 ----------+-------------------------------------- .

TABLE 2 Sample HM used in POVERTY analysis . logistic pov89 zafqt89 zses zage if classify==1 Iteration 0: Log Likelihood =-875.34188 Iteration 1: Log Likelihood =-796.51977 Iteration 2: Log Likelihood =-784.57788 Iteration 3: Log Likelihood =-784.40188 Iteration 4: Log Likelihood =-784.40179 Logit Estimates Number of obs = 3367 chi2(3) = 181.88 Prob > chi2 = 0.0000 Log Likelihood = -784.40179 Pseudo R2 = 0.1039 ------------------------------------------------------------------------------ pov89 | Coef. Std. Err. z P>|z| [95% Conf. Interval] ---------+-------------------------------------------------------------------- zafqt89 | -.8376652 .0935074 -8.958 0.000 -1.020936 -.654394 zses | -.3300791 .0901006 -3.663 0.000 -.5066731 -.1534851 zage | -.0238392 .0723743 -0.329 0.742 -.1656903 .1180119 _cons | -2.648768 .076882 -34.452 0.000 -2.799454 -2.498082 ------------------------------------------------------------------------------ . logistic pov89 zafqt89 zses zage if race4z==4 & eschl==1 & how_miss==1 & samplez==3 Logit Estimates Number of obs = 3367 chi2(3) = 181.88 Prob > chi2 = 0.0000 Log Likelihood = -784.40179 Pseudo R2 = 0.1039 ------------------------------------------------------------------------------ pov89 | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval] ---------+-------------------------------------------------------------------- zafqt89 | .4327196 .0404625 -8.958 0.000 .3602574 .5197569 zses | .7188669 .0647704 -3.663 0.000 .6024967 .8577136 zage | .9764427 .0706694 -0.329 0.742 .8473086 1.125257 ------------------------------------------------------------------------------ . lstat

TABLE 3 Logistic model for pov89 -------- True -------- Classified | D ~D Total -----------+--------------------------+----------- + | 0 3 | 3 - | 244 3120 | 3364 -----------+--------------------------+----------- Total | 244 3123 | 3367 Classified + if predicted Pr(D) = .5 True D defined as pov89 ~= 0 -------------------------------------------------- Sensitivity Pr( +| D) 0.00% Specificity Pr( -|~D) 99.90% Positive predictive value Pr( D| +) 0.00% Negative predictive value Pr(~D| -) 92.75% -------------------------------------------------- False + rate for true ~D Pr( +|~D) 0.10% False - rate for true D Pr( -| D) 100.00% False + rate for classified + Pr(~D| +) 100.00% False - rate for classified - Pr( D| -) 7.25% -------------------------------------------------------------------------

. use ":Hard Disk:Desktop Folder:this is it:dater:newbell2.dta", clear Comparable Supplemental subsample whites logistic pov89 zafqt89 zses zage if classify==2 Logit Estimates Number of obs = 1067 chi2(3) = 147.55 Prob > chi2 = 0.0000 Log Likelihood = -371.92886 Pseudo R2 = 0.1655 ------------------------------------------------------------------------------ pov89 | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval] ---------+-------------------------------------------------------------------- zafqt89 | .3250798 .0429961 -8.496 0.000 .2508462 .4212817 zses | .848368 .1038151 -1.344 0.179 .6674557 1.078316 zage | 1.025484 .1005745 0.257 0.798 .8461495 1.242827 ------------------------------------------------------------------------------ TABLE 4 . lstat Logistic model for pov89 -------- True -------- Classified | D ~D Total -----------+--------------------------+----------- + | 13 21 | 34 - | 144 889 | 1033 -----------+--------------------------+----------- Total | 157 910 | 1067 Classified + if predicted Pr(D) >= .5 True D defined as pov89 ~= 0 -------------------------------------------------- Sensitivity Pr( +| D) 8.28% Specificity Pr( -|~D) 97.69% Positive predictive value Pr( D| +) 38.24% Negative predictive value Pr(~D| -) 86.06% -------------------------------------------------- False + rate for true ~D Pr( +|~D) 2.31% False - rate for true D Pr( -| D) 91.72% False + rate for classified + Pr(~D| +) 61.76% False - rate for classified - Pr( D| -) 13.94% -------------------------------------------------- Correctly classified 84.54% --------------------------------------------------


Root Data Subjects Replication & Fit Analyst Documentation

STATA program for reading NATION.TXT

Comparison with posted codebook