This file explains the contents of the answers.tar.gz file. Contact Mike Miller if you have any questions about these data. See the Problem3_answers.* files for descriptions of the model. After uncompressing and extracting contents of the answers.tar.gz file you will have a directory called 'answers' (116 MB) that contains files with names like these: answers.0001.ASP.ped answers.0001.CONTROLS.ped There are 100 ASP files (for affected sibling pair nuclear families) and 100 CONTROLS files (for control subjects). These correspond to the 100 replicates of GAW 15 Problem 3 data, but many of the variables avaible in these answers files were not given to other GAW participants. The answer data include trait locus genotypes, fully informative markers at trait locus locations, and several latent variables. Data fields (columns) are separated by a single space. The table below shows which variables are in which fields. All genotypes use two columns and the paternally-inherited allele is always on the left. Columns 41-56 are available only for ASP and not for CONTROLS because they are not useful in CONTROLS. Fields Variable ------ ---------------------------------------------------------------- 1 Family ID 2 Individual ID 3 Father's ID 4 Mother's ID 5 Sex (1=M, 2=F) 6 Rheumatoid arthritis affection status (2=affected, 1=unaffected) 7 Dead (1=dead, 0=not dead) 8 Age at ascertainment (in years) 9 Lifetime smoking (1=smoked, 0=never smoked) 10 Anti-CCP 11 IgM 12 RA Severity (1 to 5; 1=mild, 5=severe) 13 DR allele from father 14 DR allele from mother 15 Latent Hazard Variable 16 Latent Severity (normal mixture) 17 Age at death (only for parents -- given even if death hasn't happened yet) 18 Latent anti-CCP value (normal mixture) 19 Latent IgM (normal mixture) 20-21 DR/C haplotype (1: X/c, 4: X/C, 5: 1/C, 6: 4/C) 22-23 DR Genotype 24-25 Locus A chr 16, 26.3 cM ; Allele 2 is high-risk 26-27 Locus B chr 8, 170.9 cM ; Allele 2 is high-risk 28-29 Locus C chr 6, 49.5 cM ; Allele 2 is high-risk 30-31 Locus D chr 6, 54.6 cM ; Allele 2 is high-risk 32-33 Locus E chr 18, 94.3 cM ; Allele 2 is high-risk 34-35 Locus F chr 11, 115.3 cM ; Allele 2 is high-risk 36-37 Locus G chr 9, 49.4 cM ; Allele 2 is high-risk 38-39 Locus H chr 9, 51.4 cM ; Allele 2 is high-risk 40-41 fully informative marker at Locus A 42-43 fully informative marker at Locus B 44-45 fully informative marker at Locus C 46-47 fully informative marker at Locus D 48-49 fully informative marker at Locus E 50-51 fully informative marker at Locus F 52-53 fully informative marker at Locus G 54-55 fully informative marker at Locus H