SOF ages of "G" or "H" (variable v8age)

matthewbutler +0 points · almost 5 years ago

Hi All,

There are 6 subjects in the SOF dataset (sof-visit-8-dataset-0.3.0.csv) with ages (variable v8age) coded as "G" or "H." Any idea what this means? I have not yet tried to cross reference to data from SOF Online (http://sof.ucsf.edu/interface/Introduction.asp).

  1. In the online histogram, these show up at n=6 in the age 0-8 bin.

  2. In the data dictionary, there is no indication of non-numeric codes (sof-data-dictionary-0.3.1-variables.csv):

Administrative v8age Age numeric years

remomueller +0 points · almost 5 years ago

Hi Matthew,

These look like missing codes to me, which are typically removed during the SAS to CSV export for datasets. I'll follow up with @mrueschman to find more documentation on sof:v8age values.

I've also created GitHub Issue #35 to keep track of this.

Thanks for reporting!

mrueschman +0 points · almost 5 years ago

Yep, these are missing codes set by the UCSF group. These were originally SAS missing codes (i.e. .G, .H), which come out as characters in the CSV exports. G and H correspond to values that were scrubbed at the low and high extremes. Unfortunately, we won't be able to get the ages of these subjects.

We will clarify in the next version of the data dictionary. Thanks!

