We use cookies and other tools to enhance your experience on our website and to analyze our web traffic.
For more information about these cookies and the data collected, please refer to our Privacy Policy.

SOF ages of "G" or "H" (variable v8age)

7 posts
Was this reply useful? Learn more...
matthewbutler +0 points · about 8 years ago

Hi All,

There are 6 subjects in the SOF dataset (sof-visit-8-dataset-0.3.0.csv) with ages (variable v8age) coded as "G" or "H." Any idea what this means? I have not yet tried to cross reference to data from SOF Online (http://sof.ucsf.edu/interface/Introduction.asp).

  1. In the online histogram, these show up at n=6 in the age 0-8 bin.

  2. In the data dictionary, there is no indication of non-numeric codes (sof-data-dictionary-0.3.1-variables.csv):

Administrative v8age Age numeric years

52 posts
Was this reply useful? Learn more...
remomueller +0 points · about 8 years ago

Hi Matthew,

These look like missing codes to me, which are typically removed during the SAS to CSV export for datasets. I'll follow up with @mrueschman to find more documentation on sof:v8age values.

I've also created GitHub Issue #35 to keep track of this.

Thanks for reporting!

446 posts
Was this reply useful? Learn more...
mrueschman +0 points · about 8 years ago

Yep, these are missing codes set by the UCSF group. These were originally SAS missing codes (i.e. .G, .H), which come out as characters in the CSV exports. G and H correspond to values that were scrubbed at the low and high extremes. Unfortunately, we won't be able to get the ages of these subjects.

We will clarify in the next version of the data dictionary. Thanks!