We've updated our privacy policy.

Export of data dictionary into CSV

2 posts
Was this reply useful? Learn more...
 
[-]
on9cloud +0 points · 9 months ago

I am trying to export for each NSRR study the data dictionary.

I used the nsrr R package to get a nice list.

library(nsrr)

df = nsrr_datasets()

a = nsrr_dataset_files("shhs", path = "datasets")

I also see the .json files on github. e.g., https://github.com/nsrr/heartbeat-data-dictionary

The repo shows an option to export all variables into a single CSV file. (using spout). I would rather avoid ruby and the spout tool.

I don't need the actual study data for 14 studies - only the data dictionaries. (which have no sensitive data)

How can obtain CSV dictionaries for all NSRR studies. (either as 15 files or as one HUGE file with column study)

Also, has there been any effort to "tag" corresponding variables across the studies? E.g., sex at birth being a very common variable (=data element)

222 posts
bio
Was this reply useful? Learn more...
 
[-]
mrueschman +0 points · 9 months ago

The data dictionary CSV files are publicly available for each dataset. You'll find them in the Files area under the datasets folder, e.g.

The data dictionary files you see (domains, forms, variables) represent what you get from running the Spout export.

Tags as you describe do not exist, though we do have some data harmonization enhancements in mind for the future.

Thanks for your interest in the site!

2 posts
Was this reply useful? Learn more...
 
[-]
on9cloud +0 points · 9 months ago

Thank you. Your reply enabled us to create an aggregate DD (=data dictionary).

The goal of our funded project tries to do exactly such harmonization. Your data sharing platform is very modern and we are big fans!

If you would be interested, we would love to collaborate perhaps. Even if you would just take time to review our results on your variables/studies and give us our opinion about it. Our project is described here: https://github.com/lhncbc/CDE/tree/master/hiv/#aims

Our data sharing statement set of recomendations are here https://github.com/lhncbc/CDE/tree/master/CONSIDER and short list of CDEs here https://github.com/lhncbc/CDE/tree/master/IGNITE

222 posts
bio
Was this reply useful? Learn more...
 
[-]
mrueschman +0 points · 9 months ago

I'm glad that helped. Thank you for the kind words. I will share your links with other members of the team and bring it up at our next group meeting.

We are familiar with CDEs and have toyed around a bit with linking NSRR data back to established CDEs. What you describe in your aims for HIV/AIDS sounds very similar to what we have contemplated for studies/trials of sleep.

Topic is locked. Start a new topic