Cornell University Cornell University CISER

Cornell Restricted Access Data Center

What is CRADC?

The Cornell Restricted Access Data Center (CRADC) was established in October 1999 as a NSF-sponsored pilot site for providing secure access to confidential research data. In May 2005, the Office of the Vice-Provost for Research designated CRADC as the University Custodian of Restricted Access Data Sets. The center provides Cornell researchers assistance in negotiating restricted-use data agreements with the data providers and it operates a state-of-the-art secure computing system with user friendly computing environment to house and conduct research using those data.

Under the guidelines from the Cornell Office of Sponsored Projects, the data placed on the CRADC computing system may be public use, licensed, restricted access, or HIPAA-designated Protected Health Information (PHI). Restricted access data housed on the CRADC system are provided to Cornell under agreements that are monitored by the Office of Sponsored Programs and by the Institutional Review Board for Human Participants. Protected Health Information consists of medical, treatment, clinical, and insurance records generated by medical, wellness, benefit, and insurance providers (including private employers). As a general policy, public use and licensed data will not be placed on the CRADC computing system; however, they may be occasionally placed there for the convenience of CRADC users.

[Definitions: "Restricted Access Data" - data provided by an authorized supplier that must be used with formal confidentiality protections specified in a data provider agreement. "Licensed data" - data provided by an authorized supplier under a data provider agreement that is used to protect the provider's intellectual property (copyright).]

The CRADC for your Restricted-Use Data

The Cornell Restricted Access Data Center (CRADC), operated by the Cornell Institute for Social and Economic Research (CISER), is the designated University Custodian of Restricted Access Data Sets. The center provides Cornell researchers the following services:

Assistance in negotiating restricted-use data agreements with data providers

Access to a secure computing facility for housing restricted-use data

Access to sophisticated statistical computing tools in a secure computing environment

Assistance in negotiating restricted-use data agreements with data providers -

  • CRADC has been instrumental in acquisition of several restricted-use data sets for the use by Cornell scientific community. It serves as the data custodian for all the data housed on its computing systems and implements all measures necessary to maintain the security of not only the original data but also the reports generated from these data (all reports are disclosure-proofed before their release to a researcher for distribution beyond CRADC). Currently, CRADC houses the restricted-use data from providers such as US Bureau of Labor Statistics, US Equal Employment Opportunity Commission data, European Community Household Panel data (ECHP), NICHD Study of Early Child Care, UNC National Longitudinal Study of Adolescent Health (AddHealth), UNC China Health and Nutrition Survey Data (CHNS), ISR Panel Study of Income Dynamics (PSID), ICPSR Community Tracking Study Physician Survey, CMS Medicare Current Beneficiary Survey, SUNY Office of Institutional Research and Analysis Data, French Linked Employer-Employee data, NLSY data, Washington State data, United States Linked Employer-Employee data, Quarterly Workforce Indicators Data for States, Fragile Families and Child Wellbeing Study Data, Andrew W. Mellon Foundation College and Beyond Dataset, and Andrew W. Mellon Foundation Graduate Education Data. Also coming soon: The Health and Retirement Study (HRS) Restricted Data.
  • Contact the CRADC Manager for potential restricted-use agreements for your research.

Access to a secure computing facility for housing restricted-use data -

  • CRADC maintains a secure computing system, a Windows domain that exceeds the U.S. Defense Department C-2 standards for trusted computing environments. The system is remotely accessible using a Remote Desktop Connection or a Terminal Services Client and the domain controller employs user-based authentication. The system enforces strict guidelines for selecting user passwords, and requires users to change their passwords periodically. The system does not permit connection to the outside world via FTP, E-mail, Web, Print, or disk mapping facility. The system Data Custodians may remove non-confidential summary data and programming at the request of an authorized user after verifying that the files to be removed comply with the data use agreement governing the confidential data used. All CRADC users, system administrators, and custodians, are required to sign an appropriate CRADC Computing System Data User Agreement. Data providers are required to certify their authority to provide the data and to formalize the relationship with CRADC by executing a data provider agreement.
  • CRADC computing accounts are limited to those using restricted-use data for their research. To apply for an account contact the CRADC Manager.

Access to sophisticated statistical computing tools in a secure computing environment -

  • The compute nodes in the CRADC system include a number of sophisticated software packages for data analysis, as well as other tools for organizing researchers' work. Currently, installed software packages include multi-processor enabled versions of SAS, Matlab, Compaq Visual Fortran V6, and MPIPro; and single-processor enabled versions of ASReml, aML, Atlas-ti, Stata SE, SPSS, Limdep/NLogit, GLIM, Genstat, Gauss, and eViews. There are also data conversion software (StatTransfer) and other tools like TextPad, Microsoft Office, Scientific Workplace, and Adobe Acrobat. All software is installed so that temporary files created by the application are saved in the data-user's private disk and not in areas where unauthorized users may have access.
  • CRADC computing accounts are limited to those using restricted-use data for their research. To apply for an account contact the CRADC Manager.
  • note: Researchers who wish to have access to these tools for working with non-resticted data may obtain accounts on the CISER's general computing system.

Contact:

Pinky Chandra
Manager, CRADC
cradc_custodian@cornell.edu
Ph: 607-255-2217
Fax: 607-255-9353