Skip Navigation


American Journal of Epidemiology Advance Access originally published online on December 20, 2006
American Journal of Epidemiology 2007 165(5):597-601; doi:10.1093/aje/kwk049
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
165/5/597    most recent
kwk049v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Howe, H. L.
Right arrow Articles by Shen, T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Howe, H. L.
Right arrow Articles by Shen, T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

American Journal of Epidemiology Copyright © 2006 by the Johns Hopkins Bloomberg School of Public Health All rights reserved; printed in U.S.A.

PRACTICE OF EPIDEMIOLOGY

Method to Assess Identifiability in Electronic Data Files

Holly L. Howe1, Andrew J. Lake2 and Tiefu Shen3

1 North American Association of Central Cancer Registries, Inc., Springfield, IL
2 Information Management Services, Inc., Silver Spring, MD
3 Illinois State Cancer Registry, Springfield, IL

Reprint requests to Dr. Holly L. Howe, 2121 West White Oaks Drive, Springfield, IL 62704-6495 (e-mail: hhowe{at}naaccr.org).

Received for publication December 19, 2005. Accepted for publication July 28, 2006.

The authors developed the Record Uniqueness (RU) software program to assess electronic data files for risk of confidentiality breach based on unique combinations of key variables. The underlying methodology utilized by the RU program generates a frequency distribution for every variable selected for analysis and for all combinations of the variables selected. In addition, the program provides the regression coefficient that designates the relative contribution of each variable to the unique records on the data file. The authors used RU to evaluate a North American Association of Central Cancer Registries research data set with 4.67 million cases from 34 population-based cancer registries for 1995–2001. To illustrate the process and utility of RU, they describe the evaluation process of the confidentiality risk of adding a county-based socioeconomic measure to the research file. The RU method enables one to be assured of record confidentiality, provides flexibility to adjust record uniqueness thresholds for different users or purposes of data release, and facilitates good stewardship of confidential data balanced with maximum use and release of information for research. RU is a useful data tool that can quantify the risk of confidentiality breach of electronic health databases, including reidentifiability of cases through triangulation of information or linkage with other electronic databases.

confidentiality; medical informatics; neoplasms; privacy; regression analysis; social class


Abbreviations: NAACCR, North American Association of Central Cancer Registries; RU, Record Uniqueness; SEER, Surveillance, Epidemiology, and End Results


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Qual Health ResHome page
K. Kaiser
Protecting Respondent Confidentiality in Qualitative Research
Qual Health Res, November 1, 2009; 19(11): 1632 - 1641.
[Abstract] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.