Examining Population Stratification via Individual Ancestry Estimates versus Self-Reported Race

Posted in Articles, Health/Medicine/Genetics, Latino Studies, Media Archive on 2011-12-09 04:43Z by Steven

Examining Population Stratification via Individual Ancestry Estimates versus Self-Reported Race

Cancer Epidemiology, Biomarkers & Prevention
Volume 14, Issue 6 (June 2005)
pages 1545-1551
DOI: 10.1158/1055-9965.EPI-04-0832

Jill S. Barnholtz-Sloan
Cancer Prevention and Control Program
H. Lee Moffitt Cancer Center and Research Institute

Ranajit Chakraborty
Center for Genome Research, Department of Environmental Health
University of Cincinnati, Cincinnati, Ohio

Thomas A. Sellers
Cancer Prevention and Control Program
H. Lee Moffitt Cancer Center and Research Institute

Ann G. Schwartz
Population Studies and Prevention Program, Karmanos Cancer Institute and Department of Internal Medicine
Wayne State University School of Medicine, Detroit, Michigan

Population stratification has the potential to affect the results of genetic marker studies. Estimating individual ancestry provides a continuous measure to assess population structure in case-control studies of complex disease, instead of using self-reported racial groups. We estimate individual ancestry using the Federal Bureau of Investigation CODIS Core short tandem repeat set of 13 loci using two different analysis methods in a case-control study of early-onset lung cancer. Individual ancestry proportions were estimated for “European” and “West African” groups using published allele frequencies. The majority of Caucasian, non-Hispanics had >50% European ancestry, whereas the majority of African Americans had <20% European ancestry, regardless of ancestry estimation method, although significant overlap by self-reported race and ancestry also existed. When we further investigated the effect of ancestry and self-reported race on the frequency of a lung cancer risk genotype, we found that the frequency of the GSTM1 null genotype varies by individual European ancestry and case-control status within self-reported race (particularly for African Americans). Genetic risk models showed that adjusting for individual European ancestry provided a better fit to the data compared with the model with no group adjustment or adjustment for self-reported race. This study suggests that significant population substructure differences exist that self-reported race alone does not capture and that individual ancestry may be confounded with disease status and/or a candidate gene risk genotype.

Read the entire article here.

Tags: , , , , , , ,

Comparing Genetic Ancestry and Self-Described Race in African Americans Born in the United States and in Africa

Posted in Articles, Health/Medicine/Genetics, Media Archive, United States on 2011-12-09 02:58Z by Steven

Comparing Genetic Ancestry and Self-Described Race in African Americans Born in the United States and in Africa

Cancer Epidemiology, Biomarkers & Prevention
Volume 17, Issue 6 (June 2008)
pages 1329-1338
DOI: 10.1158/1055-9965.EPI-07-2505

Rona Yaeger
Herbert Irving Comprehensive Cancer Center

Alexa Avila-Bront
Department of Medicine
College of Physicians and Surgeons of Columbia University

Kazeem Abdul
Herbert Irving Comprehensive Cancer Center

Patricia C. Nolan
Department of Medicine
College of Physicians and Surgeons of Columbia University

Victor R. Grann
Department of Medicine
College of Physicians and Surgeons of Columbia University

Mark G. Birchette
Department of Biology
Long Island University, Brooklyn, New York

Shweta Choudhry
Department of Biopharmaceutical Sciences and Medicine
University of California-San Francisco, San Francisco, California

Esteban G. Burchard
Department of Biopharmaceutical Sciences and Medicine
University of California-San Francisco, San Francisco, California
 
Kenneth B. Beckman
Children’s Hospital Oakland Research Institute, Oakland, California

Prakash Gorroochurn
Department of Biostatistics
Columbia University Medical Center, New York, New York

Elad Ziv
Division of General Internal Medicine
University of California-San Francisco, San Francisco, California

Nathan S. Consedine
Department of Psychology
Long Island University, Brooklyn, New York

Andrew K. Joe
Herbert Irving Comprehensive Cancer Center

Genetic association studies can be used to identify factors that may contribute to disparities in disease evident across different racial and ethnic populations. However, such studies may not account for potential confounding if study populations are genetically heterogeneous. Racial and ethnic classifications have been used as proxies for genetic relatedness. We investigated genetic admixture and developed a questionnaire to explore variables used in constructing racial identity in two cohorts: 50 African Americans and 40 Nigerians. Genetic ancestry was determined by genotyping 107 ancestry informative markers. Ancestry estimates calculated with maximum likelihood estimation were compared with population stratification detected with principal components analysis. Ancestry was approximately 95% west African, 4% European, and 1% Native American in the Nigerian cohort and 83% west African, 15% European, and 2% Native American in the African American cohort. Therefore, self-identification as African American agreed well with inferred west African ancestry. However, the cohorts differed significantly in mean percentage west African and European ancestries (P < 0.0001) and in the variance for individual ancestry (P ≤ 0.01). Among African Americans, no set of questionnaire items effectively estimated degree of west African ancestry, and self-report of a high degree of African ancestry in a three-generation family tree did not accurately predict degree of African ancestry. Our findings suggest that self-reported race and ancestry can predict ancestral clusters but do not reveal the extent of admixture. Genetic classifications of ancestry may provide a more objective and accurate method of defining homogenous populations for the investigation of specific population-disease associations.

Introduction

Genome-wide case-control association studies provide a powerful tool for investigating possible genetic factors that may contribute to the health disparities observed among different racial and ethnic populations. Populations with different ancestral backgrounds may carry different genetic variants, and these may contribute to the variations in disease incidence and outcomes seen in specific racial and ethnic groups (1). Association studies can most easily identify disease-associated alleles when study groups are genetically similar, sharing a similar ancestral background (2). However, individual ancestry is not an easily assayed, simple category; consequently, race continues to be used as a proxy for genetic relatedness in clinical and other biological studies (3-6). There is currently no consensus on how best to examine or characterize different racial or ethnic groups when designing and conducting such studies.

Two main approaches have been used to approximate individual ancestry in biological studies: (a) using self identified race and ethnicity, which may capture common environmental influences as well as ancestral background, and (b) genotyping a panel of markers that show large frequency differentials between major geographic ancestral groupings (7, 8). Both approaches have limitations. Self-identified racial categories may not always consistently predict ancestral population clusters, and evidence suggests that it may take large sample sizes and numerous markers to describe genetic clusters that correspond to self-identified race and ethnicity groupings (9-11). Racial categories are also imprecise and inconsistent, because they may potentially vary within the same individual over time (12, 13). Furthermore, their use risks reinforcing racial divisions in society. On the other hand, more objective analyses that genotype markers that are highly informative for ancestry may not be economically practical and are limited by the requirement of serum or fresh tissue for DNA extraction. Genetically determined ancestry may not capture unmeasured social factors that may affect differences in health outcomes. There are also unique ethical challenges when linking biological phenotypes with genetic markers for specific racial groups, and caution must always be used when attributing biological differences (e.g., disease risk and treatment response) to different populations.

Understanding the ancestral background of study subjects is most important in genetic studies of admixed populations, such as African Americans, who represent an admixture of Africans, Europeans, and Native Americans (14). Genetic studies have shown that African Americans form a diverse group with percent European admixture estimated to range between 7% and 23% (14-16). Genotyping of self-identified African Americans participating in the Cardiovascular Health Study revealed that among self-reported Africans there are differences in genetic ancestry that are correlated with some clinically important endpoints (15).

…Discussion…

The African American cohort in our study had a mean of 15% European admixture, which is consistent with previous reports of a range of 7% to 23% European admixture among U.S. African Americans (14-16). Of note, the estimates of 4% European and 1% Native American ancestry in the Nigerian population is likely due to bias in MLE due to the limited number of markers. We found that among participants there was a significantly higher proportion of admixture and higher variability in admixture proportions in the U.S.-born African American cohort compared with a population that emigrated from Africa (that is, Nigerians; Table 3). The significant variation in individual ancestry estimates among the African American cohort suggests that this group, like the Cardiovascular Health Study African American cohort (15), represents a diverse population consisting of several subpopulations. For participation in the African American cohort, subjects identified both parents as African Americans who were born in the United States. Although data regarding grandparental race were not used to screen study participation, these data were collected through a three-generation family tree during administration of the questionnaire. In this study population, all African American subjects described that the race of at least three of their four grandparents was consistent with African ancestry. Individuals and society have historically classified children of mixed-race ancestry as African American, even when one parent is Caucasian, Asian, or Native American. For African Americans, this is a remnant of the ‘‘Jim Crow’’ laws and the ‘‘One Drop’’ rule or ‘‘Rule of Hypodescent.’’ Thus, identification as African American would still occur in cases where the parents and grandparents were of mixed-race ancestry. This could also contribute to the greater European admixture and greater admixture variability seen in the African American cohort…

Read the entire article here.

Tags: , , , , , , , , , , , , , , ,