Combining multiple family-based association studies

Hua Tang 1, Jie Peng 2, Pei Wang 3, Marc Coram 4 and Li Hsu3

 1Department of Genetics, Stanford University School of Medicine, 300 Pasteur Drive, Stanford, California 94305, USA

 2Department of Statistics, University of California, Davis, California 95616, USA

3Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, Washington 98109, USA

4 Department of Health Policy Science, Stanford, California 94305, USA

Corresponding author: huatang@stanford.edu

กก

Abstract:

While high-throughput genotyping technologies are becoming readily available, the merit of using these technologies to perform genome-wide association studies has not been established. One major concern is that for studies of complex diseases and traits, the whole-genome approach requires such large sample sizes that both recruitment and genotyping pose considerable challenge. Here we propose a novel statistical method that boosts the effective sample size by combining data obtained from several studies. Specifically, we consider a situation in which various studies have genotyped non-overlapping subjects at largely non-overlapping sets of markers. Our approach, which exploits the local linkage disequilibrium structure without assuming an explicit population model, opens up the possibility of improving statistical power by incorporating existing data into future association studies.

Keywords: linkage disequilibrium, transmission disequilibrium test, imputing
กก