GWAS and fine-mapping of 35 production, reproduction, and conformation traits with imputed sequences of 27K Holstein bulls

J. Jiang1*, P.M. VanRaden2, J.B. Cole 2, Y. Da3 and L. Ma1

1Department of Animal and Avian Sciences, University of Maryland, College Park, MD
2Animal Genomics and Improvement Laboratory, ARS, USDA, Beltsville, MD
3Department of Animal Science, University of Minnesota, St. Paul, MN 55108


2018 J. Dairy Sci. (?)
© American Dairy Science Association, 2018. All rights reserved.
Individuals may download, store, or print single copies solely for personal use.
Do not share personal accounts or passwords for the purposes of disseminating this article.
 

ABSTRACT

Imputation has been routinely applied to ascertain sequence variants in large genotyped populations based on reference populations of sequenced animals. With the implementation of the 1000 Bull Genomes Project and increasing numbers of animals sequenced, fine-mapping of causal variants is becoming feasible for complex traits in cattle. Using the 1000 Bull Genomes data, we imputed three million selected sequence variants to 27,000 Holstein bulls after quality control edits and LD pruning. These bulls were selected to have highly reliable breeding values (PTAs) for 35 production, reproduction, and body conformation traits. We first performed whole-genome single-marker scan for the 35 traits using the mixed-model based association test in MMAP (https://mmap.github.io). The single-trait association statistics were then merged in multi-trait analyses of three groups of traits, production, reproduction, and body conformation, respectively. Two-Mb long candidate genomic regions were selected based on the multi-trait analyses and used in fine-mapping studies. We implemented a state-of-art fine-mapping procedure with a Bayesian method that can assign a posterior probability of causality to each variant and for each independent association signal generate a minimum set of associated variants whose total posterior probability of causality exceeds a threshold (e.g. 95%). Our fine-mapping identified 36 candidate genes for production traits, 48 for reproduction traits, and 29 for body conformation traits, respectively, including some previously reported causal variants, e.g., Chr6:38027010 in ABCG2 for production traits and Chr7:93244933 in ARRDC3 for reproduction and body conformation traits. The candidate variant list may facilitate follow-up functional validation and expand our understanding of complex traits in dairy cattle. Additionally, our method can be readily applied to other species where large-scale sequence genotypes are available.