Objective: The kernel machine (KM) test reportedly performs well in the set-based association test of rare variants. Many studies have been conducted to measure phenotypes at multiple time points, but the standard KM methodology has only been available for phenotypes at a single time point. In addition, family-based designs have been widely used in genetic association studies; therefore, the data analysis method used must appropriately handle familial relatedness. A rare-variant test does not currently exist for longitudinal data from family samples. Therefore, in this paper, we aim to introduce an association test for rare variants, which includes multiple longitudinal phenotype measurements for either population or family samples. Methods: This approach uses KM regression based on the linear mixed model framework and is applicable to longitudinal data from either population (L-KM) or family samples (LF-KM). Results: In our population-based simulation studies, L-KM has good control of Type I error rate and increased power in all the scenarios we considered compared with other competing methods. Conversely, in the family-based simulation studies, we found an inflated Type I error rate when L-KM was applied directly to the family samples, whereas LF-KM retained the desired Type I error rate and had the best power performance overall. Finally, we illustrate the utility of our proposed LF-KM approach by analyzing data from an association study between rare variants and blood pressure from the Genetic Analysis Workshop 18 (GAW18). Conclusion: We propose a method for rare-variant association testing in population and family samples using phenotypes measured at multiple time points for each subject. The proposed method has the best power performance compared to competing approaches in our simulation study.

1.
Wellcome Trust Case Control Consortium: Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 2007;447:661-678.
2.
Hunter DJ, Kraft P, Jacobs KB, Cox DG, Yeager M, Hankinson SE, Wacholder S, Wang Z, Welch R, Hutchinson A, Wang J, Yu K, Chatterjee N, Orr N, Willett WC, Colditz GA, Ziegler RG, Berg CD, Buys SS, McCarty CA, Feigelson HS, Calle EE, Thun MJ, Hayes RB, Tucker M, Gerhard DS, Fraumeni JF Jr, Hoover RN, Thomas G, Chanock SJ: A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer. Nat Genet 2007;39:870-874.
3.
Yeager M, Orr N, Hayes RB, et al: Genome-wide association study of prostate cancer identifies a second risk locus at 8q24. Nat Genet 2007;39:645-649.
4.
Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, Manolio TA: Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci USA 2009;106:9362-9367.
5.
Manolio TA, Brooks LD, Collins FS: A HapMap harvest of insights into the genetics of common disease. J Clin Invest 2008;118:1590-1605.
6.
Schork NJ, Murray SS, Frazer KA, Topol EJ: Common vs. rare allele hypotheses for complex diseases. Curr Opin Genet Dev 2009;19:212-219.
7.
Li B, Leal SM: Methods for detecting associations with rare variants for common diseases: Application to analysis of sequence data. Am J Hum Genet 2008;83:311-321.
8.
Madsen BE, Browning SR: A groupwise association test for rare mutations using a weighted sum statistic. PLoS Genet 2009;5:e1000384.
9.
Morgenthaler S, Thilly WG: A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (cast). Mutat Res 2007;615:28-56.
10.
Li B, Leal SM: Discovery of rare variants via sequencing: Implications for the design of complex trait association studies. PLoS Genet 2009;5:e1000481.
11.
Price AL, Kryukov GV, de Bakker PI, Purcell SM, Staples J, Wei LJ, Sunyaev SR: Pooled association tests for rare variants in exon-resequencing studies. Am J Hum Genet 2010;86:832-838.
12.
Han F, Pan W: A data-adaptive sum test for disease association with multiple common or rare variants. Hum Hered 2010;70:42-54.
13.
Morris AP, Zeggini E: An evaluation of statistical approaches to rare variant analysis in genetic association studies. Genet Epidemiol 2010;34:188-193.
14.
Lin WY, Yi N, Zhi D, Zhang K, Gao G, Tiwari HK, Liu N: Haplotype-based methods for detecting uncommon causal variants with common SNPs. Genet Epidemiol 2012;36:572-582.
15.
Lin WY, Yi N, Lou XY, Zhi D, Zhang K, Gao G, Tiwari HK, Liu N: Haplotype kernel association test as a powerful method to identify chromosomal regions harboring uncommon causal variants. Genet Epidemiol 2013;37:560-570.
16.
Lin WY, Lou XY, Gao G, Liu N: Rare variant association testing by adaptive combination of p-values. PLoS One 2014;9:e85728.
17.
Yan Q, Tiwari HK, Yi N, Lin WY, Gao G, Lou XY, Cui X, Liu N: Kernel-machine testing coupled with a rank-truncation method for genetic pathway analysis. Genet Epidemiol 2014;38:447-456.
18.
Wu MC, Kraft P, Epstein MP, Taylor DM, Chanock SJ, Hunter DJ, Lin X: Powerful SNP-set analysis for case-control genome-wide association studies. Am J Hum Genet 2010;86:929-942.
19.
Wu MC, Lee S, Cai T, Li Y, Boehnke M, Lin X: Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet 2011;89:82-93.
20.
Aulchenko YS, Ripatti S, Lindqvist I, et al; ENGAGE Consortium: Loci influencing lipid levels and coronary heart disease risk in 16 European population cohorts. Nat Genet 2009;41:47-55.
21.
Kamatani Y, Matsuda K, Okada Y, Kubo M, Hosono N, Daigo Y, Nakamura Y, Kamatani N: Genome-wide association study of hematological and biochemical traits in a Japanese population. Nat Genet 2010;42:210-215.
22.
Kathiresan S, Manning AK, Demissie S, D'Agostino RB, Surti A, Guiducci C, Gianniny L, Burtt NP, Melander O, Orho-Melander M, Arnett DK, Peloso GM, Ordovas JM, Cupples LA: A genome-wide association study for blood lipid phenotypes in the framingham heart study. BMC Med Genet 2007;8(suppl 1): S17.
23.
Sabatti C, Service SK, Hartikainen AL, Pouta A, Ripatti S, Brodsky J, Jones CG, Zaitlen NA, Varilo T, Kaakinen M, Sovio U, Ruokonen A, Laitinen J, Jakkula E, Coin L, Hoggart C, Collins A, Turunen H, Gabriel S, Elliot P, McCarthy MI, Daly MJ, Jarvelin MR, Freimer NB, Peltonen L: Genome-wide association analysis of metabolic traits in a birth cohort from a founder population. Nat Genet 2009;41:35-46.
24.
Wang S, Fang S, Sha Q, Zhang S: Detecting association of rare and common variants by testing an optimally weighted combination of variants with longitudinal data. BMC Proc 2014;8:S91.
25.
Furlotte NA, Eskin E, Eyheramendy S: Genome-wide association mapping with longitudinal data. Genet Epidemiol 2012;36:463-471.
26.
Melton PE, Almasy LA: Bivariate association analysis of longitudinal phenotypes in families. BMC Proc 2014;8:S90.
27.
Fan R, Zhang Y, Albert PS, Liu A, Wang Y, Xiong M: Longitudinal association analysis of quantitative traits. Genet Epidemiol 2012;36:856-869.
28.
Falk CT, Rubinstein P: Haplotype relative risks: an easy reliable way to construct a proper control sample for risk calculations. Ann Hum Genet 1987;51:227-233.
29.
Ott J: Statistical properties of the haplotype relative risk. Genet Epidemiol 1989;6:127-130.
30.
Spielman RS, McGinnis RE, Ewens WJ: Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). Am J Hum Genet 1993;52:506-516.
31.
Terwilliger JD, Ott J: A haplotype-based ‘haplotype relative risk' approach to detecting allelic associations. Hum Hered 1992;42:337-346.
32.
Ott J, Kamatani Y, Lathrop M: Family-based designs for genome-wide association studies. Nat Rev Genet 2011;12:465-474.
33.
George VT, Elston RC: Testing the association between polymorphic markers and quantitative traits in pedigrees. Genet Epidemiol 1987;4:193-201.
34.
Chen T, Santawisook P, Wu Z: A multi-level model for analyzing whole genome sequencing family data with longitudinal traits. BMC Proc 2014;8:S86.
35.
Wu YY, Briollais L: Mixed-effects models for joint modeling of sequence data in longitudinal studies. BMC Proc 2014;8:S92.
36.
Zhou H, Zhou J, Sobel EM, Lange K: Fast genome-wide pedigree quantitative trait loci analysis using Mendel. BMC Proc 2014;8:S93.
37.
Rabinowitz D, Laird N: A unified approach to adjusting association tests for population admixture with arbitrary pedigree structure and arbitrary missing marker information. Hum Hered 2000;50:211-223.
38.
Chen H, Meigs JB, Dupuis J: Sequence kernel association test for quantitative traits in family samples. Genet Epidemiol 2013;37:196-204.
39.
Schifano ED, Epstein MP, Bielak LF, Jhun MA, Kardia SL, Peyser PA, Lin X: SNP set association analysis for familial data. Genet Epidemiol 2012;36:797-810.
40.
Oualkacha K, Dastani Z, Li R, Cingolani PE, Spector TD, Hammond CJ, Richards JB, Ciampi A, Greenwood CM: Adjusted sequence kernel association test for rare variants controlling for cryptic and family relatedness. Genet Epidemiol 2013;37:366-376.
41.
Yan Q, Tiwari HK, Yi N, Gao G, Zhang K, Lin WY, Lou XY, Cui X, Liu N: A sequence kernel association test for dichotomous traits in family samples under a generalized linear mixed model. Hum Hered 2015;79:60-68.
42.
Yan Q, Weeks DE, Celedon JC, Tiwari HK, Li B, Wang X, Lin WY, Lou XY, Gao G, Chen W, Liu N: Associating multivariate quantitative phenotypes with genetic variants in family samples with a novel kernel machine regression method. Genetics 2015;201:1329-1339.
43.
Paterson AD: Drinking from the Holy Grail: Analysis of whole-genome sequencing from the genetic analysis workshop 18. Genet Epidemiol 2014;38(suppl 1):S1-S4.
44.
Chen H, Malzahn D, Balliu B, Li C, Bailey JN: Testing genetic association with rare and common variants in family data. Genet Epidemiol 2014;38(suppl 1):S37-S43.
45.
Cordell HJ: Summary of results and discussions from the gene-based tests group at genetic analysis workshop 18. Genet Epidemiol 2014;38(suppl 1):S44-S48.
46.
Kwee LC, Liu D, Lin X, Ghosh D, Epstein MP: A powerful and flexible multilocus association test for quantitative traits. Am J Hum Genet 2008;82:386-397.
47.
Liu D, Lin X, Ghosh D: Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models. Biometrics 2007;63:1079-1088.
48.
Zhang D, Lin X: Hypothesis testing in semiparametric additive mixed models. Biostatistics 2003;4:57-74.
49.
Yuan KH, Bentler PM: Two simple approximations to the distributions of quadratic forms. Br J Math Stat Psychol 2010;63:273-291.
50.
Schifano ED, Epstein MP, Bielak LF, Jhun MA, Kardia SL, Peyser PA, Lin X: SNP set association analysis for familial data. Genet Epidemiol 2012;36:797-810.
51.
Satterthwaite FE: An approximate distribution of estimates of variance components. Biometrics 1946;2:110-114.
52.
Davies R: The distribution of a linear combination of chi-square random variables. J R Stat Soc Ser C Appl Stat 1980;29:323-333.
53.
Kuonen D: Saddlepoint approximations for distributions of quadratic forms in normal variables. Biometrika 1999;86:929-935.
54.
Pinheiro J, Bates D, DebRoy S, Sarkar D; R-Core Team: nlme: linear and nonlinear mixed effects models. R package version 31-118, 2014. https://cran.r-project.org/web/packages/nlme/index.html.
55.
Vazquez AI, Bates DM, Rosa GJ, Gianola D, Weigel KA: Technical note: an R package for fitting generalized linear mixed models in animal breeding. J Anim Sci 2010;88:497-504.
56.
Schaffner SF, Foo C, Gabriel S, Reich D, Daly MJ, Altshuler D: Calibrating a coalescent simulation of human genome sequence variation. Genome Res 2005;15:1576-1583.
57.
Balding DJ, Nichols RA: A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity. Genetica 1995;96:3-12.
58.
Kang HM, Sul JH, Service SK, Zaitlen NA, Kong SY, Freimer NB, Sabatti C, Eskin E: Variance component model to account for sample structure in genome-wide association studies. Nat Genet 2010;42:348-354.
59.
Lynch M, Ritland K: Estimation of pairwise relatedness with molecular markers. Genetics 1999;152:1753-1766.
60.
Ritland K: Multilocus estimation of pairwise relatedness with dominant markers. Mol Ecol 2005;14:3157-3165.
61.
Yu J, Pressoir G, Briggs WH, Vroh Bi I, Yamasaki M, Doebley JF, McMullen MD, Gaut BS, Nielsen DM, Holland JB, Kresovich S, Buckler ES: A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet 2006;38:203-208.
62.
Liu N, Zhao H, Patki A, Limdi NA, Allison DB: Controlling population structure in human genetic association studies with samples of unrelated individuals. Stat Interface 2011;4:317-326.
63.
Mitchell BD, Kammerer CM, Blangero J, Mahaney MC, Rainwater DL, Dyke B, Hixson JE, Henkel RD, Sharp RM, Comuzzie AG, VandeBerg JL, Stern MP, MacCluer JW: Genetic and environmental contributions to cardiovascular risk factors in Mexican Americans. The San Antonio Family Heart Study. Circulation 1996;94:2159-2170.
64.
Hunt KJ, Lehman DM, Arya R, Fowler S, Leach RJ, Goring HH, Almasy L, Blangero J, Dyer TD, Duggirala R, Stern MP: Genome-wide linkage analyses of type 2 diabetes in Mexican Americans: The San Antonio Family Diabetes/Gallbladder Study. Diabetes 2005;54:2655-2662.
65.
Abecasis GR, Cherny SS, Cookson WO, Cardon LR: Merlin - rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet 2002;30:97-101.
66.
International Consortium for Blood Pressure Genome-Wide Association Studies; Ehret GB, Munroe PB, Rice KM, et al: Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk. Nature 2011;478:103-109.
67.
Laird NM: Missing data in longitudinal studies. Stat Med 1988;7:305-315.
68.
Bartoloni L, Blouin JL, Maiti AK, Sainsbury A, Rossier C, Gehrig C, She JX, Marron MP, Lander ES, Meeks M, Chung E, Armengot M, Jorissen M, Scott HS, Delozier-Blanchet CD, Gardiner RM, Antonarakis SE: Axonemal beta heavy chain dynein DNAH9: cDNA sequence, genomic structure, and investigation of its role in primary ciliary dyskinesia. Genomics 2001;72:21-33.
69.
Lee S, Wu MC, Lin X: Optimal tests for rare variant effects in sequencing association studies. Biostatistics 2012;13:762-775.
70.
Zhang Z, Ersoz E, Lai CQ, Todhunter RJ, Tiwari HK, Gore MA, Bradbury PJ, Yu J, Arnett DK, Ordovas JM, Buckler ES: Mixed linear model approach adapted for genome-wide association studies. Nat Genet 2010;42:355-360.
71.
Zhou X, Stephens M: Efficient multivariate linear mixed model algorithms for genome-wide association studies. Nat Methods 2014;11:407-409.
72.
Lippert C, Listgarten J, Liu Y, Kadie CM, Davidson RI, Heckerman D: Fast linear mixed models for genome-wide association studies. Nat Methods 2011;8:833-835.
73.
Svishcheva GR, Axenovich TI, Belonogova NM, van Duijn CM, Aulchenko YS: Rapid variance components-based method for whole-genome association analysis. Nat Genet 2012;44:1166-1170.
74.
Zhou X, Stephens M: Genome-wide efficient mixed-model analysis for association studies. Nat Genet 2012;44:821-824.
75.
Ionita-Laza I, Lee S, Makarov V, Buxbaum JD, Lin X: Sequence kernel association tests for the combined effect of rare and common variants. Am J Hum Genet 2013;92:841-853.
76.
Lee S, Emond MJ, Bamshad MJ, Barnes KC, Rieder MJ, Nickerson DA;NHLBI GO Exome Sequencing Project-ESP Lung Project Team; Christiani DC, Wurfel MM, Lin X: Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies. Am J Hum Genet 2012;91:224-237.
77.
Wang T, Elston RC: Improved power by use of a weighted score test for linkage disequilibrium mapping. Am J Hum Genet 2007;80:353-360.
78.
Chapman J, Whittaker J: Analysis of multiple SNPs in a candidate gene or region. Genet Epidemiol 2008;32:560-566.
79.
Pan W: Asymptotic tests of association with multiple SNPs in linkage disequilibrium. Genet Epidemiol 2009;33:497-507.
80.
Yi N, Liu N, Zhi D, Li J: Hierarchical generalized linear models for multiple groups of rare and common variants: jointly estimating group and individual-variant effects. PLoS Genet 2011;7:e1002382.
Copyright / Drug Dosage / Disclaimer
Copyright: All rights reserved. No part of this publication may be translated into other languages, reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying, recording, microcopying, or by any information storage and retrieval system, without permission in writing from the publisher.
Drug Dosage: The authors and the publisher have exerted every effort to ensure that drug selection and dosage set forth in this text are in accord with current recommendations and practice at the time of publication. However, in view of ongoing research, changes in government regulations, and the constant flow of information relating to drug therapy and drug reactions, the reader is urged to check the package insert for each drug for any changes in indications and dosage and for added warnings and precautions. This is particularly important when the recommended agent is a new and/or infrequently employed drug.
Disclaimer: The statements, opinions and data contained in this publication are solely those of the individual authors and contributors and not of the publishers and the editor(s). The appearance of advertisements or/and product references in the publication is not a warranty, endorsement, or approval of the products or services advertised or of their effectiveness, quality or safety. The publisher and the editor(s) disclaim responsibility for any injury to persons or property resulting from any ideas, methods, instructions or products referred to in the content or advertisements.
You do not currently have access to this content.