Menu
Currency
GWAS Study

A genome-wide association study coupled with machine learning approaches to identify influential demographic and genomic factors underlying Parkinson's disease.

Rahman MA, Liu J

37842648 PubMed ID
GWAS Study Type
7072 Participants
48 Views
Scroll to explore
Chapter I

Publication Details

Comprehensive information about this research publication

Authors

RM
Rahman MA
LJ
Liu J
Chapter II

Abstract

Summary of the research findings

Despite the recent success of genome-wide association studies (GWAS) in identifying 90 independent risk loci for Parkinson's disease (PD), the genomic underpinning of PD is still largely unknown. At the same time, accurate and reliable predictive models utilizing genomic or demographic features are desired in the clinic for predicting the risk of Parkinson's disease. Methods: To identify influential demographic and genomic factors associated with PD and to further develop predictive models, we utilized demographic data, incorporating 200 variables across 33,473 participants, along with genomic data involving 447,089 SNPs across 8,840 samples, both derived from the Fox Insight online study. We first applied correlation and GWAS analyses to find the top demographic and genomic factors associated with PD, respectively. We further developed and compared a variety of machine learning (ML) models for predicting PD. From the developed ML models, we performed feature importance analysis to reveal the predictability of each demographic or the genomic input feature for PD. Finally, we performed gene set enrichment analysis on our GWAS results to identify PD-associated pathways. Results: In our study, we identified both novel and well-known demographic and genetic factors (along with the enriched pathways) related to PD. In addition, we developed predictive models that performed robustly, with AUC = 0.89 for demographic data and AUC = 0.74 for genomic data. Our GWAS analysis identified several novel and significant variants and gene loci, including three intron variants in LMNA (p-values smaller than 4.0e-21) and one missense variant in SEMA4A (p-value = 1.11e-26). Our feature importance analysis from the PD-predictive ML models highlighted some significant and novel variants from our GWAS analysis (e.g., the intron variant rs1749409 in the RIT1 gene) and helped identify potentially causative variants that were missed by GWAS, such as rs11264300, a missense variant in the gene DCST1, and rs11584630, an intron variant in the gene KCNN3. Conclusion: In summary, by combining a GWAS with advanced machine learning models, we identified both known and novel demographic and genomic factors as well as built well-performing ML models for predicting Parkinson's disease.

6,868 cases, 204 controls

Chapter III

Study Statistics

Key metrics and study information

7072
Total Participants
GWAS
Study Type
No
Replicated
Chapter IV

AI-Generated Summary

AI-generated by DNAGENICS

Independent AI summary of health and genetic findings from the published study

Important: This summary is AI-generated by DNAGENICS for informational purposes only. It was not created by, affiliated with, or endorsed by the researchers behind the original publication, and is based solely on that published research. It may contain errors or omissions. DNAGENICS disclaims all liability for any inaccuracies or consequences arising from use of this information. Verify all information against the original publication. This is not professional scientific review or medical advice.

AI Summary In Progress

Our AI-generated summary of this publication is being prepared. Please check back soon.