The missing data problem in population genomics and statistical methods to address them.

Name: DNAGENICS DNA Analysis Platform
Rating: 4.4 (96 reviews)
Author: DNAGENICS

Sethuraman Arun

doi:10.1093/g3journal/jkaf269

Research Publication

The missing data problem in population genomics and statistical methods to address them.

Sethuraman Arun

41494994 PubMed ID

1 Authors

2026-01-07 Published

76 Views

Explore Publication View Original

Scroll to explore

Chapter I

Publication Details

Comprehensive information about this research publication

Authors 1 researchers

Publication Date 2026-01-07

URL https://pubmed.ncbi.nlm.nih.gov/41494994/

Authors

SA

Sethuraman Arun

View on PubMed More from Journal

Chapter II

Abstract

Summary of the research findings

The "Missing Data" problem is prevalent across all statistical inference, owing to the "absence of some part of a familiar data structure" (Efron 1994).Population genomic datasets are riddled with missing data (Fig. 1)-broadly classified as data missing at random (e.g.due to degradation, sequencing errors), data missing "on purpose" (e.g.due to sequencing strategies like genotyping by sequencing), and data missing due to unknown evolutionary history (e.g.introgression from ancestral ghost populations).Editors and scientific contributors to both the GSA's journals, Genetics and G3 have continually highlighted statistical issues and pitfalls with inference in the presence of missing data (McIntyre 2025), particularly in an age of Biobank scale population genomic datasets.Here I highlight studies, including those that have been recently published in Genetics and G3 towards systematically assessing the effects of missing data problems and addressing them towards inference in a variety of population genomics questions.

Chapter III

AI-Generated Summary

AI-generated by DNAGENICS

Independent AI summary of ancestry and genetic findings from the published study

Important: This summary is AI-generated by DNAGENICS for informational purposes only. It was not created by, affiliated with, or endorsed by the researchers behind the original publication, and is based solely on that published research. It may contain errors or omissions. DNAGENICS disclaims all liability for any inaccuracies or consequences arising from use of this information. Verify all information against the original publication. This is not professional scientific review or medical advice.

The missing data problem in population genomics and statistical methods to address them.

Publication Details

Authors

Abstract

AI-Generated Summary

AI Summary In Progress

Summary

Key Findings

Ancestry Insights

Traits Analysis

Historical Context

Explore More Research

The missing data problem in population genomics and statistical methods to address them.

Publication Details

Authors

Abstract

AI-Generated Summary

AI Summary In Progress

Summary

Key Findings

Ancestry Insights

Traits Analysis

Historical Context

Related Publications

An empirical evaluation of genotype imputation of ancient DNA.

High resolution analysis of recent population structure using rare variants.

A cost-effective, high-throughput, highly accurate genotyping method for outbred populations.

Phase-free local ancestry inference mitigates the impact of switch errors on phase-based methods.

A probabilistic approach to visualize the effect of missing data on PCA in ancient human genomics.

Early medieval genetic data from Ural region evaluated in the light of archaeological evidence of ancient Hungarians

Explore More Research