Menu
Research Publication

Pygenstrat: A Python package for EIGENSTRAT data processing

Dilek Koptekin

1 Authors
2026-01-23 Published
Scroll to explore
Chapter I

Publication Details

Comprehensive information about this research publication

Authors

DK
Dilek Koptekin
Chapter II

Abstract

Summary of the research findings

Motivation: EIGENSTRAT format is widely used in ancient DNA population-genetic analyses but software for processing it is limited. pygenstrat is a Python command-line package that provides memory-efficient, chunked processing for large ancient-DNA EIGENSTRAT datasets and supports extensive filtering, subsetting, file updates, pseudo-haploidisation, allele polarisation, and conversion between text and binary formats. Results: Benchmarking versus convertf on the Allen Ancient DNA Resource (v62.0) reports 2–15× speedups and 90–95% memory reduction while producing equivalent outputs for standard operations. Availability: open-source (GitHub).

Chapter III

Analysis

Comprehensive review of ancestry and genetic findings

Important Disclaimer: This review has been performed semi-automatically and is provided for informational purposes only. While we strive for accuracy, this analysis may contain errors, omissions, or misinterpretations of the original research. DNA Genics disclaims all liability for any inaccuracies, errors, or consequences arising from the use of this information. Users should independently verify all information and consult original research publications before making any decisions based on this content. This analysis is not intended as a substitute for professional scientific review or medical advice.

Analysis In Progress

Our analysis of this publication is currently being prepared. Please check back soon for comprehensive insights into the ancestry and genetic findings discussed in this research.