Variant Frequency
Also known as: allele frequency, mutation frequency, variant allele frequency
The proportion of sequencing reads or cells in a population that carry a specific genetic variant at a given locus.
Variant Frequency is the fraction of a population — measured as the proportion of sequencing reads at a given locus — that carries a specific genetic variant 1.
How It Works
When a mixed population of cells is sequenced, not all cells may share the same genotype at every position. Variant frequency (also called variant allele frequency, or VAF) is calculated as the number of reads supporting the alternative allele divided by the total read depth at that position. A VAF of 1.0 indicates a fixed variant present in all cells, while lower values indicate the variant is present in a subpopulation.
Accurate variant frequency estimation requires sufficient sequencing depth. Low-frequency variants (below 1-5%) require deep sequencing (hundreds to thousands of reads) and careful error modeling to distinguish true variants from sequencing artifacts.
In synthetic biology, variant frequency analysis tracks the emergence and spread of escape mutations in genetic circuits, monitors population heterogeneity during adaptive laboratory evolution, and detects unintended mutations arising during strain construction or passaging.
Computational Considerations
Variant callers such as VarScan2 use statistical tests to distinguish true low-frequency variants from sequencing errors based on base quality, strand bias, and read position 1. For microbial evolution experiments, breseq provides specialized pipelines that identify mutations, track their frequencies across time points, and annotate their predicted functional effects 2. Visualization tools display variant frequency trajectories, revealing selective sweeps, clonal interference, and the dynamics of competing lineages.
Woolf Software builds computational pipelines for biological data analysis and experimental design optimization. Get in touch.
Variant callers estimate allele frequencies from read pileups using Bayesian or likelihood-based models, enabling detection of low-frequency mutations in mixed populations.
Related Terms
References
- Koboldt DC, Zhang Q, Larson DE, et al.. VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing . Genome Research (2012) DOI
- Deatherage DE, Barrick JE.. Identification of mutations in laboratory-evolved microbes from next-generation sequencing data using breseq . Methods in Molecular Biology (2014) DOI