Skip to contents

The Tamura 1992 (T92) distance extends the K80 distance by taking GC content into account. It is calculated as \(-h \ln \left(1 - \frac{p}{h} - q\right) - \frac{1}{2} \times (1 - h) \ln\left(1 - 2 q\right)\), where \(p\) is the probability of transition, \(q\) the probability of transversion, \(h = 2\theta (1 - \theta)\) and \(\theta\) is the GC content. See the Wikipedia for more details.

Usage

T92_distance(train_parameters)

Arguments

train_parameters

A list containing the probabilities of the alignment, produced by the readTrainFile() function.

Value

Returns a numeric value show the evolutionary distance between two genomes. the larger the value, the more different the two genomes are.

References

Tamura, K. (1992). "Estimation of the number of nucleotide substitutions when there are strong transition-transversion and G+C-content biases." Molecular Biology and Evolution, 9(4), 678–687. DOI: 10.1093/oxfordjournals.molbev.a040752

Author

Zikun Yang

Examples

parameters <- readTrainFile(system.file("extdata/example.train", package = "GenomicBreaks"))
T92_distance(parameters)
#> [1] 0.01617185