BEAST 2 Help Me Choose

BEAST 2 Help Me Choose Yule-skyline collapse tree prior

Yule-skyline collapse tree prior

The Yule-skyline collapse model is a pure-birth model which is used for identifying boundaries between species (species delimitation) and assumes all tips were sampled at the same time.

Under the Yule skyline model, each epoch is associated with a birth rate. The birth rate of each epoch is sampled from a Gamma(α β) distribution where α is the shape and β is the rate. Also see the Yule sklyline blog.

Under the collapse model, taxa whose divergence time falls below time ε are clustered into a single species (Jones et al. 2015 and 2017).

These two tree priors are combined into the Yule-skyline collapse (YSC) model, overviewed in a previous post.

Collapse threshold ε

The threshold ε describes the maximum amount of divergence that can be tolerated before two samples are considered to be different species. ε is expressed in the same units as the tree height. We recommend the user performs a sensitivity analysis to this term by running multiple MCMC replicates, each with a different value of ε. A guide for performing this exercise can be found in Douglas and Bouckaert 2022.

Collapse weight ω

The collapse weight 0<ω<1 is approximately the proportion of lineages which are clustered into species. The collapse weight prior guide contains further details on this parameter.

Birth rate shape α

The shape of the Gamma distribution is related to how peaked or modal the distribution of birth rates is. This defaults to 2.

Birth rate rate β

The rate of the Gamma distribution is inversely related to the mean birth rate, which is inversely related to the expected tree height and length. The birth rate rate guide contains further details on this parameter.

Equal epochs

If Equal Epochs is false, the model uses birth rate epochs based on ~equally sized groups of lineages, otherwise it uses equal sized epochs that scale with the tree height.

Linked means

If Linked Mean is true, the model uses birth rate rate only for first epoch, and for other epochs uses the posterior mean of the previous epoch. This makes the Yule skyline prior behave like a smoothing prior, which emulates the biologically plausible assumption that birth rates in consecutive epochs are related. If false, each epoch is assumed independently drawn from the same Gamma distribution.

References

Bouckaert, RR. An efficient coalescent epoch model for Bayesian phylogenetic inference. Sys Bio, syac015, 2022 doi:10.1093/sysbio/syac015

Douglas, J., Bouckaert, R. Quantitatively defining species boundaries with more efficiency and more biological realism. Commun Biol 5, 755 (2022). https://doi.org/10.1038/s42003-022-03723-z

Jones, G. Algorithmic improvements to species delimitation and phylogeny estimation under the multispecies coalescent. J. Math. Biol. 74, 447–467 (2017). https://doi.org/10.1007/s00285-016-1034-0

Jones, G., Aydin, Z., Oxelman, B. “DISSECT: an assignment-free Bayesian discovery method for species delimitation under the multispecies coalescent.” Bioinformatics 31.7 (2015): 991-998. https://doi.org/10.1093/bioinformatics/btu770

Bayesian evolutionary analysis by sampling trees

Disclaimer: The above is the opinion of the author RB. If you do not agree, or spot a mistake, contact the author, or discuss this in the issues area or raise a new issue. A link will be added from this page to make sure others can find it.

Served through Jekyll, customised theme based on the twentyfourteen wordpress theme.