Exercise 1 - Parameter estimation

Inferring phylogenies using maximum likelihood

In this tutorial you will be guided in using PhyML and its extension, CodonPhyML, to solve common phylogenetic problems. For some of the following exercises there might be more than one single solution.


Goal: Observing the effect of parameter estimation from data on the inferred tree topology.

In this exercise you are asked to run PhyML twice in order to compare the effect of estimating nucleotide frequencies from the used dataset vs. optimising them with a maximum likelihood (ML) approach.


Datasets
Dataset file:
First Run

Set the model to HKY+Gamma, estimating the transition/transversion ratio and the alpha parameter of the Gamma distribution by maximum likelihood (ML), nucleotide frequencies are estimated by ML.

Second Run

Set the model to HKY+Gamma, estimating the transition/transversion ratio and the alpha parameter of the Gamma distribution by maximum likelihood (ML), nucleotide frequencies are estimated empirically from the data


Questions/Tasks
  1. Do you see much difference in the tree?
  2. In the likelihood value (stat file)?
  3. Which option is best and why do you think so?

phylogenies tree-estimation maximum-likelihood parameter-estimation


This exercise was prepared by Maria Anisimova