lunes, 8 de diciembre de 2014

Homology: Concepts and Inference

what is is and how we can possibly assess it

On the different concepts of homology

Homology is a central concept in biology, and it refers to the correspondence of morphological or molecular attributes due to common ancestry. It is generally useful to sort and evaluate relationships among organisms (Pinna 1991)⁠, depending on the field of study (Brigandt 2003)⁠. In comparative biology, it allows for effective descriptions across species and unified morphological knowledge, which provides comparisons that could be finely used for classifications. In other branches of biology such as evolutionary biology and phylogenetic systematics, it serves to explain the adaptive modifications of characters and to characterize taxa, respectively (Brigandt 2003)⁠. Given its diverse extents, there is also a set of criteria to infer it, among them, the similarity in molecular, structural detail and histology, the connectivity to adjacent structures, correspondence of the developmental origin (Patterson 1982; Patterson 1988)⁠ , and step-counting.

Regarding the concepts of homology, (De Pinna 1991)⁠ made a distinction between “primary” and “secondary” homology, treating the first as an untested supposition that two parts are equivalent by inheritance -definition that (Agnarsson & Coddington 2007)⁠ pondered as the prior of a logical analysis or the background knowledge that is not concerned to morphological data- , and the second as the hypothesis that has passed the congruence test and thus can be treated as a synapomorphy. A large discussion has been taken upon this latter concept, as for authors like (De Queiroz 1985)⁠ it does not equal homology, for he argued that homology applies to instantaneous morphologies, while synapomorphy applies to ontogenetic transformations that characterize monophyletic groups. On the other hand, like (Eldredge, N, Cracraft 1980)⁠, like (De Pinna 1991)⁠, considered these concepts as a single one. However, (Patterson 1982)⁠ and (Rieppel 1980)⁠ assigned a certain hierarchical organization of homologies, distinguishing between taxic (the feature of a monophyletic group) and transformational (the change of a structure into another) homology. Taking this into context, the radula of molluscs or the radular teeth constitute a taxic homology, and the classic example of the incus and the quadrate are a transformational homology (Zelditch M.L., Fink W.L. 1995)⁠.

Patterson's Test








A main question that can be asked is how do we recognize a character as homologous? One of the most outstanding processes was proposed by Patterson (Patterson 1982)⁠, and it is an evaluation that compresses three tests that a character needs to pass to be considered as an homology in a phylogenetic analysis. These tests are: similarity, which refers to the resemblance between structures that could share a common ancestor and thus it is a first “candidate” of homology; conjunction which evaluates that two different states of a character can not be present within the same organism (angels are definitely not an useful species to be included in an analysis using this test); and congruence that is the final test that shows whether a character most probably represents an homology.

Choosing criteria of evaluation

A question that arises when it comes to the evaluation of homology is what test or what methodology to use. In theory, parsimony or other criteria, could guide choice among competing homology hypotheses before assessing their fit to a tree (Agnarsson & Coddington 2007)⁠. Testing the fit of data to trees using various optimality criteria has been widely studied. However, the evaluation of the initial criteria of similarity in the formation of homology hypotheses, such as whether two structures should be considered homologous, remains a key point of attention. The authors proposed an index to make this initial assessment, weighting topology, function and similarity equally, so that multiple points of comparison within each criterion allow them to be averaged or down-weighted. Thence it took each criterion into a comparable unit.

Although morphological data present several points of special attention (Patterson 1982)⁠, molecular data is a powerful source that definitely has to be carefully analyzed. Mainly because it passes through a test of similarity that could render wrong assumptions and results. Taking into account that homologous sequences can be orthologous -as divergent from a common ancestor- or paralogous -as product of gene duplication-, as well as other mechanisms or variations -like xenology- (Fitch 2000)⁠ we can not be sure that a result from a similarity test will not come up as a convergence.

Another issue to consider is the character optimization and analysis, which basically have two different arguments: one that supports sequence alignment as a crucial step (Simmons and Freudenstein, 2003), and another that has proved that aligning base positions is not that fundamental and the optimization can be directly done (Wheeler 2001; Wheeler 2006)⁠. Wheeler introduced the concept of dynamic homology, which states that nucleotide homologies are topology specific and can be identified by optimization processes that determine nucleotide correspondence and transformation via a specific criterion. This method does not rely on a prior alignment, but analyses the variation, rendering likely cladograms.

Overall, it is evident that homology is a wide concept and its assessment is rather complex. There are several types of potential information and methods to analyze it properly. However, it is always necessary to bring the evolutionary histology and biology principles into the discussion, so one could be a little more positive about the obtained phylogenies, which at the end, are hypotheses that should constantly be evaluated until the phylogenetic relationships within a group are considered to be clear. As (Patterson 1982)⁠ stated, homology can be seen as a probability, as a result of any of the three tests, interpreting them is always a delicate task assigned to biologists.

References

Agnarsson, I. & Coddington, J.A., 2007. Cladistics Quantitative tests of primary homology. , 23, pp.1–11.
Brigandt, I., 2003. Homology in Comparative , Molecular , and Evolutionary Developmental Biology : The Radiation of a Concept. Journal of Experimental Zoology, 299, pp.9–17.
Eldredge, N, Cracraft, J., 1980. Phylogenetic Patterns and the Evolutionary Process,
Fitch, W.M., 2000. Homology a personal view on some of the problems. Homology, 16(5), pp.227–231.
Patterson, C., 1988. Homology in Classical and Molecular Biology. Molecular biology and evolution, 5(6), pp.603–625.
Patterson, C., 1982. Morphological characters and homology.
Pinna, M.C.C., 1991. Concepts and Tests of Homology in the Cladistic Paradigm. Cladistics, 7(4), pp.367–394. Available at: http://doi.wiley.com/10.1111/j.1096-0031.1991.tb00045.x.
De Pinna, M.C.C., 1991. Concepts and Tests of Homology in the Cladistic Paradigm. Cladistics, 7(4), pp.367–394. Available at: http://doi.wiley.com/10.1111/j.1096-0031.1991.tb00045.x.
De Queiroz, K., 1985. The ontogenetic method for determining character polarity and its relevance to phylogenetic systematics. Syst. Zool., 34, pp.280–299.
Rieppel, O., 1980. Homology, a deductive concept? Journal of Zoological Systematics and Evolutionary Research, 18(4), pp.315–319.
Wheeler, W., 2001. Homology and the Optimization of DNA Sequence Data. Cladistics, 17(1), pp.S3–S11. Available at: http://doi.wiley.com/10.1006/clad.2000.0154 [Accessed October 30, 2014].
Wheeler, W.C., 2006. Cladistics Dynamic homology and the likelihood criterion. , 22, pp.157–170.
Zelditch M.L., Fink W.L., S.D.L., 1995. Morphometrics, Homology, and Phylogenetics: Quantified Characters as Synapomorphies. Syst Biol, 44(2), pp.179–189.


martes, 7 de octubre de 2014

Evidence and Phylogenetic Inference

-keeping the Popperian and Bayesian spirits alive



Most of problems in phylogeny are not solved by simple hypotheses (i.e., Arthropoda and Onycophora as sister clades), they are rather assessed by composed hypotheses that should not be addressed as it was just one (let's make a reliable phylogenetic reconstruction of the group you are most interested in). Therefore, sorting this out is a matter of two components: Popperian vision of confirmation and Bayesian analysis. It is possible that the aforesaid will always sound odd, since Popper always expressed his disagreement about the Bayesian test. Nonetheless, the purpose of this article is to give some attention on the points that actually join these two approaches.

Evidence and Popper

Searching for evidence is an everyday task for the quizzical spirits who make science as what it really is: a constant pursuit of the truth about the processes that underlie the phenomena of life. However, there is no absolute certain of anything, and what we call truth would rather require a level of belief. This level of belief comes with the evidence of the hypothesis, which is what we think is the possible explanation of a particular event, or a statement that can not be defeated by evidence per se.

Sober (2009), states that the evidence we have do not render our theories true, but if we follow an argument of deductive logic, “the conclusion must be true if the premises are”. Hence, if the premises are true, there is nothing wrong in believing the conclusion. This concept is the antithesis of what Popper has proposed in the past about the power of a hypothesis, which relies on how improbable it is. The aforementioned, because in contrast to a scientist who accepts the most highly probable hypotheses, scientists according to the philosophy of Popper (1963) seek for explanations that are not subjected to a limited number of observations.

At this point, it is very necessary to bring up some of the important concepts of Sir. Karl Popper, which have contributed to this crucial step of the scientific method. First, falsifiability is a necessary criterion for scientific ideas, for it is the logical possibility that a statement could be false because of a particular observation or an experiment (Helfenbein and DeSalle R. 2005). Thereupon, a common feature of every scientific hypothesis and theory is to be “falsiable”, which not necessarily implies that they are false. If there is no degree of falsifiability in the theory or hypothesis you are formulating, it could possibly mean that it behaves as a universal law or you are facing an artifact, which is undesirable in scientific framework. Up to this point, I totally agree with Popper's statement, because one good conclusion is derived from a good hypothesis that could be contrasted with the evidence and other researches could arrive at the very same conclusion using different experiments.

The only issue I take on Popper's proposal is because of the formulated concept of corroboration, which implies that hypotheses should stand up to the most severe tests. In the cases where hypotheses were not falsified under these tests, we call the concept of degree of corroboration, which is the “appraisal of the worth of the hypothesis” (Helfenbein and DeSalle R. 2005). So, if a hypothesis has not been falsified, it has been corroborated.

To put this into context, I am enormously fond on molluscs, whose morphological and behavioral traits have defined them as a monophyletic group. Among several features, the veliger larvae constitutes the synapomorphy of this clade. Therefore the veliger larvae is the evidence of this hypothesis of monophyly. Through several analyses, we could be positive sure that it is a powerful conclusion that suits the given evidence just fine. Thus, the Popperian concept of a good hypothesis could really be in trouble when testing, because there is not such unlimited number of available observations.

Bayes and Popper
“There is not even anything irrational in relying for practical purposes upon well-tested theories, for no more rational course of action is open to us. (Popper, 1963)

First, I agree with the point of Bernardo (1999): "we are able to embed a Popperian take on the goal and methods of science into a genuine Bayesian model of hypothesis testing(...)". This idea is supported mainly because Popper’s judgement that an idea must be falsifiable could interpreted as a manifestation of the Bayesian conservation-of-probability rule (Yudowsky, 2014).

The perks of Bayesian analysis

Then, why is it preferable to use a Bayesian framework in phylogeny? Because it is flexible and allows you to use an evolution model. The prior probability that is calculated is nothing else but the probability of the model when you have not take a look at the data. Thus, the posterior probability is the probability of your model (hypothesis) given the data. These two components are explicit, which allows further assessment. Moreover, the Bayesian approach also takes account on the likelihood, which is included on its theorem. The likelihood is the probability of the data given the model, which gives us useful information, but not all the information we need.

On the other hand, due to the flexibility of the method, it allows using complex models and large sets of data. This property always provides the possibility of enlarge or enhance the method (Heaps et al., 2014). And finally, you can get a clearer grasp on the solution of your problem with the posterior probability.

Thence, the tests that are based on the calculus of probabilities, following an evolution model seem to be a powerful way to evaluate a hypothesis and determine how close we are to get a suitable likely answer. In fact, contemporary science tends to accept robust hypotheses that have been supported, rather than hypotheses of what can not possibly be refuted for its highly degree of improbability. Therefore, using a Bayesian approach would give very informative results and could allow us to perform further analyses. As a matter of being judgemental about an statement, Popper's philosophy is strong and it is an important startpoint for the scientific thinking: hypotheses must stand for severe tests. Nonetheless, in any case, choosing the approach to work with is subjected to personal criteria and to the aims of the study.

References

Bernardo J M. (1999) “Nested Hypothesis Testing: The Bayesian Reference Criterion”, in J. Bernardo et al. (eds.): Bayesian Statistics 6: Proceedings of the Sixth Valencia Meeting, 101–130 (with discussion), Oxford University Press, Oxford.

Helfenbein KG, DeSalle R. (2005) Falsifications and corroborations: Karl Popper’s influence on systematics. Molecular phylogenetics and evolution, 35(1), 271-280.

Heaps SE, Nye TM, Boys RJ, Williams TA, Embley TM. (2014) Bayesian modelling of compositional heterogeneity in molecular phylogenetics. Stat Appl Genet Mol Biol. 13, 589-609.

Popper KR. (1963) Conjectures and Refutations: The Growth of Scientific Knowledge. New York: Harper.

Sober E. (2008) Evidence and Evolution: The Logic Behind the Science. Cambridge University Press.

Addittional references

How Do Hypothesis Tests Provide Scientific Evidence? Reconciling Karl Popper and Thomas Bayes Departmental Seminar, Philosophy Department of Uppsala University, Uppsala.

Yudkowsky E. An Intuitive Explanation of Bayes'Theorem.http://yudkowsky.net/rational/bayes