<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="6.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Zhang, Jian</style></author><author><style face="normal" font="default" size="100%">Zhiyuan Zhao</style></author><author><style face="normal" font="default" size="100%">Jennifer Evershed</style></author><author><style face="normal" font="default" size="100%">Guoying Li</style></author></authors><secondary-authors><author><style face="normal" font="default" size="100%">Hofestädt, Ralf</style></author></secondary-authors></contributors><titles><title><style face="normal" font="default" size="100%">Monophyletic clustering and characterization of protein families</style></title><secondary-title><style face="normal" font="default" size="100%">Journal of Integrative Bioinformatics</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Statistics</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2007</style></year></dates><volume><style face="normal" font="default" size="100%">4</style></volume><pages><style face="normal" font="default" size="100%">Article 67</style></pages><isbn><style face="normal" font="default" size="100%">1613-4516</style></isbn><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">A protein family contains sequences that are evolutionarily related. Generally, this is reflected by sequence similarity. There have been many attempts to organize the set of protein families into evolutionarily homogenous clusters using certain clustering methods. How do we characterize these clusters? How can we cluster protein families using these characterizations? In this work, these questions were addressed by use of a concept called group-wide co-evolution, and was exemplified by some real and simulated protein family data. The results have shown that the trend of a group of monophyletic proteins might be characterized by a normal distribution, while the strength and variability of this trend can be described by the sample mean and variance of the observed correlation coefficients after a suitable transformation. To exploit this property, we have developed a monophyletic clustering method called monophyletic k?medoids clustering. </style></abstract></record></records></xml>