ALGORITHM:Because of possible long duration of the treatment the result will be sent to you by Email along with it's WWW demonstration. The source sequence multiple alignment could be
formed from low and upper case letters. d(i,j)=1(S(i,j)  Sr(i,j))/(Smax(i,j)  Sr(i,j)), where
This is mostly used formula. It take into account that similarity of the sequences can't be less than some definite (random) level. Another variant of the formula ( d(i,j) = 1  S(i,j)/Smax(i,j), The third variant of the formula smooths the distinction of the small similarities. d(i,j) = ln (S(i,j)/(Smax(i,j)). We use the following weight MATRICES for aminoacids exchange:
The main feature of the topological algorithms is the fact that they optimize the tree structure (i.e. the way the tree nodes are connected) first without consideration of branch lengths, that are reconstituted once the topological structure have been established. The topological algorithms employed in the module are based on the
socalled topological similarity principle. This approach is tailored for the most precise
representation of the internal structure of the analyzed distance matrix. In order to do
that the number of sequence quartets d(i,j) + d(k,l) holds in the matrix, but does not hold in the tree, and vice versa, i.e. the topological deviation value. The algorithm is aimed at construction of the tree, for which this number is minimal, i.e the maximum topological similarity tree. Although in general the algorithm does not guarantee that the global minimum of the topological deviation is obtained, the constructed trees corresponding to local minima approximate the maximum topological similarity tree reasonably well. In cluster algorithm the notion of distance between groups of sequences is used for the setting of the branching order. This distance is defined as the arithmetic mean of pairwise distances between elements of the two groups. Contary to topological algorithm, in the cluster one the order of node connections is reconstituted together with the corresponding branch lengths. The root is also determined in the natural way as a point on one of the branches such that the distances from it to all hanging nodes (corresponding to sequences) are equal. This property of cluster trees allows to introduce the distance from the root to each node and to draw the tree using this distance as a node abscissa. KAG2_CAVPO CGGVLVDPQWVLTAAHCINDSN KAG_PIG CGGVLVNPKWVLTAAHCKNDNY PLMN_PIG parvvggcvsiphswpwqislryryrgHFCGGTLISPEWVLTAKHC TRYP_PIG nsgsHFCGGSLINSQWVVSAAHCYKSRI UROK_PIG CGGSLISPCWVVSATHCFINYQQKEDY KAG2_CAVPO QVKLGRHNLFEDEDTAQHFLVSQSVPHPDFN KAG_PIG EVGWLRHNLFENENTAQFFGVTADFPHPGFN PLMN_PIG lekssspssykvilgaheeyhlge TRYP_PIG QVRLGEHNIDVLEGNEQFINAAKIITHPNFN UROK_PIG IVYLGRQTLHSSTHGEMKFEVEKLILHEDYSADSLA Phylogenetic trees can be presented in one or several graphical forms (picture types):  slanted cladogram, two versions;  rectangular cladogram, two versions;  phylogram, that is a rectangular cladogram with branches scaled by their length (weight);  unrooted, two versions (unscaled and scaled braches).All images have same size. Width and height of the images may be set from 320 to 2000 and 240 to 1500 pixels, respectively. Defaults are 640 and 480. Unrooted tree with scaled branches (Unrooted 2) has special option: max/min factor. Scaled unrooted tree looks not so good when branches (edges) have very different lengths. This option restrains the difference so that very short branches are plotted with length only at factor times less than maximum plotted. Such branches are dispayed in orange color. Also very long branches (three at most) are plotted with shorter, partly dashed, green lines. Max/min factor can be set different for cluster and topological algorithms. In the bootstrap a multiple alignment is resampled 100 times. That is 100 trees are generated. Bootstrap values are expressed in percentages and placed at nodes for all grahpical forms except unrooted trees. 