Pages

Tuesday, August 7, 2012

Analysis of Artificial Dataset


Description

Here there are 89 True Sequences and around 30 identifiable clusters. As seen in plots some clusters have multiple True Sequences near them -- presumably separate families are not resolvable, a few True Sequences seem far from any sequences. All determined clusters (from DA-PWC) have one or more True Sequence reasonably near.


Links

Images

Presented below are the images of length dependence and clusters from DA-PWC and Uclust 0.85 and 0.95. Note statistics is not very good and so some refinements are not realistic.
  • Length Study




  • DA-PWC Clusters




  • Uclust 0.85




  • Uclust 0.95




No comments:

Post a Comment