Pages

Tuesday, August 7, 2012

Study of Uclust Vs. DA-PWC for Divergent Dataset

Description DA-PWC Uclust 0.65 Uclust 0.75 Uclust 0.85 Uclust 0.95
Total number of clusters 23 4 10 36 91
Total number of clusters uniquely identified i.e. one original cluster goes to one uclust cluster 23 0 0 13 16
Total number of shared clusters with significant sharing (one uclust cluster goes to > 1 real cluster 0 4 10 5 0
Total number of uclust clusters that are just part of a real cluster 0 4 10 17(11) 72(62)
Total number of real clusters that are one uclust cluster but uclust cluster is spread over multiple real clusters 0 14 9 5 0
Total number of real clusters that have significant contribution from > 1 uclust cluster 0 9 14 5 7


Associated Histograms

Note. Uclust and CD-HIT were run with a similarity cut of 0.97 (default), which denotes that two sequences belong to different clusters if they have a similarity value less than 0.97.


  • Divergent Dataset


  • Artificial Dataset

No comments:

Post a Comment