OTU accuracy results

See also
OTU benchmark

Species per OTU on HMP mock communities
Since most methods produce many OTUs per species, I used the inverse ratio, i.e. number of species divided by number of OTUs, to produce a measure that usually falls in the range zero to one. I estimated the number of species as the sum of the number of mock species found, plus the number of OTUs classified as Contaminant.

For the 454 datasets, the QIIME species per OTU values are too small to be visible, ranging from 0.005 (Stag2P) to 0.014 (Stag1P).

For the Illumina datasets (not shown in figure), UPARSE OTUs have 0.95 species per OTU, while QIIME has 0.1 species per OTU.

The much closer agreement between the number of OTUs and the number of mock species plus detected contaminants achieved by UPARSE compared to the other assessed pipelines suggests that UPARSE may generate OTUs that are closer to 1:1 correspondence with species on samples collected in vivo.

Reference
Edgar, R.C. (2013) UPARSE: Highly accurate OTU sequences from microbial amplicon reads, Nature Methods [Pubmed:23955772, dx.doi.org/10.1038/nmeth.2604].