| See also UPARSE home page
 UPARSE pipeline
 OTU benchmark methods and data
 
 Introduction
 In SSU metagenomics, 
next-generation reads are clustered into 
Operational Taxonomic Units (OTUs). This requires
quality filtering,
dereplication, 
discarding singletons (optional), and finally 
clustering into OTUs, typically at a 97% identity threshold.
 Benchmark resultsThe OTU benchmark uses 454 Titanium 
and Illumina MiSeq reads of Even and Staggered mock communities used for 
protocol development in the Human 
Microbiome Project (HMP). USEARCH results were obtained with the same parameters for all samples.
The number of  reads per sample ranges from 10,000 (Titanium) to two million (MiSeq). The 
accuracy of UPARSE was compared to 
recommended procedures (Sept. 2012) for mothur, QIIME and AmpliconNoise.
 
 
| Accuracy 
measure | Summary | Detailed results (click on image)
 |  
| Sequence quality Are OTUs accurate reconstructions of biological sequences?
 | Most USEARCH OTUs are >=99% identical to a biological 
sequence. Most QIIME, mothur and AmpliconNoise OTUs are >3% 
diverged from a biological sequence. Roughly half are chimeric. |  
 |  
| Diversity Does the number of OTUs correspond to the number of 
species?
 | USEARCH generated from 0.8 to 1.0 OTUs per 
detectable species. Mothur 
and AmpliconNoised produced 2.3x to 6.7x more OTUs 
than species. QIIME produced thousands of OTUs, far more than the 
number of species. | 
 
 |  ReferenceEdgar, R.C. (2013) UPARSE: Highly accurate OTU sequences from microbial amplicon reads, 
Nature Methods [Pubmed:23955772, 
dx.doi.org/10.1038/nmeth.2604].
 
 |