USEARCH manual > commands > sortbysize
sortbysize command
 
See also
 
Abundance sort
  UCLUST sort order
  Global trimming

Sort sequences by decreasing size annotation, which usually refers to the size of a cluster. The size is specified by a field size=N; in the sequence label, where N is an integer. The output file is specified by the ‑output option. The ‑minsize option can be used to specify a minimum size.

For most applications, it is recommended that sequences are globally trimmed before clustering and abundance sorting.

The -relabel prefix option can be used to generate sequential labels for the sorted sequences (requires v6.0.235 or later). The output label is prefixN where N=1, 2, 3 etc. If -sizeout is used, a size annotation will be appended to the sequential label.

Example

usearch -sortbysize seqs.fasta -output seqs_sorted.fasta -minsize 4