As next-generation sequencing technologies continue to generate staggering amounts of raw protein sequences, it has become very difficult to thoroughly annotate the emerging protein-sequence space.