Unspecific Peroxygenase Database (UPObase) is a genome mining pipeline based database which consists of all the sequences of fungal unspecific peroxygenases (UPOs) present in the fungal kingdom. The fungal UPOs in this database are obtained by searching against all the fungal genomes present in the Ensembl database. This server uses profile hidden Markov models (HMMs) for homology search and incorporates clustering algorithms to group the most similar sequences.
The complete genome sequences are subjected to sequence-based and graph-based clustering to make large groups of highly similar sequences together, which then later searched for the signature motifs of different subfamilies of UPOs. This database consists of approximately 2,000 UPO encoding sequences.