Gene Ontology Meta Annotator for Plants
Making a genome sequence accessible and useful involves three basic steps: genome assembly, structural annotation, and functional annotation. The quality of data generated at each step influences the accuracy of inferences that can be made, with high- quality analyses produce better datasets resulting in stronger hypotheses for downstream experimentation.
Gene Ontology Meta Annotator for Plants (GOMAP) is a high-throughput pipeline to annotate GO terms to plant protein sequences in a high-confidence and reproducible manner. It combines sequence-similarity, domain-presence and mixed-method based approaches and we are currently applying it to the reference genomes of several agriculturally important plants (maize, wheat, rice, cotton, soybean). All generated annotation datasets as well as the source code are publicly available.
For more information on how the pipeline works as well as instructions on how to run it yourself, please check out the documentation on GitHub.