Article
Open Access
Pannotator integrated with Medpipe provides immunological and subcellular location features using a microservice
Faculty of Computing, Federal University of Uberlândia, Uberlândia, Brazil
  • Volume
  • Citation
    Gonçalves R, Santos A. Pannotator integrated with Medpipe provides immunological and subcellular location features using a microservice. Biomed. Inform. 2024(1):0002, https://doi.org/10.55092/bi20240001. 
  • DOI
    10.55092/bi20240001
  • Copyright
    Copyright2024 by the authors. Published by ELSP.
Abstract

Bacterial and archaea genome sequencing and assembly are trivial tasks nowadays. After assembling contigs and scaffolds from a genome, the subsequent step is annotation. An annotation evidencing the expected features, like rRNA, tRNA, and CDS, is a signal of the quality of our sequencing and assembly. Different techniques to obtain and reproduce DNA samples, as well as sequencing and assembly of genomes, can impact the quality of a genome’s expected features. The Pannotator tool was conceived as an aid annotation tool focusing on the differences between assembling and its reference genome. Some of the key features for bacterial genome annotations are the subcellular location and immunological potential of a CDS. Instead of reimplementing the prediction of these features in Pannotator, we leveraged the capabilities of our microservice to provide them. In the end, Medpipe software was not modified, and Pannotator underwent minor changes to incorporate the subcellular location and immunological potential of all exported proteins annotated by the tool. Moreover, our Medpipe microservice can also be incorporated into other software. The Medpipe microservice is open to anyone, not only to our Pannotator tool. The successful integration of Medpipe to Pannotator, powered by the Medpipe microservice, offers a powerful approach to advanced genomic analysis. The Medpipe microservice, built on Kotlin with the Spring Boot framework, is instrumental in the automation of Medpipe processing. It achieves this using REST endpoints, such as the execution of Medpipe in an asynchronous manner, status retrieval, and prediction generation, which enhance the modularity and scalability of the microservice. The availability of endpoint documentation, detailed request examples, and logs make our microservice user-friendly. The results of this integration demonstrate the value of the information provided by Medpipe, enriching genomic annotation with additional details, such as the density of mature epitopes (MED) and protein subcellular location classification. The Pannotator has evolved beyond basic function annotation and now provides data on immunological potential, structure, and subcellular location after being integrated with our microservice. The Medpipe microservice is available at https://github.com/santosardr/medpipe-ms.git.

Keywords

genome; annotation; medpipe; pannotator; microservices; protein; subcellular location; mature epitope density

Preview
References
  • [1]Gangotia D, Gupta A, Mani I. Role of Bioinformatics in Biological Sciences. In Advances in Bioinformatics, 1st ed. Singapore: Springer, 2021, pp. 37–57.
  • [2]Zhang Y, Zhang Q, Zhou J, Zou Q. A survey on the algorithm and development of multiple sequence alignment. Brief. Bioinform. 2022, 23(3):1–16.
  • [3]Costa SS, Guimarães LC, Silva A, Soares SC, Baraúna RA. First Steps in the Analysis of Prokaryotic Pan-Genomes. Bioinform. Biol. Insights 2020, 14:1177932220938064.
  • [4]Santos A, Barbosa E, Fiaux K, Zurita-Turk M, Chaitankar V, et al. PANNOTATOR: an automated tool for annotation of pan-genomes. Genet. Mol. Res. 2013, 12(3):2982–2989.
  • [5]Santos A, Pereira V, Barbosa E, Baumbach J, Pauling J, et al. Mature Epitope Density—A strategy for target selection based on immunoinformatics and exported prokaryotic proteins. BMC Genomics 2013, 14 Suppl 6(Suppl 6):S4.
  • [6]Fowler M. Microservices. 2014, Available: https://martinfowler.com/articles/microservices.html (accessed on 20 March 2024).
  • [7]Hossain MD, Sultana T, Akhter S, Hossain MI, Thu NT, et al. The role of microservice approach in edge computing: Opportunities, challenges, and research directions. ICT Express 2023, 9(6):1162–1182.
  • [8]Velepucha V, Flores P. A Survey on Microservices Architecture: Principles, Patterns and Migration Challenges. IEEE Access 2023, 11:88339–88358.
  • [9]Fowler SJ. Microsserviços prontos para a produção: Construindo sistemas padronizados em uma organização de engenharia de software, São Paulo: Novatec Editora, 2019.
  • [10]Newman S. Building Microservices, 1st ed. Sebastopol: O’Reilly Media, 2015.
  • [11]Liwei G, Anush M, Li-Heng C, Vinicius C, Aditya M, et al. Rebuilding Netflix Video Processing Pipeline with Microservices | by Netflix Technology Blog | Netflix TechBlog. 2024, Available: https://netflixtechblog.com/rebuilding-netflix-video-processing-pipeline-with-microservices-4e5e6310e359 (accessed on 20 March 2024).
  • [12]H J. 4 Microservices Examples: Amazon, Netflix, Uber, and Etsy. 2023, Available: https://blog.dreamfactory.com/microservices-examples/ (accessed on 20 March 2024).
  • [13]Seemann T. Prokka: Rapid prokaryotic genome annotation. Bioinformatics 2014, 30(14):2068–2069.
  • [14]Tobes R, Pareja-Tobes P, Manrique M, Pareja-Tobes E, Kovach E, et al. Gene calling and bacterial genome annotation with BG7. Methods Mol. Biol. 2015, 1231:177–189.
  • [15]Brettin T, Davis JJ, Disz T, Edwards RA, Gerdes S, et al. RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes. Sci. Rep. 2015, 5:8365.
  • [16]Leung E, Huang A, Cadag E, Montana A, Soliman JL, et al. Protein Sequence Annotation Tool (PSAT): A centralized web-based meta-server for high-throughput sequence annotations. BMC Bioinformatics 2016, 17(1):43.
  • [17]Kremer FS, Eslabão MR, Dellagostin OA, da Pinto LS. Genix: A new online automated pipeline for bacterial genome annotation. FEMS Microbiol. Lett. 2016, 363(23):fnw263.
  • [18]Casimiro-Soriguer CS, Muñoz-Mérida A, Pérez-Pulido AJ. Sma3s: A universal tool for easy functional annotation of proteomes and transcriptomes. Proteomics 2017, 17(12).
  • [19]Mende DR, Letunic I, Huerta-Cepas J, Li SS, Forslund K, et al. ProGenomes: A resource for consistent functional and taxonomic annotations of prokaryotic genomes. Nucleic Acids Res. 2017, 45(D1):529–534.
  • [20]Tanizawa Y, Fujisawa T, Arita M, Nakamura Y. Generating publication-ready prokaryotic genome annotations with DFAST. In Methods in Molecular Biology, United States: Humana Press, 2019, pp. 215–226.
  • [21]Sallet E, Gouzy J, Schiex T. EuGene: An automated integrative gene finder for eukaryotes and prokaryotes. In Methods in Molecular Biology, United States: Humana Press, 2019, pp. 97–120.
  • [22]Zhao B, Katuwawala A, Oldfield CJ, Dunker AK, Faraggi E, et al. DescribePROT: Database of amino acid-level protein structure and function predictions. Nucleic Acids Res. 2021, 49(D1):298–308.
  • [23]De Souza EV, Dalberto PF, Machado VP, Canedo A, Saghatelian A, et al. µProteInS-A proteogenomics pipeline for finding novel bacterial microproteins encoded by small ORFs. Bioinformatics 2022, 38(9):2612–2614.
  • [24]Jaiswal V, Chanumolu SK, Gupta A, Chauhan RS, Rout C. Jenner-predict server: Prediction of protein vaccine candidates (PVCs) in bacteria based on host-pathogen interactions. BMC Bioinformatics 2013, 14(1):211.
  • [25]Nagpal G, Usmani SS, Raghava GP. A web resource for designing subunit vaccine against major pathogenic species of bacteria. Front. Immunol. 2018, 9:2280.
  • [26]He Y, Racz R, Sayers S, Lin Y, Todd T, et al. Updates on the web-based VIOLIN vaccine database and analysis system. Nucleic Acids Res. 2014, 42(D1):1124–1132.
  • [27]Abby SS, Denise R, Rocha EP. Identification of Protein Secretion Systems in Bacterial Genomes Using MacSyFinder Version 2. In Methods in Molecular Biology, United States: Humana Press, 2024, pp. 1–25.
  • [28]Yero D, Conchillo-Solé O, Daura X. Antigen discovery in bacterial panproteomes. In Methods in Molecular Biology, United States: Humana Press, 2021, pp. 43–62.
  • [29]Saudate A. APIs REST: Seus serviços prontos para o mundo real, São Paulo: Casa do Código, 2021.
  • [30]The Spring Team. Spring Boot Reference Documentation (Version 3.1.6). 2023, Available: https://docs.spring.io/spring-boot/docs/3.1.6/reference/html/ (accessed on 20 March 2024).
  • [31]Miller FP, Vandome AF, McBrewster J. Apache Maven, São Paulo: Alpha Press, 2010.
  • [32]H2 Database Engine - Cant. 2024, Available: http://www.h2database.com/html/main.html (accessed on 20 March 2024).