The draft genome assembly of Psidium guajava L.

Título da Revista

ISSN da Revista

Título de Volume

Editor

Universidade Federal do Espírito Santo

Resumo

Psidium guajava is a major commercial fruit of Myrtaceae family, mainly cropped for fresh fruit consumption, valued for its taste and high C vitamin content. Also, is used for industrialized production and has medicinal applications. Is widely distributed in all tropical and subtropical regions. However, guava conventional breeding could be a difficult and slow process and itshampered by limited molecular resources.This dissertation focuses on establishing the genome sequence for the Psidium guajava, that comprises all inheritable traits of an organism and provides essential information required to determine phenotypic traits and accelerate breeding programns. In this dissertation, I describe the process of creating a de novo assembly of Psidium guajava that was generated using a hybrid assembly strategy with Illumina short reads and long reads produced by Pacbio platforms.The genome assembly comprised 2,881 scaffolds with a total length of 412.5 Mb and a scaffold N50 of 312 Kb. Genome completeness indicated that 95,15% of the expected eudicotyledons genes were present in the genome assembly and cytogenetic analysis confirmed the genome size of 465 Mband GC content estimation around 40%. To detect the genomic features, an automatic annotation was performed, which revealed a large proportion of repetitive elements (57%), which suggests that they have played important roles in the evolution of P. guajava.The richness and distribution of microsatellites was also evaluated, revealing 210,037 potential SSR s allowing the fast and cost‐effective development of molecular markersfor genetic research.A structural sequence annotation resulted in the prediction of about 61,089 putative genes, that are possible related to protein coding regions. The next stage of this project included the annotation of these proteins

Descrição

Palavras-chave

Whole genome sequencing, repetitive elements, microsatellites, PacBio sequencing, Illumina, bioinformatics

Citação

Avaliação

Revisão

Suplementado Por

Referenciado Por