Improved ontology for eukaryotic single-exon coding sequences in biological databases

dc.contributor.authorJorquera, R.
dc.contributor.authorGonzález, C.
dc.contributor.authorClausen, P.
dc.contributor.authorPetersen, B.
dc.contributor.authorHolmes, D.S.
dc.date.accessioned2019-12-20T18:20:44Z
dc.date.available2019-12-20T18:20:44Z
dc.date.issued2018-01
dc.descriptionIndexación: Scopus.es
dc.description.abstractEfficient extraction of knowledge from biological data requires the development of structured vocabularies to unambiguously define biological terms. This paper proposes descriptions and definitions to disambiguate the term 'single-exon gene'. Eukaryotic Single-Exon Genes (SEGs) have been defined as genes that do not have introns in their protein coding sequences. They have been studied not only to determine their origin and evolution but also because their expression has been linked to several types of human cancer and neurological/developmental disorders and many exhibit tissue-specific transcription. Unfortunately, the term 'SEGs' is rife with ambiguity, leading to biological misinterpretations. In the classic definition, no distinction is made between SEGs that harbor introns in their untranslated regions (UTRs) versus those without. This distinction is important to make because the presence of introns in UTRs affects transcriptional regulation and post-transcriptional processing of the mRNA. In addition, recent whole-transcriptome shotgun sequencing has led to the discovery of many examples of single-exon mRNAs that arise from alternative splicing of multi-exon genes, these single-exon isoforms are being confused with SEGs despite their clearly different origin. The increasing expansion of RNA-seq datasets makes it imperative to distinguish the different SEG types before annotation errors become indelibly propagated in biological databases. This paper develops a structured vocabulary for their disambiguation, allowing a major reassessment of their evolutionary trajectories, regulation, RNA processing and transport, and provides the opportunity to improve the detection of gene associations with disorders including cancers, neurological and developmental diseases. © The Author(s) 2018. Published by Oxford University Press.es
dc.description.urihttps://academic.oup.com/database/article/doi/10.1093/database/bay089/5099430
dc.identifier.citationDatabase, 2018(2018).es
dc.identifier.issn1758-0463
dc.identifier.otherDOI: 10.1093/database/bay089
dc.identifier.urihttp://repositorio.unab.cl/xmlui/handle/ria/11582
dc.language.isoenes
dc.publisherOxford University Presses
dc.subjectBase Sequencees
dc.subjectDatabases, Genetices
dc.subjectEukaryotaes
dc.subjectExonses
dc.subjectGene Ontologyes
dc.subjectOpen Reading Frameses
dc.subjectProtein Isoformses
dc.titleImproved ontology for eukaryotic single-exon coding sequences in biological databaseses
dc.typeArtículoes
Archivos
Bloque original
Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
Jorquera_Improved_ontology.pdf
Tamaño:
2.07 MB
Formato:
Adobe Portable Document Format
Descripción:
TEXTO COMPLETO EN INGLES
Bloque de licencias
Mostrando 1 - 1 de 1
No hay miniatura disponible
Nombre:
license.txt
Tamaño:
1.71 KB
Formato:
Item-specific license agreed upon to submission
Descripción: