Assessment and improvement of Indian-origin rhesus macaque and Mauritian-origin cynomolgus macaque genome annotations using deep transcriptome sequencing data. Academic Article uri icon

Overview

abstract

  • BACKGROUND: The genome annotations of rhesus (Macaca mulatta) and cynomolgus (Macaca fascicularis) macaques, two of the most common non-human primate animal models, are limited. METHODS: We analyzed large-scale macaque RNA-based next-generation sequencing (RNAseq) data to identify un-annotated macaque transcripts. RESULTS: For both macaque species, we uncovered thousands of novel isoforms for annotated genes and thousands of un-annotated intergenic transcripts enriched with non-coding RNAs. We also identified thousands of transcript sequences which are partially or completely 'missing' from current macaque genome assemblies. We showed that many newly identified transcripts were differentially expressed during SIV infection of rhesus macaques or during Ebola virus infection of cynomolgus macaques. CONCLUSIONS: For two important macaque species, we uncovered thousands of novel isoforms and un-annotated intergenic transcripts including coding and non-coding RNAs, polyadenylated and non-polyadenylated transcripts. This resource will greatly improve future macaque studies, as demonstrated by their applications in infectious disease studies.

publication date

  • May 8, 2014

Research

keywords

  • Hemorrhagic Fever, Ebola
  • Macaca fascicularis
  • Macaca mulatta
  • Monkey Diseases
  • Simian Acquired Immunodeficiency Syndrome
  • Transcriptome

Identity

PubMed Central ID

  • PMC4176519

Scopus Document Identifier

  • 84904657654

Digital Object Identifier (DOI)

  • 10.1111/jmp.12125

PubMed ID

  • 24810475

Additional Document Info

volume

  • 43

issue

  • 5