gene is an essential transcription factor regulating the differentiation of hematopoietic stem cells into mature blood cells. Though at least 12 different alternatively spliced
mRNAs are generated, three splice variants (
c) have been characterized. Here, using the reverse transcription-polymerase chain reaction with outwardfacing primers, we identified a novel non-polyadenylated transcript from the
gene, with exons 5 and 6 scrambled. The novel transcript resisted RNase R digestion, indicating it is a circular RNA structure that may originate from products of mRNA alternative splicing. The expression of the novel transcript in different cells or cell lines of human and a number of other species matched those of the canonical transcripts. The discovery provides additional evidence that circular RNA could stably exist
in human, and may also help to understand the mechanism of the regulation of the
gene transcription. [BMB Reports 2013; 46(3): 163-168]
The Runt-related transcription factor 1 gene
), located on chromosome 21q22, consists of 12 exons and spans 261 kb of genomic DNA. It plays an important role in establishing hematopoiesis and the generation of hematopoietic stem cells in the embryo
. Transcription of
is initiated at two distinct 5’ regions, a distal region-promoter-1 (P1) and a proximal region-promoter-2 (P2). The two promoters generate at least 12 alternatively spliced mRNAs, which differ within the 5’ and 3’ untranslated regions (UTR) and coding regions. Among these variants, the predominant transcripts
c code for a 453-amino acid protein and a 480-amino acid protein, respectively, while the minor
a codes for a protein, which has a higher affinity for DNA binding and suppresses the transcriptional activation of
b. To date, 55 partner chromosome bands and 21 translocations have been described at the molecular level, which are considered as contributors to leukemia, especially acute myeloid leukemia (AML), using different pathways
In our previous study, we used a pair of outward-facing primers to identify a novel partner gene of
in an acute myeloid leukemia patient. We randomly selected 3 healthy individuals as negative control, and to our surprise, three unexpected bands of the same size could be amplified in all samples. By sequencing, we accidentally discovered the sequence of the exons was arranged in a different order compared to the genomic DNA, which consisted of a 5-6-5 exon alignment. These findings raise several fundamental questions: Where does it originate from? What does its whole structure look like?
In this study, we have presented evidence from RT-PCR using outward-facing primers that further examine its presence. Also, through several experiments, we investigated its structure. Finally, the expression of this transcript in a number of human cell types, and several other species was studied.
- AML1 produces the abnormal transcript with the 5-6-5 exon alignment
To examine the abnormal transcript consisting of the 5-6-5 exon alignment, we performed PCR with total RNA from bone marrow or peripheral blood of 10 AML patients and 10 healthy individuals using a pair of outward-facing primers (oF/oR) from exon 5 (as in GenBank D43969). Fragments were observed in all samples at approximately 305 bp, and absent in the negative and blank controls. It was noteworthy that the 305 bp fragment contained an abnormal exon 5 following the original exons 5 and 6 (
To further examine whether the product with the abnormal exon order was an artifact, we used additional pairs of primers (6F/5R2 and 5F2/5R1). Also, primers F65 and R65 were designed to cover the boundary of the exons 6-5 and these were used in further PCRs with the primer pairs F65/5R3 and 5F1/R65. We confirmed that these amplified products contained the expected sequences (
Analysis of RT-PCR results. (A) Detection of the novel AML1 transcript with the exon 5-6-5 alignment using RT-PCR and sequence analysis. Lanes 1 and 4 show the amplification products of the novel transcript in an AML patient and a healthy individual respectively, Lanes 2 and 5 are negative control, and Lanes 3 and 6 are blank control. The band above 305 bp in Lane 4 is nonspecific amplification product. The positions and directions of outward-facing primers (oF and oR) in the cDNA structure are indicated by arrows. The sizes of the expected fragment and the junction sequence of the 3’ end of exon 6 and 5’ end of exon 5 are also shown. (B) Detection of exon 5-6-5 alignment using different pairs of primers. Positions and directions of different pairs in the cDNA structure are indicated by arrows. The amplification products of primers 6F/5R2, 5F2/5R1, F65/5R3 and 5F1/R65 are shown. By sequencing, the bands of 118 bp and 284 bp in Lanes 1 and 4 and the bands of 116 bp and 315 bp in Lanes 7 and 10 respectively are the products, Lanes 2, 5, 8 and 11 are negative controls, Lanes 3, 6, 9 and 12 are blank controls. (C) Analysis of RT-PCR results and schematic diagram of the positions and directions of primers used to seek the neighboring exons of the exon 5-6-5 alignment. Lanes 1 to 4 are products of RT-PCR using 3F/7aR, 3F/7bR, 4F/7aR and 4F/7bR, respectively. Asterisks show the canonical transcripts without exon 6. Lanes 5 to 8 are products of 3F/R65, 4F/R65, F65/7aR and F65/7bR, respectively. Lane 9 shows the amplification product of F65/6R, Lane 10 is negative control.
- Identification of the neighboring sequences upstream and downstream of the 5-6-5 exon alignment
To explore whether the novel transcript possesses the same neighboring sequences upstream and downstream as the canonical sequences, PCR was performed using a further set of primers. The fragment could be amplified using primers F65/6R (
C), while the expected band was not present using the primers 3F/R65, 4F/R65, F65/7aR and F65/7bR. The canonical transcripts
b were successfully amplified using 3F/7aR, 3F/7bR, 4F/7aR and 4F/7bR (
C). Hence, we concluded that the 5-6-5 alignment was followed with an exon 6, but it did not contain exons 3 or 4 upstream or exons 7a or 7b downstream.
- Poly(A) tail is absent in the exons 5 and 6 scrambled AML1 molecule
We performed Rapid Amplification of cDNA Ends (RACE) to obtain either the 3’ or 5’ end of the novel transcript, but were unable to form any fragments (data not shown). To investigate whether a polyadenylated tail was present in this scrambled exon alignment, total RNA was reverse-transcribed with either random primers or oligo(dT) primers. PCR was then performed using the primers shown in
. Only the random primers, and not the oligo(dT) primers, produced the scrambled transcript (
A). Also, poly(A)+ and poly(A)- RNA was prepared from total RNA, and the quantity of poly(A)+ and poly(A)- RNA were equal when reverse transcribed using random primers. Canonical transcripts were amplified to testify the purity of the three samples using the primers 4F/5R1. As a result of amplification using the different pairs of primers shown in
, the expected bands were present in the total RNA and poly(A)- RNA Lanes while absent in the poly(A)+ Lane (
Nucleotide sequence of primers
aPrimers oF and oR are designed outward facing.
- The novel AML1 transcript is probably circular
As the transcript lacked the poly(A) tail, we used the One Step PrimeScript
miRNA cDNA Synthesis Kit (Takara, Catalog No. D350A) to add a segment of polyadenylate to it when it was reverse transcribed, allowing its 3’ end to be amplified using our special forward primer and the Uni-miR qPCR Primer which is reverse to the adaptor sequence supplied in the kit. We were unable to add a tail to the novel transcript, while the reference gene U6 formed the expected mono peak after real-time PCR and the expected band on an agarose gel after conventional PCR (data not shown).
To further testify the conformation of the non-canonical transcript, we determined its sensitivity to the RNA exonuclease R (RNAse R), which specifically digests linear RNA, both structured and non-structured, but does not digest circle or lariat RNA
. We performed real-time PCR to detect the change of its expression after RNase R digestion using cDNAs of five healthy individuals. The results showed a significant increase in the circular transcript. The result of calculation using the equation 2
showed there was on average a 282-fold increase (
), confirming its circular nature.
- The expression of the novel transcript in different cells or cell lines of human and several other species matched those of the canonical transcripts
To determine the ubiquity of the novel transcript, we employed RT-PCR using cDNAs from different cells or cell lines of human and several other species (
C). It turned out that the novel circular transcript of
existed not in lower organisms or plant, but in higher animals we selected.
Observation of poly(A) tail in the novel transcript and expression of the novel transcript in human normal cells, cancer cell lines and several other species. (A) RT-PCR results. 1 μg total RNA was subjected to reverse transcription using either random primers or oligo(dT) primer and amplified by PCR using three pairs of primers oF/oR, F65/6R, and 5F1/R65. The canonical transcript was amplified using 4F/5R1. (B) RT-PCR results of total RNA, poly(A)+ RNA and poly(A)- RNA. Canonical transcripts were amplified using primers 4F/5R1 (shown on the left). Results of PCR amplifying the novel transcript are shown on the right. (C) Lines 1 to 2 respectively represent the amplification product of Oral mucosa and nucleus pulposus cells of an intervertebral disc using primers 5F1/R65; Lines 3 to 7 respectively represent U266, K562, SMMC-7721, AGS and rat NRK52E cells; Line 8 represents tetrahymena; Lines 9 to 10 respectively represent Euplotes elegans and Tartary buckwheat. The positive control represents cDNA from bone marrow of healthy individuals, followed by the negative control.
In this manuscript, we have successfully identified a novel
transcript containing the donor site of exon 6 linked to the acceptor site of exon 5. Our investigations have gained an insight into this novel transcript, characterizing its sequence and conjecturing on its structure. The observation of the novel transcript prompted experimental analysis of several possible interpretations of this phenomenon. Originally, this transcript was thought to arise from either the expression of a duplicate exon 5 to exon 6 genomic segment or an intermolecular splicing (trans-splicing) of two
pre-mRNAs. When reverse transcriptase was omitted from the protocol, the product was not detected, as shown in Lane 2 in
A, indicating that the amplified products did not result from contamination of the genomic DNA. We also excluded the possibility that it was produced by the self-amplification of a single primer (data not shown). Additionally, the novel transcript was non-polyadenylated, and resisted RNase R digestion, a 3’-5’ exoribonuclease that digests linear RNA. These results indicated that the novel
transcript with scrambled exons was a stable, naturally occurring, circular RNA species, as previously reported for other human genes.
The enrichment of the novel transcript after RNase R digestion
aR+ and R- represent the groups with or without RNase R digestion respectively.
identified three authentic transcripts of
using Northern blotting, but did not identify any circular forms. Our group also performed Northern blotting and we were also unable to detect the circular transcript (data not shown). The following two explanations may account for the negative result of the Northern blotting: (i) the content of the circular transcript is indeed insufficient to be detected by Northern blotting; (ii) circular RNA possesses unique secondary structures to give it stability in the cytoplasm, leading to an obstacle in detection via probe labeling. At present, the only techniques to identify that a molecule is circular RNA, are the results of RT-PCR using outward-facing primers and RNase R digestion. If a molecule is linear, it could not exhibit enrichment after RNase R digestion and the value of 2
is approximately 1. However, by detecting the change of the novel
transcript after RNase R treatment in five healthy individuals using SYBR Green based real-time PCR, we observed that there was approximately a 282-fold increase on average (
Circular RNA, first reported in Viroids
, has a particular structure where the 3’ donor splice site is joined precisely to the incongruous 5’ acceptor splice site. This type of structure did not attract great attention from researchers until circular transcripts were identified from the tumor suppressor gene,
, in rodent and human cell lines
. Eukaryotic circular RNA was then reported in the human
and subsequently in the mouse testis-determining gene,
, the rat cytochrome P-450 2C24 gene
, the human cytochrome P-450 2C18 gene
, the Na/Ca exchanger gene 1
, the dystrophin gene
-associated noncoding RNA
. Noteworthily, recently Salzman
., by means of RNA-seq, has found circular transcripts of hundreds of human genes in diverse cell types
. Here, we report a novel circularized transcript from the
gene, and provide additional evidence that circular RNA exists in humans.
Hitherto, it was reported that circular RNA may consist of several contiguous exons
, exons not in succession or only a single exon
. The intriguing mechanism of circularization of scrambled exons was inferred to be a spliceosome-mediated alternative splicing event. It seems reasonable to suppose that they may simply represent errors during the normal splicing process, namely mis-splicing
, or lariat splicing and inverse splicing
, or trans-splicing
-generated trans-spliced molecules are always polyadenylated as are canonical mRNAs
. Exon skipping was considered to be associated with the formation of circular transcripts
, by using ‘molecular patches’ to mask the skipped exon. We propose that there exists a particular structure which favors RNA circularization, a nuclease that recognizes this structure and a ligase that bridges the splice sites. It might be similar to the process of the generation of microRNA, which possess a hairpin structure, in which inverted repeat transcripts are precisely cleaved by RNase III enzyme(s) in the Dicer and/or Drosha protein families
What does the circular species exist for? Is it trash or treasure? Since circular transcripts are reported from different genes, and are stable in the cytoplasm
, we support the hypothesis that they have a biological function beyond the splicing process. Chen and Sarnow
have demonstrated that if circular transcripts contain an internal ribosome entry site (IRES) element, they can be translated into proteins. Moreover, the circular
exon 2 transcript was deduced to encode a truncated 602-amino acid protein
, though the efficiency of translation might be affected by its circular structure as well as by the lack of a cap or poly(A) tail
. However, the majority of circular transcripts, like the circular
transcript, does not possess internal initiation sites for translation and probably do not code for a protein.
The non-coding characteristic of the circular RNA species are reminiscent of the group of non-coding RNAs, which are also products of the splicing process and challenge the central dogma. Recent studies indicate that members of this non-coding RNA family are far more abundant and important than considered initially
. What is the relationship between circular RNAs and non-coding RNAs? Could we suppose that these circular RNA transcripts belong to the non-coding RNA species? Since the discovery of microRNA and the reporting of their meaningful function, more attention has been paid to these non-canonical mRNAs.
Whether the circular transcripts encode protein or not, study on their biological function has been going through a bottleneck period. To date, due to the limited approach and technology, there is no way to figure out what physiological or pathological functions circular RNA may have. Therefore, it still requires us to seek new approach or develop new technology to further disclose the mysterious mask of these circular transcripts in the future.
In summary, we identified a novel transcript from the
gene, which is likely to be a circular RNA molecule. We concluded that the novel circular transcript of AML1 is just consisted of exon5 and exon6. In other words, in the novel transcript exon5 and exon6 were joint head to tail. Therefore, we inferred its full length is length of exon5 plus that of exon6, that is 297 bp. The novel transcript was identified in different cells or cell lines of human and rat cells we selected, but not in lower organisms. The circular RNA transcripts may suggest a novel RNA species, which may help to integrate the genomic context. But this still leads to the debate about their unknown biological role. They should attract our attention, just like the non-coding RNA. We believe there is likely to be circular RNA transcripts of other genes, though there is yet no direct method to detect them. Moreover, the specific mechanism of their formation also needs to be elucidated.
MATERIALS AND METHODS
- Sample preparation
Bone marrow samples were from healthy individuals and AML patients, who were diagnosed using the MICM classification standard. Oral mucosa and nucleus pulposus cells of an intervertebral disc were also obtained from healthy individuals. All samples were collected with informed consent from the subjects. The patients’ IRB number is NO. 2012057 from Shanxi Medical University Medical Ethics Committee.
U266, K562, SMMC-7721, AGS and NRK52E cells were deposited in our laboratory, and cDNAs of tetrahymena, Euplotes elegans and Tartary buckwheat were given by Dr. Shen Quan.
- Preparation of total RNA, poly(A)+ and poly(A)-RNA
Total RNA was extracted using RNAiso Plus (Takara, Catalog#-D9108A) according to the instruction of manufacturer. The mRNA was isolated from total RNA using PolyATtract
mRNA Isolation Systems III (Promega, Cat#Z5300), and the poly(A)-RNA was recycled from the supernatant by ethanol precipitation. Purity and concentration were detected by NanoDrop 2000 (Thermo, USA) before being stored at −80℃.
- Reverse transcription-polymerase chain reaction (RT-PCR)
RNA (1 μg) was
reverse-transcribed into cDNA with reverse transcriptase M-MLV (RNase H-) (Takara, Tokyo, Japan, Cat#D2640A) using random primers (Promega, Madison, MI, USA) and an oligo(dT) primer (Takara, Cat#D511), respectively. As a negative control, cDNA prepared without reverse transcriptase was employed.
The PCR was performed with a set of outward-facing primers designed to bind in exon 5, oF and oR. Amplification conditions were as follows: denaturation at 94℃ for 3 min; 35 cycles of denaturation at 94℃ for 30 s, annealing at 53℃ for 30 s, and extension at 72℃ for 30 s; plus a final extension step at 72℃ for 10 min. PCR was also employed with other sets of primers, using 60℃ as the annealing temperature. The primers were designed by Primer 5.0 (
Each 25 μl PCR reaction contained 1 μl cDNA template, 2.5 μl of 10× Taq buffer, 1.5 μl of 25 mM MgCl
, 0.5 μl of 10 mM dNTPs, 1.5 U of Taq DNA polymerase (Fermentas, Cat#EP0282) and 1 μmol/L of forward and reverse primers. Aliquots (10 μl) of the amplified PCR products were loaded onto 2% agarose gels for electrophoresis and visualization with ethidium bromide. The band was cut from the gel, purified using the Gel Extraction Kit (Omega, Cat#D2500-01), and sequenced by the Beijing Aucg Biotechnology Company.
- RNase R digestion
RNA (20 μg) was incubated with or without 40 U of RNase R (Epicentre Biotechnologies, Cat# RNR07250) for 2 h at 37℃. The resulting RNA was purified by ethanol precipitation and quantified. Equal amounts of RNA with or without RNase R treatment were subjected to reverse transcription using random primers. Expression of the novel transcript was detected using SYBR Green-based real-time PCR, which was performed using the 7300 Real-Time PCR System (Applied Biosystems, USA) with SYBR Green I detection and Tm analysis. SYBR
Premix Ex Taq
II (perfect real time) kit (Cat#DRR081A) was purchased from Takara. The optimized reaction was performed in a 25 μl final reaction volume containing 12.5 μl of kit-supplied SYBR
Premix Ex Taq
II, 1 μl ROX Reference Dye (50×), 0.5 μl of both the forward and reverse primers (each 10 μM), 2 μl cDNA template, and 8.5 μl distilled water giving a final volume of 25 μl. Amplification conditions were as follows: 95℃ for 30 s, followed by 40 cycles of 95℃ for 5 s, 60℃ for 31 s with a dissociation step. The equation 2
was used to measure the change of the novel transcript expression after RNase R digestion.
We would like to thank Jiayan Ran, Zhenhua Qiao, and Linhua Yang, the Second Hospital of Shanxi Medical University, who recruited the patients in the study. This work was supported by grant from the National Natural Science Fund (30772064) of China.
De Braekeleer E.
De Braekeleer M.
RUNX1 translocations and fusion genes in malignant hemopathies.
DOI : 10.2217/fon.10.158
Burd C. E.
Jeck W. R.
Sanoff H. K.
Sharpless N. E.
Expression of linear and novel circular forms of an INK4/ARF-Associated Non-Coding RNA correlates with atherosclerosis risk.
DOI : 10.1371/journal.pgen.1001233
Zhang M. Q.
Characterization of RNase R-digested cellular RNA source that consists of lariat and circular RNAs from pre-mRNA splicing.
Nucleic Acids Res.
DOI : 10.1093/nar/gkl151
Alternative splicing and genomic structure of the AML1 gene involved in acute myeloid leukemia.
Nucleic Acids Res.
DOI : 10.1093/nar/23.14.2762
Majerus M. A.
Kerckaert J. P.
Splicing with inverted order of exons occurs proximal to large introns.
Mis-splicing yields circular RNA molecules.
Circular transcripts of the testis-determining gene Sry in adult mouse testis.
DOI : 10.1016/0092-8674(93)90279-Y
Zaphiropoulos P. G.
Circular RNAs from transcripts of the rat cytochrome P450 2C24 gene: correlation with exon skipping.
Proc. Natl. Acad. Sci.
DOI : 10.1073/pnas.93.13.6536
Zaphiropoulos P. G.
Exon skipping and circular RNA formation in transcripts of the human cytochrome P-450 2C18 gene in epidermis and of the rat androgen binding protein gene in testis.
Mol. Cell. Biol.
Circular dystrophin RNAs consisting of exons that were skipped by alternative splicing.
Hum. Mol. Genet.
DOI : 10.1093/hmg/8.3.493
Multiple exon skipping and RNA circularisation contribute to the severe phenotypic expression of exon 5 dystrophin deletion.
J. Med. Genet.
DOI : 10.1136/jmg.40.8.e100
Li X. F.
A circularized sodium-calcium exchanger exon 2 transcript.
J. Biol. Chem.
DOI : 10.1074/jbc.274.12.8153
Sänger H. L.
Gross H. J.
Kleinschmidt A. K.
Viroids are single-stranded covalently closed circular RNA molecules existing as highly base-paired rod-like structures.
Proc. Natl. Acad. Sci.
DOI : 10.1073/pnas.73.11.3852
Wang P. L.
Brown P. O.
Circular RNAs are the predominant transcript isoform from hundreds of human genes in diverse cell types.
DOI : 10.1371/journal.pone.0030733
Heterogeneous Sp1 mRNAs in human HepG2 cells include a product of homotypic trans-splicing.
J. Biol. Chem.
DOI : 10.1074/jbc.M002010200
Natural trans-spliced mRNAs are generated from the human estrogen receptor-α (hERα) gene.
J. Biol. Chem.
Axtell M. J.
Westholm J. O.
Lai E. C.
Vive la différence: biogenesis and evolution of microRNAs in plants and animals.
DOI : 10.1186/gb-2011-12-4-221
Chen C. Y.
Initiation of protein synthesis by the eukaryotic translational apparatus on circular RNAs.
DOI : 10.1126/science.7536344
Sachs A. B.
Davis R. W.
The poly(A) binding protein is required for poly(A) shortening and 60s ribosomal subunit-dependent translation initiation.
DOI : 10.1016/0092-8674(89)90938-0