Nuclear rDNA characteristics for DNA taxonomy of the centric diatom Chaetoceros (Bacillariophyceae)
Nuclear rDNA characteristics for DNA taxonomy of the centric diatom Chaetoceros (Bacillariophyceae)
ALGAE. 2010. Jun, 25(2): 65-70
Copyright ©2010, The Korean Society of Phycology
This is an Open Access article distributed under the terms of theCreative Commons Attribution Non-Commercial License( permits unrestrictednon-commercial use, distribution, and reproduction in any medium,provided the original work is properly cited.
  • Received : March 28, 2010
  • Published : June 15, 2010
Export by style
Cited by
About the Authors
Hye-Young, Oh
Department of Green Life Science, Sangmyung University
Ju-Yong, Cheon
Department of Green Life Science, Sangmyung University
Jin Hwan, Lee
Department of Green Life Science, Sangmyung University
Sung Bum, Hur
Korea Marine Microalgae Culture Center, Department of Aquaculture, Pukyong National University
Jang-Seu, Ki
Department of Green Life Science, Sangmyung University

The genus Chaetoceros provides highly diversified diatoms in marine systems. Morphological descriptions of the genus are well-documented, yet the DNA taxonomy of Chaetoceros has not been satisfactorily established. Here, the molecular divergences of the 18S-28S rDNA of Chaetoceros were assessed. DNA similarities were relatively low in both 18S (93.1 ± 3.9%) and 28S rDNA (81.0 ± 4.6%). Phylogenies of the 18S, 28S rDNAs showed that Chaetoceros was divided according to individual species, clustering the same species into single clades. Statistical analysis with corrected genetic (p-) distance scores showed that nucleotide divergence of Chaetoceros 28S rDNA significantly differed from that of 18S rDNA (Student’s t-test, p < 0.05). This finding suggests that the 28S rDNA may be treated as a more suitable marker for species-level taxonomic distinctions of Chaetoceros.
Chaetoceros Ehrenberg, 1844, is the largest and most species-rich genus of marine planktonic diatoms (Rines and Hargraves 1988). To date, approximately 400 species of Chaetoceros have been morphologically described (Hasle and Syvertsen 1997). Some species are responsible for marine algal blooms. High concentrations of Chaetoceros cells may clog the gills of farmed fish and the spiny Chaetoceros setae can penetrate the gill tissue (Rensel 1993). These environmental and economically important effects have spurred many studies on Chaetoceros , which have improved the understanding of their biology, systematics, and ecology (Rensel 1993, Rines and Theriot 2003). In taxonomic and environmental monitoring purposes, discrimination of Chaetoceros species is generally achieved by microscopic observations, considering certain morphological characters such as forms of the chains, shapes of the aperture, and shapes of the valves. Particularly, the fine structures of the diatoms, including Chaetoceros , are observed with scanning electron microscopy. However, it is often very difficult to distinguish between Chaetoceros species (von Quillfeldt 2001) because of small size and their morphological similarity, and they can exhibit morphological changes under different culture conditions. In addition, morphological identification demands specialized in-depth knowledge.
DNA-based molecular tools are sometimes very effective for the species discriminations of microscopic-size organisms like diatoms (e.g., Jung et al. 2010). Recently, the concept of DNA barcoding was introduced to diatom taxonomy (Evans et al. 2007, Kaczmarska et al. 2007). The promise of DNA barcoding is based on a small DNA fragment divergence coinciding with biological species separation (Moniz and Kaczmarska 2009). Several pioneer studies on the diatom barcoding were performed with several molecular markers such as nuclear ribosomal DNA (rDNA), chloroplast rbcL , and mitochondrial cox1 gene (Evans et al. 2007, Moniz and Kaczmarska 2009, 2010). Also, many DNA-based studies have been done with regard to the evolutionary history and phylogenetic relationships of diatoms (Damste et al. 2004, Alverson et al. 2007, Choi et al. 2008).
Of the molecular markers used in taxonomic studies, nuclear rDNA in eukaryotes is typically composed of tandem arrays of a basic unit that contain the transcription unit (e.g., 18S, 5.8S, 28S) and an intervening intergenic spacer region. The different subunits and regions of the rDNA locus have different degrees of sequence variability and varying suitability for comparison at the inter-generic or inter-species level. Recent data indicated that nuclear rDNA is a suitable molecular marker for DNA-based taxonomy or DNA barcoding of diatoms (Alverson et al. 2007, Evans et al. 2007, Kaczmarska et al. 2007, Moniz and Kaczmarska 2009, 2010, Jung et al. 2010). However, DNA-based discriminations should be carefully applied to the strongly diversified diatoms considering their molecular divergences of the rDNA, because they are variable according to different rDNA molecules and taxonomic categories (e.g., Jung et al. 2010). For example, the centric diatoms Cyclotella and Discostella show high divergences of both 18S and 28S rDNA (Jung et al. 2010), while their close relative diatom Stephanodiscus shows highly conserved 18S rDNA sequences within this genus, indicating the non-suitability of the 18S rDNA for their DNA taxonomy (Ki 2009). Taking this into account, it is necessary to evaluate genetic divergences of individual rDNA locus according to taxonomic categories, particularly at the generic level. In the case of Chaetoceros , although their morphological phylogenetic relationships have been studied (Rines and Theriot 2003), few studies on the molecular phylogeny have been attempted to date. Most studies have been carried out through broader diatom phylogenetic analyses (Damste et al. 2004, Choi et al. 2008). Also, little is known of the genetic divergences of Chaetoceros rDNA for DNA taxonomy.
In the present study, we characterized molecular characteristics including genetic divergences and DNA similarity of the 18S-28S rDNA sequences from several selected Chaetoceros . In addition, phylogenetic and statistical analyses were performed to evaluate the usefulness of the 18S and 28S rDNA for the DNA taxonomy of Chaetoceros .
- Taxon samplings
In this study, a total of 32 rDNA sequences from Chaetoceros were used for the extensive analyses. The 18S rDNA sequences were determined from eight Chaetoceros species: C. calcitrans (GenBank accession numbers AY485449, AY625894, EU240879, EU240880), C. curvisetus (AY229895), C. debilis (AY229896), C. gracilis (AY625895), C. muellerii (AY485453, AY625896), C. neogracile (EU090012), C. rostratus (X85391), and C. socialis (AY485446). The partial 28S rDNA were from twelve Chaetoceros species: C. atlanticus (EF423454), C. brevis (EF423469), C. compressus (EF423429), C. costatus (EF423471-4), C. curvisetus (EF423476-7), C. danicus (EF423447), C. debilis (EF423466), C. diadema (EF423433), C. lorenzianus (EF423435-6), C. peruvianus (EF423449), C. pseudo-curvisetus (EF423478-9), and C. socialis (EF423467-8).
- DNA sequence characteristics
Intra-specific genetic variations of Chaetoceros were investigated by comparing DNA similarities and genetic distances of both 18S and partial 28S rDNA sequences. For the extensive analyses, we constructed two data matrixes of the selected 18S and 28S rDNA sequences. These contained eight sequences for 18S and twelve sequences for 28S. Multiple alignments were performed with each dataset using the Clustal W 1.8 (Thompson et al. 1994). The aligned sequences were trimmed at each end to the same length and obvious base errors that were only found in single strands were manually removed. Finally, we used identical positions (e.g., 1,706 out of 1,815 alignment positions for 18S; 757 out of 800 for 28S) of the aligned sequences. DNA similarities of the 18S-28S rDNA were measured separately in BioEdit version 5.0.6 (North Carolina State University). The corrected pairwise ( p -) genetic distances were calculated with Kimura 2-parameter model in MEGA 4.0 (Tamura et al. 2007). Sequence characteristics, including parsimony informative (PI) site, were analyzed using MEGA version 4.0. Statistical analyses of the nucleotide comparisons were performed using SPSS version 10.0.7 (SPSS Inc., Chicago, IL, USA).
- Phylogenetic analysis ofChaetoceros
For the phylogenetic analysis of Chaetoceros , DNA sequences were aligned in the same way used in the sequence comparisons, and unambiguously aligned sequences for the phylogenetic analyses: 1,706 out of 1,806 alignment positions for 18S, and 652 out of 806 for 28S, respectively. As best-fit models for the present 18S, 28S datasets, the General Time Reversible plus Gamma distributed model (GTR+G) was selected for 18S (- lnL = 5046.1) and for 28S (- lnL = 3933.2) from the Akaike Information Criterion in MrModeltest2 (Nylander 2004). Bayesian analysis of the 18S rDNA was implemented in MrBayes version 3.1.2 (Huelsenbeck and Ronquist 2001) using the selected GTR+G model with among-site rate variation, while the rates for variable sites were drawn from a gamma distribution. The Markov chain Monte Carlo process was set at two chains, and a million generations were conducted. Sampling frequency was assigned as every 100 generations. After analysis, the first 2,000 trees were deleted as burn-in and the consensus tree was constructed. For 28S rDNA tree, Bayesian analysis was performed in the same way using the 18S sequences.
In this study, we characterized nuclear 18S and 28S rDNA sequences of Chaetoceros using available DNA sequences (12 sequences of 18S and 20 sequences of 28S) obtained in the public databases. These included nearly complete 18S rDNA sequences and partial 28S rDNA. Particularly, the 28S rDNA, the largest rDNA coding region, contains relatively conserved core segments and 12 hypervariable, divergent (D) domains (Hassouna et al. 1984). The present 28S data contained 28S rDNA D1 to D3 and their adjacent partial core regions.
PPT Slide
Lager Image
Phylogenetic relationships of Chaetoceros inferred from (A) nearly complete 18S rDNA and (B) partial 28S rDNA sequences with Bayesian algorithms. Bayesian likelihood scores were recorded at ?lnL = 5067.4 in 18S tree and at ?lnL = 3970.5 in 28S tree, respectively. The numbers at each node represent posterior probability (> 0.50). *Chaetoceros. calcitrans f. pumilus.
Genetic variations in the rDNA of Chaetoceros intra-species were investigated with the DNA similarity scores. Mostly, high DNA similarities were measured from individual 18S and partial 28S comparisons within the same species (more than 99% similarity). For example, C. calcitrans , including C. calcitrans f. pumilus , had nearly identical genotypes of the 18S rDNA (99.9 ± 0.1% similarity) among four different isolates, and C. muellerii showed 99.8% similarity between CCMP 1316 (GenBank accession number AY485453) and CCAP 1010/3 (AY625896), respectively. Also, we detected high DNA similarity in comparisons of the intra-species 28S rDNA. At present, we detected few genetic variations in the rDNA of intraspecies; however, the present data are quite limited and so generalization should not be done. Further studies are needed to determine the nucleotide sequences of the rDNA of increased number of samples, collected worldwide from different geographical regions.
Bayesian trees with the 18S-28S rDNAs showed Chaetoceros spp. studied here were divided according to their taxonomic positions (Fig. 1) . In cases of the same species, they formed single clusters (e.g., C. calcitrans, C. costatus, C. muellerii, C. lorenzianus, and C. socialis ), which were separate from other species. These were in accordance with the intra-species rDNA comparisons, in which Chaetoceros has the different genotypes of the 18S-28S rDNA among inter-species, but have nearly identical genotypes of the rDNA among intra-species (e.g., C. calcitrans, C. muellerii, C. costatus ). The 18S tree (Fig. 1A) showed that C. rostratus formed the early divergent species (1.00 posterior probability [PP]). C. gracilis and C. muellerii as sister species formed a clade with C. curvisetus and C. debilis (1.00 PP). The two later species were not separated by our 18S phylogenetic analysis. On the other hand, the 28S Bayesian tree (Fig. 1B) separated individual species more clearly with long-branches compared with the present 18S rDNA phylogeny. The 28S Bayesian tree showed that Chaetoceros formed a polytomy (1.00 PP), in which species were separated into three clades: one cluster contained with C. curvisetus and C. pseudo-curvisetus , another included C. costatus, C. debilis , and C. socialis , and the other included C. atlanticus, C, diadema, C. brevis , and C. lorenzianus .
Similarity scores (above diagonal) and genetic distances (below diagonal) between nine pairs of the aligned sequence data (1,734 sites) of the nearly complete 18S rDNA ofChaetoceros
PPT Slide
Lager Image
Similarity scores (above diagonal) and genetic distances (below diagonal) between nine pairs of the aligned sequence data (1,734 sites) of the nearly complete 18S rDNA of Chaetoceros
Similarity scores (above diagonal) and genetic distances (below diagonal) between 12 pairs of the aligned sequence data (758 sites) of partial 28S rDNA ofChaetoceros
PPT Slide
Lager Image
Similarity scores (above diagonal) and genetic distances (below diagonal) between 12 pairs of the aligned sequence data (758 sites) of partial 28S rDNA of Chaetoceros
Molecular comparisons and phylogenies showed that sequence variations in the 18S and 28S rDNA within intra-species were not significantly different (Student’s t-test, p > 0.05). Thus, we selected different Chaetoceros (e.g., eight for 18S rDNA; twelve for 28S rDNA) to extensively compare one another. Table 1 summarizes the DNA similarity and corrected p-distance scores between the eight pairs of aligned 18S rDNA sequences. DNA pairs of C. curvisetus, C. debilis, C. gracilis, and C. muellerii were recorded at high DNA similarities (> 99%, or < 0.6% p-distance), indicating that they could not be separated by the 18S rDNA divergences; however the other pairsFig. 2. Nucleotide divergences of Chaetoceros 18S and 28S rDNAs based on corrected p-distances. Values of the p-distances were measured at 7.3 ± 4.01 (n = 36) for 18S and at 16.2 ± 4.34 (n = 66), respectively. showed relatively low similarities (< 95%, or > 4.8% p-distance). On the other hand, the Chaetoceros 28S rDNA showed high genetic divergences in the present analysis. Table 2 displays the DNA similarity and p-distance scores among the 12 compared species. In most cases, DNA divergences were considerably high in the 28S rDNA (81.0 ± 4.6% similarity). The highest similarity (94.4%) was recorded between C. danicus and C. peruvianus, and the lowest (77.2%) was recorded between C. devilis and C. pseudo-curvisetus.
In addition, comparative analysis showed that corrected p-distances of the 18S and 28S rDNAs were 7.3% and 16.3%, respectively (Fig. 2), based on pairwise genetic distance scores (Tables 1 & 2). Statistical testing revealed that divergences of the 28S rDNA were significantly different compared to the 18S rDNA (Student’s t-test, p < 0.05). In further analysis, we found that the 28S rDNA contained more PI sites (28.6%) than 18S rDNA (11.0%). The 28S variation was approximately 2.60-times higher than that of the 18S as judged from the % PI values, and it was also 2.23-times by p-distance in the present data sets. These statistical, parsimonious results showed that the 28S rDNA D1-D3 (> 3.5% p-distance, > 5.4% dissimilarity) had a much greater genetic divergence than the 18S rDNA (> 0.4% p-distance, > 0.5% dissimilarity). These results were generally in accordance with other centric diatoms, Cyclotella, Discostella, and Stephanodiscus (Ki 2009, Jung et al. 2010). These results suggest that the 28S rDNA may be treated as a more suitable marker for species- level taxonomic distinctions of Chaetoceros.
PPT Slide
Lager Image
Nucleotide divergences of Chaetoceros 18S and 28S rDNAs based on corrected p-distances. Values of the p-distances were measured at 7.3 ± 4.01 (n = 36) for 18S and at 16.2 ± 4.34 (n = 66), respectively.
This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MEST) (No. 2009-0084603).
View Fulltext  
Alverson A. J , Jansen R. K , Theriot E. C 2007 Bridging the Rubicon: phylogenetic analysis reveals repeated colonizations of marine and fresh waters by thalassiosiroid diatoms Mol Phylogenet Evol 45 193 - 210    DOI : 10.1016/j.ympev.2007.03.024
Choi H. G , Joo H. M , Jung W , Hong S. S , Kang J. S , Kang S. H 2008 Morphology and phylogenetic relationships of some psychrophilic polar diatoms (Bacillariophyta) Nova Hedwigia Beih 133 7 - 30
Damste J. S , Muyzer G , Abbas B , Rampen S. W , Masse G , Allard W. G , Belt S. T , Robert J. M , Rowland S. J , Moldowan J. M , Barbanti S. M , Fago F. J , Denisevich P , Dahl J , Trindade L. A , Schouten S 2004 The rise of the rhizosolenid diatoms Science 304 584 - 587    DOI : 10.1126/science.1096806
Evans K. M , Wortley A. H , Mann D. G 2007 An assessment of potential diatom "barcode" genes (cox1 rbcL 18S and ITS rDNA) and their effectiveness in determining relationships in Sellaphora (Bacillariophyta) Protist 158 349 - 364    DOI : 10.1016/j.protis.2007.04.001
Hasle G. R , Syvertsen E. E , Tomas C 1997 Marine diatoms;Identifying Marine Phytoplankton Academic Press San Diego 5 - 385
Hassouna N , Michot B , Bachellerie J. P 1984 The complete nucleotide sequence of mouse 28S rRNA gene Implications for the process of size increase of the large subunit rRNA in higher eukaryotes Nucleic Acids Res 12 3563 - 3583    DOI : 10.1093/nar/12.8.3563
Huelsenbeck J. P , Ronquist F 2001 MRBAYES: Bayesian inference of phylogenetic trees Bioinformatics 17 754 - 755    DOI : 10.1093/bioinformatics/17.8.754
Jung S. W , Han M. S , Ki J. S 2010 Molecular genetic divergence of the centric diatom Cyclotella and Discostella (Bacillariophyceae) revealed by nuclear ribosomal DNA comparisons J Appl Phycol 22 319 - 329    DOI : 10.1007/s10811-009-9462-5
Reid C , Kusber W. H , Kaczmarska I , Reid C , Mooniz M , Kusber W. H , Jahn R 2007 Diatom taxonomy: morphology molecules and barcodes;Proc 1st Central-European Diatom Meeting Botanic Garden and Botanical Museum Berlin-Dahlem Freie Universitat Berlin Berlin 69 - 72
Ki J. S 2009 Comparative molecular analysis of freshwater centric diatoms with particular emphasis on the nuclear ribosomal DNA of Stephanodiscus (Bacillariophyceae) Algae 24 129 - 138    DOI : 10.4490/ALGAE.2009.24.3.129
Moniz M. B. J , Kaczmarska I 2009 Barcoding diatoms: is there a good marker Mol Ecol Resour 9 65 - 74    DOI : 10.1111/j.1755-0998.2009.02633.x
Moniz M. B. J , Kaczmarska I 2010 Barcoding of diatoms: nuclear encoded ITS revisited Protist 161 7 - 34    DOI : 10.1016/j.protis.2009.07.001
Nylander J. A. A 2004 MrModeltest v2 Evolutionary Biology Centre Uppsala University Uppsala
Rensel J. E , Smayda T. J , Shimizu Y 1993 Severe blood hypoxia of Atlantic salmon (Salmo salar) exposed to the marine diatom Chaetoceros concavicornis;Toxic Phytoplankton Blooms in the Sea Elsevier New York 625 - 630
Hargraves P. E , Rines J. E. B , Hargraves P. E 1988 The Chaetoceros Ehrenberg (Bacillariophyceae) fiora of Narragansett Bay Rhode island USA Bibl Phycol 79 1 - 196
Rines J. E. B , Theriot E. C 2003 Systematics of Chaetocerotaceae (Bacillariophyceae) I A phylogenetic analysis of the family Phycol Res 51 83 - 98    DOI : 10.1111/j.1440-1835.2003.tb00175.x
Tamura K , Dudley J , Nei M , Kumar S 2007 MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 40 Mol Biol Evol 24 1596 - 1599    DOI : 10.1093/molbev/msm092
Thompson J. D , Higgins D. G , Gibson T. J 1994 Clustal W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting position-specific gap penalties and weight matrix choice Nucleic Acids Res 22 4673 - 4680    DOI : 10.1093/nar/22.22.4673
von Quillfeldt C. H 2001 Identification of some easily confused common diatom species in Arctic spring blooms Bot Mar 44 375 - 389    DOI : 10.1515/BOT.2001.048