- Open Access
Genome-wide analysis of the sox family in the calcareous sponge Sycon ciliatum: multiple genes with unique expression patterns
EvoDevovolume 3, Article number: 14 (2012)
Sox genes are HMG-domain containing transcription factors with important roles in developmental processes in animals; many of them appear to have conserved functions among eumetazoans. Demosponges have fewer Sox genes than eumetazoans, but their roles remain unclear. The aim of this study is to gain insight into the early evolutionary history of the Sox gene family by identification and expression analysis of Sox genes in the calcareous sponge Sycon ciliatum.
Calcaronean Sox related sequences were retrieved by searching recently generated genomic and transcriptome sequence resources and analyzed using variety of phylogenetic methods and identification of conserved motifs. Expression was studied by whole mount in situ hybridization.
We have identified seven Sox genes and four Sox-related genes in the complete genome of Sycon ciliatum. Phylogenetic and conserved motif analyses showed that five of Sycon Sox genes represent groups B, C, E, and F present in cnidarians and bilaterians. Two additional genes are classified as Sox genes but cannot be assigned to specific subfamilies, and four genes are more similar to Sox genes than to other HMG-containing genes. Thus, the repertoire of Sox genes is larger in this representative of calcareous sponges than in the demosponge Amphimedon queenslandica. It remains unclear whether this is due to the expansion of the gene family in Sycon or a secondary reduction in the Amphimedon genome. In situ hybridization of Sycon Sox genes revealed a variety of expression patterns during embryogenesis and in specific cell types of adult sponges.
In this study, we describe a large family of Sox genes in Sycon ciliatum with dynamic expression patterns, indicating that Sox genes are regulators in development and cell type determination in sponges, as observed in higher animals. The revealed differences between demosponge and calcisponge Sox genes repertoire highlight the need to utilize models representing different sponge lineages to describe sponge development, a prerequisite for deciphering evolution of metazoan developmental mechanisms.
The Sox genes (Sry related high mobility group, HMG box) are a family of transcription factors with important roles in regulating development and cell fate determination throughout the animal kingdom[1, 2]. The Sox proteins are characterized by the HMG DNA binding domain of 79 amino acids, resembling the mammalian testis determination factor, Sry, which was the first Sox domain identified. There are 20 Sox genes in mammals which have been classified in five groups of Sox proteins (B, C, D, E, and F). However, additional groups have been created to accommodate divergent genes with limited taxonomic distribution, for instance group J. Groups B, C, E, and F are found in all eumetazoan lineages, but group D is found only in the bilaterians.
No Sox genes are present in the sequenced genomes of the unicellular choanoflagellate, Monosiga brevicollis, or the amoeboid holozoan Capsaspora owczarzaki. Since they are present in basal metazoans like sponges (that is, the demosponge Amphimedon queenslandica)[8, 9] and placozoans (Trichoplax adhaerens), they have likely arisen in the last common ancestor to the Metazoa. There is a larger repertoire of Sox genes in cnidarians[11–13] and the ctenophore Pleurobrachia pileus than in the demosponges[8, 9, 15] and the placozoans. Previous phylogenetic analysis of cnidarian Sox genes including the species Hydra magnipapillata, Nematostella vectensis, and Clytia hemisphaerica placed some of these sequences into the previously identified groups of Sox genes; however some of these genes cannot be classified into any specific group[11–13]. The expression patterns of cnidarian Sox genes suggest that they have roles in a wide variety of developmental functions, such as germ layer formation, organ development, cell type specification, and neural development[11–13].
Previous studies on Sox genes in sponges include the two demosponges, Amphimedon queenslandica[8, 9] and Ephydatia muelleri, as well as the calcareous sponge Sycon raphanus. In Amphimedon, four Sox genes have been found, including two members of group B (AmqSoxB1 and AmqSoxB2) and single members of groups C and F. Sox genes from Ephydatia and Sycon raphanus could not be clearly classified due to incomplete domain sequences included in the phylogenetic analyses. As a consequence, the complement of Sox genes in calcareous sponges is still unclear. In addition, apart from an RT-PCR study suggesting dynamic expression of Sox genes during embryonic development in Amphimedon, no expression patterns on a cellular level are published for this or any other sponge. For this reason, more studies in sponges are required to fully understand the function of Sox genes in the phylum Porifera in comparison with the Eumetazoa. The aims of this study were to analyze the repertoire of Sox genes in the calcareous sponge Sycon ciliatum and to trace their expression during development.
Sycon ciliatum is an attractive model system for developmental biology studies. This sponge is a common and abundant (Figure1A) species found in shallow waters in the North-East Atlantic, with a distribution extending from The Channel in the south to Svalbard and Greenland in the north (Rapp, unpublished work). It has the typical body plan of syconoid sponges where choanocyte-lined radial chambers surround an endopinacocyte-lined atrial cavity leading to a single osculum; the outer surface of the body is covered by exopinacocytes (Figure1B, C). Embryogenesis of Sycon ciliatum and related species has been well studied[17–19] and it takes place in the mesohyl, a narrow space located between the pinacocytes and choanocytes (Figure1D). Symmetric cleavage followed by cell differentiation leads to formation of a cup-shaped embryo composed of numerous ciliated micromeres, a lower number of larger macromeres, and four cruciform cells symmetrically distributed among the micromeres (Figure1D, E). The embryo undergoes inversion while it translocates to the radial chamber (Figure1D), and the mature larva (Figure1F) swims through the oscular opening. Both larva and adult display clear single body axes; the larva has unique tetra-radial symmetry (conferred by the cruciform cells) while the adult is radially symmetrical.
Recently generated complete draft genomic sequence and extensive transcriptome resources allow us to perform whole-genome analysis of developmentally important gene families (Adamski et al., unpublished work), and established in situ hybridization protocols allow for studies of gene expression in all life stages.
Phylogenetic analysis of sycon Sox genes
We have identified 12 HMG domains corresponding to 11 Sycon ciliatum Sox- related genes within the genomic and transcriptomic resources generated by a combination of traditional and next generation sequencing (Table1).
We have performed phylogenetic analyses of HMG domain sequences of Sox genes using different combination of taxa and the 12 sequences of Sycon (data not shown). In the initial phylogenetic analysis, most of Sycon Sox genes did not clearly fall into the recognized Sox groups (data not shown). To test whether adding sequences from another sponge closely related to Sycon would help to resolve the phylogenetic tree, we additionally identified and included sequences of Sox genes from another calcareous sponge, Leucosolenia complicata (Adamski et al., unpublished work). Up to date, we have recovered a total of seven Sox and Sox-related sequences from Leucosolenia (Table1).
Another phylogenetic tree was then constructed including the entire repertoire of identified Sycon and Leucosolenia Sox genes (Additional file1). However, this phylogenetic analysis also resulted in a non-resolved tree with multiple long-branch attraction artefacts. To reduce long-branch attraction, the most divergent sequences from both Sycon and Leucosolenia were excluded from further analyses. The excluded genes resemble Sox genes but have a divergent conserved motif within the HMG domain: either at the motif RPMNAF (positions 5 to 10), and/or at YK/R (positions 70 to 72); we named them Sox-like genes: SciSoxL1 to SciSoxL4a/b and LcoSoxL1, LcoSoxL4a/b (Table1, Additional file2).
Overall, the final phylogenetic analyses of the non-divergent set of sequences shown on Figure2 (see Additional file3 for alignment) resolved most of Sycon Sox genes within the known Sox gene groups. The analysis also resolved Sox genes from Bilateria and Cnidaria within SoxB, C, D, E, and F groups supporting previous analyses[5, 8, 9, 11–15]. However, SoxB group did not show a clear division into SoxB1 or SoxB2 clades. Five Sycon HMG domains of Sox genes can be assigned to the known eumetazoan Sox groups B, C, E, and F (Figure2). Although the list of Sox genes in Leucosolenia might still be incomplete, so far all of the identified sequences have clustered with Sycon sequences. A SoxC gene in Leucosolenia has not been identified; this may be due to incomplete sequence resources for this species or represent genuine gene loss in Leucosolenia. In addition, our analysis suggests that an expansion of SoxF genes have occurred in the calcaronean sponges; we named these genes SoxF1 and SoxF2 (Table1).
Notably, our analysis did not reveal orthologous relationships between Amphimedon and calcaronean sequences even in cases where members of the same subfamily are present in both sponges, such as SoxB or SoxC. As reported by Larroux and colleagues the Amphimedon SoxF gene did not cluster with other SoxF sequences in the maximum likelihood analysis. However, conserved motif analysis (see below) indicates that this gene belongs to the SoxF subfamily.
The remaining two Sycon Sox genes named SciSox6 and SciSox7 (Table1) did not fall into any known Sox group, while clustering within the Sox family (Figure2). One ortholog of SciSox6 was found in Leucosolenia, and it was named LcoSox6. In contrast, we have not found a counterpart of SciSox7 in Leucosolenia.
Motif conservation within sponge Sox genes
We compared full length Sox proteins from Sycon, Leucosolenia, and Amphimedon with their homologs from different taxa (Figure2, Additional file4) to find conserved motifs outside the HMG domain. The analysis revealed the presence of a number of motifs that are conserved between the eumetazoan and poriferan sequences. However, the motifs in sponge sequences were often quite divergent as compared to their bilaterian and cnidarian counterparts (Figure2, Additional file4). Amphimedon SoxB1 and B2, Leucosolenia SoxB, and Sycon SoxB genes contained the B-group specific motif. In contrast to the eumetazoan SoxB proteins, the B-group specific motif in sponges was not located directly next to the C-terminal of the HMG domain, but appeared in different positions within the C-terminal part of the protein. Both Amphimedon and Sycon SoxC proteins contained a slightly divergent C-group motif as compared to Homo and Acropora SoxC. Two conserved regions were found for the Sycon SoxE protein while only one region was found in Leucosolenia SoxE. Finally, the conserved short SoxF motif was also found in the three sponge proteins, but was located closer to the HMG domain, while in Acropora and Homo it is located at the C-terminal of the protein.
Sox genes are dynamically expressed during embryogenesis and cell differentiation
We have studied expression of Sox genes in adult sponges containing a wide variety of embryonic stages by whole mount in situ hybridization. Except for SciSoxL3 and SciSoxL4a/b, for which we could not amplify probes suggesting they are not significantly expressed in adult cells or during embryogenesis, all other genes displayed unique patterns during development and/or in adult cells.
The expression of SciSoxB was strong in the oocytes and in blastomeres of early cleavage stages (Figure3A). During pre-inversion it is specifically detected in macromeres and in the cruciform cells (Figure3B, C). This pattern continues until early post-inversion, but when the larva is fully developed no expression can be detected in the cruciform cells (Figure3D). SciSoxC was also detected in oocytes and in all blastomeres during cleavage (Figure3E, F) but became restricted to macromeres during pre-inversion (Figure3G, H). This expression becomes undetectable after inversion and in the larva (Figure3I).
The expression of SciSoxE, SciSoxF1, SciSoxF2, as well as SciSox6 was detected in various adult cells and not during embryonic stages or in the larvae (Figure4). SciSoxE expression was detected in choanocytes, but not in the accessory cells[17, 19] (choanocyte-derived cells surrounding the oocytes and embryos) (Figure4A, B). SciSoxE was also detected in a fraction of the mesohyl cells. Similarly, SciSoxF1 was detected in choanocytes; but in contrast to SciSoxE, its expression was also detected in the accessory cells (Figure4C, D). SciSoxF2 expression was detected in large cells (possibly the myocytes based on the cell shape), which are located in the middle part of the osculum (Figure4E, F). SciSox6 expression was detected in all choanocytes, pinacocytes, and in some mesohyl cells near the rim of the osculum of young sponges (Figure4G, H). SciSox7 was expressed in the choanocytes of adult sponges (Figure4I), as well as in the oocytes (Figure4J) and uniformly in the embryos (data not shown).
Finally, the Sox-like genes SciSoxL1 and SciSoxL2 are expressed during embryonic development and in adult cells. SciSoxL1 is uniformly expressed during early cleavage and during pre-inversion (Figure5A, B); and then detected in the cruciform cells (Figure5C). SciSoxL2 is detected during early embryogenesis (Figure5E, F) and during pre-inversion in macromeres and weaker in micromeres, but not in the cruciform cells (Figure5G). SciSoxL1 and SciSoxL2 expression was also detected in adult sponges in choanocytes, and SciSoxL2 also in mesohyl cells in the osculum (Figure5D-H).
The Sox gene family is significantly larger in Sycon than in Amphimedon
As previously reported by Larroux et al.[8, 9]Amphimedon has four Sox genes corresponding to groups B, C, and F. In the demosponge Ephydatia muelleri only three Sox genes could be identified. In contrast, the genome of the calcareous sponge Sycon ciliatum contains seven Sox genes and four additional Sox-related genes.
In Sycon, five Sox genes correspond to the recognized Sox subfamilies, confirming the presence of Sox genes of the groups B, C, and F in sponges, and adding SoxE to the sponge repertoire. While bootstrap support and posterior probabilities values for assigning the poriferan sequences into eumetazoan subfamilies are generally low, analysis of conserved motifs within the full length proteins consistently confirmed placement of the calcaronean sequences within the recognized subfamilies.
There are several differences between the demosponge and calcaronean Sox genes as evidenced by the comparison between Amphimedon and Sycon. For example, there is only one SoxB gene in Sycon. In contrast, the calcaronean sequences can be classified as belonging to SoxE and SoxF families; while only a single (and difficult to place in phylogenetic analysis) SoxF gene is present in the Amphimedon genome. The Amphimedon SoxE gene might have been lost, or SoxE genes might have evolved after demosponges diverged. It is impossible to differentiate between these two scenarios until the issue of sponge monophyly vs. paraphyly is resolved. On the other hand, our result indicates that SoxF genes in Sycon and Leucosolenia are likely to be a result of lineage-specific duplication.
Interestingly, the Amphimedon genome does not appear to contain the large number of Sox-related genes that we have identified in the two calcaronean genomes. It remains unclear whether this is a result of significant gene loss in Amphimedon, or rather of expansion of the Sox family in the Calcaronea. Only analysis of additional poriferan genomes representing a range of clades (especially homoscleromorphs, calcineans, and a range of demosponges) will help to shed light into this issue.
Dynamic expression of Sox genes in sycon
The expression patterns of Sycon Sox genes fall into two categories: embryonic (SciSoxB and SciSoxC) or predominantly in differentiated adult cells (SciSoxE, SciSoxF1, SciSoxF2, and SciSox6). Sox-like genes are expressed both during development and in adult tissues (Summary on Table2).
Until functional data are obtained in sponges, the specific roles of the identified genes will remain unclear. However, we can hypothesize on their putative function in Sycon and on hypothetical ancestral roles in the metazoan ancestor, by comparing the expression patterns of Sycon and the eumetazoan Sox genes. This is particularly tempting for genes belonging to subfamilies that appear to have a conserved function throughout the Eumetazoa, such as the SoxB group. At least one Sox gene belonging to Group B is expressed in the embryonic ectoderm and the neurogenic region of embryos in early development in most bilaterians (for a review see), cnidarians[12, 13], and in the ctenophore P. pileus.
Sycon SoxB expression is restricted to two cell types of the embryo, the macromeres and the cruciform cells. During settlement and metamorphosis, the macromeres become the outer cells of the post-larva and subsequently differentiate into exopinacocytes, the outer epithelium of the sponge[22, 23]. The SciSoxB expression in the macromeres provides support for the notion that the exopinacoderm of the sponges might be homologous to the ectoderm of higher metazoans.
The cruciform cells are characteristic cells of the calcaronean sponge larvae[19, 24]. They form from four cytoplasm regions segregated during cleavage and differentiate at the pre-inversion stage; they are present in the swimming larva, to later degenerate during settlement and metamorphosis. Their role is not yet clear, but these four cells are the only candidate cells suggested to play a role in larval photoreception. If the cruciform cells are indeed involved in photoreception, the SoxB expression during their differentiation would indicate conservation of SoxB functions in broadly defined neurogenesis and sensory organ formation.
The expression of Sycon SoxC is very prominent in macromeres during pre-inversion, while expression was not detected in larvae. In the cnidarians Acropora and Nematostella, SoxC is expressed during embryogenesis in cell types that are suspected to be sensory neurons[11, 12]. However in Clytia, SoxC (ChSox15) is expressed in stem cells. Therefore it appears that there is no clear conservation of expression pattern among these organisms.
While there is no strong conservation of expression for SoxE and SoxF genes, SoxE genes in bilaterian invertebrates tend to have a role in sex-specific aspects of gonad development, and SoxF genes tend to be associated with endoderm formation[21, 26]. In the cnidarians Nematostella and Acropora, SoxE and SoxF are expressed in endodermal lineages; while in Clytia SoxE is expressed in germline cells, stem cells, and nematoblasts, indicating once again no clear conservation among cnidarians within this group. However, expression in the endoderm (in Anthozoan cnidarians) and mesodermal derivatives (gonads) of bilaterians, together with the observed expression of Sycon SoxE and SoxF in choanocytes and some mesohyl cells, could be used to support a concept of homology of the choanoderm + mesohyl with endomesoderm. Otherwise, these two genes might play roles in cell differentiation in Sycon, as evidenced by the fact that expression of SoxE disappears in choanocytes that transdifferentiate into accessory cells, while expression of SoxF1 becomes stronger in these cells during the process.
Sponges are relatively simple organisms with few cell types, thus the limited number of transcription factors representing conserved metazoan families in the demosponge Amphimedon quenslandica fits neatly with the concept of a simple developmental tool kit patterning a simple body. This study demonstrates that Sycon ciliatum has multiple Sox genes which are dynamically expressed during development and in patterns consistent with governing adult cell differentiation. This indicates that Sox genes were involved in development and cell differentiation from the beginning of multicellular animal evolution. Further analyses of this and other developmental gene families in the Calcarea and in other sponge group are necessary to test whether the identified differences between Sycon and Amphimedon are indicative of global differences in the developmental toolkits. Such studies, now underway in our laboratory and in other groups, will provide insight into the evolutionary history of the animal developmental toolkit.
Identification of Sox genes in Sycon and Leucosolenia
Sox-like genes from Sycon ciliatum were retrieved by searching our recently generated genomic and transcriptome databases (Adamski et al., unpublished work) using HMG domain sequences from Nematostella and Amphimedon. Scaffolds were recovered and annotated using TBLASTN and BLASTX searches. Additionally, we searched in our on-going genome and transcriptome project of another calcaronean, Leucosolenia complicata, using the 12 identified Sycon HMG domain sequences to recover their orthologs from this species. These sequences were used in the phylogenetic analysis.
Sycon Sox genes were amplified by either RACE or RT-PCR using SMARTTM RACE Amplification kit (Clontech). Primer sequences are available upon request. The cDNA used as a template was prepared from a mixture of RNA extracted from juveniles and adult samples containing embryonic stages. PCR products were cloned into pGEM-Teasy (Promega) and sequenced using the BigDye Terminator v3.1 protocol (ABI). Purified PCR products obtained using SP6 and T7 primers during colony PCR were used to produce Dig-labeled antisense RNA probes for in-situ hybridization (see below).
Alignment and phylogenetic analysis
Alignment of HMG domains for phylogenetic analyses: MUSCLE was used for the alignment which included Sycon and Leucosolenia complete HMG domains of candidate Sox genes together with a different combination of taxa (see Additional file2). The alignment was manually modified where needed. In this final dataset, the following sequences were included: two HMG domains from Sycon Tcf genes and out-groups used for phylogenetic analysis as in Jager et al.. We did not include the sponge Sox sequences from the previous study in sponges from Jager et al. as these HMG domains contain only partial information (59 aa).
Phylogenetic calculations: Prottest 3 was used to determine the best suitable model of protein evolution for our alignment. We used two phylogenetic analyses of HMG domains:
Two independent runs of PhyML  were performed. Each run searched for five random starting trees using SPR moves. The tree with the best log likelihood value was selected (Log likelihood = −5686.2). From this tree a bootstrap analysis using 100 replicates was performed.
Bayesian analysis  under LG model, with 5,000,000 generations sampled every 500 generations using four chains. Convergence was reached before 5,000,000 generations. A majority rule of consensus tree of 12,500 trees was generated and posterior probabilities values were calculated from this tree.
Finding conserved motifs within sponge Sox sequences
MEME 3.5.7 was used to find conserved motifs outside the HMG domain within Sycon and Leucosolenia Sox proteins and their closest homologues from Acropora, Homo, Nematostella, and Amphimedon. The following parameters were used for searching possible conserved motifs: minimum motif width, six; maximum width, 100; maximum motifs to find, six. Complete sequences were aligned and their motif locations were compared with previous studies[4, 12]. ‘My domain image creator’ tool included in Prosite was used to visualize the locations of motifs in Sox proteins.
Specimen collection and whole mount in-situ hybridization
Adult Sycon specimens were collected from fjords located near Bergen, Norway (+60° 27' 33", +4° 56' 1") during the reproductive season from May to September (2008 to 2011). For in-situ hybridization, samples were immediately fixed in 100 mM MOPS, pH 7.5; 0.5 M sodium chloride; 2 mM MgSO4; 4% paraformaldehyde; 0.05% glutaraldehyde over night at 4°C, stepped into and extensively washed in 70% EtOH and stored at −20°C until processing. Macro sections of sponges in 24 well plates (Nunc) were rehydrated and washed in PBS/0.1% Tween (PTw). Samples were pretreated with 7.5 μg/mL proteinase K for 10 minutes at 37°C, followed by quenching with glycine (2 mg/mL PTw). Acetylation was performed by serial treatment with 0.1 M triethanolamine containing 0, 1.5, and 3 μl/mL acetic anhydride. Re-fixation was done in 4% paraformaldehyde/0.05% glutaraldehyde in PBS for 1 h at room temperature, followed by extensive washing in PTw. Tissue was prehybridized as previously described in 2 mL-tubes for 90 to 180 min at 51°C. Probe hybridization was done with denatured RNA probe (0.1-0.3 ng/μL, approximately 1 kb) for 12 to 18 h at 51°C. Stringent washes were carried out at 55°C as following: 1 × 10 min in hybridization buffer; 2 × 10 min 50% formamide/4 × SSC/0.1%; 2 × 10 min 50% formamide/2 × SSC/0.1% Tween; 2 × 10 min 25% formamide/2 × SSC/0.1% Tween, followed by 3 × 15 min 2 × SSC/0.1% Tween at room temperature. Samples were transferred to maleic acid buffer and incubated in 2% (w/v) Blocking Reagent (Roche) for 60 min at room temperature. After overnight incubation with AP-coupled anti-Digoxigenin-Fab fragments (Sigma, 1:5,000) at 4°C, samples were washed in maleic acid buffer at least 6 × 30 min. Probe was detected using NBT/BCIP as substrate (Roche) with tissue equilibrated in alkaline phosphatase buffer (100 mM sodium chloride, 50 mM MgCl2, 100 mM Tris pH 9.5, 0.1% Tween, 1 mM Levamisole). The staining reaction (0.5 to 3 days) was stopped with PBS/0.5% Tween, samples were transferred to 100% glycerol for microscopy or ethanol-dehydrated and embedded in epoxy resin (Sigma) for sectioning. Pictures of whole mount samples and sections were taken using a Nikon DS-U3 microscope and processed in Photoshop.
Guth SI, Wegner M: Having it both ways: sox protein function between conservation and innovation. Cell Mol Life Sci. 2008, 65: 3000-3018. 10.1007/s00018-008-8138-7.
Lefebvre V, Dumitriu B, Penzo-Méndez A, Han Y, Pallavi B: Control of cell fate and differentitation by Sry-related high-mobility-group box (Sox) transcription factors. Int J Biochem Cell B. 2007, 39: 2195-2214. 10.1016/j.biocel.2007.05.019.
Gubbay J, Collignon J, Koopman P, Capel B, Economou A, Munsterberg A, Vivian N, Goodfellow P, Lovell-Badge R: A gene mapping to the sex-determining region of the mouse Y chromosome is a member of a novel family of embryonically expressed genes. Nature. 1990, 346: 245-250. 10.1038/346245a0.
Schepers GE, Teasdale RD, Koopman P: Twenty pairs of sox: extent, homology, and nomenclature of the mouse and human sox transcription factor gene families. Dev Cell. 2002, 3: 167-170. 10.1016/S1534-5807(02)00223-X.
Bowles J, Schepers G, Koopman P: Phylogeny of the SOX family of developmental transcription factors based on sequence and structural indicators. Dev Biol. 2000, 227: 239-255. 10.1006/dbio.2000.9883.
King N, Westbrook MJ, Young S, Kuo A, Abedin M, Chapman J, Fairclough S, Hellsten U, Isogai Y, Letunic I, Marr M, Pincus D, Putnam N, Rokas A, Wright KJ, Zuzow R, Dirks W, Good M, Goodstein D, Lemons D, Li W, Lyons JB, Morris A, Nichols S, Richter DJ, Salamov A, Sequencing JG, Bork P, Lim WA, Manning G: The genome of the choanoflagellate Monosiga brevicollis and the origin of metazoans. Nature. 2008, 451: 783-788. 10.1038/nature06617.
Sebé-Pedrós A, de Mendoza A, Lang BF, Degnan BM, Ruiz-Trillo I: Unexpected repertoire of metazoan transcription factors in the unicellular holozoan capsaspora owczarzaki. Mol Biol Evol. 2011, 28: 1241-1254. 10.1093/molbev/msq309.
Larroux C, Fahey B, Liubicich D, Hinman VF, Gauthier M, Gongora M, Green K, Wörheide G, Leys SP, Degnan BM: Developmental expression of transcription factor genes in a demosponge: insights into the origin of metazoan multicellularity. Evol Dev. 2006, 8: 150-173. 10.1111/j.1525-142X.2006.00086.x.
Larroux C, Luke GN, Koopman P, Rokhsar DS, Shimeld SM, Degnan BM: Genesis and expansion of metazoan transcription factor gene classes. Mol Biol Evol. 2008, 25: 980-996. 10.1093/molbev/msn047.
Srivastava M, Begovic E, Chapman J, Putnam NH, Hellsten U, Kawashima T, Kuo A, Mitros T, Salamov A, Carpenter ML, Signorovitch AY, Moreno MA, Kamm K, Grimwood J, Schmutz J, Shapiro H, Grigoriev IV, Buss LW, Schierwater B, Dellaporta SL, Rokhsar DS: The Trichoplax genome and the nature of placozoans. Nature. 2008, 454: 955-960. 10.1038/nature07191.
Magie CR, Pang K, Martindale MQ: Genomic inventory and expression of Sox and Fox genes in the cnidarian Nematostella vectensis. Dev Genes Evol. 2005, 215: 618-630. 10.1007/s00427-005-0022-y.
Shinzato C, Iguchi A, Hayward DC, Technau U, Ball EE, Miller DJ: Sox genes in the coral Acropora millepora: divergent expression patterns reflect differences in developmental mechanisms within the Anthozoa. BMC Evol Biol. 2008, 8: 311-10.1186/1471-2148-8-311.
Jager M, Queinnec E, Le Guyarde H, Manuel M: Multiple Sox genes are expressed in stem cells or in differentiating neuro-sensory cells in the hydrozoan Clytia hemisphaerica. EvoDevo. 2011, 2: 12-10.1186/2041-9139-2-12.
Jager M, Queinnec E, Chiori R, Le Guyader H, Manuel M: Insights into the early evolution of SOX genes from expression analyses in a ctenophore. J Exp Zool B Mol Dev Evol. 2008, 310: 650-667.
Jager M, Queinnec E, Houliston E, Manuel M: Expansion of the SOX gene family predated the emergence of the Bilateria. Mol Phylogenet Evol. 2006, 39: 468-477. 10.1016/j.ympev.2005.12.005.
Adamska M, Degnan B, Green K, Zwafink C: What sponges can tell us about the evolution of developmental processes. Zoology. 2011, 114: 1-10. 10.1016/j.zool.2010.10.003.
Franzen W: Oogenesis and larval development of Scypha ciliata (Porifera, Calcarea). Zoomorphology. 1988, 107: 349-357. 10.1007/BF00312218.
Leys SP, Eerkes-Medrano D: Gastrulation in Calcareous Sponges: in Search of Haeckel’s Gastraea. Integr Comp Biol. 2005, 45: 342-351. 10.1093/icb/45.2.342.
Ereskovsky AV: The Comparative Embryology of Sponges. 2010, Springer, Netherlands
Bergsten J: A review of long-branch attraction. Cladistics. 2005, 21: 163-193. 10.1111/j.1096-0031.2005.00059.x.
Phochanukul N, Russell S: No backbone but lots of SOX: the invertebrate SOX family. Int J Biochem Cell Biol. 2009, 42: 453-464.
Amano S, Hori I: Metamorphosis of calcareous sponges I. Ultrastructure of free-swimming larvae. Invertebr Reprod Dev. 1992, 21: 81-90. 10.1080/07924259.1992.9672223.
Gallissian MF, Vacelet J: Ultrastructure of the oocyte and embryo of the calcified sponge (Petrobiona massiliana Porifera, Calcarea). Zoomorphology. 1992, 112: 133-141. 10.1007/BF01633104.
Tuzet O: Éponges calcaires. Traité de Zoologie. Anatomie, Systématique, Biologie. Spongiaires. Edited by: Grassé P-P. 1973, Masson et Cie, Paris, 27-132.
Meulemans D, Bronner-Fraser M: The amphioxus soxB family: implications for the evolution of vertebrate placodes. Int J Biol Sci. 2007, 3: 356-364.
Nanda T, DeFalco SHY, Phochanukul LN, Camara N, VanDoren M, Russell S: Sox100B, a drosophila group e sox-domain gene, is required for somatic testis differentiation. Sexual Development. 2009, 3: 26-37. 10.1159/000200079.
Edgar RC: MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinform. 2004, 5: 113-10.1186/1471-2105-5-113.
Abascal F, Zardoya R, Posada D: ProtTest: Selection of best-fit models of protein evolution. Bioinformatics. 2005, 21: 2104-2105. 10.1093/bioinformatics/bti263.
Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O: New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of phyML 3.0. Syst Biol. 2010, 59: 307-321. 10.1093/sysbio/syq010.
Ronquist F, Huelsenbeck JP, van der Mark P: MrBayes 3.1. 2005, [http://mrbayes.csit.fsu.edu/index.php],
Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS: MEME Suite: tools for motif discovery and searching. Nucleic Acids Research. 2010, suppl 2: W202-W208.
Sigrist CJA, Cerutti L, de Castro E, Langendijk-Genevaux PS, Bulliard V, Bairoch A, Hulo N: PROSITE, a protein domain database for functional characterization and annotation. Nucleic Acids Research. 2010, 38: 161-166.
Larroux C, Fahey B, Adamska M, Richards GS, Gauthier M, Green K, Lovas E, Degnanet BM: Whole-Mount In Situ Hybridization in Amphimedon. Cold Spring Harbor Protocols. 2008, 10.1101/pdb.prot5096.
This study was funded by the core budget of the Sars International Centre for Marine Molecular Biology. Sequencing has been performed at The Norwegian High-Throughput Sequencing Centre funded by the Research Council of Norway. We thank Lucas Leclère for helpful comments on the manuscript.
1Sars International Centre for Marine Molecular Biology, Thormøhlensgt. 55, Bergen 5008, Norway. 2Department of Biology and Centre for Geobiology, University of Bergen, Thormøhlensgt. 55, Bergen, 5008, Norway.
The authors declare that they have no competing interests.
Conceived and designed the study: MajA and SF. Suggested the model system and provided knowledge about its biology: H-TR. Assembled genomes and transcriptomes and created sequence databases: MarA. Carried out sampling and experiments: SF, BB, MarA, CZ, SL, CG, SJ, and MajA. Analyzed data: SF and MajA. Drafted the manuscript: SF. Edited the manuscript: MajA and SF with input from co-authors. All authors read and approved the final manuscript.