what does it mean to say that hox genes are "conserved" through evolution?

  • Journal List
  • Genome Res
  • 5.13(6a); 2003 Jun
  • PMC403639

Genome Res. 2003 Jun; thirteen(6a): 1111–1122.

Evolutionary Conservation of Regulatory Elements in Vertebrate Hox Gene Clusters

Simona Santini

1 Department of Biology, University of Konstanz, 78457 Konstanz, Germany

Jeffrey Fifty. Boore

2 Department of Evolutionary Genomics, DOE Articulation Genome Institute, Lawrence Berkeley National Laboratory, and University of California, Berkeley, California 94720, United states of america

Axel Meyer

1 Department of Biology, University of Konstanz, 78457 Konstanz, Germany

Received 2002 Aug 8; Accustomed 2003 Mar 24.

Abstract

Comparisons of Dna sequences among evolutionarily distantly related genomes permit identification of conserved functional regions in noncoding DNA. Hox genes are highly conserved in vertebrates, occur in clusters, and are uninterrupted by other genes. We aligned (PipMaker) the nucleotide sequences of the HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human being, and mouse, which are separated by approximately 500 million years of development. In back up of our arroyo, several identified putative regulatory elements known to regulate the expression of Hox genes were recovered. The majority of the newly identified putative regulatory elements incorporate brusque fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac database). The regulatory intergenic regions located between the genes that are expressed most anteriorly in the embryo are longer and apparently more evolutionarily conserved than those at the other finish of Hox clusters. Unlike presumed regulatory sequences are retained in either the Aα or Aβ duplicated Hox clusters in the fish lineages. This suggests that the conserved elements are involved in dissimilar factor regulatory networks and supports the duplication-deletion-complementation model of functional divergence of duplicated genes.

Understanding the mechanisms that underlie gene regulation is i of the major goals of comparative genomics as well as developmental biological science. The functions of cis-acting regulatory sequences that are located in noncoding regions of DNA are still not well understood (Clark 2001). Comparative DNA sequence analyses have become increasingly of import since the loftier degree of conservation of regulatory elements was first recognized (e.g., Aparicio et al. 1995; Manzanares et al. 2000). The conservation of protein coding sequences fifty-fifty among evolutionarily distantly related organisms, presumably as a outcome of stabilizing option, has been noted earlier (e.g., Hardison et al. 1997; Brenner et al. 2002). Yet, only a small portion of organisms' genomes encodes information for proteins. A big portion of the genome (up to 97%, Onyango et al. 2000) is noncoding DNA, and a heretofore unknown part of it plays a role in regulating gene expression. The identification of functional elements in noncoding Dna sequences is frequently complicated by the fact that these elements are typically short (6–15 bp; due east.g., Carroll et al. 2001) and reside at varying distances from their target gene. Functional elements tend to evolve at slower rates than nonfunctional regions, considering they are subject to choice (Tagle et al. 1988; Hardison et al. 1997; Hardison 2000; Cliften et al. 2001). Due to this slower rate of evolution, comparisons amid evolutionarily distantly related genome sequences provide a tool to identify functional regions in the sea of noncoding DNA (Tompa 2001, Blanchette and Tompa 2002), an approach that has been termed phylogenetic footprinting (Roth et al. 1998; Venkatesh et al. 2000; Cliften et al. 2001). Comparisons among closely related organisms, such as different species of Saccharomyces (Cliften et al. 2001) or Drosophila (Bergman and Kreitman 2001) have been successfully used to place regulatory regions, and comparisons betwixt humans and mice (evolutionary distance of approximately fourscore meg years; Pough et al. 1999) revealed many of the functionally relevant binding sites (Onyango et al. 2000). This is because of their high degree of conservation (on average 93.two%; Wassermann et al. 2000).

Comparisons amongst closely related species revealed that many nonfunctional noncoding sequences besides testify a high degree of nucleotide identity, rendering the identification of Deoxyribonucleic acid regions involved in gene regulation more difficult. However, in the alignment of long stretches of Deoxyribonucleic acid sequences from evolutionarily distantly related species, conserved putative regulatory elements volition stand up out from the groundwork of highly variable nonfunctional regions. This beneficial signal-to-dissonance ratio among more distantly related species permits the identification of putative regulatory elements.

The search for regulatory elements through comparative genomic approaches in Hox gene clusters promises to be particularly successful considering their nucleotide sequence and part are extremely conserved in all vertebrates in which they have been studied. Hox genes lawmaking for transcription factors that are responsible for establishing the animal torso programme early in embryonic development. They specify the position for developing fields along the inductive–posterior centrality, and are characterized by a 183-bp motif, the homeobox, which encodes a conserved Dna binding structure, the homeodomain (reviewed in Gehring 1993). Within the homeobox gene family, Hox genes belong to a subfamily whose members are arranged in genomic clusters. Interestingly, their expression in terms of time of activation and boundary of expression along the inductive–posterior centrality is "colinear" with the their chromosomal arrangement (e.g., Krumlauf 1994). Hox genes occur in strictly packed clusters, which aids their identification and alignment. Information technology may not be surprising that Hox genes are highly conserved during evolution because of their importance in development in all animal phyla. Moreover, the degree of conservation in their coding sequences might suggest that their regulatory elements are largely invariant across even not bad evolutionary distances. There is some evidence for this expectation. One of the selective forces that keeps the genes of Hox clusters uninterrupted by reshuffling and insertion of other genes may stalk from the fact that adjacent genes share mutual cis-regulatory elements (Peifer et al. 1987). Therefore, adjacent genes must remain closely linked, because translocations or insertions between them would deprive ane of them of its cis-regulatory elements and, hence, be lethal mutations.

RESULTS

Nosotros compared iv teleost species: tilapia (Oreochromis niloticus), pufferfish (Fugu rubripes), striped bass (Morone saxatilis), and zebrafish (Danio rerio) with two mammalian species (Man sapiens and Mus musculus) and the horn shark (Heterodontus francisci) as an outgroup species. The Hox gene contents for all these species are compared in Figure ane. Highly conserved homeobox domains in the Hox genes permitted "anchoring" of the clusters with each other. Therefore, it was possible to align HoxA clusters on the footing of highly conserved regions of exons and thereby align evolutionarily distantly related genomic sequences to discover putative regulatory elements.

An external file that holds a picture, illustration, etc.  Object name is 62703-19f1_L4TT.jpg

Evolutionary relationships among the species included in this piece of work. The divergence appointment betwixt the lineage leading to Chondrichthyes (to which Heterodontus, the horn shark, belongs) and that leading to the clade of all other taxa on this tree is about 500 millions years. Actinopterygii (the ray-finned fishes) and Sarcopterygii (the tetrapods) diverged virtually 450 one thousand thousand years ago. Teleosts radiated more than than 200 million years ago. The divergence between human and mouse is dated to nigh 80 million years (Pough et al. 1999). Horn shark, mouse, and human being have a single HoxA cluster, while all fishes examined so far take ii (see text for details). Amidst fishes, independent cistron losses took place in zebrafish and pufferfish relative to tilapia. Solid boxes represent individual genes. Duplicated clusters are designated as α or β. Pseudogenes are marked with a cross. Question marks represent genomic regions that are not notwithstanding characterized.

Genomic Architecture of Hox A Clusters

Comparisons of gene lengths and distances betwixt genes of the HoxA clusters are shown in Figure 2. The single Hox cluster region of the cephalochordate amphioxus (haploid Dna content: C= 0.59 pg; Ohno and Atkin 1966) spans over 400 kb (Garcia-Fernandez and Holland 1994; Ferrier et al. 2000), but the HoxA clusters of vertebrates that have been studied are considerably smaller. In the shark (C= 7.25 pg, Stingo et al. 1989), the HoxA region is but approximately 110 kb long ({"type":"entrez-nucleotide","attrs":{"text":"AF224262","term_id":"7271827","term_text":"AF224262"}}AF224262 and {"type":"entrez-nucleotide","attrs":{"text":"AF479755","term_id":"20278974","term_text":"AF479755"}}AF479755). In this species, the cluster was previously named HoxM, but is the ortholog of HoxA (Kim et al. 2000). In humans (C= 3.50 pg; Tiersch et al. 1989), the Hox A cluster is 110 kb long ({"type":"entrez-nucleotide","attrs":{"text":"AC004079","term_id":"2822174","term_text":"AC004079"}}AC004079, {"blazon":"entrez-nucleotide","attrs":{"text":"AC004080","term_id":"14589643","term_text":"AC004080"}}AC004080, and {"type":"entrez-nucleotide","attrs":{"text":"AC010990","term_id":"7243952","term_text":"AC010990"}}AC010990), in the mouse (C = 3.25 pg; Vinogradov 1998; Asif et al. 2002) it is 105 kb ({"blazon":"entrez-nucleotide","attrs":{"text":"AC021667","term_id":"12484331","term_text":"AC021667"}}AC021667), in the tilapia (C = 0.99 pg; Hinegardner 1976) the HoxAα cluster is 100 kb ({"type":"entrez-nucleotide","attrs":{"text":"AF533976","term_id":"31879894","term_text":"AF533976"}}AF533976), in the pufferfish (C= 0.forty pg; Brenner et al. 1993) the HoxAα is 64 kb (JGI public database), in the zebrafish (C= 1.75 pg; Vinogradov 1998) the HoxAα is 62 kb ({"type":"entrez-nucleotide","attrs":{"text":"AC107365","term_id":"18390289","term_text":"AC107365"}}AC107365) and the HoxAβ is 33 kb ({"type":"entrez-nucleotide","attrs":{"text":"AC107364","term_id":"18308323","term_text":"AC107364"}}AC107364). The HoxA cluster of the mouse shows an even base of operations limerick, whereas for all other genomes examined the base composition of the HoxA clusters is AT-biased (Table 1).

An external file that holds a picture, illustration, etc.  Object name is 62703-19f2_L1TT.jpg

Relative sizes of HoxA clusters. Boxes correspond individual genes. The duplicated α and β clusters are shown only for zebrafish. The alignable portion of the pseudogenes HoxA7α of pufferfish, HoxA2α and HoxA10α of zebrafish are shown as well.

Table i.

Percent Base Composition of the HoxA Clusters

Species %A %C %G %T
Tilapia 28.356 21.166 20.981 29.496
Pufferfish 28.476 21.398 21.093 29.033
Zebrafish α 31.231 18.816 eighteen.378 31.574
Zebrafish β 32.891 18.552 16.876 31.680
Horn shark 31.169 18.783 18.666 31.382
Human 31.169 18.783 18.666 31.382
Mouse 24.827 24.778 25.271 25.124

The available striped bass (C= 0.89 pg, Hinegardner 1976) sequence does non cover the unabridged cluster, merely simply the region from HoxA10α to HoxA4α. The region HoxA9α to HoxA4α in striped bass is 24 kb long ({"blazon":"entrez-nucleotide","attrs":{"text":"AF089743","term_id":"5669600","term_text":"AF089743"}}AF089743); the homologous region in the tilapia HoxAa cluster is 23 kb, in the pufferfish HoxAa cluster it is approximately 20 kb, and in the zebrafish HoxAα cluster it is approximately 19 kb (the zebrafish HoxAβ does not incorporate genes 4, five, and seven, so therefore cannot be evaluated). In the shark, man, and mouse clusters the region HoxA9 to HoxA4 is approximately 36 kb. In agreement with the view that Hox clusters are reduced in size in vertebrates, this role of the amphioxus cluster is approximately 135 kb long (Fig. 2).

Genome sizes and lengths of the HoxA clusters seem to be correlated (Fig. three). Lengths of Hox clusters take been previously shown to be independent of the pattern of gene loss amid several fish species (Aparicio et al. 1997; Snell et al. 1999; Chiu et al. 2002). When the same genes are retained, the architecture of HoxA clusters is by and large conserved among the species under examination; this holds true both in regard to relative lengths not only of orthologous genes amidst species, simply also of spacing between genes, that is, the length of intergenic regions (Fig. 2).

An external file that holds a picture, illustration, etc.  Object name is 62703-19f3_L1TT_rev1.jpg

Human relationship between genome size and length of the portion HoxA4 to HoxA9 of HoxA clusters. The length of HoxA clusters is correlated (P = 0.06) with genome size expressed equally C value. The HoxAα cluster lengths are shown. To be able to include also striped bass (HoxA cluster sequence is available only from HoxA4α to HoxA10α) and zebrafish (HoxAα cluster lacks HoxA10α) into the analysis, only the length of the HoxA4 to HoxA9 portion of the cluster is shown.

There is increasing evidence for a fish-specific genome duplication that was shared by all (or most) ray-finned fishes (e.m., Amores et al. 1998; Wittbrodt et al. 1998; Taylor et al. 2001). This genome duplication too caused an initial doubling (and some secondary lineage-specific losses) of the number of Hox clusters from four to viii. Then that, for example, two copies of the initial HoxA cluster resulted in the HoxAα and the HoxAβ clusters, which are now expected to exist plant in all (or most) ray-finned fishes. Contained gene losses in Hox clusters have happened in dissimilar species of fishes (Fig. 2).

The pufferfish HoxAα cluster was initially idea to lack HoxA7α (Aparicio et al. 1997), and information technology was hypothesized that this loss, together with the loss of other members of the entire paralogy grouping vii genes (Aparicio et al. 1997), could have been responsible for the absenteeism of ribs and pelvic fins and girdle in this group of fishes (The netherlands 1997; Meyer 1998; Prince et al. 1998; Meyer and Malaga-Trillo 1999). Our comparisons prove conservation of HoxA7α exons in pufferfish, with the exception of a 84-bp deletion in the homeobox in exon 2. However, the observation that the homeodomain is defective its cardinal and most conserved part might fence that in pufferfish the HoxA7α gene is a pseudogene.

The zebrafish Aα cluster lacks HoxA7α and contains merely a fragment of exon two of HoxA10α. It likewise lacks HoxA2α (Amores et al. 1998), but the cluster region corresponding to both HoxA2α exons, the promoter, and the intron however shows nucleotide conservation, suggesting that its loss was a relatively recent result in the zebrafish lineage. The zebrafish Aβ cluster lacks the HoxA1β and HoxA3β, HoxA4β, HoxA5β, and HoxA7β genes. In zebrafish, the HoxAβ cluster has been subject field to more losses of genes than the HoxAα cluster. Alternatively, the Hox5, 4, and iii genes could have been lost in a single event in HoxAβ cluster. The just genes absent in the HoxAα cluster, just nowadays in the HoxAβ cluster belong to the Hoxx and Hox2 paralogy groups.

Tilapia has an almost consummate HoxAα cluster, in terms of presence of Hox genes, and no lineage-specific gene losses relative to other teleost fishes were observed. The Tilapia HoxAα cluster retains the Hox two, vii, and ten genes, which are absent in the zebrafish HoxAα cluster. Figure 1 summarizes the specific losses of Hox genes in different fish lineages. We also have preliminary evidence for the presence of a HoxAβ cluster in tilapia (HoxA2β and HoxA3β; Malaga-Trillo and Meyer 2001). The increased gene loss of the HoxAβ cluster compared to the HoxAα cluster known from zebrafish may too be repeated in the tilapia genome.

Alignment of Nucleotide Sequences

All Hox clusters were screened with RepeatMasker to highlight interspersed repeats. There is a complete absence of any kind of long repeats between genes of the HoxA clusters in all the examined species. We compared the nucleotide sequence of HoxA homologous genes from HoxA of tilapia, pufferfish, striped bass, shark, human, and mouse clusters, and both HoxAα and HoxAβ clusters from zebrafish. In the Pip output (Fig. four), coding regions are shown with a blue background, introns in yellow, and conserved noncoding sequences (CNSs; Loots et al. 2000) non previously described in the literature in green. The sequence regions in red are conserved regulatory regions that accept been previously described in literature. As expected, coding sequences show a particularly loftier degree of similarity, peculiarly in the second exon (above 75%), which contains the homeobox, while introns are generally less conserved and cannot be aligned for long regions.

An external file that holds a picture, illustration, etc.  Object name is 62703-19f4_L4TT.jpg

Pip output of the comparison of tilapia HoxAα, striped bass HoxAα, pufferfish HoxAα, zebrafish HoxAα and Aβ, horn shark HoxA, human being HoxA, and mouse HoxA clusters. The tilapia sequence has been used as reference sequence. Kilobase (kb) markings are based on the tilapia sequence. Blue groundwork indicates coding regions, yellow indicates introns, red indicates conserved noncoding sequences (CNSs) previously described in literature, and the greenish groundwork indicates heretofore undescribed CNSs. Horizontal arrows betoken the direction of transcription, tall blackness boxes showexons, curt open boxes indicate a CpG/GpC ratio between 0.6 and 0.75, and short gray boxes indicate a CpG/GpC ratio over 0.75. Interspersed echo elements are shown as triangles (e.1000., in position 91 kb).

Identification of CNSs

Several stretches of sequence exterior of the recognized coding regions of the Hox genes are highly conserved in all species examined (Fig. iv; Table two). These CNSs were maintained for a menses of near 500 meg years of evolution. The fraction of CNSs for each intergenic region for the HoxA clusters is shown in Table 3. Interestingly, several 5′ and iii′ untranslated regions adjacent to the Hox genes of the clusters are conserved as well, suggesting that they may play an important role in the transcriptional regulation of the genes that they are flanking. A summary of the identified conserved regions is shown in Table 2. All identified CNSs have been tested individually by using BLASTN to exclude their presence in other positions of the genomes. No matches accept been found to sequences outside the Hox clusters (at the significance threshold of Eastward value < 0.01). Several stretches of sequence involved in the regulation of Hox genes have been previously described in the literature (cavalcade eleven in Tabular array ii), and these known regulatory sequences were also identified by our method.

Table two.

CNSs Identified Through the Comparative Approach

Position Length (bp) Striped bass Pufferfish Zebrafish a Zebrafish b Horn shark Human Mouse Over 95% Literature New/similar homebox binding sites
1 kb upstream thirteen 63 86 73 82 78 79 80 1 × 19 NF-one (Rossi et al. 1988)
Imm. upstream 13 188 83 65 63 66 75 71 0 New
xiii-xi 192 89 26 sixty 68 67 2 × ten New
Imm. upstream 11 230 89 66 68 63 seventy 71 5 × 7-28 New
xi-x 121 96 84 85 64 63 2 × 6-viii New
Imm. upstream 10 391 92 86 66 68 lxx 68 ii × 8-25 New
10-9 a 96 95 86 63 66 61 65 2 × 6-7 Abd B (Ekker et al. 1994); RNA politician. Ii cap signal (Bucher 1990)
ten-9 b 95 98 98 89 81 79 73 72 1 × 24 Murine homeotic proteins b.s. (Catron et al. 1993)
Imm. upstream 9 191 94 87 63 56 69 61 63 ii × 5-6 Target sequences chicken CdxA (Margalit et al. 1993)
9-7 a 62 100 98 92 72 78 79 79 two × 11-15 Abd B (Ekker et al. 1994)
9-vii b 276 96 93 71 71 seventy 69 iii × 6-11 c-ETS-1 poly peptide b.south. (Woods et al. 1992)
Imm. upstream 7 185 95 88 79 78 78 3 × 9-14 HoxA7 enhancer regulatory element, H. sapiens (Knittel et al. 1995)
7-5 163 81 78 77 78 81 81 2 × 8-11 H8/7-half-dozen FCS (Kim et al. 2000)
Imm. upstream 5 529 93 84 69 38 76 76 8 × 6-39 RARE (box c and box d), H. sapiens, Chiliad. musculus (Odenwald et al. 1989)
5-4 a 280 99 94 77 82 83 83 7 × 9-33 Pax b.s., (Epstein et al. 1994); Ultrabithorax b.s. (Ekker et al. 1991); target sequences of chicken CdxA homeobox gene (Margalit et al. 1993)
v-iv b 63 97 98 83 83 81 83 2 × 9-19 Dof b.s. (Yanagisawa and Schmidt, 1999)
5-4 c 209 95 93 67 71 69 69 5 × viii-xv NF of C-EBP family (Grange et al. 1991)
five-4 d 239 92 89 81 84 79 78 7 × 7-24 RARE (HoxA4 promoter, H. sapiens, Doerksen et al. 1996)
Imm. upstream 4 83 100 100 89 91 78 76 3 × 6-30 RARE (HoxA4 promoter, H. sapiens, Doerksen et al. 1996)
4-iii a 78 91 69 76 72 66 ii × 6-seven Dof b.s. (Yanagisawa and Schmidt, 1999)
4-3 b 480 87 67 71 66 63 5 × six-10 New
4-three c 51 93 72 lxxx 75 73 2 × vi-10 New
4-3 d 136 96 76 76 66 65 4 × 8-12 New
Imm. upstream 3 235 86 ninety 82 73 79 half-dozen × vii-13 New
iii-2 a 476 79 61 66 60 0 New
three-ii b 189 93 72 81 68 69 67 five × 6-9 New
Imm. upstream 2 382 89 64 78 77 77 77 8 × viii-43 HoxA2 promoter, 1000. musculus (Tan et al. 1992)
two-1 190 93 61 72 72 3 × 10-11 RARE, M. musculus (Frasch et al. 1995) New
Imm. upstream 1 352 78 threescore 59 64 61 2 × half-dozen-8 RARE, M. musculus (Frasch et al. 1995) New
Total 6233

Table 3.

Base Conservation of the Intergenic Regions of the Tilapia HoxA Cluster

Intergenic fragment % of total noncoding bases % identified as CNS % described in literature % of full CNSs
Evx-13 xiii 3 0 four
xiii-11 14 4 0 7
xi-ten 9 9 0 8
10-9 7 nine 0 6
9-seven 6 13 12 8
7-5 8 14 4 11
5-4 thirteen 10 4 14
four-3 17 9 0 sixteen
3-2 vii 23 10 17
2-1 6 14 14 9

The intergenic regions betwixt genes located 3′ in the clusters are better conserved than those between genes located 5′ in the cluster (Fig. v; Tabular array 3; and the alignment in the Supplementary data files available online at www.genome.org). The total number of conserved nucleotides (over lx% identity) is significantly higher (P = 0.007; Fig. 5) in the intergenic regions in the 3′ end of the cluster, and the detected CNSs are longer here.

An external file that holds a picture, illustration, etc.  Object name is 62703-19f5_L1TT.jpg

Total lengths of CNSs for each intergenic region. The intergenic regions located 3′ in the cluster are better conserved than those betwixt genes located 5′ in the cluster. The graph shows the number of conserved bases (CNS as divers in text) per intergenic region. At that place is a pregnant relationship between the number of conserved bases per intergenic region and the position of the region in the cluster (P = 0.007).

Description of Some Putative Regulatory Elements

Due to the nature of cis-regulating elements, which can be as short as 6 bp (Hardison et al. 1997), we were interested in finding where such sequences reach the highest degree of conservation for even a small number of nucleotides.

The first part of the intron of HoxA11α (51 bp) of the tilapia sequence is over 80% identical amidst tilapia, pufferfish, zebrafish Aα and Aβ, horn shark, humans, and the mouse (information for this region in striped bass are not available). The fragment presents the consensus homeodomain bounden sites HB1 located in the intron of the mouse genes HoxA4 and vii (Haerry and Gehring 1996). The HB1-element consists of three homeodomain binding sites (HB1), and it is an evolutionary conserved Deoxyribonucleic acid sequence previously described from the intron of HoxA7 (Haerry and Gehring 1996), in the leader (putative autoregulatory element) of its Drosophila homolog Ubx and in the introns of the paralogy group 4 Hox genes in medaka, chicken, the mouse, and humans (Morrison et al. 1995). The HB1 chemical element binds Drosophila CAD homeoprotein and CDX-i, its homolog in the mouse, and information technology therefore is supposed to be a target for various homeodomain proteins in both vertebrates and invertebrates. Our comparative analyses bear witness that the HB1 element is present non simply in the introns of HoxA4 and 7 equally already described in the literature, but also in the intron of HoxA11 in the HoxAα cluster of all the species examined. Interestingly, it is as well present in the intron of HoxA11β of zebrafish.

The region responsible for the cis-regulation of the HoxA7 gene has previously been described by Knittel et al. (1995) every bit an enhancer located 1.6 kb upstream of the coding sequence in human and mouse. These authors hypothesized that another proximal regulatory element can cooperate in the expression of HoxA7. Immediately upstream of the HoxA7 gene we highlighted a 185 bp stretch with more than than 84% sequence identity. Our comparison (Fig. 4) shows that there are several completely conserved sequences within this fragment, characterized by the short motif GTAAA. This long conserved region might exist the regulatory element that Knittel et al. (1995) hypothesized.

In the intron of the HoxA7 the HB1-element has a sequence identity of over 80% among the examined species. The region immediately upstream of the HoxA5 gene (490 bp) is between 70% and 85% similar. The RARE elements described equally "box c" and "box d" by Odenwald et al. (1989) in humans and the mouse were recognized (Fig. 6). These elements are present, with small variations, among all Hox genes of paralogy group v, and are known regulatory binding sites in the mouse Hox i.3 (HoxA5) (Odenwald et al. 1989). The conservation percentages inside the single boxes are 88% for the "box c" and 96% for the "box d".

An external file that holds a picture, illustration, etc.  Object name is 62703-19f6_F1TT.jpg

Alignment of known RARE elements. The alignment shows the RARE chemical element described as "box c" and "box d" (Odenwald et al. 1989) immediately upstream of the HoxA5 genes. In zebrafish, the RARE element is present but in HoxAα cluster (indicated every bit zebrafish α).

Downstream of the HoxA5 gene (1.iii kb) a region of 259 bp has an average similarity of xc%, with 2 100% identical stretches of 25 and 33 bp length. The motifs found in this region are ATGAAT (with a repeat following subsequently 13 bp), ATAAA, (AAGT)2, and (ACATA)2. The motifs identified past our comparisons are similar to those described as binding sites of the paired domain of the Pax genes (Epstein et al. 1994) and also of the Ultrabithorax gene of Drosophila (Ekker et al. 1991). This extremely conserved region was not previously described as being involved in Hox5 and iv regulation, but the nature and conservation of the long stretches highlighted through this comparison suggest that it might exist a good candidate region for functional tests.

Upstream of the HoxA4 gene nosotros identified a stretch 154 bp that has a similarity of 85% containing a RARE element (17 bp) that is part of the HoxA4 promoter, described by Doerksen et al. (1996). In the intron of factor HoxA4 a 68 bp long stretch was found containing the previously described HB1 element (Haerry and Gehring 1996).

Downstream of HoxA4 (1.7 kb) a 127 bp-long sequence is, on boilerplate, 78% conserved with a 26 bp-long stretch that is 96% conserved containing the AAATAAAA (position 63576–63583) and ATTTAA motifs and a sixteen-bp stretch that is 94% conserved containing the motif TTTTATTT (position 63882–63889). This is possibly a palindromic sequence for the complementary 1 in position 63576. Palindromes are frequently associated with regulatory elements (Chu et al. 2001).

Immediately upstream of the cistron HoxA2 nosotros found a 352-bp region that is 85% conserved that constitutes role of the HoxA2 promoter described by Tan et al. (1992) in the mouse HoxA cluster. The Krx20 element and the nearby "box a", described by Nonchev et al. (1996) as existence involved in HoxA2 trans-activation in mouse, and nowadays in tilapia HoxA cluster (Fig. 7A), was not identified by our alignment. To ostend this result nosotros searched specifically for these elements in zebrafish, pufferfish, and horn shark clusters, but could non identify them.

An external file that holds a picture, illustration, etc.  Object name is 62703-19f7_L1TT.jpg

Alignment of known regulatory elements. (A) Sequence of Krx20 bounden sites in different species. Krox20 binding sites are involved in Hox2 regulation and they are conserved in HoxA and B clusters from human, mouse, pufferfish, and HoxA from tilapia. Both Krx20 and the "box a" are widely conserved. The degree of identity is 67% amongst the species in this comparing. (B) Alignment of sequences of the "box a" motif.

Identification of Previously Described Functional Elements

All-encompassing searches of the transcription factor database (Transfac) revealed that several of these brusque 100% conserved sequences match previously described transcription gene binding sites (column 12 in Table 2). The matches most frequently obtained are: nuclear cistron NF1 binding sites (Rossi et al. 1988), abdominal B (AbdB) homeobox cistron binding sites (Ekker et al. 1994), CdxA homeobox gene binding sites (Margalit et al. 1993), and murine homeodomain binding sites (Catron et al. 1993).

Several of the most conserved sequences are highly similar to known transcription factors binding site motifs. Ane of these is the Krx20 binding site, which was found in humans, the mouse, pufferfish, and tilapia clusters (Fig. 7A). Krx20 binding sites accept been described by Nonchev et al. (1996) as beingness involved in HoxA2 regulation every bit an r3/r5 enhancer that upregulates the expression of those genes in rhombomere3/rhombomere5, where Krx20 is expressed in humans, chicks, the mouse, and pufferfish. The Krx20 binding site is 9 bp long and occurs effectually ii kb upstream of the genes HoxA2 and HoxB2, with a high degree of conservation (Fig. 7A). It is closely followed by a 12 bp-long conserved sequence motif called "box a", which is highly like to "box1", the corresponding chemical element associated with Krx20 binding site in cluster B (Fig. 7B). Box 1 is required for r3/r5 enhancer function in transgenic mice (Vesque et al. 1996).

DISCUSSION

Our analyses confirm the value of comparative evolutionary genomic approaches in the identification and description of regulatory elements in genomes. Nosotros expect that this type of analysis will help to increment the noesis base about the characteristics, evolutionary conservation, and the position of functional elements with respect to the genes that they control.

We conducted several comparative analyses of the unabridged HoxA clusters for vii species of vertebrates. We compared the position and nucleotide sequence of the genes that constitute the HoxAα cluster from O. niloticus with those of the other species in this study. The consummate absence of long repetitive elements supports the idea that 1 of the selective forces keeping the genes in Hox clusters tightly arranged stems from the fact that adjacent genes share common cis-regulatory elements. Interestingly, it has been suggested that repetitive elements are frequently involved in chromosomal rearrangement processes, such equally inversion, translocation, and excision (Moran et al. 1999; Tomilin 1999). Hence, the absenteeism of repetitive elements might be the result of selections against them, to reduce the run a risk of events that may interrupt Hox cluster firmness.

Caste of Conservation of Intergenic Regions

Teleost fishes, horn shark, and mammals were included in this study, to ensure comparisons of distantly related genomes, because their lineages separated approximately 450–500 millions years ago (due east.thousand., Pough et al. 1999). Our comparative analyses were directed toward identifying conserved blocks of nucleotides among evolutionarily distantly related species that might be cis-interim sites for Hox cistron-regulating factors. Intergenic regions prove varying degrees of conservation (Table iii). Intergenic spaces betwixt genes located 3′ in the clusters are significantly more conserved than those in the 5′ portion of the clusters (Fig. v; Table 3). This pattern might be explained by the dissimilar Hox genes' expression patterns during evolution. Genes located in 5′ position in the cluster are expressed more posteriorly in the embryo and afterward in its evolution, while genes located in position 3′ in the cluster are expressed more anteriorly in the embryo and earlier in its development (Duboule and Dollé 1989). Genes located 3′ in the cluster, namely Hox1–4, are expressed in the developing hindbrain. Their regulatory elements are evolutionarily highly conserved every bit was demonstrated through transgenic experiments (e.g., Frasch et al. 1995; Manzanares et al. 2000). The intergenic regions of Hox genes 3′ in the clusters are responsible for the activation of the first and more than rostral genes to be expressed during evolution, and therefore their extreme conservation might exist necessary for the correct activation of the subsequent Hox expression system. We found a significant increase in length of the CNSs between pairs of 3′ genes compared to intergenic regions of genes located 5′ and not involved in hindbrain sectionalization (Fig. 5; P = 0.007).

In our analyses nosotros also included the noncoding regions upstream of the Hoxthirteen gene and downstream of the Hox1 gene. Intergenic regions between ii Hox genes contain regulatory elements for genes both upstream and downstream (e.g., Peifer et al. 1987). In addition, likewise if the region upstream of the Hoxthirteen gene contains just regulatory elements for this gene, and the same holds true for the region downstream of the Hoxane gene, the trend of increasing length of CNSs from 5′ to three′ within intergenic regions is still meaning.

Search for Regulatory Sequences

Several conserved noncoding regions take been identified in this study. All the identified CNSs are specific to Hox clusters (no matches with whatsoever other region of the genome when aligned by using BLASTN).

Some of these regions reside immediately 5′ and three′of the genes of the Hox clusters, and this feature is generally related to functional roles (due east.g., reviewed by Maconochie et al. 1996). Promoters are located immediately 5′ upstream of genes (e.1000., HoxA2 promoter; Tan et al. 1992) and RAREs are located 3′ of the regulated gene (e.grand., Frasch et al. 1995). However, the largest part of conserved regions we found is located between two genes and is quite afar (by 1–five kb; column 1 in Tabular array 2) from both. Thus, these regions are the most interesting, because cis-regulatory regions in Hox clusters are located in positions that are intermediate betwixt the genes they regulate. An example for this phenomenon is an element named H8/7–vi FCS (Kim et al. 2000) that was shown by Kim et al. (2000) to exist in all iv clusters of mammals and shark that they compared. We showed that this element is too present in the HoxAα cluster of fishes (Fig. iv; Tabular array 2). This element is located 1.2 kb downstream of the HoxA7α factor and 3.6 kb upstream of the HoxA5α gene in tilapia (Tabular array 2). These Hox genes are involved in controlling the development of the branchial region (Krumlauf 1994). The conservation of the nucleotide sequence and relative position in all clusters examined so far makes this chemical element an excellent candidate for an evolutionary conserved cis-regulatory element. Tabular array 2 lists several other CNSs located between two genes that might contain cis-regulatory elements. We could non locate the Krx20 and "box a" in whatever CNS in our alignment, because the Krx20 binding site and "box a" are short sequences that are not embedded in a cake of at least fifty bp with a conservation of at least sixty% in a minimum of iv clusters. In this item case, our criteria defining CNSs were besides strict. Furthermore, HoxA1 RARE elements described by Langston et al. (1997) could not be identified, considering the region downstream of HoxA1α was not available for most of the sequences and, hence, the alignment did non fit the above-mentioned criteria for defining CNSs.

All except i of the CNSs identified through our comparisons are present in at least one of the zebrafish HoxA clusters and some in both of them (Table 2). A specific CNS is generally conserved in the one of the ii zebrafish HoxA clusters that still retains the gene located downstream of its position, that is, the CNS upstream of HoxA10 is present only in HoxAβ cluster, which retains the gene HoxA10, and was lost in HoxAα cluster, which does not accept the Hoxten factor. The same blueprint is constitute in CNSs located upstream of the HoxA5, 4, and 3 genes that are present only in the HoxAα cluster, which still retains those genes. The CNS found immediately upstream of HoxA7 and previously described past Knittel et al. (1995) as an enhancer of HoxA7 in humans and the mouse is absent from both zebrafish Hox clusters. This is specially interesting, because the HoxA7 cistron was lost during zebrafish genome development. Too, the CNS located in the intergenic region between the HoxA3 and 2 genes and indicated as 3–2a in Tabular array ii is absent from both zebrafish clusters. This item CNS has one of the lowest overall conservation levels, with no stretches being over 95% identity. These observations enforce the possibility that the CNSs nosotros identified are actually involved in regulatory functions.

The duplication-deletion-complementation model (DDC; Force et al. 1999) proposes that the two duplicated gene copies retain different sets of regulatory elements, and therefore, presumably dissimilar function. The set of functions of the initial gene might be divided, "subfunctionalized," by the two duplicated "daughter" copies of the factor. The Hox13, 11, and 9 genes are each nowadays in two copies in the zebrafish genome, in the HoxAα and Aβ clusters. The CNSs upstream of these genes are also retained in both clusters, but are different between them. This could signal that they have been preserved because they are important for the regulation of those genes, but control different patterns of expression, and are, hence, an case for the process of subfunctionalization of the duplicated "daughter" copies of the genes.

Chiu et al. (2002) did not observe the same blueprint of conservation in zebrafish HoxA clusters. These differences might be due to a different method of identifying CNS sequences. Chiu et al. (2002) described, by comparison of homo and horn shark HoxA clusters, a great number of Phylogenetic Footprints (PFs). These are defined as brusk blocks of noncoding DNA, typically vi bp or more than, that are 100% conserved in two taxa that take diverged at least 250 1000000 years agone (Tagle et al. 1988; Blanchette and Tompa 2002). Among PFs, they described as Phylogenetic Footprint Clusters (PFCs) those that were found close to each other (within 200 bp) and located at comparable distances from the cistron that is located 3′ of each intergenic region. They found simply a pocket-sized number of PFCs to be present in at least 1 of the two zebrafish HoxA clusters. They ended that the essential Hox gene functions in zebrafish are performed with dissimilar cis-regulatory elements (east.g., phenogenetic drift; Weiss and Fullerton 2000) from those of the ancestral cistron, with cis elements highly conserved in horn shark and human. We defined a sequence as a CNS using the following criteria (see Methods) (1) identity over 60% in at to the lowest degree four out of eight clusters; (2) presence in at least two species known to have just ane HoxA cluster (horn shark, homo, mouse; see Fig. i), and (3) a minimum length of 50 base pairs (bp). We therefore identified a smaller number of longer conserved elements, which are shared by a higher number of species/clusters. Moreover, because of the fact that many trans-regulatory elements recognize a cadre sequence that is fifty-fifty shorter than 6 bp and with a certain caste of tolerance, we accepted a 95% lower threshold for the short highly conserved sequences we described (cavalcade 10 in Table 2).

Regulatory Elements in Introns

Intronic sequences are typically non conserved among evolutionarily diverged species. A articulate exception to this dominion are the HB1 elements, believed to be binding sites for several homeoproteins (Haerry and Gehring 1996, 1997). Our analyses prove that the HB1 elements, which and then far have been described only in the introns of the Hox4 and seven genes, are present also in the intron of the Hox11 gene in the HoxA cluster (in both HoxAα and HoxAβ in zebrafish). The Hox4, 7, and 11 genes are expressed in different regions of the developing embryos (rhombomeres half dozen and 7 in the hindbrain for Hox4 paralogous group, thoracic region for Hox 7, and caudal region for Hox 11) and at different times of development. The spatial regular redundancy of HB1 elements in Hox clusters might be related to the different timing of activation of groups of Hox genes (anterior, central, and caudal) in the developing embryo. Information technology would be of involvement to meliorate characterize the function of different HB1 elements within the aforementioned Hox cluster. Moreover, information technology would be important to know if other Hox clusters also show a similar pattern as the HoxA clusters concerning HB1 regulatory elements.

A long (over 600 bp) stretch of intron of gene Hox2 is threescore–lxx%, and is conserved amid all the species included in this comparison. Role of this sequence matches a previously described POU protein binding site (Verrijzer et al. 1992). The overexpression of homeoprotein POU2 rescues zebrafish Krx20 and valentino mutants (Hauptmann et al. 2002) that are caused past disrupted Hoxii-related patterning of rhombomeres 3/five. It seems likely that Hoxii expression and role is related to the conservation of the putative regulatory element in its intron.

Known Conserved Regions and Regulatory Elements

The reliability of our results was confirmed by the ascertainment that some of the highly conserved, possibly functional, noncoding regions that we have identified have been previously described as regulatory elements (column 11 in Table 2). Moreover, many of them contain homeoprotein binding sites that are believed to be responsible for Hox gene regulation (cavalcade 12 in Tabular array 2). It is reasonable to assume that the elements that are evolutionarily conserved are the ones that regulatory proteins bind to, and this agrees with the testify that other classes of homeobox genes are responsible for Hox genes regulation. Currently, 4 groups of transcriptional regulators accept been identified that directly regulate Hox factor expression in the vertebrate embryo: retinoic acrid receptors, Krx20, members of the Pbx/exd family, and the Hox genes themselves (reviewed past Lufkin 1997). Because Hox genes have a colinear temporal pattern of differential expression (i.e., HoxA1 is expressed before HoxA2, then on), further studies on homeoprotein bounden sites are necessary to define if and how Hox genes expressed before in embryo development could regulate the expression of Hox genes expressed later.

It would be particularly interesting to test some of the and then far undescribed conserved noncoding regions that we have identified through this comparative genomic arroyo for a possible functional role in the activation and regulation of Hox genes. Considering functional studies involve a great deal of effort, for case, transgenic animals, information technology is critical to reduce the number of possible candidates for regulatory function. Sequencing projects of whole genomes (e.g., pufferfish, zebrafish, medaka) offer new possibilities for comparative genomic approaches to report distantly related organisms to uncover putative regulatory elements. Moreover, using distantly related genome comparisons between teleosts and, for instance, mammals or amphioxus, highlights the divergence in gene regulation of paralogous genes that evolved subsequent to gene duplication. Information technology is still a subject field of discussion whether paralogous genes in ray-finned fishes are due to an early on whole genome duplication (Meyer and Schartl 1999; Taylor et al. 2001), or rather to several independent smaller scale duplication events (Robinson-Rechavi et al. 2001). One of the chief mechanisms by which subfunctionalization of duplicated genes occurs may be through a change in their regulatory elements, whereby mutations or differences in deletions in these elements can lead to differential expression patterns of duplicated genes (Strength et al. 1999). The comparing of distantly related genomes may indicate which duplicated genes have divergent regulatory sequences in comparison to organisms for which such a duplication did not occur, for example, mammals. This, in turn, would provide a method by which to elucidate dissimilar evolutionarily new functions for duplicated genes.

METHODS

The Hox clusters included in this report are: tilapia (Oreochromis niloticus {"type":"entrez-nucleotide","attrs":{"text":"AF533976","term_id":"31879894","term_text":"AF533976"}}AF533976, Evxi–HoxA1α), pufferfish (Fugu rubripes, JGI public database http://www.jgi.doe.gov/programs/fugu/fugu_mainpage.html, HoxA13α–HoxA1α), striped bass (Morone saxatilis {"type":"entrez-nucleotide","attrs":{"text":"AF089743","term_id":"5669600","term_text":"AF089743"}}AF089743, HoxA10α–HoxA4α), zebrafish (Danio rerio {"blazon":"entrez-nucleotide","attrs":{"text":"AC107365","term_id":"18390289","term_text":"AC107365"}}AC107365, Evx1–HoxA1α and {"type":"entrez-nucleotide","attrs":{"text":"AC107364","term_id":"18308323","term_text":"AC107364"}}AC107364, HoxA13β–HoxA2β), horn shark (Heterodontus francisci {"type":"entrez-nucleotide","attrs":{"text":"AF224262","term_id":"7271827","term_text":"AF224262"}}AF224262 and {"type":"entrez-nucleotide","attrs":{"text":"AF479755","term_id":"20278974","term_text":"AF479755"}}AF479755 HoxM13–HoxM1, respective to HoxA; Kim et al. 2000), mouse (Mus musculus {"blazon":"entrez-nucleotide","attrs":{"text":"AC021667","term_id":"12484331","term_text":"AC021667"}}AC021667, HoxA13–HoxA1), and Human being sapiens ({"type":"entrez-nucleotide","attrs":{"text":"AC004079","term_id":"2822174","term_text":"AC004079"}}AC004079, {"type":"entrez-nucleotide","attrs":{"text":"AC004080","term_id":"14589643","term_text":"AC004080"}}AC004080, and {"blazon":"entrez-nucleotide","attrs":{"text":"AC010990","term_id":"7243952","term_text":"AC010990"}}AC010990, Evx1–HoxA1).

The tilapia HoxAα cluster sequence (Malaga-Trillo and Meyer 2001) has been used as the template sequence to which the others are compared. It has been filtered for repetitive and other "junk" elements through RepeatMasker, available at University of Washington Genome Center (http://ftp.genome.washington.edu/cgi-bin/RepeatMasker/).

The alignment has been performed using the plan MultiPipMaker available at http://bio.cse.psu.edu/pipmaker/. PipMaker (Schwartz et al. 2000) computes alignments of like regions in two or more Deoxyribonucleic acid sequences. The resulting alignments are summarized with a "percent identity plot", or "pip" for brusque. All pair-wise alignments with the kickoff sequence are computed and so returned equally interleaved pips, and information technology is possible to compute a truthful multiple alignment of the input sequences to produce a nucleotide-level view of the results. The alignment engine is BLASTZ, which is an experimental variant of the Gapped BLAST program (Altschul et al. 1997; Zhang et al. 1998).

Loots et al. (2000) divers conserved noncoding sequences (CNSs) as conserved noncoding elements with greater or equal to 70% identity over at least 100 bp between humans and the mouse. Considering nosotros used eight clusters from seven species more evolutionarily divergent than only humans and the mouse, the following criteria have been used to define CNSs: identity over threescore% in at least four out of viii clusters; presence in at least ii species known to have only one HoxA cluster (horn shark, humans, mouse; Fig. 1) and minimum length of 50 base of operations pairs (bp). Despite this, when taking into account only the comparison between humans and the mouse, our CNSs as well fulfill the definition from Loots et al. (2000). CNSs have been tested in BLASTN (http://world wide web.ncbi.nlm.nih.gov/BLAST/) to confirm that they are specific to Hox clusters.

Within such sequences, stretches between 95% and 100% identity and six nucleotides or more in length, conserved among at least six out of seven examined clusters, have received item attention. The stretches over 95% identity within CNSs have been used to screen the transfac database (http://transfac.gbf.de/TRANSFAC/) to determine if they accept been already described as transcription factors bounden sites in similar or different biological context.

Acknowledgments

This work has been supported by a grant of the Deutsche Forschungsgemeinschaft (to A.M.) and by a Marie-Curie fellowship to S.Due south. The authors thank East. Malaga-Trillo and other members of the Meyer-Lab for library screening, many members of the DOE Joint Genome Found (JGI) for Deoxyribonucleic acid sequencing, and C. Klingenberg for reviewing the manuscript. Function of this work was performed nether the auspices of the U.Southward. Section of Free energy, Office of Biological and Environmental Inquiry, Lawrence Berkeley National Laboratory, under Contract No. DE-AC03-76SF00098.

The publication costs of this article were defrayed in part past payment of page charges. This commodity must therefore exist hereby marked "advertisement" in accord with 18 USC section 1734 solely to signal this fact.

Notes

Commodity and publication are at http://www.genome.org/cgi/doi/10.1101/gr.700503.

Footnotes

[Supplemental material is bachelor online at www.genome.org. The sequence data from this report take been submitted to GenBank under accession no. AF538976.]

References

  • Altschul, S.F., Madden, T.Fifty., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-Smash: A new generation of protein database search programs. Nucleic Acids Res. 25 : 3389–3402. [PMC complimentary article] [PubMed] [Google Scholar]
  • Amores, A., Force, A., Yan, Y.-L., Joly, L., Amemiya, C., Fritz, A., Ho, R.K., Langeland, J., Prince, V., Wang, Y.-50., et al. 1998. Zebrafish Hox clusters and vertebrate genome evolution. Scientific discipline 282 : 1711–1714. [PubMed] [Google Scholar]
  • Aparicio, S., Morrison, A., Gould, A., Gilthorpe, J., Chaudhuri, C., Rigby, P., Krumlauf, R., and Brenner, S. 1995. Detecting conserved regulatory elements with the model genome of the Japanese pufferfish, Fugu rubripes. Proc. Natl. Acad. Sci. 92 : 1684–1688. [PMC gratuitous commodity] [PubMed] [Google Scholar]
  • Aparicio, Due south., Hawker, M., Cottage, A., Mikawa, Y., Zuo, L., Venkatesh, B., Chen, E., Krumlauf, R., and Brenner, S. 1997. Organization of the Fugu rubripes Hox clusters: Show for standing evolution of vertebrate Hox complexes. Nat. Genet. xvi : 79–83. [PubMed] [Google Scholar]
  • Asif, T.C., Melt, 50.L., Delehaunty, Grand.D., Fewell, 1000.A., Fulton, Fifty.A., Fulton, R.S., Graves, T.A., Hillier, L.Due west., Mardis, E.R., McPherson, J.D., et al. 2002. Initial sequencing and comparative analysis of the mouse genome. Nature 420 : 520–562. [PubMed] [Google Scholar]
  • Bergman, C.Chiliad. and Kreitman, M. 2001. Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences. Genome Res. 11 : 1335–1345. [PubMed] [Google Scholar]
  • Blanchette, M. and Tompa, M. 2002. Discovery of regulatory elements by a computational method for phylogenetic footprinting. Genome Res. 12 : 739–748. [PMC free article] [PubMed] [Google Scholar]
  • Brenner, S., Elgar, G., Sandford, R., Macrae, A., Venkatesh, B., and Aparicio, South. 1993. Label of the pufferfish (Fugu) genome as a compact model vertebrate genome. Nature 366 : 265–268. [PubMed] [Google Scholar]
  • Brenner, S., Venkatesh, B., Yap, W.H., Chou, C.F., Tay, A., Ponniah, S., Wang, Y., and Tan, Y.H. 2002. Conserved regulation of the lymphocyte-specific expression of lck in the Fugu and mammals. Proc. Natl. Acad. Sci. 99 : 2936–2941. [PMC free article] [PubMed] [Google Scholar]
  • Bucher, P. 1990. Weight matrix description of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences. J. Mol. Biol. 212 : 563–578. [PubMed] [Google Scholar]
  • Carroll, Southward., Grenier, J., and Weatherbee, S. 2001. From Deoxyribonucleic acid to diversity-Molecular genetics and the evolution of animal design. Blackwell Scientific discipline, Malden, MA.
  • Catron, K.K., Iler, N., and Abate, C. 1993. Nucleotides flanking a conserved TAAT core dictate the Deoxyribonucleic acid binding specificity of three murine homeodomain proteins. Mol. Cell. Biol. 13 : 2354–2365. [PMC free article] [PubMed] [Google Scholar]
  • Chiu, C.H., Amemiya, C., Dewar, K., Kim, C.B., Ruddle, F.H., and Wagner, G.P. 2002. Molecular development of the HoxA cluster in the three major gnathostome lineages. Proc. Natl. Acad. Sci. 99 : 5492–5497. [PMC free article] [PubMed] [Google Scholar]
  • Chu, D., Kakazu, North., Gorrin-Rivas, G., Lu, H., Kawata, 1000., Abe, T., Ueda, K., and Adachi, Y. 2001. Cloning and label of LUN, a novel ring finger protein that is highly expressed in lung and specifically binds to a palindromic sequence. J. Biol. Chem. 276 : 14004–14013. [PubMed] [Google Scholar]
  • Clark, A.1000. 2001. The search for significant in noncoding DNA. Genome Res. 11 : 1319–1320. [PubMed] [Google Scholar]
  • Cliften, P.F., Hillier, L.West., Fulton, L., Graves, T., Miner, T., Gish, W.R., Waterston, R.H., and Johnston, Thou. 2001. Surveying Saccharomyces genomes to identify functional elements by comparative Dna sequence assay. Genome Res. 11 : 1175–1186. [PubMed] [Google Scholar]
  • Doerksen, L.F., Bhattacharya, A., Kannan, P., Pratt, D., and Tainsky, Chiliad.A. 1996. Functional interaction betwixt a RARE and an AP-2 bounden site in the regulation of the human HOX A4 cistron promoter. Nucleic Acids Res. 24 : 2849–2856. [PMC costless article] [PubMed] [Google Scholar]
  • Duboule, D. and Dollé, P. 1989. The structural and functional organization of the murine HOX gene family unit resembles that of Drosophila homeotic genes. EMBO J. viii : 1497–1505. [PMC complimentary article] [PubMed] [Google Scholar]
  • Ekker, S.C., Young, Grand.E., von Kessler, D.P. and Beachy, P.A. 1991. Optimal Dna sequence recognition by the ultrabithorax homeodomain of Drosophila. EMBO J. ten : 1179–1186. [PMC costless article] [PubMed] [Google Scholar]
  • Ekker, S.C., Jackson, D.Grand., von Kessler, D.P., Lord's day, B.I., Young, K.E., and Beachy, P.A. 1994. The degree of variation in DNA sequence recognition among 4 Drosophila homeotic proteins. EMBO J. 13 : 3551–3560. [PMC free article] [PubMed] [Google Scholar]
  • Epstein, J., Cai, J., Glaser, T., Jepeal, L., and Maas, R. 1994. Identification of a Pax paired domain recognition sequence and evidence for DNA-dependent conformational changes. J. Biol. Chem. 269 : 8355–8361. [PubMed] [Google Scholar]
  • Ferrier, D.Eastward., Minguillon, C., Holland, P.W., and Garcia-Fernandez, J. 2000. The amphioxus Hox cluster: Deuterostome posterior flexibility and Hoxfourteen. Evol. Dev. 2 : 284–293. [PubMed] [Google Scholar]
  • Force, A., Lynch, M., Pickett, F.B., Amores, A., Yan, Y.L., and Postlethwait, J. 1999. Preservation of indistinguishable genes by complementary, degenerative mutations. Genetics 151 : 1531–1545. [PMC gratuitous article] [PubMed] [Google Scholar]
  • Frasch, M., Chen, Ten., and Lufkin, T. 1995. Evolutionary-conserved enhancers direct region-specific expression of the murine Hoxa-ane and Hoxa-2 loci in both mice and Drosophila. Evolution 121 : 957–974. [PubMed] [Google Scholar]
  • Garcia-Fernandez, J. and Kingdom of the netherlands, P.W. 1994. Archetypal organisation of the amphioxus Hox gene cluster. Nature 370 : 563–566. [PubMed] [Google Scholar]
  • Gehring, W.J. 1993. Exploring the homeobox. Gene 135 : 215–221. [PubMed] [Google Scholar]
  • Grange, T., Roux, J., Rigaud, M., and Pictet, R. 1991. Prison cell-type specific activity of two glucocorticoid responsive units of rat tyrosine aminotransferase gene is associated with multiple binding sites for C/EBP and a novel liver-specific nuclear gene. Nucleic Acids Res. 19 : 131–139. [PMC free commodity] [PubMed] [Google Scholar]
  • Haerry, T.East. and Gehring, W.J. 1996. Intron of the mouse Hoxa-7 cistron contains conserved homeodomain binding sites that tin can function as an enhancer element in Drosophila. Proc. Natl. Acad. Sci. 93 : 13884–13889. [PMC free article] [PubMed] [Google Scholar]
  • Haerry, T.E. and Gehring, W.J. 1997. A conserved cluster of homeodomain binding sites in the mouse Hoxa-4 intron functions in Drosophila embryos as an enhancer that is straight regulated by Ultrabithorax. Dev. Biol. 186 : ane–15. [PubMed] [Google Scholar]
  • Hardison, R.C. 2000. Conserved noncoding sequences are reliable guides to regulatory elements. Trends Genet. xvi : 369–372. [PubMed] [Google Scholar]
  • Hardison, R.C., Oeltjen, J., and Miller, W. 1997. Long human–mouse sequence alignments reveal novel regulatory elements: A reason to sequence the mouse genome. Genome Res. seven : 959–966. [PubMed] [Google Scholar]
  • Hauptmann, One thousand., Belting, H.Thou., Wolke, U., Lunde, Grand., Soll, I., Abdelilah-Seyfried, South., Prince, V., and Driever, Westward. 2002. spiel ohne grenzen/pou2 is required for zebrafish hindbrain segmentation. Evolution 129 : 1645–1655. [PubMed] [Google Scholar]
  • Hinegardner, R. 1976. The cellular Deoxyribonucleic acid content of sharks, rays and another fishes. Comp. Biochem. Physiol. B 55 : 367–370. [PubMed] [Google Scholar]
  • The netherlands, P.W. 1997. Vertebrate evolution: Something fishy about Hox genes. Curr. Biol. 7 : R570–R572. [PubMed] [Google Scholar]
  • Kim, C.-B., Amemiya, C., Bailey, W., Kawasaki, Thou., Mezey, J., Miller, W., Minoshima, S., Shimizu, N., Wagner, G., and Ruddle, F. 2000. Hox cluster genomics in the horn shark, Heterodontus francisci. Proc. Natl. Acad. Sci. 97 : 1655–1660. [PMC complimentary article] [PubMed] [Google Scholar]
  • Knittel, T., Kessel, M., Kim, M.H., and Gruss, P. 1995. A conserved enhancer of the man and murine HoxA-seven gene specifies the anterior boundary of expression during embryonal development. Development 121 : 1077–1088. [PubMed] [Google Scholar]
  • Krumlauf, R. 1994. Hox genes in vertebrate evolution. Cell 78 : 191–201. [PubMed] [Google Scholar]
  • Langston, A.Due west., Thompson, J.R., and Gudas, Fifty.J. 1997. Retinoic acid-responsive enhancers located 3′ of the Hox A and Hox B homeobox factor clusters. Functional analysis. J. Biol. Chem. 272 : 2167–2175. [PubMed] [Google Scholar]
  • Loots, G.G., Locksley, R.1000., Blankespoor, C.1000., Wang, Z.Eastward., Miller, Westward., Rubin, E.M., and Frazer, K.A. 2000. Identification of a coordinate regulator of interleukins 4, 13, and 5 past cross-species sequence comparisons. Science 288 : 136–140. [PubMed] [Google Scholar]
  • Lufkin, T. 1997. Transcriptional regulation of vertebrate Hox genes during embryogenesis. Crit. Rev. Eukaryot. Factor Expr. 7 : 195–213. [PubMed] [Google Scholar]
  • Maconochie, M., Nonchev, S., Morrison, A., and Krumlauf, R. 1996. Paralogous Hox genes: Part and regulation. Annu. Rev. Genet. xxx : 529–556. [PubMed] [Google Scholar]
  • Malaga-Trillo, E. and Meyer, A. 2001. Genome duplication and accelerated evolution of Hox genes and cluster architecture in teleosts fishes. Am. Zool. 41 : 676–686. [Google Scholar]
  • Manzanares, M., Wada, H., Itasaki, N., Trainor, P.A., Krumlauf, R., and Holland, P.W.H. 2000. Conservation and elaboration of Hox cistron regulation during evolution of the vertebrate caput. Nature 408 : 854–857. [PubMed] [Google Scholar]
  • Margalit, Y., Yarus, Southward., Shapira, E., Gruenbaum, Y., and Fainsod, A. 1993. Isolation and characterization of target sequences of the chicken CdxA homeobox cistron. Nucleic Acids Res. 21 : 4915–4922. [PMC gratis article] [PubMed] [Google Scholar]
  • Meyer, A. 1998. Hox factor variation and development. Nature 391 : 225–228. [PubMed] [Google Scholar]
  • Meyer, A. and Malaga-Trillo, E. 1999. More fishy tales about Hox genes. Curr. Biol. 9 : R210–R213. [PubMed] [Google Scholar]
  • Meyer, A. and Schartl, One thousand. 1999. Gene and genome duplications in vertebrates: The one-to-four (-to-8 in fish) rule and the evolution of novel gene functions. Curr. Opin. Jail cell. Biol. eleven : 699–704. [PubMed] [Google Scholar]
  • Moran, J.V., DeBerardinis, R.J., and Kazazian Jr., H.H. 1999. Exon shuffling by L1 retrotransposition. Science 283 : 1530–1534. [PubMed] [Google Scholar]
  • Morrison, A., Chaudhuri, C., Ariza-McNaughton, L., Muchamore, I., Kuroiwa, A., and Krumlauf, R. 1995. Comparative analysis of chicken Hoxb-4 regulation in transgenic mice. Mech. Dev. 53 : 47–59. [PubMed] [Google Scholar]
  • Nonchev, S., Vesque, C., Maconochie, Grand., Seitanidou, T., Ariza-McNaughton, L., Frain, M., Marshall, H., Sham, M.H., Krumlauf, R., and Charnay, P. 1996. Segmental expression of Hoxa-two in the hindbrain is directly regulated past Krox-20. Evolution 122 : 543–554. [PubMed] [Google Scholar]
  • Odenwald, W.F., Garbern, J., Arnheiter, H., Tournier-Lasserve, E., and Lazzarini, R.A. 1989. The Hox-one.3 homeo box protein is a sequence-specific DNA-binding phosphoprotein. Genes & Dev. iii : 158–172. [PubMed] [Google Scholar]
  • Ohno, S. and Atkin, N.B. 1966. Comparative DNA values and chromosome complements of eight species of fishes. Chromosoma 18 : 455–466. [PubMed] [Google Scholar]
  • Onyango, P., Miller, Due west., Lehoczky, J., Leung, C.T., Birren, B., Wheelan, S., Dewark, Chiliad., and Feinberg, A.P. 2000. Sequence and comparative assay of the mouse one-Megabase region orthologous to the human 11p15 imprinted domain. Genome Res. 10 : 1697–1710. [PubMed] [Google Scholar]
  • Peifer, M., Karch, F., and Bender, W. 1987. The bithorax circuitous: Command of segmental identity. Genes & Dev. 1 : 891–898. [PubMed] [Google Scholar]
  • Pough, F.H., Janis, C.M., and Heiser, J.B. 1999. Vertebrate life. Prentice Hall, Englewood Cliffs, NJ.
  • Prince, 5.Eastward., Joly, Fifty., Ekker, Thousand., and Ho, R.K. 1998. Zebrafish Hox genes: Genomic organization and modified colinear expression patterns in the torso. Development 125 : 407–420. [PubMed] [Google Scholar]
  • Robinson-Rechavi, Thousand., Marchand, O., Escriva, H., Bardet, P.L., Zelus, D., Hughes, S., and Laudet, V. 2001. Euteleost fish genomes are characterized by expansion of gene families. Genome Res. 11 : 781–788. [PMC complimentary article] [PubMed] [Google Scholar]
  • Rossi, P., Karsenty, G., Roberts, A.B., Roche, Northward.S., Sporn, M.B., and de Crombrugghe, B. 1988. A nuclear gene 1 bounden site mediates the transcriptional activation of a type I collagen promoter by transforming growth gene-β. Prison cell 52 : 405–414. [PubMed] [Google Scholar]
  • Roth, F.P., Hughes, J.D., Estep, P.W., and Church, G.M. 1998. Finding DNA regulatory motifs inside unaligned noncoding sequences clustered by whole-genome mRNA quantitation. Nat. Biotechnol. 16 : 939–945. [PubMed] [Google Scholar]
  • Schwartz, S., Zhang, Z., Frazer, Chiliad.A., Smit, A., Riemer, C., Bouck, J., Gibbs, R., Hardison, R., and Miller, W. 2000. PipMaker-A web server for aligning two genomic Deoxyribonucleic acid sequences. Genome Res. x : 577–586. [PMC gratis article] [PubMed] [Google Scholar]
  • Snell, Eastward.A., Scemama, J.-L., and Stellwag, Eastward.J. 1999. Genomic system of the Hoxa4-Hoxa10 region from Morone saxatilis: Implications for Hox gene evolution amidst vertebrates. J. Exp. Zool. (Mol. Dev. Evol.) 285 : 41–49. [PubMed] [Google Scholar]
  • Stingo, 5., Rocco, L., and Improta, R. 1989. Chromosome markers and karyology of selachians. J. Exp. Zool. Suppl. 2 : 175–185. [PubMed] [Google Scholar]
  • Tagle, D.A., Koop, B.F., Goodman, M., Slightom, J.Fifty., Hess, D.L., and Jones, RT. 1988. Embryonic ε and γ globin genes of a prosimian primate (Galago crassicaudatus). Nucleotide and amino acrid sequences, developmental regulation and phylogenetic footprints. J. Mol. Biol. 203 : 439–455. [PubMed] [Google Scholar]
  • Tan, D.-P., Ferrante, J., Nazarali, A., Shao, X., Kozak, C.A., Guo, Five., and Nirenberg, K. 1992. Murine Hox-1.xi homeobox cistron structure and expression. Proc. Natl. Acad. Sci. 89 : 6280–6284. [PMC free article] [PubMed] [Google Scholar]
  • Taylor, J.S., Van de Peer, Y., Braasch, I., and Meyer, A. 2001. Comparative genomics provides evidence for an ancient genome duplication event in fish. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 356 : 1661–1679. [PMC free commodity] [PubMed] [Google Scholar]
  • Tiersch, T.R., Chandler, R.W., Wachtel, S.S., and Elias, S. 1989. Reference standards for flow cytometry and application in comparative studies of nuclear Dna content. Cytometry ten : 706–710. [PubMed] [Google Scholar]
  • Tomilin, Due north.5. 1999. Control of genes by mammalian retroposons. Int. Rev. Cytol. 186 : 1–48. [PubMed] [Google Scholar]
  • Tompa, Grand. 2001. Identifying functional elements past comparative Dna sequence analysis. Genome Res. 11 : 1143–1144. [PubMed] [Google Scholar]
  • Venkatesh, B., Gilligan, P., and Brenner, South. 2000. Fugu: A compact vertebrate reference genome. FEBS Lett. 476 : 3–7. [PubMed] [Google Scholar]
  • Verrijzer, C.P., Alkema, 1000.J., van Weperen, W.Due west., Van Leeuwen, H.C., Strating, Yard.J., and van der Vliet, P.C. 1992. The Deoxyribonucleic acid binding specificity of the bipartite POU domain and its subdomains. EMBO J. 11 : 4993–5003. [PMC free article] [PubMed] [Google Scholar]
  • Vesque, C., Maconochie, M., Nonchev, Due south., Ariza-McNaughton, L., Kuroiwa, A., Charnay, P., and Krumlauf, R. 1996. Hoxb-ii transcriptional activation in rhombomeres iii and 5 requires an evolutionarily conserved cis-acting chemical element in addition to the Krox-20 binding site. EMBO J. xv : 5383–5396. [PMC free article] [PubMed] [Google Scholar]
  • Vinogradov, A.E. 1998. Genome size and GC-pct in vertebrates every bit adamant by period cytometry: The triangular relationship. Cytometry 31 : 100–109. [PubMed] [Google Scholar]
  • Wasserman, Due west.Westward., Palumbo, Yard., Thompson, W., Fickett, J.West., and Lawrence, C.Due east. 2000. Homo–mouse genome comparisons to locate regulatory sites. Nat. Genet. 26 : 225–228. [PubMed] [Google Scholar]
  • Weiss, Thousand.One thousand. and Fullerton, S.Thou. 2000. Phenogenetic drift and the development of genotype–phenotype relationships. Theor. Popul. Biol. 57 : 187–195. [PubMed] [Google Scholar]
  • Wittbrodt, J., Meyer, A., and Schartl, M. 1998. More genes in fish? BioEssays 20 : 511–515. [Google Scholar]
  • Woods, D.B., Ghysdael, J., and Owen, Chiliad.J. 1992. Identification of nucleotide preferences in DNA sequences recognized specifically past c-ETS-1 poly peptide. Nucleic Acids Res. twenty : 699–704. [PMC gratis article] [PubMed] [Google Scholar]
  • Yanagisawa, S. and Schmidt, R.J. 1999. Diversity and similarity among recognition sequence of Dof transcription factors. Plant J. 17 : 209–214. [PubMed] [Google Scholar]
  • Zhang, Z., Berman, P., and Miller, Westward. 1998. Alignments without low-scoring regions. J. Comput. Biol. v : 197–210. [PubMed] [Google Scholar]

Articles from Genome Research are provided here courtesy of Cold Jump Harbor Laboratory Press


luisthenstuthe.blogspot.com

Source: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC403639/#:~:text=It%20may%20not%20be%20surprising,across%20even%20great%20evolutionary%20distances.

0 Response to "what does it mean to say that hox genes are "conserved" through evolution?"

Post a Comment

Iklan Atas Artikel

Iklan Tengah Artikel 1

Iklan Tengah Artikel 2

Iklan Bawah Artikel