Opsin evolution: key critters: Difference between revisions

From genomewiki
Jump to navigationJump to search
No edit summary
No edit summary
Line 1: Line 1:
Imaging eyes developed in deuterostomes quite late relative to protostomes, after divergence of echinoderms, acornworms, amphioxus, and tunicates but before divergence of lamprey. Consequently visual systems in these species that bracket the origin of eyes are vital to understand.
See also: [[Opsin evolution]]


Unfortunately we don't have fossil dna nearly this far back and so are forced to work with a limited to non-existent fossil record of soft body parts or genomes of living species 400-500 million years removed from the ancestral chain of events. Thus anatomical transcript-labelling studies of photoreceptor systems or whole-genome recover of opsin genes in modern amphioxus only speak to the current situation. That may be seriously different from the ancestor both in terms of innovations and losses.
Imaging eyes developed in deuterostomes rather late relative to protostomes -- after body plan appearance and divergence echinoderms, acornworms, amphioxus, and tunicates but before divergence of lamprey (and the visually degenerated hagfish sister group) with the next node up cartilaginous fish. Consequently visual systems in these species that bracket the origin of eyes are vital to understand, so the biology and opsin availability for these key species are considered in depth below.


It's worth addressing this perpetual source of confusion by emphasizing [http://genomewiki.ucsc.edu/index.php/Opsin_evolution:_trichromatic_ancestral_mammal again] that contemporary tunicates, lancelets, and lamprey are not ancient, ancestral, antiquated, archaic, character-retaining, dead-end, failed experiments, frozen in time, genetically stationary, living fossil, primitive, primordial, relic, or retro species. They're full modern -- the tree of life is right-justified. Indeed their genes, regulatory signalling systems, and enzymes may be more finely honed than human because of more rapid evolution attributable to larger effective population sizes, reproductive mode, generation time, and marine selective predatory pressures.
However a broader view is needed because rhabdomeric non-imaging opsins of vertebrates have a predominantly protostomal context (where they constitute the primary imaging opsins); conversely cilliary opsins of vertebrates have functionally obscure but definite antecendents in the ecdyzoa and lophotrochozoa wings. The third type of opsin which uses Go gustducin signalling must also have an important evolutionary history. Recall signaling GPCR alpha subunits experienced gene duplication and subfunctionalization in coordination with opsinlike genes -- for that reason opsins classes can be indicated either by localization (apical rhabdome, cilium membrane, retinal ganglial cell), monophyletic clade in gene tree, or type of signalling partner. This works because after a certain degree of co-evolution, an opsin is irrevocably located into a receptor type.


However we hope that ancestral character traits will still be reflected to some extent in these earlier diverging species and that with enough complete opsin repertoires from taxonomically appropriate species, the ancestral genes and even visual systems can be reconstructed at key nodes on the phylogenetic tree. The story describing the evolution of the human eye then amounts to describing the status at these successive nodes and perhaps interpolating between them. There are definitely limits to knowledge here as metazoans provide only 35 nodes between sponge and human -- gaps between nodes average 30 million years but can seriously exceed that.
Some species such as drosophila have lost all ciliary opsins [ref] -- clearly they are not essential for a successful visually complex flying insect with 5-color vision and circadian rhythm. Bees, annelids, and mammals retain ciliary opsins so we know this must be the ancestral bilateran state state. This predicts ciliary opsins in cnidaria and indeed one was [http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2013938 just found] in cnidaria. One sees the importance of complete genomes here (versus transcripts or immunostained sections): absence of ciliary opsin evidence in a genome is truly evidence of ciliary opsin absence.  


The plan below is to summarize several most excellent experimental papers on visual systems in echinoderms, amphioxus, tunicates, lamprey, and chondrichtyhes and supplement that with genomewide recoverable opsin sequences, about half of which are new here and never considered experimentally. That genomic data -- which by itself would have limited interpretability -- can also be processed to determine times of gene origin, gene family relationships, and ancestral sequence which yields, by standard theory, ancestral adsorption specta.
When the eye is reduced to a single pigment cell backing a single photoreceptor cell, the opsin of that species will be expressed only in one cell of the entire body. In this situation, the opsin may never show up in transcript collections, even with subtraction of common ones.


In recent years, our whole framing of vision has undergone immense refinement. Many animal species possess multiple photoreception systems that are not conventional high resolution imaging eyes (eg lack cornea, lens, retina) by human standards. In fact some 83% of animal phyla have flourished over geological time scales without such eyes, in seeming contradiction of popular nonsense ([http://books.google.com/books?id=bhgQC92kxN0C&dq=in+the+blink+of+an+eye In the Blink of an Eye]). The supposed deuterostome advantage in the Cambrian needs to contend with the fact that hemichordates, echinoderms, cephalochordates, and urochordates completely lack imaging eyes even today. Moreover, their genomes totally lack any hint of having developed the requisite ciliary opsin family members.  
Vertebrates could never have evolved cilliary opsin vision had the bilateran ancestor possessed the limited opsin repertoire of fruit fly. Thus most pressing question is -- assuming rhabdomeric opsins were thoroughly entrenched in the earliest imaging eyes and photoreception systems -- what kept ciliary opsins around in early bilatera (and even cnidaria) so that they could later be co-opted for ciliary opsin-based vision? We could also ask why vertebrates did not stay on the rhabdomeric track of early deuterostomes but instead underwent this profound switch to the 'untested' ciliary track. It is not at all clear what advantages ciliary offers over rhabdomeric -- ever miss swatting a fly?


First it's worth reviewing the recent experimental literature. Note abstracts are readily available at [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=12435605,15936279,11591373,9427550,17463225,14981504,15096614,9256070,17961206,11874910,15514158,16311335,16291092 PubMed] but access to free full text is unpredictable and so links are collected below. It suffice to note just recent substantial articles because they cite the earlier literature and their citation in turn by more recent is collected by Google Scholar. Most opsin sequences have a source at PubMed as part of their fasta header database and those can  
Fossil dna does not go back nearly this far back, the nearly non-existent fossil record of soft body parts is unhelpful, and transcripts plus genomes of living species are 450-550 million years removed from the crucial ancestral chain of events. For example, transcript-labelled thin sections of photoreceptor systems in modern amphioxus only speak to the current situation, as does extraction of opsin genes from the new assembly, and speak to the history only by inference. The situation today may be seriously different from the ancestor in terms of both innovations and losses. And that history is not necessarily the most parsimonius (even though we will often assume that).
== simply ==
 
be compiled to an [active] link that opens all of them.[http://genomewiki.ucsc.edu/]them
It's worth expanding on this perpetual source of confusion by emphasizing [http://genomewiki.ucsc.edu/index.php/Opsin_evolution:_trichromatic_ancestral_mammal here too] that contemporary tunicates, lancelets, and lamprey are not ancient, ancestral, antiquated, archaic, character-retaining, dead-end, failed experiments, frozen in time, genetically stationary, living fossil, primitive, primordial, relic, or  retro species. They're full modern -- the tree of life is right-justified. Indeed their genes, regulatory signalling systems, and enzymes may be more finely honed than mammals because of more rapid evolution attributable to larger effective population sizes, reproductive mode, generation time, and marine selective predatory pressures.
 
However we can hope that ancestral character traits will still be reflected to some extent in these earlier diverging species and that with enough complete opsin repertoires from taxonomically appropriate species, the ancestral genes and even visual systems can be reconstructed at key nodes on the phylogenetic tree. The story describing the evolution of the human eye then amounts to describing the status at these successive nodes and perhaps interpolating between them. There are definitely limits to knowledge here as extant metazoans provide only 35 nodes between sponge and human -- gaps between nodes may average 30 million years but can seriously exceed that. This is offset by the occasional proposal of new deuterostome branches (Xenoturbella, Convoluta).
 
The ideal set of genomes needed to study the evolution of the metazoan eye is only partly completed, underway, or even proposed. In some cases, the genome size of clade representatives is so large (eg lungfish at 25x human) the species may never be sequenced, though opsin transcripts could still be obtained. In others, the rate of evolution has been so fast so long that very little information about photoreception at ancestral nodes can remain (eg Oikopleura). Hagfish opsins, which would conveniently break up the crucial lamprey long branch, are not available at GenBank but here the animal has adopted a deep water (dark) habitat, meaning that its cone opsin genomic repertoire will be highly reduced, if not gone entirely, in its markedly degenerated eyes. (Its other opsins could still be informative.)
 
Model organism choices do not always coincide with genome sequenceability, transcriptome projects, nor (worst of all) with slow-evolving less derived species. Finally, most sequencing speaks to narrow anthropocentric interests, whereas the more broadly conceived sequencing need is greatest farther back (to break up branches). The evolution of the eye needs a rather different portfolio of genomes than a typical disease gene because of the earlier intrinsic timing of the innovative events. In fact, one product of the investigation here is to spell out these needed genomes. Of course one obvious genome choice is the cubomedusan jellyfish Carybdea marsupialis with its 24 eyes of 6 types.
 
It's worth reviewing genome status and recent experimental literature on key species. While abstracts are readily available at PubMed, access to free full text is unpredictable, so those links are collected when available. It suffice to reference only recent articles because they cite the earlier literature and citation in turn of their paper are collected by Google Scholar (or AbstractsPlus at PubMed). Most opsin sequences in the [[Opsin evolution]] reference sequence collection have a PubMed accession as a field their fasta header database; those can simply be compiled to an active link that opens all of them in one PubMed window.
   
   
<tt>
* <span style="color: #990099;">Ixodes (tick)</span>
under development ...


[http://biology.plosjournals.org/perlserv/?request=get-document&doi=10.1371/journal.pbio.0040254 Melanopsins]
The genome project was completed long ago but has experienced a multi-year bottleneck in assembly release and publication. However contigs built from 19.4 million traces should become available to tblastn of the GenBank "wgs" division by late 2007. Ixodes has a very conservative genome (regretably 2.1 gbp in size), seemingly far less derived than drosophilids in matters such as intron, gene retention,  and protein sequence conservation. This, in conjunction with the helpful phylogenetic position of chelicerate outgroup to the many insect genomes, has improved prospects for reconstructing the ancestal opsin repertoire of Arthropoda and eventually Protostomia and UrBilatera.


[http://www.sciencedirect.com/cache/MiamiImageURL/B6T36-4417HYP-4-1/0?wchp=dGLzVzz-zSkWW Ciona opsins]
A large collection of annotated ESTs is available at the [http://compbio.dfci.harvard.edu/tgi/cgi-bin/tgi/gireport.pl?gudb=i_scapularis DFCI Gene Index] of which 3 are marked up as opsins. Using the [http://genomewiki.ucsc.edu/index.php/Opsin_evolution Opsin Classifier], I recovered the full length gene for the first of these on 24 Nov 07, intronated the transcript at the Trace Archives (4 introns, superb coverage), and added it to the [http://genomewiki.ucsc.edu/index.php/Opsin_evolution classifier fasta collection] as RHAB1_ixoSca. It classifies with rhabdomeric opsins (ie with deuterostome melanopsins) with a very respectable 57% maximal percent protein identity. The second intron and third have classical ancestral position (following GWSR and LAK) and phase (2 and 0).  Synteny awaits assembly of large contigs -- adjacent exons are never spanned by a single trace.


[http://www.sciencemag.org/cgi/content/full/306/5697/869 Vertebrate-type opsin in invertebrate brain]
* <span style="color: #990099;">Daphnia (water flea)</span>


[http://jeb.biologists.org/cgi/content/full/204/14/2527 What good are cone opsin gene duplications without brain rewiring:]
An 8.7x genome assembly was released in July 2007 at [http://www.jgi.doe.gov/Daphnia/ JGI] with further support at [http://wfleabase.org/ wFleaBase]. This crustacean provides a potentially important outgroup to insects (together forming Pancrustacea). However the opsin story, summarized in a [https://dgc.cgb.indiana.edu/display/daphnia/Carla+Caceres meeting abstract] is an embarrassment of riches, not conducive to deducing ancestral arthropod genome content. The total number of opsin genes came in at 37, comprised of 22 rhabdomeric opsins (mostly long wavelength), 7 ciliary opsins (pteropsins), and 8 in a novel family without close affiliates. This seems excessive but Daphnia has ommatidia (compound eyes), circadian rhythms, and a need to assess water turbidity and depth. Planned in situ hybridization studies may illuminate biological roles of these opsins. The pteropsins are probably of most interest here.


</tt>
Gene models have not been submitted yet to GenBank but are likely extractable by text query at wFleaBase. What is needed here however is not the clutter of 37 sequences but their collapse into UV, blue, long, pteropsin, and novel ancestral representatives. This would remove 'noise' from lineage-specific expansions. The intron structure could provide very important support to classification schemes.  
"The nervous systems of vertebrates are not "hard-wired" at birth (or hatch or the end of metamorphosis...). Decisions about which nerve cells should be connected to which other nerve cells are made during a long space of time prior to adulthood, and in some animals (though usually to a much more limited extent) even during adulthood. Genetics seem to specify (in unknown ways) some of the gross features of connectivity--for example in mammals the axons of ganglion cells in the eye mostly grow through the optic nerve to a particular group of cells in the thalamus. However, the fine distinctions about, for instance, which ganglion cells connect to which cells in the thalamus are made initially by the formation of a lot of random connections. Many of these connections are then pruned back so that each ganglion cell stimulates only a small subset of the cells it initially connected with. The "rules" governing the pruning back are largely based on correlations in the activity of different cells--if two cells in the retina are generally active at the same time, then they will probably end up being connected to the same cells in the thalamus."


"This activity-dependent pruning of connections appears to be the way that "maps" are created in higher brain areas. The best indicator of whether or not two cells in the retina will be simultaneously active, is how close they are to each other in space. Cells in the thalamus thus form a map of cells in the retina according to their activity, and hence their connectivity. Now it's easy to imagine that another determinant of whether or not two cells will be active at the same time is whether or not they are connected to cells which express the same pigment (within the retina, the same rules are followed in the creation of connections, so ganglion cells will preferentially be connected to cells which express the same pigment). So in the thalamus and other brain regions, there will be maps of the different receptor types within the maps of retinal location."
The expansions may have arisen through retroprocessing (rather than segmental duplication) of a few master exonic genes, which would then be the orthologs to other arthropod opsins. Indeed the intronation pattern -- typically far more [http://genomewiki.ucsc.edu/index.php/Ancestral_introns:_SGSH deeply conserved] than protein sequence -- could link pteropsins more convincingly to lophotrochoan and deuterostome opsins than alignments with percent identities in the 20's.  


This blast twilight zone is especially dangerous for photoreceptor opsins because they are embedded in much larger gene family of generic rhodopsin and GPCR which share many structural and signaling properties. A slowly evolving generic rhodopsin might well score higher than fast evolving photoreceptor opsins. Gene expansions are noted for markedly enhanced rates as copies neo- or subfunctionalize. The generic rhodopsin might also share diagnostic residues through convergence at least at the level of statistical signficance ambiguity. Consequently intron location/phase and synteny can provide important backup.


[[Category:Comparative Genomics]]
The synteny circle surviving at this phylogenetic depth will be local (optimistically Pancrustacean). That is, the blue opsin of Daphnia might in synteny with Drosophila (ie establish orthology) but not to Platynereis ciliary opsin much less any vertebrate opsin (eg encephalopsin). This could be remedied to some extent by ancestral gene order reconstruction. The degree to which synteny can contribute to validating orthology relations within opsins is not currently known.

Revision as of 17:32, 24 November 2007

See also: Opsin evolution

Imaging eyes developed in deuterostomes rather late relative to protostomes -- after body plan appearance and divergence echinoderms, acornworms, amphioxus, and tunicates but before divergence of lamprey (and the visually degenerated hagfish sister group) with the next node up cartilaginous fish. Consequently visual systems in these species that bracket the origin of eyes are vital to understand, so the biology and opsin availability for these key species are considered in depth below.

However a broader view is needed because rhabdomeric non-imaging opsins of vertebrates have a predominantly protostomal context (where they constitute the primary imaging opsins); conversely cilliary opsins of vertebrates have functionally obscure but definite antecendents in the ecdyzoa and lophotrochozoa wings. The third type of opsin which uses Go gustducin signalling must also have an important evolutionary history. Recall signaling GPCR alpha subunits experienced gene duplication and subfunctionalization in coordination with opsinlike genes -- for that reason opsins classes can be indicated either by localization (apical rhabdome, cilium membrane, retinal ganglial cell), monophyletic clade in gene tree, or type of signalling partner. This works because after a certain degree of co-evolution, an opsin is irrevocably located into a receptor type.

Some species such as drosophila have lost all ciliary opsins [ref] -- clearly they are not essential for a successful visually complex flying insect with 5-color vision and circadian rhythm. Bees, annelids, and mammals retain ciliary opsins so we know this must be the ancestral bilateran state state. This predicts ciliary opsins in cnidaria and indeed one was just found in cnidaria. One sees the importance of complete genomes here (versus transcripts or immunostained sections): absence of ciliary opsin evidence in a genome is truly evidence of ciliary opsin absence.

When the eye is reduced to a single pigment cell backing a single photoreceptor cell, the opsin of that species will be expressed only in one cell of the entire body. In this situation, the opsin may never show up in transcript collections, even with subtraction of common ones.

Vertebrates could never have evolved cilliary opsin vision had the bilateran ancestor possessed the limited opsin repertoire of fruit fly. Thus most pressing question is -- assuming rhabdomeric opsins were thoroughly entrenched in the earliest imaging eyes and photoreception systems -- what kept ciliary opsins around in early bilatera (and even cnidaria) so that they could later be co-opted for ciliary opsin-based vision? We could also ask why vertebrates did not stay on the rhabdomeric track of early deuterostomes but instead underwent this profound switch to the 'untested' ciliary track. It is not at all clear what advantages ciliary offers over rhabdomeric -- ever miss swatting a fly?

Fossil dna does not go back nearly this far back, the nearly non-existent fossil record of soft body parts is unhelpful, and transcripts plus genomes of living species are 450-550 million years removed from the crucial ancestral chain of events. For example, transcript-labelled thin sections of photoreceptor systems in modern amphioxus only speak to the current situation, as does extraction of opsin genes from the new assembly, and speak to the history only by inference. The situation today may be seriously different from the ancestor in terms of both innovations and losses. And that history is not necessarily the most parsimonius (even though we will often assume that).

It's worth expanding on this perpetual source of confusion by emphasizing here too that contemporary tunicates, lancelets, and lamprey are not ancient, ancestral, antiquated, archaic, character-retaining, dead-end, failed experiments, frozen in time, genetically stationary, living fossil, primitive, primordial, relic, or retro species. They're full modern -- the tree of life is right-justified. Indeed their genes, regulatory signalling systems, and enzymes may be more finely honed than mammals because of more rapid evolution attributable to larger effective population sizes, reproductive mode, generation time, and marine selective predatory pressures.

However we can hope that ancestral character traits will still be reflected to some extent in these earlier diverging species and that with enough complete opsin repertoires from taxonomically appropriate species, the ancestral genes and even visual systems can be reconstructed at key nodes on the phylogenetic tree. The story describing the evolution of the human eye then amounts to describing the status at these successive nodes and perhaps interpolating between them. There are definitely limits to knowledge here as extant metazoans provide only 35 nodes between sponge and human -- gaps between nodes may average 30 million years but can seriously exceed that. This is offset by the occasional proposal of new deuterostome branches (Xenoturbella, Convoluta).

The ideal set of genomes needed to study the evolution of the metazoan eye is only partly completed, underway, or even proposed. In some cases, the genome size of clade representatives is so large (eg lungfish at 25x human) the species may never be sequenced, though opsin transcripts could still be obtained. In others, the rate of evolution has been so fast so long that very little information about photoreception at ancestral nodes can remain (eg Oikopleura). Hagfish opsins, which would conveniently break up the crucial lamprey long branch, are not available at GenBank but here the animal has adopted a deep water (dark) habitat, meaning that its cone opsin genomic repertoire will be highly reduced, if not gone entirely, in its markedly degenerated eyes. (Its other opsins could still be informative.)

Model organism choices do not always coincide with genome sequenceability, transcriptome projects, nor (worst of all) with slow-evolving less derived species. Finally, most sequencing speaks to narrow anthropocentric interests, whereas the more broadly conceived sequencing need is greatest farther back (to break up branches). The evolution of the eye needs a rather different portfolio of genomes than a typical disease gene because of the earlier intrinsic timing of the innovative events. In fact, one product of the investigation here is to spell out these needed genomes. Of course one obvious genome choice is the cubomedusan jellyfish Carybdea marsupialis with its 24 eyes of 6 types.

It's worth reviewing genome status and recent experimental literature on key species. While abstracts are readily available at PubMed, access to free full text is unpredictable, so those links are collected when available. It suffice to reference only recent articles because they cite the earlier literature and citation in turn of their paper are collected by Google Scholar (or AbstractsPlus at PubMed). Most opsin sequences in the Opsin evolution reference sequence collection have a PubMed accession as a field their fasta header database; those can simply be compiled to an active link that opens all of them in one PubMed window.

  • Ixodes (tick)

The genome project was completed long ago but has experienced a multi-year bottleneck in assembly release and publication. However contigs built from 19.4 million traces should become available to tblastn of the GenBank "wgs" division by late 2007. Ixodes has a very conservative genome (regretably 2.1 gbp in size), seemingly far less derived than drosophilids in matters such as intron, gene retention, and protein sequence conservation. This, in conjunction with the helpful phylogenetic position of chelicerate outgroup to the many insect genomes, has improved prospects for reconstructing the ancestal opsin repertoire of Arthropoda and eventually Protostomia and UrBilatera.

A large collection of annotated ESTs is available at the DFCI Gene Index of which 3 are marked up as opsins. Using the Opsin Classifier, I recovered the full length gene for the first of these on 24 Nov 07, intronated the transcript at the Trace Archives (4 introns, superb coverage), and added it to the classifier fasta collection as RHAB1_ixoSca. It classifies with rhabdomeric opsins (ie with deuterostome melanopsins) with a very respectable 57% maximal percent protein identity. The second intron and third have classical ancestral position (following GWSR and LAK) and phase (2 and 0). Synteny awaits assembly of large contigs -- adjacent exons are never spanned by a single trace.

  • Daphnia (water flea)

An 8.7x genome assembly was released in July 2007 at JGI with further support at wFleaBase. This crustacean provides a potentially important outgroup to insects (together forming Pancrustacea). However the opsin story, summarized in a meeting abstract is an embarrassment of riches, not conducive to deducing ancestral arthropod genome content. The total number of opsin genes came in at 37, comprised of 22 rhabdomeric opsins (mostly long wavelength), 7 ciliary opsins (pteropsins), and 8 in a novel family without close affiliates. This seems excessive but Daphnia has ommatidia (compound eyes), circadian rhythms, and a need to assess water turbidity and depth. Planned in situ hybridization studies may illuminate biological roles of these opsins. The pteropsins are probably of most interest here.

Gene models have not been submitted yet to GenBank but are likely extractable by text query at wFleaBase. What is needed here however is not the clutter of 37 sequences but their collapse into UV, blue, long, pteropsin, and novel ancestral representatives. This would remove 'noise' from lineage-specific expansions. The intron structure could provide very important support to classification schemes.

The expansions may have arisen through retroprocessing (rather than segmental duplication) of a few master exonic genes, which would then be the orthologs to other arthropod opsins. Indeed the intronation pattern -- typically far more deeply conserved than protein sequence -- could link pteropsins more convincingly to lophotrochoan and deuterostome opsins than alignments with percent identities in the 20's.

This blast twilight zone is especially dangerous for photoreceptor opsins because they are embedded in much larger gene family of generic rhodopsin and GPCR which share many structural and signaling properties. A slowly evolving generic rhodopsin might well score higher than fast evolving photoreceptor opsins. Gene expansions are noted for markedly enhanced rates as copies neo- or subfunctionalize. The generic rhodopsin might also share diagnostic residues through convergence at least at the level of statistical signficance ambiguity. Consequently intron location/phase and synteny can provide important backup.

The synteny circle surviving at this phylogenetic depth will be local (optimistically Pancrustacean). That is, the blue opsin of Daphnia might in synteny with Drosophila (ie establish orthology) but not to Platynereis ciliary opsin much less any vertebrate opsin (eg encephalopsin). This could be remedied to some extent by ancestral gene order reconstruction. The degree to which synteny can contribute to validating orthology relations within opsins is not currently known.