Opsin evolution

From genomewiki
Revision as of 10:13, 8 January 2008 by Tomemerald (talk | contribs)
Jump to navigationJump to search

Introduction to Opsin Evolution

The Curated Set of Metazoan Opsins

Below is the largest set of phylogenetically dispersed hand-curated opsin sequences ever assembled. The sequences are organized into true orthology classes using coding indels, intron location and phase, synteny of flanking genes, diagnostic residues, blast clustering, and experimental characterization when available. The reference set of opsin sequences includes selected GenBank entries but mostly new opsins extracted from dozens of newly completed genome projects.

The set serves as a gene family classifier ... just uBlast an unknown candidate opsin against the full database below and look for consistent labelling of the top hits from the Opsin Classifier. Then validate the apparent orthology class using three independent classes of rare genomic events (indels, introns, signature residues) by including the new sequence in a full-width Multalinment, being sure to include a couple dozen non-opsin GPCRs as controls. Then check for residual bilateral flanking gene adjacency when a genome assembly is available. The whole process takes only 15 seconds per query!

The set of reference sequences is deliberately not exhaustive -- that seriously oversamples in popular experimental clades such as vertebrates and insects. When a given clade has many similar sequences available, those in species with genome assemblies are chosen to represent the group, for example anole is preferred to gecko, and (rightly or wrongly) any experimental results from gecko transfered over. This avoids the uninformative clutter of near-identical sequences. However if a clade reflects a very deep divergence especially important to opsin evolution (such as lamprey or amphioxus), all available sequences are needed to maximally break up long branches.

Sequences not available from GenBank were culled from trace archives, tiled contigs, and genome assemblies, typically by uBlastx against the growing set of reference sequences, as described in the annotation section. The level of error in the curated sequences is very low, declines with time as anomalies are revisited and repaired, but never reaches zero because of problems inherent to experimental data, imperfect sequencing reads, less-than-complete assemblies, and sequence manipulation.

Rare genomic changes can supplement (and even displace) traditional maximal likelihood and bayesian inference in resolving polytomic divergence nodes in gene and species tree topologies. Rare genomic changes applicable to opsins include coding indels (deletions and insertions), intron placement (position and phase comparison), synteny (gene order along the chromosome), and gene copy number change (gene gain from retropositional, tandem, segmental, and whole genome duplications; gene loss from pseudogenization or deletion). Results from these methods must be evaluated for their susceptibility to homoplasy (misleading recurrent independent events that mimic a single event) and incomplete penetration in the population level at the time of speciation (lineage sorting).

Opsins are more informatively stored as proteins since nucleotide sequences are far too diverged at metazoan evolutionary distances and do not explicitly manifest residue properties that experience selection. These protein sequences are parsed into constituent exons using genomic information when available -- fortunately splicing mechanisms are exceeding well conserved across animal phyla. When not directly available (eg the opsin originated as a cDNA in a species lacking a genome project), exon breaks have been inferred from the phylogenetically closest neighbors via hterologous alignment (but not in insects where intron turnover is too high). As an example, lamprey opsins from Geotria australis and Lethenteron japonicum can work as blastn queries to locate orthologs within the Petromyzon maritimus genome project (which consists solely of 19 million traces as of mid-December 2007).Numbers flanking exons, namely 0 1 2, show the phasing of each intron, eg 12 means an overhang of 1 bp at the 3' end of an exon with fragmentary codon completed by a 2 bp overhang at the beginning of the next exon.

Intron position and phasing are conserved over vast evolutionary distances -- human to sponge and beyond -- with the exception of certain rogue lineages (that unfortunately includes two major model organisms). Informative conservation is still available long after protein percent identity has slid into the twilight zone of uncertainty (critical in opsins because they are readily confused with generic GPCR receptors). For example, no variation whatsoever in intron pattern has occured in any vertebrate opsin class since lamprey divergence, no events in many billions of years of branch length. Prior to that, rare events are observed, for example LWS opsins gained an extra intron of phase 12. In a protein having 333 locations with 3 possible phases at each location, convergent evolution (homoplasy) is uncommon, that is, close examination establishes opsin introns are still highly informative even when sequences have greatly diverged.

Insertions and deletions (coding indels) are sufficiently uncommon in opsins that they are a potentially phylogenetically informative class of rare genomic events. We'll harvest these from a massive opsin alignment and stratify by location and phylogenetic depth. Indels are more subject to homoplasy than intron gain or loss but the risk of that varies greatly by region. Indels are less affected by lineage-specific rates than other event classes.

Syntentic relationships can have great value in determining orthology relationships, though gene order is not often conserved over great evolutionary timescales due to chromosomal rearrangements. As with introns, certain rogue lineages lose almost all this information. Tracking synteny is nomenclaturally confused because many genes are unnamed and simply numbered by annotation pipeline procedures in each genome without homological consistency. Here, HUGO-convention names are used for two genes flanking each side of a given human opsin. Strand orientation is noted relative to a fixed convention of plus strand for the human opsin. Then each genome assembly is visited to determine the extent of conservation of these flanking genes and orientation. In the event humans lack the gene, synteny is defined by the nearest diverging species, typically platypus or chicken, that has the gene. Sometimes the original synteny is only partly retained (left or right half-synteny). For deeply diverging species such as amphioxus, flanking genes can be are pushed forward into 'nearby' species. This can also be done without an assembly in species with sufficiently large contigs containing the opsin. Blast clustering can be uncertain because of diminishing percent identity or even loss of these flanking genes; common chromsomal displacements such as inversions eventually disrupt most syntentic arrangements.

The fasta header of each sequence serves as a miniature database that conveniently collects this and other basic information, with fields showing the opsin type, genus, species and common name, accession number, best PubMed citation, indels, intron pattern, sequence length, lambda max adsorption, flanking synteny, and G protein type with which it interacts (all subject to availability and work-in-progress). These fasta headers by themselves provide a quick over view the opsin reference set collection -- simply paste into a blank document and pull lines containing '>'.

Thus the usual querying at GenBank does not remotely compare to the Opsin Classifier. Those sequences -- often mis-annotated by an unattended pipeline or well-intentioned individual with no qualifications in comparative genomics -- are spread out over separate databases not accessible by any single method, with difficult-to-interpret edge creep of genomic blast matches, uncorrected frameshifts, missing stop codons and erroneous amino terminals.

As worst-case scenario, half-baked annotation of the sea urchin genome by pipelines and casual procedures has left a awful legacy of bogus opsin gene structures at GenBank, journal alignments, and genome browsers -- often mis-classified because of an inadequate set of reference sequences and non-opsin GPCR controls, chimeric confusion in tandem duplicates, and non-consideration of intron structure, indels, and synteny. These errors may ripple downstream to errors by subsequent scientists trusting GenBank nr as classifier and lead to a whole subculture taken up with non-existent "virtual" opsins and attendent vacuous speculation on echinoderm photoreception.

Melanopsins, the unexpected rhabdomeric-class Gq-coupled opsin recently found in upper deuterostomes, illustrate some of the difficulties of accurate annotation. They can be confused homologically due to various expansions and contractions. Mammals, human through platypus, have a single melanopsin. However a common ancestor to chicken, lizard, frog, teleost fish, and possibly cartilagenous fish had a multi-gene segmental duplication with both resulting melanopsins retained (though substantially diverged). In ray-finned fish, a processed retrogene arose that may be functional in zebrafish though lost in fugu and stickleback. After whole genome duplication, zebrafish also retained two copies of the original melanopsin. Chondrichthyes also have a second copy of the primary melanopsin but synteny -- which is essential for analysis since intron placement is uninformative in duplications and sequence alignment too dependent on unknown rates -- is not available in the current contig-level assembly.

Amphioxus contains two melanopsins from an apparently independent duplication. Flanking gene order today bears no relation to any vertebrate gene order. The lamprey situation awaits assembly of its traces or targeted transcript studies. At this time, only a four-exon fragmentary melanopsin can be recovered (however with high percent identity, 80%). Possibly orthologs of this melanopsin locus could be tracked into the highly derived tunicates, acorn worm, and sea urchins. The distinctive intron pattern may even allow melanopsin antecedents to be identified in Cnidaria and Protostomia. At this point, the best blastp match to insects stands at 37% with no evident syntenic or intronic support.

While clade-specific proliferations of melanopsins -- and implied role subfunctionalizations -- confounds the situation for chordates, it has little impact on the opsin classifier described here. Unknown sequences readily find their place because of the extensive phylogenetic representation of reference sequence orthology classes and the inherent distance of melanopsins from the ciliary subcollection. At that level of alignment, the melanopsins serve as outgroup to ciliary opsins and so help define motifs specific to Gt-coupled signaling and other structure/function issues.

It appears however that far too much 'lumping' has taken place in nearly all non-imaging opsins, for example encephalopsins, melanopsins, and peropsins. The taxonomic counterpart is too much lumping of species outside of mammal. In all likelihood, additional opsin orthology classes need definition and distinctive naming. These opsins were belatedly discovered through whole-genome homology searches; they may differ one from another just as much as cone and rod opsins. However, it is currently difficult to disentangle phylogenetically short-lived expansions within known families from deep-rooted parallel-evolving subfamilies concentrated (superficially) within 'secondary' photoreception systems. Genome sampling density and completion efforts are overwelmingly concentrated in a few model species and near human.

We should not be too hasty to write off peropsins as mere auxillary retinal isomerases that replenish cis-retinal for 'real' opsins. This reaction is not a simple isomerization (like glucose isomerase) but photoisomerization with an evolutionarily tuned and conserved visual light action spectrum. There is no reason that trans-retinal cannot be the signaling agonist with conversion to all-cis retinal the waste product. Pairing two opposing types of opsins in a single cell or nearby cells then completes a full visual cycle with interesting opportunities for sensory capabilities. Of all the so-called retinal isomerases, peropsins retain more of the diagnostic residues of ciliary opsins; possibly they function similarly but in balance with another opsin class co-expressed in the same cell. Melanopsin, surprisingly, is fully capable of self-replenishment, so perhaps the first-studied ciliary opsins are the true anomaly with their need for an auxillary replenishment cycle in the retinal epithelium.

It's abundantly clear from distinct ancestral introns and alignment clustering that peropsins together with neuropsins and RGR opsins comprise a distinctive subclade within the overall opsin gene tree. The comparative genomics of RGR illustrates the danger of generalizing from all-to-commom phylogenetic undersampling: with much deeper sampling, we can observe that the E/DRY motif -- conserved across all classes of opsins (indeed GPCR) and critical to maintaining non-signalling conformation -- has become GRY in all boreoeutheran mammals (it's ERY in afrotheres and DRY from platypus through shark and even tunicate, and DRY in neuropsins and peropsins).

In other words, after several trillion years of branch length conservation as charged amino acid, a radical amino acid substitution has taken hold -- to glycine with tiny inert side chain. In this subclade of placental mammals, glycine has been conserved without exception for over two billion years of branch length. Given the importance of this motif for maintaining the non-signalling state, this suggests a major change in functional properties of RGR opsins within boreoeutheres. That change might be breakdown to mere isomerase within that clade.

Comparative Genomics of DRY motifs in exon 3 of RGR Opsins:
1 RWPYGSDGCQAHGFQGFVTALASICSSAAIAWGRYHHYCT 1 human
1 RWPYGSDGCQAHGFQGFVTALASICSSAAIAWGRYHHYCT 1 macaque
1 RWPYGSGGCQAHGFQGFTTALASICGSAAIAWGRYHHYCT 1 lemur
1 RWPYGSDGCKVHGFQGFATALASISGSAAIAWGRYHQYCT 1 treeshrew
1 RWPHGSEGCQVHGFQGFATALASICGSAAVAWGRYHHYCT 1 mouse
1 RWPYGSDGCQAHGFQGFATALASICGSAAIAWGRYHHYCT 1 rabbit
1 RWPYGSDGCQAHGFQGFATALASICGSAAIAWGRYHHYCT 1 pika
1 RWPYGSEGCQAHGFQGFVTALASICSSAAVAWGRYHHFCT 1 cow
1 RWPYGSDGCQAHGFQGFVTALASICSSAAIAWGRYHHYCT 1 horse
1 RWPYGSNGCQAHGFQGFVTALASICSSAAIAWGRYHHYCS 1 cat
1 RWPYGPDGCQAHGFQGFATALASICSSAALAWGRYHHYCT 1 dog
1 RWPYGSGGCQAHGFQGFAAALASICGSAAVAWGRYHHYCT 1 bat
1 RWPFGPDGCQAHGFQGFATALASICSSAAIAWGRYHHYCT 1 shrew
1 RWPYGSDGCQAHGFQGFVTALASICSCAAIAWERYHHYCT 1 elephant
1 RWPYGSDGCQAHGFQGFVMALTSICSCAAIAWERYHHYCT 1 hyrax
1 HWPYGSGGCQAHGFQGFTVALASICSCAAIAWERYHHYCT 1 tenrec
1 RWPHGSDSCQAHSFQGFATALASISSSAAIAWERYRHHCT 1 sloth
1 RWPYGSGGCQAHGFQGFVTALASISSSAAIAWERCHRHCI 1 armadillo
1 HWPYGAEGCRLHGFQGFATALASISLSAAIGWDRYLRHCS 1 platypus
1 YWPYGSDGCQIHGFHGFLTALTSISSAAAVAWDRHHQYCT 1 lizard
1 YWPYGSEGCQIHGFQGFLTALASISSSAAVAWDRYHHYCT 1 chicken
1 YWPYGSEGCQIHGFQGFVAALSSIGSCAAIAWDRYHQYCT 1 frog
1 YWPYGSDGCQTHGFQGFVTALASIHFIAAIAWDRYHQYCT 1 stickleback
1 YWPYGSDGCQTHGFQGFMTALASIHFIAAIAWDRYHQYCT 1 zfish1
1 HWPFGSEGCQLHAFQGMVSILAAISFLGAVAWDRYHQYCT 1 zfish2
1 YWPYGSEGCQTHGFQGFVTALASIHFVAAIAWDRYHQYCT 1 tetraodon
1 YWPYGSDGCQTHGFQGFVTALASIHFVAAIAWDRYHQYCT 1 fugu
1 YWPYGSEGCQTHGFHGFLTALASIHFIAAIAWDRYHQYCT 1 medaka
1 YWPYGSDGCQTHGFQGFMTALASIHFIAAIAWDRYHQYCT 1 pimephales
1 YWPYGSDGCQTHGFQGFMTALASIHFVAAIAWDRYHQYCT 1 osmerus
1 YWPYGSEGCQTHGFHGFLMALASINACAAIAWDRYHQNCS 1 elephantshark
1 EWPFGSIGCQLDAFIGMAPTFISIAGAALVAKDKYYRICK 1 tunicate 1
1 RWLFGKFGCYFHGFAGMLFGLGSIGNLTVISIDRYIITCK 1 tunicate 2
1 QWPFGDLGCQVDAFIGMAPTFISIAGAALIAKDKYYRFCK 1 tunicate 2

Excellent recent publications have greatly added to our understanding of the evolutionary origin of light reception capabilities, yet only with the advent of genomic sequencing have we begun to get a full grip on opsins and a few other evo-devo photoreceptor components such as PAX6. Yet critical early-diverging metazoan genomes that might retain information on ancestral characters are greatly under-represented given the branch lengths involved. No purpose is served by elaborate gene tree computations that include a single token peropsin or parietopsin among hundreds of cone opsins; if opsins are central to the endeavor yet so haphazardly annotated, what is the status of dozens of lesser but still critical photoreceptor gene families?

The question of how many times eyes independently arose is ill-posed. We can only wonder how people with no grasp whatsoever of the underlying molecular biology of photoreception could pontificate so boldly on its evolution prior to even sequencing of the first opsin in 1980. First collect comprehensive comparative genomic data sets on all the components in all the relevent clades; then ask what it could mean to homologenize two contemporary systems comprised of partially overlapping sets of heterogenous paralogs from assorted ancient gene families having long complex histories of clade-specific expansion and contraction cycles. If four arrestins serve a thousand GPCR, if even specialized photoreceptor cells express dozens of barely related 7TM signaling molecules, it's difficult to imagine what a cognate arrestin or ectopic evo-devo cassette might be. It's difficult enough to reconstruct an ancestral gene or two; reconstructing ancestral systems biology interactions lies in the future.

After reviewing topics such as ciliary opsin in protostomes, rhabdomeric opsins in deuterostomes,the rich opsin repertoires in cnidarians and probable opsins in sponges, we will consider special topics such as the origin of image-forming eyes beween amphioxus and lamprey divergences, noting throughout that our notion of 'eye' is much more nuanced than earlier. The reconstruction of Urbilateran and Urmetazoan eyes awaits additional cnidarian genomes but no new ones are currently underway. However the plethora of new arthopod and lophotrochozoan genome assemblies has opened up new avenues of research as the realization grows that fly and nematode model species are exceedingly derived, with better ancestral characters retained in other lineages.

Numerous conflicting gene trees have been published for ciliary opsins. Some methodologies have bordered on the preposterous -- thin phylogenetic coverage, dimly related outgroups such as adrenergic receptors, and naive underlying mutational models assumed for maximal likelihood despite great diversity of species and many billions of years of branch length. Nonetheless the resultant trees have only moderate conflict, suggesting that a definitive opsin gene tree is not far off. We'll do this using the multiple types of evidence discussed above.

The first point to be understood in deuterostome ciliary opsin evolution is jawless fish such as lamprey already exhibit a full-blown set of modern rod and cone opsins whereas earlier diverging clades as represented by hemichordate, echinoderm, amphioxus and tunicate genomes totally lack them and indeed lack imaging eyes altogether, while using their rhabdomeric opsins in a very distinct signaling system for their own photorecepton systems. Despite 7 sequenced opsin mRNAs in the amphioxus Branchiostoma belcheri and an initial assembly in Branchiostoma floridae providing counterparts there, no rod/cone opsin can be located there or in earlier diverging deuterostomes with genome projects (3 unicates, 2 urchins, 1 acorn worm). These species may have larval eye spots, ocelli, pigment cells, and related photoreceptors but lack imaging eyes.

Characters in extant (living) species should never be confused with ancestral characters present at the time of divergence nodes (last common ancestors); conceivably these early diverging deuterostomes have lost opsin genes, perhaps due to a habitat shift to deep water, burrowing habitat, or nocturnal lifestyle. However the molecular evidence is quite clear that full-blown pentachromatic color vision and most other modern ciliary opsin classes first appeared during the evolutionary stem preceding lamprey divergence. These are demonstrably not 'new' genes but derived from gene duplication and divergence of still older opsins of ciliary class..

The fossil record is unsatisfactory: less than 1 bilateran in 10,000 in Chengjiang and Burgess Shale fossils is even a candidate for deuterostomy. Low numbers of specimens and poor preservation conspire with career pressures and impact-seeking journals in egregiously misinterpreted data in the view of Hou, discoverer of the Chinese lagerstaette. Myllokunmingia is the best situation with 500 specimens; however Haikouichthys as supposed stem deuterostome, Metaspreggina as supposed post-Ediacaran, and Yunnanozoan generally are problematic, valid only in the eye of some beholder.

While signs of bilaterily disposed eyes are sometimes documentable, it does not follow these were image-forming eyes. Indeed contemporary Branchiostoma and tunicate larva have an eye-spot (ocellus); the genomes contain ciliary opsins but only clustering to ENCEPH and PPIN -- still a long ways from any imaging opsin. Echinoderms and hemichordates genomes also have opsins but even further diverged. Sea urchin genome encodes at least six opsins, four of these cluster classify to rhabdomeric, ciliary and Go-type. Tube feet are apparently the photosensory organ in adult urchins.

The oldest known fossil lamprey, Priscomyzon, dates at 360 myr to the Devonian. Molecular clocks place lamprey appearance at approximately 430 myr, some 100 million years after Chengjiang and Burgess Shales fossil Lagerstatte formed. Like most soft tissues, eyes seldom leave a good fossil record, though bilateral placement might be reflected in bone orbits. Hagfish, sister group to lamprey, have imaging eyes but have not been studied; their opsins situation may be derived due to deepwater marine habitat (similarly deepwater coelocanth opsins are adapted to less scattered wavelengths, centered at420 nm).

The next-diverging chondrichtyes have inadequate data at GenBank -- only a few rhodopsin genes from skates and dogfish. This makes even fragmentary opsins from the partially sequenced elephantfish Callorhyncus milii quite valuable. Those 9 fragments and 3 from the lamprey genome are provided in the data section -- the opsin classifier tool can reliably type a fragment from a single mid-sized exon. Full length genes are preferable but these fragments serve to prove existence of various gene class at the time of a given divergence node. Further, they can validate certain rare genomic events provided the fragment happens to overlap the region of interest.

On the other hand, thousands of high-quality Cambrian arthropod fossils unmistakably show stalked paired eyes. Fossil trilobite eyes have been much studied; these are better preserved due to calcite as lens crystalin. Imaging eyes of contemporary arthropods and lophotrochozoa are rhabdomeric, utilizing depolarizing Gq-type receptor, phospholipase C, phosphoinositola, diacylglycerol, and transient receptor potential TRP and TRPL channel signaling. However their genomes can also contain ciliary opsins, using hyperpolarizing Gt-type transducins and phosphodiesterase cGMP second-messaging (as well as Go-type gustducin ciliary opsins in other types of photoreceptors).

Vertebrates are just the opposite, having crossed over to a near-exclusively ciliary opsin-based imaging system, while retaining rhabdomeric signaling in retinal ganglion cells and elsewhere. (A very recent report demonstrates very rare human and mouse cone cell types expressing exclusive melanopsin; however melanopsin cannot have given rise to ciliary cone opsins.)

It must not be thought that bilaterans invented imaging eyes because much earlier diverging cubomedusan jellyfish such as Carybdea marsupialis has 4 eyestalks each with 6 photoreceptors of 4 types: simple eyespots, pigment cups, complex pigment cups with lenses, and camera-type eyes with a cornea, lens, and retina. This jellyfish chases, captures, and eats 'more advanced' teleost fish. This wing of cnidarians very much needs a genome project.

Cnidarian opsins are becoming available from Hydra and Nematostella genomes and targeted hybridization experiments. Hydra may express a ciliary-type opsin in ectodermal sensory nerve cells whereas Nematostella has opsins classifying between melanopsin and encephalopsin. These distant opsin are very difficult to distinguish bioinformatically from non-opsins in the rhodopsin superfamily within GPCR, so it is exceedingly important to include controls because a somewhat unspecialized photoreceptor cell may also express other sensory system signaling non-opsins that are nonetheless genetically homologous.

In summary, there is no evidence whatsoever -- and every reason to doubt from genomic analysis -- that deuterostomes had imaging eyes during the Cambrian. Despite this, a BBC series "Walking With Monsters" portrayed a school of 25 mm Haikouichthys attacking and wounding an Anomalocaris twenty times their size. It is easy to guess at a scientific advisory panel that envisions deuterostomes triumphing over protostomes. This recurrent anthropocentric fantasy is echoed in museum imagery of early mammals nimbly predating on dinosaur nests -- dioramas quietly dismantled after Yucatan meteorite discovery.

Imaging eyes are not essential to survival; even today subterranean mammals such as blind mole rat flourish without them. Discounting ray-finned fish numbers, a very substantial proportion of all extant animal species lack imaging eyes 525 myr after the Cambrian. Of 33 animal phyla, one-third have no specialized organ for detecting light, one-third have light-sensitive organs, and the remaining 6 have imaging eyes (Cnidaria, Mollusca, Annelida, Onychophora, Arthropoda, and Chordata). Thus 82% of animal phyla have survived well over 500 myr without imaging eyes despite the supposedly unrelenting competition/predation from animals having them.


Opsin Gene and Species Trees

The phylogenetic tree below shows the presence or absence of various opsin genes in clade-representative species, as reflected in the collected reference sequences. The purpose is timing appearance (or disappearance) of a given class of opsin gene. For example, cone and rod opsins first appeared before lamprey divergence; otherwise they are absent from urochordates, cephalochordates, and earlier deuterostomes. Note however a given gene might appear absent because of a genome project gap, lack of experimental effort, insufficient or outdated bioinformatics, or species idiosyncracies (ie be present in a different species of that clade). In other cases (eg platypus SWS1) pseudogene remnants or a syntenically proven deletion establish the gene is definitely absent. Y means yes (present), N means no (absent). The figure needs a few fixes.

Opsin phylo.png


The opsin gene trees below illustrate only a few of the myriad possibilities, even beginning with commonsense ordering (blast nearest neighbors). Because these gene families originated long ago and are only known from remotely related representatives in extant species with wildly differently mutational mechanisms and histories, the true tree cannot be reliably infered from maximal likelihood. Indeed no two such attempts have ever come up with the same gene tree! Instead, we'll keep this set of gene trees in view until analysis of rare genomic events is completed

Opsin gene trees.png


Using the Opsin Classifier

Below is the primary collection of opsing protein sequences. Here "fields" in the fasta header show gene name, genus, species, common name, heterotrimeric G protein alpha subunit used in signaling, intron structure, synteny (2 flanking genes on each side of the opsin), indel status, sequence length, lambda max, and comment field. The 230-odd sequences are now organized into deuterostomes, lophotrochozoans, and ecdysozoan divisions, further broken refined into ciliary, rhabdomeric, or neither. Even with the full set copied into the uBlast, a novel opsin candidate can be classified in 6 seconds over just a conventional DSL internet connection.

On 26 Nov 07, I added 41 new sequences, mostly arthropod rhabdomeric imaging opsins, extracting them from a 2007 pancrustacean opsin paper, using the much-studied accessions in their Table 1, as ordered phylogenetically according to their Fig.3, with subsampling to avoid too-close sequences and narrow lineage-specific expansions. This involved replacing a few defective accessions and partial sequences with comparable complete ones, favoring sequences with completed or planned genome projects which can be directly intronated and their synteny determined. Lambda max values were helpfully compiled for all these opsins by the original authors.

This significantly upgrades the resolving power of the Opsin Classifier vis-a-vis these new classes of protostome opsins. It raises thorny nomenclature issues because of short-sighted historic uses such as rhodopsin or LWS for both fruitfly and human genes. These are indeed vaguely homologous in the distant pre-Bilateran GPCR past but are certainly not orthologous as implied by the same common name. The definition of orthology requires genes under comparison in extant species to descend from a single parent gene in their last common ancestor. If this LCA requirement is relaxed, the entire set of a thousand GPCR all become orthologous.

Additional ecdysozoan and lophotrochozoan opsins are needed, whatever new that can be extracted from invertebrate genome projects; some of these will prove ciliary and conversely some deuterostome opsins rhabdomeric. Melanopsin/enchepalopsin appear at the heart of the Big Switchover that took place in chordates -- their imaging opsins did not arise from gene duplication and divergence of anything we see among contemporary protostomal imaging opsins or any reconstructed ancestor.

In fact, none of the opsin genes in Urbilatera destined to become rhabdomeric imaging opsins in living arthropods (even all of protostomia) seems to have descended directly to any deuterostome. It may turn out that none of the opsin genes in Ur-Bilatera destined to become ciliary imaging opsins in living vertebrates (even all of deuterostomia) survived in any protostome. The pool of GPCR genes was already large and signalling diversified across gene copies. However lophotrochozoa and basal ecdysozoan ciliary opsins are still largely unexplored. A similarly 'bad' Venn diagram could hold in Ureumetazoa. Here the only two cnidarians with sequence data (Hydra and Nematostella) were not the best choices for studying opsin evolution. We may have to wait for genome sequencing of a full-featured cubomedusan.

Please do not add text or edit sequences at this time, even though genomeWiki encourages participation; consider starting a fresh page to hold your contributions. After finishing the first round of articles, everything will be revisited and reorganized at a second pass.

Opsin Dataset

>RHO1_homSap Homo sapiens (human) Gt 0...2.1.0.0 indel -MBD4 +IFT122 +H1FOO -PLXND1 349 aa 497 nm 16565402 NM_000539 rod rhodopsin RHO ciliary all GT-AG   
0 MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLG 1
2 GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSR 2
1 YIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKE 0
0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQ 0
0 FRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA* 0

>RHO1_bosTau Bos taurus (cow) Gt 0...2.1.0.0 indel -MBD4 +IFT122 +H1FOO -PLXND1 349 aa 497 nm 2145276 NM_001014890 rod RHO most studied   
0 MNGTEGPNFYVPFSNKTGVVRSPFEAPQYYLAEPWQFSMLAAYMFLLIMLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVFGGFTTTLYTSLHGYFVFGPTGCNLEGFFATLG 1
2 GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLVGWSR 2
1 YIPEGMQCSCGIDYYTPHEETNNESFVIYMFVVHFIIPLIVIFFCYGQLVFTVKE 0
0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSDFGPIFMTIPAFFAKTSAVYNPVIYIMMNKQ 0
0 FRNCMVTTLCCGKNPLGDDEASTTVSKTETSQVAPA* 0

>RHO1_monDom Monodelphis domesticus (opossum) Gt 0...2.1.0.0 indel -MBD4 +IFT122 +H1FOO -PLXND1 349 aa 000 nm no_ref genome rod rhodopsin   
0 MNGTEGPNFYVPFSNKTGTVRSPFEEPQYYLADPWQFSCLAAYMFMLIVLGFPINFLTLYVTIQHKKLRTPLNYILLNLAIADLFMVFGGFTMTLYTSLHGYFVFGPTGCNLEGFFATLG 1
2 GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIIGVAFTWVMALACAFPPLIGWSR 2
1 YIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGQLVFTVKE 0
0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSNFGPIFMTIPAFFAKSSSVYNPVIYIMMNKQ 0
0 FRTCMITTLCCGKNPLGDDEASATASKTETSQVAPA* 0

>RHO1_ornAna Ornithorhynchus anatinus (platypus) Gt 0...2.1.0.0 indel - +IFT122 - -PLXND1 354 aa 000 nm ABN43074 17339011 rod rhodopsin   
0 MNGTEGQDFYIPMSNKTGVVRSPFEYPQYYLAEPWQYSVLAAYMFMLIMLGFPINFLTLYVTIQHKKLRTPLNYILLNLAFANHFMVLGGFTTTLYTSLHGYFVFGPTGCNIEGFFATLG 1
2 GEIALWSLVVLAIERYIVVCKPMSNFRFGENHAIMGVAFTWIMALACALPPLVGWSR 2
1 YIPEGMQCSCGIDYYTLRPEVNNESFVIYMFVVHFTIPMTIIFFCYGRLVFTVKE 0
0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTVPAFFAKSSAIYNPVIYIMMNKQ 0
0 FRNCMLTTICCGKNPLGDDEASATASKTEQSSVSTSQVSPA* 0

>RHO1_galGal Gallus gallus (chicken) Gt 0...2.1.0.0 indel -MBD4 +IFT122 +H1FOO -PLXND1 352 aa 000 nm 1385866 NM_205490 rod rhodopsin RH1   
0 MNGTEGQDFYVPMSNKTGVVRSPFEYPQYYLAEPWKFSALAAYMFMLILLGFPVNFLTLYVTIQHKKLRTPLNYILLNLVVADLFMVFGGFTTTMYTSMNGYFVFGVTGCYIEGFFATLG 1
2 GEIALWSLVVLAVERYVVVCKPMSNFRFGENHAIMGVAFSWIMAMACAAPPLFGWSR 2
1 YIPEGMQCSCGIDYYTLKPEINNESFVIYMFVVHFMIPLAVIFFCYGNLVCTVKE 0
0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTNQGSDFGPIFMTIPAFFAKSSAIYNPVIYIVMNKQ 0
0 FRNCMITTLCCGKNPLGDEDTSAGKTETSSVSTSQVSPA* 0

>RHO1_anoCar Anolis carolinensis (lizard) Gt 0...2.1.0.0 indel -MBD4 +IFT122 - -PLXND1 353 aa 000 nm no_ref genome rod rhodopsin   
0 MNGTEGQNFYVPMSNKTGVVRNPFEYPQYYLADPWQFSALAAYMFLLILLGFPINFLTLFVTIQHKKLRTPLNYILLNLAVANLFMVLMGFTTTMYTSMNGYFIFGTVGCNIEGFFATLG 1
2 GEMGLWSLVVLAVERYVVICKPMSNFRFGETHALIGVSCTWIMALACAGPPLLGWSR 2
1 YIPEGMQCSCGVDYYTPTPEVHNESFVIYMFLVHFVTPLTIIFFCYGRLVCTVKA 0
0 AAAQQQESATTQKAEREVTRMVVIMVISFLVCWVPYASVAFYIFTHQGSDFGPVFMTIPAFFAKSSAIYNPVIYILMNKQ 0
0 FRNCMIMTLCCGKNPLGDEDTSAGTKTETSTVSTSQVSPA* 0

>RHO1_xenTro Xenopus tropicalis (frog) Gt 0...2.1.0.0 indel -MBD4 +IFT122 - -PLXND1 355 aa 000 nm no_ref genome rod rhodopsin   
0 MNGTEGPNFYIPMSNKTGVVRSPFDYPQYYLAEPWKYSALAAYMFLLILLGFPINFMTLYVTIQHKKLRTPLNYILLNLVFANHFMVLCGFTVTMYTSMHGYFIFGQTGCYIEGFFATLG 1
2 GEMALWSLVVLAIERYVVVCKPMANFRFGENHAIMGVVFTWIMALSCAAPPLFGWSR 2
1 YIPEGMQCSCGVDYYTLKPEVNNESFVVYMFIVHFTIPLCVIFFCYGRLLCTVKE 0
0 AAAQQQESATTQKAEKEVTRMVVMMVIFFLICWVPYAYVAFYIFTHQGSDFGPVFMTVPAFFAKSSAIYNPVIYIVLNKQ 0
0 FRNCLITTLCCGKNPFGDEEGSSAASSKTEASSVSSSQVSPA* 0

>RHO1_neoFor Neoceratodus forsteri (lungfish) Gt 0...2.1.0.0 indel x x x x 355 aa 000 nm 17961206 EF526299 rod rhodopsin   
0 MNGTEGPNFYVPMTNKTGVVRSPFEYPQYYLADPWKYSALAAYMFFLILTGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVFGGFTTTMYTAMNGYFVFGVVGCNLEGFFATFG 1
2 GIIALWCLVVLAIERYIVVCKPISNFRFGENHAIMGVVFTWIMALACAGPPLFGWSR 2
1 YIPEGMQCSCGIDYYTLKPEVNNESFVIYMFIVHFTIPLIIIFFCYGRLMCTVKE 0
0 AAAQQQESATTQKAEKEVTRMVYIMVISYLVCWLPYASVSFYIFTHQGSDFGPVFMTVPAFFAKTASVYNPVIYILMNKQ 0
0 FRNCMITTLCCGKNPFGDEETTSAGTSKTEASSVSSSQVSPA* 0

>RHO1_latCha Latimeria chalumnae (coelacanth) Gt 0...2.1.0.0 indel x x x x 354 aa 478 nm 10339578 AAD30519 rod rhodopsin   
0 MNGTEGPNFYVPMSNKTGVVRNPFEYPQYYLADPWKYSALAAYMFFLILVGFPINFLTLFVTIQHKKLRTPLNYILLDLAVADLCMVFGGFFVTMYSSMNGYFVLGPTGCNIEGFFATLG 1
2 GQVALWALVVLAIERYVVVCKPMSNFRFGENHAIMGVIFTWIMALSCAVPPLFGWSR 2
1 YIPEGMQSSCGVDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKD 0
0 AAAQQQESATTQKAEKEVTRMVIVMVISFLVCWVPYASVAAYIFFNQGSEFGPVFMTAPSFFAKSASFYNPVIYILLNKQ 0
0 FRNCMITTLCCGKNPFGDEDATSAAGSSKTEASSVSSSSVSPA* 0

>RHO1_takRub Takifugu rubripes (pufferfish) Gt 0...2.1.0.0 indel -MBD4 +IFT122 - -PLXND1 355 aa 000 nm 12783465 AF201472 rod rhodopsin   
0 MNGTEGPNFYIPMSNKTGVVRSPFEYPQYYLAEPWKYSLVAAYMLFLIITAFPVNFLTLFVTVKHKKLRTPLNYVLLNLAVADLFMVIGGFTVTLYTALHAYFVLGVTGCNIEGFFATLG 1
2 GEIALWSLVVLAVERYIVVCKPMTNFRFGEKHAIAGLVFTWIMALTCATPPLLGWSR 2
1 YIPEGMQCSCGIDYYTPKPEINNTSFVIYMFILHFSIPLAIIFFCYSRLLCTVRA 0
0 AAALQQESETTQRAEKEVTRMVIVMVISFLVCWVPYASVAWYIFANQGTEFGPVFMTAPAFFAKSAALYNPVIYILLNRQ 0
0 FRNCMITTVCCGKNPFGDDDAATTVSKTQSSSVSSSQVAPA* 0

>RHO1_leuEri Leucoraja erinacea (skate) Gt 0...2.1.0.0 indel x x x x 355 aa 000 nm 9256070 U81514 rod rhodopsin   
0 MNGTEGENFYVPMSNKTGVVRSPFDYPQYYLGEPWMFSALAAYMFFLILTGLPVNFLTLFVTIQHKKLRQPLNYILLNLAVSDLFMVFGGFTTTIITSMNGYFIFGPAGCNFEGFFATLG 1
2 GEVGLWCLVVLAIERYMVVCKPMANFRFGSQHAIIGVVFTWIMALSCAGPPLVGWSR 2
1 YIPEGLQCSCGVDYYTMKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKE 0
0 AAAQQQESESTQRAEREVTRMVIIMVVAFLICWVPYASVAFYIFINQGCDFTPFFMTVPAFFAKSSAVYNPLIYILMNKQ 0
0 FRNCMITTICLGKNPFEEEESTSASASKTEASSVSSSQVAPA* 0

>RHO1_calMil Callorhinchus milii (elephantfish) Gt 0...2.1.0.0 indel x x x x 355 aa 000 nm no_ref genome rod rhodopsin complete wgs   
0 MNGTEGENFYIPMSNKTGVVRSPFEYPQYYLAEPWQFSILAAYMFFLIITCFPVNFLTLYVTFEHKKLRQPLNFILLNLAVADLFMVFGGFFITVYTSLHGYFVFGVTGCNFEGFFATLG 1
2 GEIGLWSLVVLAIERYVVVCKPMSNFRFGTNHAIMGVAFTWVMALACAVPPLMGWSR 2
1 YIPEGLQCSCGVDYYTLKPEINNESFVIYMFVVHFLIPLIIIFFCYGRLVCTVKE 0
0 AAAQQQESESTQRAEREVTRMVIIMVIFFLICWVPYASVAFFIFTNQGSEFGPIFMAVPAFFAKSSALYNPLIYILLNKQ 0
0 FRNCMITTLCCGKNPFEEDESTSAAASKTEASSVSSSQVSPA* 0

>RHO1_petMar Petromyzon marinus (lamprey) Gt 0...2.1.0.0 indel x x x x 354 aa 000 nm no_ref genome rod rhodopsin   
0 MNGTEGENFYIPFSNKTGLARSPFEYPQYYLAEPWKYSVLAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAVANLFMVLFGFTLTMYSSMNGYFVFGPTMCNFEGFFATLG 1
2 GEMSLWSLVVLAIERYIVICKPMGNFRFGSTHAYMGVAFTWFMALSCAAPPLVGWSR 2
1 YLPEGMQCSCGPDYYTLNPNFNNESFVIYMFLVHFIIPFIVIFFCYGRLLCTVKE 0
0 AAAAQQESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDFGATFMTVPAFFAKTSALYNPIIYILMNKQ 0
0 FRNCMITTLCCGKNPLGDEDSGASTSKTEVSSVSTSQVSPA* 0

>RHO1_geoAus Geotria australis (lamprey) Gt 0...2.1.0.0 indel x x x x 354 aa 497 nm 17463225 AY366493 rod rhodopsin rodRhA   
0 MNGTEGQNFYIPFSNKTDVARSPFEYPQYYLAEPWKFSALAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAVSNLFMILFGFTTTMYTSMNGYFVFGPTMCSIEGFFATLG 1
2 GEVSLWSLVVLAIERYIVICKPMGNFRFGNTHAIMGVALTWVMALSCAAPPLLGWSR 2
1 YLPEGMQCSCGPDYYTMNPTYNNESFVIYMFIVHFTIPFVIIFFSYGRLLCTVKE 0
0 AAAAQQESASTQKAEKEVTRMVVLMVVGFLVCWVPYASVAFYIFTNQGSDFGATFMTLPAFFAKSSALYNPVIYILMNKQ 0
0 FRNCMITTLCCGKNPLGDDDSGASTSKTEVSSVSTSQVAPA* 0

>RHO1_letJap Lethenteron japonicum (lamprey) Gt 0...2.1.0.0 indel x x x x 354 aa 000 nm 15096614 AB116382 cone rhodopsin   
0 MNGTEGDNFYVPFSNKTGLARSPYEYPQYYLAEPWKYSALAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAMANLFMVLFGFTVTMYTSMNGYFVFGPTMCSIEGFFATLG 1
2 GEVALWSLVVLAIERYIVICKPMGNFRFGNTHAIMGVAFTWIMALACAAPPLVGWSR 2
1 YIPEGMQCSCGPDYYTLNPNFNNESYVVYMFVVHFLVPFVIIFFCYGRLLCTVKE 0
0 AAAAQQESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDFGATFMTLPAFFAKSSALYNPVIYILMNKQ 0
0 FRNCMITTLCCGKNPLGDDESGASTSKTEVSSVSTSQVSPA* 0

>RHO2_galGal Gallus gallus (chicken) Gt 0...2.1.0.0 indel -IHPK3 -LEMD2 -GRM4 +HMGA1 356 aa 000 nm 2268324 NP_990771 cone rhodopsin   
0 MNGTEGINFYVPMSNKTGVVRSPFEYPQYYLAEPWKYRLVCCYIFFLISTGLPINLLTLLVTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFVFGPVGCAVEGFFATLG 1
2 GQVALWSLVVLAIERYIVVCKPMGNFRFSATHAMMGIAFTWVMAFSCAAPPLFGWSR 2
1 YMPEGMQCSCGPDYYTHNPDYHNESYVLYMFVIHFIIPVVVIFFSYGRLICKVRE 0
0 AAAQQQESATTQKAEKEVTRMVILMVLGFMLAWTPYAVVAFWIFTNKGADFTATLMAVPAFFSKSSSLYNPIIYVLMNKQ 0
0 FRNCMITTICCGKNPFGDEDVSSTVSQSKTEVSSVSSSQVSPA* 0

>RHO2_anoCar Anolis carolinensis (lizard) Gt 0...2.1.0.0 indel -IHPK3 -LEMD2 -GRM4 +HMGA1 356 aa 000 nm no_ref genome cone rhodopsin   
0 MNGTEGINFYVPLSNKTGLVRSPFEYPQYYLAEPWKYKVVCCYIFFLIFTGLPINILTLLVTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFIFGPIGCAIEGFFATLG 1
2 GQVALWSLVVLAIERYIVVCKPMGNFRFSATHALMGISFTWFMSFSCAAPPLLGWSR 2
1 YIPEGMQCSCGPDYYTLNPDYHNESYVLYMFGVHFVIPVVVIFFSYGRLICKVRE 0
0 AAAQQQESASTQKAEREVTRMVILMVLGFLLAWTPYAMVAFWIFTNKGVDFSATLMSVPAFFSKSSSLYNPIIYVLMNKQ 0
0 FRNCMITTICCGKNPFGDEDVSSSVSQSKTEVSSVSSSQVSPA* 0

>RHO2_gekGek Gekko gekko (gecko) Gt 0...2.1.0.0 indel x x x x 356 aa 000 nm 11591478 AY024356 cone rhodopsin in pure rod-retina   
0 MNGTEGINFYVPLSNKTGLVRSPFEYPQYYLADPWKFKVLSFYMFFLIAAGMPLNGLTLFVTFQHKKLRQPLNYILVNLAAANLVTVCCGFTVTFYASWYAYFVFGPIGCAIEGFFATIG 1
2 GQVALWSLVVLAIERYIVICKPMGNFRFSATHAIMGIAFTWFMALACAGPPLFGWSR 2
1 FIPEGMQCSCGPDYYTLNPDFHNESYVIYMFIVHFTVPMVVIFFSYGRLVCKVRE 0
0 AAAQQQESATTQKAEKEVTRMVILMVLGFLLAWTPYAATAIWIFTNRGAAFSVTFMTIPAFFSKSSSIYNPIIYVLLNKQ 0
0 FRNCMVTTICCGKNPFGDEDVSSSVSQSKTEVSSVSSSQVAPA* 0

>RHO2_neoFor Neoceratodus forsteri (lungfish) Gt 0...2.1.0.0 indel x x x x 356 aa 000 nm 17961206 EF526299 cone rhodopsin   
0 MNGTEGINFYVPHSNKTGVVRSPFEYPQYYLADPWKYSIVCAYMFFLIITGLPINLLTLVVTFKHKKLRQPLNYILVNLAVADLFMVCFGFTVTFSTAINGYFIFGPRGCAIEGFMATLG 1
2 GEVALWSLVVLAIERYIVVCKPMGNFRFSNNHSIIGIVFTWLAALSCAAPPLFGWSR 2
1 YLPEGMQCSCGPDYYTMNPDYHNESFVIYMFVVHFFIPVIVIFVSYGRLICKVKE 0
0 AAAQQQESASTQKAEREVTRMVILMVIGFMTAWTPYATVAFWIFMNKGAEFGATFMAAPAFFSKSSALYNPIIYVLMNKQ 0
0 FRNCMVTTLCCGKNPFGDDDVSSSVSAGKTEVSSVSSSQVSPA* 0

>RHO2_latCha Latimeria chalumnae (coelacanth) Gt 0...2.1.0.0 indel x x x x 355 aa 485 nm 10339578 AH007713 cone rhodopsin RH2   
0 MNGTEGMNFYVPLSNRTGLVRSPFEYTQYYLAEPWKFSVLCAYMFLLIILGFPINFLTLLVTFKHKKLRQPLNYILVNLAVASLFMVVFGFTVTFYSSLNGYFVLGPMGCAMEGFFATLG 1
2 GQVALWSLVVLAIERYIVVCKPMGNFRFASSHAIMGIAFTWIMALACAAPPLVGWSR 2
1 YIPEGLQCSCGPDYYTLNPDFHNESYVMYLFLVHFLLPIIIIFFTYGRLICKVKE 0
0 AAAQQQESASTQKAEKEVTRMVILMVIGFLTAWVPYASAAFWIFCNRGAEFTATLMTVPAFFSKSSCLFNPIIYVLLNKQ 0
0 FRNCMITTLCCGKNPLGDDDTSSAVSQSKTDVSSVSSSQVSPA* 0

>RHO2_geoAus Geotria australis (lamprey) Gt 0...2.1.0.0 indel x x x x 355 aa 492 nm 17463225 AY366494 cone rhodopsin RhB no petMar   
0 MNGTEGANFYIPFHNRTGVVRSPYEYPQYYLADPWMYSAISAYVFTLILIGFPVNFMTLFVTFKLKKLRQPLNFILVNLCVADLLMIMFGFTTTFYTAMNGYFVFGPTGCNIEGFFATLG 1
2 GEVSLWSLVMLAIERYIVVCKPMGNFRFATTHAALGVVFTWVMASACAVPPLVGWSR 2
1 YIPEGMQCSCGPDYYTLNPKYYNESYVIYLFLVHFLLPVTIIFFTYGRLICTVKE 0
0 AAAQQQESASTQKAEREVTRMVIIMVVGFLVCWVPYASFAFYLFMNKGILFSATAMTVPAFFSKSSVLYNPIIYVLLNKQ 0
0 FRTCMVTTLFCGKNPFGEDDSSMVSTSKTEVSSVSSSQVSPS* 0

>SWS2_ornAna Ornithorhynchus anatinus (platypus) Gt 0...2.1.0.0 indel -IRAK1 -MECP2 - +TKTL1 364 aa 000 nm 17339011 ABN43074 cone short blue tandem -FLNB--+MECP2 with MWS1   
0 MHKTHRNLQNELPEDFFIPLPLDTDNITSLSPFLVPQTHLGGSGIFMSLAAFMFLLITLGFPINLLTVICTIKYKKLRSHLNYILVNLAVSNMLVVCVGSATAFYSFAHMYFVLGPTACKIEGFAATLG 1
2 GMVSLWSLAVIAFERFLVICKPLGNLSFRGTHAIFGCAATWVFGLAASLPPLFGWSR 2
1 YIPEGLQCSCGPDWYTTNNKWNNESYVIFLFSFCFGVPLSIIIFSYGRLLLTLRA 0
0 VAKQQEQSATTQKAEREVTKMVIVMVLGFLVCWLPYASFSLWVVTNRGQVFDLRMASIPSVFSKASTIYNPIIYVFMNKQ 0
0 FRSCMLKLVFCGKSPFGDEDEISGSSQATQVSSVSSSQVSPA* 0

>SWS2_galGal Gallus gallus (chicken) Gt 0...2.1.0.0 indel x x x x 362 aa 000 nm 7975342 NP_990848 cone short2 blue   
0 MHPPRPTTDLPEDFYIPMALDAPNITALSPFLVPQTHLGSPGLFRAMAAFMFLLIALGVPINTLTIFCTARFRKLRSHLNYILVNLALANLLVILVGSTTACYSFSQMYFALGPTACKIEGFAATLG 1
2 GMVSLWSLAVVAFERFLVICKPLGNFTFRGSHAVLGCVATWVLGFVASAPPLFGWSR 2
1 YIPEGLQCSCGPDWYTTDNKWHNESYVLFLFTFCFGVPLAIIVFSYGRLLITLRA 0
0 VARQQEQSATTQKADREVTKMVVVMVLGFLVCWAPYTAFALWVVTHRGRSFEVGLASIPSVFSKSSTVYNPVIYVLMNKQ 0
0 FRSCMLKLLFCGRSPFGDDEDVSGSSQATQVSSVSSSHVAPA* 0

>SWS2_taeGut Taeniopygia guttata (finch) Gt 0...2.1.0.0 indel x x x x 363 aa 000 nm no_ref genome cone short2   
0 MPKPREMRDELPEDFYIPMSLETPNLTALSPFLVPQTHLGSPGIFKAMAAFMFLLVLLGVPINALTVLCTAKYKKLRSHLNYILVNLAVANLLVVCVGSTTAFYSFSQMYFALGPLACKIEGFTATLG 1
2 GMVSLWSLAVVAFERFLVICKPLGNFTFRGSHAVLGCAITWIFGLIASLPPLFGWSR 2
1 YIPEGLQCSCGPDWYTTDNKWNNESYVIFLFCFCFGFPLTVIVFSYGRLLLTLRA 0
0 VAKQQEQSASTQKAEREVTKMVVVMVLGFLVCWLPYCSFALWVVTHRGHPFDLGLASIPSVFSKASTVYNPIIYVFMNKQ 0
0 FRSCMLKLVFCGRSPFGDEDDVSGSSQATQVSSVSSSQVSPA* 0 

>SWS2_utaSta Uta stansburiana (lizard) Gt 0...2.1.0.0 indel x x x x 364 aa 000 nm 16543463 DQ100326 cone short   
0 MHNSRPHSRDDLPEDFFIPMPLDVANITTLSPFLVPQTHLGSPALFMGMAAFMFLLIILGVPINVLTIFCTFKYKKLRSHLNYILVNLAVSNLLVVCIGSTTAFYSFAQMYFSLGPTACKIEGFAATLG 1
2 GMVSLWSLAVVAFERFLVICKPLGNFSFRGTHAIIGCIITWVFGLVASLPPLFGWSR 2
1 YIPEGLQCSCGPDWYTTNNKWNNESYVLFLFSFCFGVPLSVIIFSYGRLLLTLRA 0
0 VAKQQEQSATTQKAEREVTKMVVVMVMGFLVCWLPYASFALWVVTHRGEPFDVRLATIPSVFSKASSVYNPVIYVFMNKQ 0
0 FRSCMLKLVFCGKSPFGDEDDVSGSSQTTQVSSVSSSQVSPA* 0

>SWS2_xenTro Xenopus tropicalis (frog) Gt 0...2.1.0.0 indel -IRAK1 -MECP2 - - 363 aa 000 nm no_ref genome cone short   
0 MSKGRPDLRMEMPDEFYVPIPLETTNISSLSPFLVPQTHLGTPGIFMSISAFMLFTIIFGFPLNLLTIICTVKYKKLRSHLNYILVNLAVANLIVICFGSTTAFYSFSQMYFSLGTLACKIEGFTATLG 1
2 GIIGLWSLAVVAFERFLVICKPMGNFTFRESHAVLGCILTWVIGLVAAIPPLLGWSR 2
1 YIPEGLQCSCGPDWYTVNNKWNNESYVLFLFCFCFGFPLAIIVFSYGRLLLALHA 0
0 VAKQQEQSATTQKAEREVTRMVIVMVVGFLVCWLPYASFALWAVTHRGELFDLRMSSVPSVFSKASTVYNPFIYIFMNRQ 0
0 FRSCMMKMIFCGKNPLGDDEETSVSGSTQVSSVSSSQIAPS* 0 

>SWS2_neoFor Neoceratodus forsteri (lungfish) Gt 0...2.1.0.0 indel x x x x 364 aa 000 nm 17961206 EF526299 cone short   
0 MHRTKPDPQEDLPDDFYIPVSLNTNNITMLSPFLVPQTHLGSPSVFMVLSVFMFFLLITGIPINVLTIICTFKYKKLRSHLNYILVNLAVANLIVVGFGSTTAFYSFSQMYFAWGPLACKIEGFAATLG 1
2 GMVSLWSLAVVAFERFLVICKPLGNFTFRSTHAIIGCVATWVFGLISSAPPLFGWSR 2
1 YIPEGLQCSCGPDWYTTNNKWNNESYVIFLFCFCFGFPLSVIIFSYGRLLMTLRA 0
0 VAKQQEQSASTQKAEREVTKMVVVMVLGFLVCWLPYTVFSLWVVTHRGESFELALGSIPAVFSKSSTVYNPLIYVFMNKQ 0
0 FRSCMMKLIFCGKSPFGDEDDASSASQSTQVSSVSSSQVAPA* 0

>SWS2_takRub Takifugu rubripes (pufferfish) Gt 0...2.1.0.0 indel x x x x 351 aa 000 nm no_ref genome cone short2   
0 MRGVRQHEFQEDFYIPIPLDVDNITALSPFLVPQDHLGSPAVFYGMSAFMFFLFVAGTGINVLTIACTIQYKKLRSHLNYILVNLAFSNLLVTTVGSFTCFCCFFVRYMIVGPLGCKIEGFAATLG 1
2 GMVSLWSLAVVAFERWLVVCKPLGNFIFKPDHAIVCCIFTWFFALIISAPPLFGWSR 2
1 YIPEGFQCSCGPDWYTTGNKYNNESYVWFIFGFGFAVPLFVIVFCYSQLLVMLKS 0
0 AKAQAESASTQKAEREVTRMVVVMILGFLVCWLPYASFALWVVNNRGTPFDLRLATIPACFSKASTVYNPIIYVVLNKQ 0
0 FRSCMKKMLGMSGGDDEESSSQSVTEVSKVSPS* 0 

>SWS2_gasAcu Gasterosteus aculeatus (stickleback) Gt 0.2.2.1.0.0 indel x x x x 359 aa 000 nm no_ref genome cone short   
0 MKHGRVPEIPEDFYIPISLDTDNITSLSPFLVPQDHLASKATFYSLAFYMFFILIVGTFINALTVACTVQNKKLRSHLNYILVNLAVSNLLVSGVGAFTAFLSFAARYFVLGTLACKVEGFLATLG 1
2 GMVSLWSLAVIAFERWLVICKPLGNFIFKPDHALVCCAFTWVFALAASAPPLVGWSR 2
1 YIPEGLQCSCGPDWYTTNNKYNNESYVLFLFGFCFAVPFCTICFCYSQLLFTMKMA 0
0 AKAQAESASTQKAEREVTRMVVLMVMGFLVCWMPYASFALWVVNNRGQTFDLRFASIPSVFSKSSAVYNPVIYVLLNKQ 0
0 FRSCMMKMLGMGGGDDEESSTSSVTEVSKVGPA* 0

>SWS2_geoAus Geotria australis (lamprey) Gt 0...2.1.0.0 indel x x x x 362 aa 439 nm 17463225 AY366492 cone short2 blue retinal petMar ps   
0 MYQGKSTQVDDLPEDFYIPIALNVKNMSELSPFLVPQVHLGDSFIFYGMSAFMLFLVLAGFPLNFLTVFVTIKYKKLRSHLNYILVNLAIANLIVVCCGSTLAFYSFMHKYFILGPLFCKMEGFTATLG 1
2 GMLSLWSLAVLAFERCLVICKPFGNIAFRGTHALIRCGFAWAAAIAASTPPLFGWSR 2
1 YIPEGLQCSCGPDWYTTNNKYNNESYVMFLFIFCFGTPFTIIIVSYSKLILTLRA 0
0 AAAQQQESASTQKAEKEVSRMVVIMVGGFLVCWLPYASLALWIVFNRGSPFDLRLATIPSVFSKASTVYNPVIYIFLNKQ 0
0 FRSCMMKTIFCGKNPLGDDEDATSTTTQVSSVSTSQVAPA* 0

>SWS1_homSap Homo sapiens (human) Gt 0.2.2.1.0.0 indel -FAM137A -CALU -NAG6 -FLNC 348 aa 000 nm 1385866 NP_990769 cone short   
0 MRKMSEEEFYLFKNISSVGPWDGPQYHIAPVWAFYLQAAFMGTVFLIGFPLNAMVLVATLRYKKLRQPLNYILVNVSFGGFLLCIFSVFPVFVASCNGYFVFGRHVCALEGFLGTVA 1
2 GLVTGWSLAFLAFERYIVICKPFGNFRFSSKHALTVVLATWTIGIGVSIPPFFGWSR 2
1 FIPEGLQCSCGPDWYTVGTKYRSESYTWFLFIFCFIVPLSLICFSYTQLLRALKA 0
0 VAAQQQESATTQKAEREVSRMVVVMVGSFCVCYVPYAAFAMYMVNNRNHGLDLRLVTIPSFFSKSACIYNPIIYCFMNKQ 0
0 FQACIMKMVCGKAMTDESDTCSSQKTEVSTVSSTQVGPN* 0

>SWS1_monDom Monodelphis domesticus (opossum) Gt 0...2.1.0.0 indel -FAM137A -CALU -NAG6 -FLNC 347 aa 000 nm no_ref genome cone short   
0 MSGDEEFYLFKNISSVGPWDGPQYHIAPAWAFHFQTVFMGFVFCAGTPLNAVVLVATLRYKKLRQPLNYILVNVSLCGFIFCIFAVFTVFISSSQGYFIFGRHVCAMEAFLGSVA 1
2 GLVTGWSLAFLAFERFIVICKPFGNFRFNSKHAMMVVLATWVIGIGVSIPPFFGWSR 2
1 FIPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIMPLFLICFSYSQLLRALRA 0
0 VAAQQQESATTQKAEREVSRMVVMMVGSFCLCYVPYAALAMYMVNNQNHGLDLRLVTIPAFFSKSACVYNPIIYCFMNKQ 0
0 FHACIMEMVCRKPMTDDSDVSSSQKTEVSAVSSSQVGPT* 0

>SWS1_galGal Gallus gallus (chicken) Gt 0...2.1.0.0 indel x x x x 348 aa 000 nm no_ref genome cone short1 violet   
0 MSSDDDFYLFTNGSVPGPWDGPQYHIAPPWAFYLQTAFMGIVFAVGTPLNAVVLWVTVRYKRLRQPLNYILVNISASGFVSCVLSVFVVFVASARGYFVFGKRVCELEAFVGTHG 1
2 GLVTGWSLAFLAFERYIVICKPFGNFRFSSRHALLVVVATWLIGVGVGLPPFFGWSR 2
1 YMPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRA 0
0 VAAQQQESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGLDLRLVTIPAFFSKSACVYNPIIYCFMNKQ 0
0 FRACIMETVCGKPLTDDSDASTSAQRTEVSSVSSSQVGPT* 0

>SWS1_taeGut Taeniopygia guttata (finch) Gt 0...2.1.0.0 indel x x x x 347 aa 000 nm no_ref genome cone short1   
0 MDEEEFYLFKNQSSVGPWDGPQYHIAPMWAFYLQTIFMGLVFVAGTPLNAIVLIVTIKYKKLRQPLNYILVNISVSGLMCCVFCIFTVFIASSQGYFVFGKHMCAFEGFAGATG 1
2 GLVTGWSLAFLAFERYIVICKPFGNFRFNSRHALLVVAATWIIGVGVAIPPFFGWSR 2
1 YIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRA 0
0 VAAQQQESATTQKAEREVSRMVVVMVGSFCMCYVPYAALAMYMVNNREHGIDLRLVTIPAFFSKSSCVYNPIIYCFMNKQ 0
0 FRACIMETVCGRPMTDDSEVSSSAQRTEVSSVSSSQVGPS* 0

>SWS1_anoCar Anolis carolinensis (lizard) Gt 0.2.2.1.0.0 indel - -CALU - - 347 aa 000 nm no_ref genome cone short   
0 MSGQEDFYLFENISSVGPWDGPQYHIAPMWAFYFQTAFMGFVFFAGTPLNAIILIVTVKYKKLRQPLNYILVNISFAGFLFCTFSVFTVFMASSQGYFFFGRHVCAMEAFLGSVA 1
2 GLVTGWSLAFLAFERYIVICKPFGNFRFNSRHALLVVAATWIIGVGVAIPPFFGWSR 2
1 YIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRA 0
0 VAAQQQESATTQKAEREVSRMVVVMVGSFCLCYVPYASLAMYMVNNRDHGLDLRLVTIPAFFSKSSCVYNPIIYCFMNKQ 0
0 FRACILETVCGKPMSDESDVSSSAQKTEVSSVSSSQVSPS* 0

>SWS1_utaSta Uta stansburiana (lizard) Gt 0...2.1.0.0 indel x x x x 348 aa 000 nm 16543463 DQ100325 cone short   
0 MSGEEDFYLFENISSVGPWDGPQYHIAPMWAFYFQTAFMGFVFFAGTPLNAIILIVTVKYKKLRQPLNYILVNISFAGFLFCVFSVFTVFLASSQGYFFFGRHICALEAFLGSVA 1
2 GLVTGWSLAFLAFERYIVICKPFGNFRFNSKHALLVVAATWFIGIGVSIPPFFGWSR 2
1 FIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRA 0
0 VAAQQQESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGIDLRLVTIPAFFSKSACVYNPIIYCFMNKQ 0
0 FRACIMETVCGKPMTDESDVSSSAQKTEVSSVSSSQVSPS* 0

>SWS1_xenLae Xenopus laevis (frog) Gt 0...2.1.0.0 indel - -CALU - - 348 aa 000 nm no_ref genome cone short   
0 MLEEEDFYLFKNVSNVSPFDGPQYHIAPKWAFTLQAIFMGMVFLIGTPLNFIVLLVTIKYKKLRQPLNYILVNITVGGFLMCIFSIFPVFVSSSQGYFFFGRIACSIDAFVGTLT 1
2 GLVTGWSLAFLAFERYIVICKPMGNFNFSSSHALAVVICTWIIGIVVSVPPFLGWSR 2
1 YMPEGLQCSCGPDWYTVGTKYRSEYYTWFIFIFCFVIPLSLICFSYGRLLGALRA 0
0 VAAQQQESASTQKAEREVSRMVIFMVGSFCLCYVPYAAMAMYMVTNRNHGLDLRLVTIPAFFSKSSCVYNPIIYSFMNKQ 0
0 FRGCIMETVCGRPMSDDSSVSSTSQRTEVSTVSSSQVSPA* 0

>SWS1_neoFor Neoceratodus forsteri (lungfish) Gt 0...2.1.0.0 indel x x x x 347 aa 000 nm 17961206 EF526299 cone short   
0 MSGEEEFYLFKNISSVGPWDGPQYHIAPKWAFFLQAAFMGFVLFVGTPLNAIVLFVTIKYKKLQQPLNYILVNISLAGFIFCFFGVFAVFIASCQGYFIFGKTVCALEGFTGSVA 1
2 GLVTGWSLAILAFERYLVICKPIGNFRFGSKHSMIAVVAAWVIGVGVSIPPFFGWSR 2
1 YIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIIPLFIICFSYSQLLGALRA 0
0 VAAQQQESATTQKAEREVSRMIIVMVGSFCVCYVPYAALAMYMVNNRDHGIDLRLVTIPAFFSKSSFVYNPIIYCFMNKQ 0
0 FRACIMQTVFGKPMTDDSDISSSGKTEVSSVSSSQVNPS* 

>SWS1_danRer Danio rerio (zebrafish) Gt 0...2.1.0.0 indel - -CALU - - 337 aa 000 nm no_ref genome cone short1   
0 MDAWAVQFGNASKVSPFEGEQYHIAPKWAFYLQAAFMGFVFIVGTPMNGIVLFVTMKYKKLRQPLNYILVNISLAGFIFDTFSVSQVSVCAARGYYSLGYTLCSMEAAMGSIA 1
2 GLVTGWSLAVLAFERYVVICKPFGSFKFGQGQAVGAVVFTWIIGTACATPPFFGWSR 2
1 YIPEGLGTACGPDWYTKSEEYNSESYTYFLLITCFMMPMTIIIFSYSQLLGALRA 0
0 VAAQQAESESTQKAEREVSRMVVVMVGSFVLCYAPYAVTAMYFANSDEPNKDYRLVAIPAFFSKSSSVYNPLIYAFMNKQ 0
0 FNACIMETVFGKKIDESSEVSSKTETSSVSA* 0

>SWS1_oryLat Oryzias latipes (medaka) Gt 0...2.1.0.0 indel - - - - 336 aa 000 nm no_ref genome cone short1   
0 MGKYFYLYENISKVGPYDGPQYYLAPTWAFYLQAAFMGFVFFVGTPLNFVVLLATAKYKKLRVPLNYILVNITFAGFIFVTFSVSQVFLASVRGYYFFGQTLCALEAAVGAVA 1
2 GLVTSWSLAVLSFERYLVICKPFGAFKFGSNHALAAVIFTWFMGVGCACPPFFGWSR 2
1 YIPEGLGCSCGPDWYTNCEEFSCASYSKFLLVTCFICPITIIIFSYSQLLGALRA 0
0 VAAQQAESASTQKAEKEVSRMIIVMVASFVTCYGPYALTAQYYAYSQDENKDYRLVTIPAFFSKSSCVYNPLIYAFMNKQ 0
0 FNGCIMEMVFGKKMEEASEVSSKTEVSTDS*0

>SWS1_geoAus Geotria australis (lamprey) Gt 0...2.1.0.0 indel x x x x 346 aa 359 nm 17463225 AY366495 cone short1 UV retinal   
0 MSGDEEFYLFKNISKVGPWDGPQFHIAPKWAFYLQAAFMGFVFICGTPLNAIVLVVTIKYKKLRQPLNYILVNISAAGLVFCLFSISTVFVASMQGYFFLGPTICALEAFFGSLA 1
2 GLVTGWSLAFLAAERYIVICKPFGNFRFGSKHALVAVGLTWMLGLSVALPPFFGWSR 2
1 YIPEGLQCSCGPDWYTVGTKYKSEYYTYFLFVFCFVVPLSIIIFSYGSLLGTLRA 0
0 VAAQQQESASTQKAEREVSRMVIMMVASFCTCYVPYAALAVYMVTNRDHNIDLRFVTVPAFFSKASCVYNPLIYSFMNKQ 0
0 FRACILETVCGKPITDESETSSSRTEVSSVSTTQMIPG* 0

>LWS_homSap Homo sapiens (human) Gt 0.2.2.1.0.0 indel -IRAK1 -MECP2 -TEX28 +TKTL1 364 aa 530 nm 12853434 NP_000504 cone long OPN1MW deutan   
0 MAQQWSLQRLAGRHPQDSYEDSTQSSIFTYTNSNSTR 1
2 GPFEGPNYHIAPRWVYHLTSVWMIFVVIASVFTNGLVLAATMKFKKLRHPLNWILVNLAVADLAETVIASTISVVNQVYGYFVLGHPMCVLEGYTVSLC 1
2 GITGLWSLAIISWERWMVVCKPFGNVRFDAKLAIVGIAFSWIWAAVWTAPPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSSYPGVQSYMIVLMVTCCITPLSIIVLCYLQVWLAIRA 0
0 VAKQQKESESTQKAEKEVTRMVVVMVLAFCFCWGPYAFFACFAAANPGYPFHPLMAALPAFFAKSATIYNPVIYVFMNRQ 0
0 FRNCILQLFGKKVDDGSELSSASKTEVSSVSSVSPA* 0

>LWS_monDom Monodelphis domesticus (opossum) Gt 0.2.2.1.0.0 indel -IRAK1 -MECP2 - +TKTL1 368 aa 000 nm no_ref genome cone long   
0 MTQAWDPAGFLARRRDVNEDDNDETTRSSLFVYTNSNNTR 1
2 GPFEGPNYHIAPRWVYNLTSLWMVFVVIASIFTNGLVLVATMKFKKLRHPLNWILVNLAVADLGETVIASTISVINQIYGYFILGHPLCVLEGYTVSLC 1
2 GITGLWSLAIISWERWVVVCKPFGNVKFDAKLAMVGIIFSWVWAAVWTAPPLFGWSR 2
1 YWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMATCCIFPLSIILLCYVQVWLAIRA 0
0 VAKQQKESESTQKAEKEVSRMVVVMILAYCFCWGPYTLFACFAAANPGYSFHPLTASLPAYFAKSATIYNPIIYVFMNRQ 0
0 FRTCILQLFGKKVDDGSEVSSTSRTEVSSVSSVAPA* 0 

>LWS_ornAna Ornithorhynchus anatinus (platypus) Gt 0.2.2.1.0.0 indel -IRAK1 -MECP2 - - 365 aa 000 nm 17339011 ABN43074 cone long LWS green   
0 MTPAWNSGVYAARRRFEDEEDTTRTSVFVYTNSNNTR 1
2 DPFEGPNYHIAPRWAYNVTSLWMIFVVIASVFTNGLVLVATMKFKKLRHPLNWILVNLAVADLGETLIASTISVINQIFGYFILGHPMCVLEGYTVSLC 1
2 GITGLWSLSIISWERWIVVCKPFGNVKFDAKLAMVGIVFSWVWAAVWTAPPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMSTCCILPLSIIVLCYLQVWLAIRA 0
0 VAKQQKESESTQKAEKEVSRMVVVMILAYCFCWGPYTIFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0
0 FRNCIMQLFGKKVDDGSELSSTSRTEVSSVSSVSPA* 0

>LWS_galGal Gallus gallus (chicken) Gt 0.2.2.1.0.0 indel x x x x 363 aa 000 nm 12716987 NM_205438 cone long green iodopsin missing in assembly   
0 MAAWEAAFAARRRHEEEDTTRDSVFTYTNSNNTR 1
2 GPFEGPNYHIAPRWVYNLTSLWMIFVVAASVFTNGLVLVATWKFKKLRHPLNWILVNLAVADLGETVIASTISVINQISGYFILGHPMCVVEGYTVSAC 1
2 GITALWSLAIISWERWFVVCKPFGNIKFDGKLAVAGILFSWLWSCAWTAPPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFFPLAIIILCYLQVSLAIRA 0
0 VAAQQKESESTQKAEKEVSRMVVVMIVAYCFCWGPYTFFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0
0 FRNCILQLFGKKVDDGSEVSTSRTEVSSVSNSSVSPA* 0

>LWS_anoCar Anolis carolinensis (lizard) Gt 0.2.2.1.0.0 indel - - -TEX28 +TKTL1 366 aa 000 nm no_ref genome cone long   
0 MAGTVTEAWDVAVFAARRRNDEDDTTRDSLFTYTNSNNTR 1
2 GPFEGPNYHIAPRWVYNITSVWMIFVVIASIFTNGLVLVATAKFKKLRHPLNWILVNLAIADLGETVIASTISVINQISGYFILGHPMCVLEGYTVSTC 1
2 GISALWSLAVISWERWVVVCKPFGNVKFDAKLAVAGIVFSWVWSAVWTAPPVFGWSR 2
1 YWPHGLKTSCGPDVFSGSDDPGVLSYMIVLMITCCFIPLAVILLCYLQVWLAIRA 0
0 VAAQQKESESTQKAEKEVSRMVVVMIIAYCFCWGPYTVFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0
0 FRNCIMQLFGKKVDDGSELSSTSRTEVSSVSNSSVSPA* 0

>LWS_xenTro Xenopus tropicalis (frog) Gt 0.2.2.1.0.0 indel -IRAK1 -MECP2 - - 370 aa 000 nm no_ref genome cone long   
0 MASHWNEAVFAARRRNDDDDTTRSSVFTYTNSNNTR 1
2 GPFEGPNYHIAPRWVYNISSLWMIFVVLASVFTNGLVLVATLKFKKLRHPLNWILVNMAIADLGETVIASTISVCNQIFGYFVLGHPMCILEGYTVSVC 1
2 GIAALWSLTVIAWERWFVVCKPFGNIKFDGKLAATGIIFSWVWAAGWCAPPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSSDPGVQSYMLVLMITCCIIPLAIIVLCYMHVWLTIRQ 0
0 VAQQQKESESTQKAEREVSRMVVVMIIAYIFCWGPYTFFACFAAFNPGYNFHPLAAAMPAYFAKSATIYNPIIYVFMNRQ 0
0 FRNCIYQLFGKKVDDGSEVSSTSRTEVSSVSNSSVSPA* 0

>LWS_neoFor Neoceratodus forsteri (lungfish) Gt 0.2.2.1.0.0 indel x x x x 365 aa 000 nm 17961206 EF526299 cone long   
0 MAEPWDAVLAARRRHQDEETTRSTIFVYTNSNNTR 1
2 GPFEGPNYHIAPRWVYNLTSLWMIFVVFASCFTNGLVLMATYKFKKLRHPLNWILVNLAIADLGETLIASTISVTNQIFGYFILGHPMCMLEGFTVATC 1
2 GITGLWSLTIIAWERWVVVCKPFGNIKFDGKWAAGGIIFSWVWSAFWCAMPLFGWSR 2
1 FWPHGLKTSCGPDVFSGEDKYGTRSFMIALMITCCIIPLGVIILCYIQVWWAIRT 0
0 VAKQQKESESTQKAEKEVSRMVVVMIFAYCFCWGPYTFMACFGAAYPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0
0 FRNCIYQLLGKKVDDGSELSSTSKTEVSSVSNSSVSPA* 0

>LWS_takRub Takifugu rubripes (pufferfish) Gt 0...2.1.0.0 indel x x x x 358 aa 000 nm no_ref genome cone long   
0 MAEEWGKQSFAARRYHEDTTRGSAFVYTNSNHTR 1
2 DPFEGPNYHIAPRWVYNVATVWMFIVVVLSVFTNGLVLVATAKFKKLRHPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPMCVFEGYTVSTC 1
2 GIAALWSLTIISWERWVVVCKPFGNVKFDAKWATGGIVFSWVWAAVWCAPPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCIIPLAIIILCYLAVWLAIRS 0
0 VAMQQKESESTQKAEKEVSRMVVVMIVAYCVCWGPYTFFACFAAANPGYAFHPLAAAMPAYFAKSATIYNPVIYVFMNRQ 0
0 FRVCIMKLFGKEVDDGSEVSTSKTEVSSVAPA* 0

>LWS_gasAcu Gasterosteus aculeatus (stickleback) Gt 0.2.2.1.0.0 indel - - - - 358 aa 000 nm no_ref genome cone long   
0 MAEEWGKQAFAARRYNEDTTRGSMFVYTNSNNTK 1
2 DPFEGPNYHIAPRWVYNLSTLWMFIVVALSVFTNGLVLVATAKFKKLQHPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPMCVFEGYVVSVC 1
2 GITALWSLTIISWERWIVVCKPFGNVKFDAKWATAGIVFSWIWSAVWCAPPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCLIPLAIIILCYLAVWLAIRA 0
0 VAMQQKESESTQKAERDVSRMVVVMIVAYIVCWGPYTTFACFAAANPGYAFHPLAAAMPAYFAKSATIYNPVIYVFMNRQ 0
0 FRSCIMQLFGKEVDDGSEVSTsKTEVSSVAPA* 0

>LWS_calMil Callorhinchus milii (elephantfish) Gt 0.2.2.1.0.0 indel x x x x 262 aa 000 nm no_ref genome fragment exon break   
2 dPFEGPNYHIAPRWAYNLTSVWMVGVVVASVFTNGLVLVATVRFKKLRHPLNWILVNMALADLGETVLASTVSVANQFFGYFILGHPLCVFEGFVVSLC 1
2 GITALWSLTIIAWERWVVVCKPFGNVKFDGKWAAFGIIFSWVWSIGWCLPPVFGWSR 2
0 AEKEVSRMVVVMVAAFCLCWGPYACFAMFSALNPGYAFHPLVASIPSYFAKSSTIYNPIIYVFMNRQ 0
0 FRNCILQLFGKKVDDGSELSSTSKTDVSSVSNSSVSPA* 0

>LWS_petMar Petromyzon maritimus (lamprey) Gt 0.2.2.1.0.0 indel x x x x 366 aa 000 nm no_ref genome cone traces key to intron 3 position and gapping   
0 MTASWQGAMFAARRRQDDEDTTMESLFRYTNENNTK 1
2 DPFEGPNYHIAPRWVFNLTSVWMIIVVVLSLFSNGLVLVATVKFKKLRHPLNWIIVNLAIADILETIFASTISVCNQVYGYFILGHPMCVFEGYVVSTC 1
2 GIAGLWSLAIISWERWMVVCKPFGNIKFDGKIATILIVFSWVWPASWCSLPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFLPLSIIILCYLQVWLAIHS 0
0 VAQQQKESETTQKAERDVSRMVVVMILAYVFCWGPYTFFACFAAANPGYSFHPIAAALPAYFAKGATIYNPIIYVFMNRQ 0
0 FRNCILQLFGKKVDDGSEVSSSSRTEVSSVSNSSVSPA* 0

>LWS_letJap Lethenteron japonicum (lamprey) Gt 0.2.2.1.0.0 indel x x x x 365 aa 000 nm 15096614 AB116381 cone long   
0 MTASWHGAVFAARRRNDDEDTTKDSIFRYTNENNTR 1
2 DPFEGPNYHIAPRWMFNLTSVWMIIVVVLSLFTNGLVLVATMKFKKLRHPLNWILVNLAIADILETIFASTISVCNQVFGYFILGHPMCVFEGYVVSTC 1
2 GIAGLWSLAIISWERWMVVCKPFGNIKFDGKIAIILIVFSWVWPACWCSLPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFLPLSVIILCYLQVWLAIHS 0
0 VAQQQKESETTQKAERDVSRMVVVMILAYIFCWGPYTFFACYAAANPGYAFHPLTAALPAYFAKSATIYNPVIYVFMNRQ 0
0 FRNCIMQLFGKKVDDGSEVSSASRTEVSSVSNSSISPA*

>LWS_geoAus Geotria australis (lamprey) Gt 0.2.2.1.0.0 indel x x x x 365 aa 560 nm 17463225 AY366491 cone long red retinal   
0 MAQSWERAMFAARRRQDEDTTKGDLFRYTNENNTR 1
2 DPFEGPNYHIAPRWMYNLTSFWMIIVVILSLFTNGLVLVATLKFKKLRHPLNWILVNLAIADIGETIFASTVSVVNQIFGYFILGHPLCVFEGFTVSVC 1
2 GITALWSLAIISFERWMVVCKPFGNLKFDGKVAIVLIIFSWAWSAGWCAPPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFIPLALIIICYLQVWLAIHT 0
0 VAQQQKESETTQKAERDVSRMVVVMIFAYIFCWGPYTFFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0
0 FRNCIMQLFGKKVDDGSEVSSSARTEVSSVSNSSVSPA* 0

>PIN_galGal Gallus gallus (chicken) Gt 0...2.2.0.0 indel x x x x 352 aa 000 nm no_ref genome pinopsin pineal non-visual   
0 MSSNSSQAPPNGTPGPFDGPQWPYQAPQSTYVGVAVLMGTVVACASVVNGLVIVVSICYKKLRSPLNYILVNLAVADLLVTLCGSSVSLSNNINGFFVFGRRMCELEGFMVSLT 1
2 GIVGLWSLAILALERYVVVCRPLGDFQFQRRHAVSGCAFTWGWALLWSTPPLLGWSSYVPE 1
2 GLRTSCGPNWYTGGSNNNSYILSLFVTCFVLPLSLILFSYTNLLLTLRA 0
0 AAAQQKEADTTQRAEREVTRMVIVMVMAFLLCWLPYSTFALVVATHKGIIIQPVLASLPSYFSKTATVYNPIIYVFMNKQ 0
0 FQSCLLEMLCCGYQPQRTGKASPGTPGPHADVTAAGLRNKVMPAHPV* 0

>PIN_utaSta Uta stansburiana (lizard) Gt 0...2.2.0.0 indel x x x x 359 aa 000 nm 16543463 DQ100321 pinopsin pinopsin missing Anole genome   
0 MVNEWSNATPGPFDGPQWPYLAPRSIYTSVAVLMGLVVVSAAFVNGLVIVVSIQYKKLRSPLNYILVNLAIADLLVTSFGSTLSFANNIYGFFVLGQTACEFEGFMVSLT 1
2 GIVGLWSLAILAFERYLVICKPVGDFRFQQRHAVFGCVFTWMWSLVWTLPPLFGWSSYVPE 1
2 GLRTSCGPNWYTGGSGNNSYIMALFVTCFALPLGMIIFSYASLLLTLRA 0
0 VATQQKEVETTQQAEKEVTRRVIAMVMAFLVCWLPYASFAMVVATNKDLVIQPALASLPSYFSKTATVYNPIIYVFMNKQ 0
0 FRSCLLSTMSCGHRPRGAQETTPAMISIPQGPTSALQGSRNKVTPSASEGSGNEAIPS* 0

>PIN_pheMad Phelsuma madagascariensis (gecko) Gt 0...2.2.0.0 indel x x x x 358 aa 000 nm no_ref AB022881 pinopsin   
0 MHVQMANASQASLKNGTLSPFDGPQWPHRASRRVYTSLAALMGVVVLSASLANGLVIAVSVRFKRLRSPLNYILVNLATADLLVTFFGSIISFVNNAVGFFVFGKTACRFEGFMVSLT 1
2 GIVGLWSLAILAFERYLVICKPVGDFQFQRRHAVIGCLYTWGWSLIWTVPPLFGWSSYVPE 1
2 GLGTSCGPNWYMGGTNNNSYIVALFVTCFALPLSMILFSYANLLLTLRA 0
0 VAAQQKEQETTQRAEKEVTRMVITMVMAFLVCWLPYATFAMVVATTKDLSIQPGLASLPSYFSKTATVYNPIIYVFMNKQ 0
0 FRSCLLNTVSCGRIPQTMPGTPATTAVRGGFVLTSEGRGNKVASTELHS* 0

>PIN_podSic Podarcis sicula (lizard) Gt 0...2.2.0.0 indel x x x x 354 aa 000 nm 16688437 DQ013042 pinopsin pinopsin mRNA   
0 MQASNASWVEVRNRTPGPFEGPQWPYLAPQSTYISVAVLMGLVVISATLVNGLVIVVSVQFKKLRSPLNYVLVNLAVADLLVTFFGSTISFVNNAQGFFIFGQATCEFEGFMVSLT 1
2 GIVGLWSLAILAFERYLVICKPVGDFRFPARHAVLGCAFTWGWSFVWTVPPLLGWSSYVPE 1
2 GLRTSCGPNWYSGGSSNNSYIMTLFVTCFAMPLSTILFSYANLLMTLRT 0
0 VAAQQKEQETTQRAEREVTRMVVAMVAAFLVCWLPYASFAMVVATHKDLAIRPALASLPSYFSKTATVYNPIIYVFMNKQ 0
0 FRSCLLYKMSCGHRALSSQDTTPAGISLPGRLTTSASKGSRNQVSPS* 0

>PIN_xenTro Xenopus tropicalis (frog) Gt 0...2.2.0.0 indel x x x x 346 aa 000 nm no_ref genome pinopsin   
0 MRAGNMSAYEAPGPYDGPQWPHLAPRSTFLTVAAVMCMVVILAFFVNGLVIVVTLKYKKLRSPLNYILVNLAIANLLVTIFGSSVSFSNNVVGYFFMGKTMCEFEGFMVSLT 1
2 GIVGLWSLAILAFERYLVICKPMGDFRFQQKHAILGCSFTWVWSFIWTSPPLFGWCSYVPE 1
2 GLRTSCGPNWYTGGTNNNSYIMALFLTCFIMPLSTIIFSYSNLLMALRA 0
0 VAAQQKDSETTQRAEKEVTRMVIAMVLAFLICWLPYASFAVVVAVNKDVVIEPTVASLPSYFSKTATVYNPIIYVFMNKQ 0
0 FRNCLMTLLCCGRSFGDDETSSASGRTDVTSVSEAGGNKVTPA* 0

>PIN_bufJap Bufo japonicus (toad) Gt 0...2.2.0.0 indel x x x x 347 aa 000 nm 9537517 AF200433 pinopsin classifies oddly   
0 MHSANMSALETPGPFEGPQWPHVAPRSTYLTVAVLMGMVVFLAFFVNGMVIVVSLKYKKLRSPLNYILVNLAVADILVTMFGSTVSFHNNIFGFFTLGKLVCELEGFVVSLT 1
2 GIVGLWSLAILAFERYIVICKPMGDFRFQQRHAVMGCAFTWIWAFLWTSPPLIGWCSYVPE 1
2 GLGTSCGPNWYTGGTNNNSYILALFTTCFMMPLTTIIFSYSNLLLALRA 0
0 VAAQQKESETTQRAEREVTRMVIAMVLAFLICWLPYAVFAIVMASNKNVVIDPTLASMPSYFSKTATVYNPVIYVFMNKQ 0
0 FRDCLTKLLCCGRNPFGEDETSTTSGRTDVTSVSEGGGNKVTPA* 0

>PIN_calMil Callorhinchus milii (elephantfish) Gt 0...2.2.0.0 indel x x x x 093 aa 000 nm no_ref genome fragment no petMar
0 FGSTVSFSNNINGYFVLGETVCQFEGFMVSLT 1
2 GIVGLWSLAILAFERYIVICKPMGDFRFQQKHAVWGCLFTWLWSLFWTLPPLFGWCSYVPE 1

>VAOP_galGal Gallus gallus (chicken) Gt 0...2.1.0.0 indel +INPP5A -NXK6 +C10orf61 +ALDH18A1 393 aa 000 nm no_ref genome TCTN3 exon 1 genbank error
0 MDVFRALGNESLLSNSSGPARWDPFHHPLDSIQPWHFRLVAAVMFVVTSLSLAENLAVILVTFKFKQLRQPVNYVIVNLSVADFLVSLTGGTISFLANLKGYFYMGHWACVLEGFAVTFF 1
2 GIVALWSLALLAFERYIVICRPVGNMRLRGKHAAQGIAFVWTFSFIWTIPPTMGWSSYTTSKIGTTCEPNW 2
1 YSGAYNDRSYIIAFFTTCFIVPLLVILVSYGKLLQKLRK 0
0 VSNTQGRLRTARKPERQVTRMVVVMIIAFLICWMPYAVFSILATAYPSIELDPHLAAIPAFFSKTATVYNPIIYVFMNKQ 0
0 FRMCLIQMFKCSAIETAESNMNPTSERATLTQDKRDSQLSVMAVRSTILKRKTGDEHRADDLWLFRQLQKPKCVPCRAGDGS* 0

>VAOP_anoCar Anolis carolinensis (lizard) Gt 0...2.1.0.0 indel +INPP5A -NXK6 +GPR125 +KNDC1 389 aa 000 nm no_ref genome vertebrate ancient   
0 MAGLRREAENDSWLFDPSSSSAPFDPFLQPLDIIEPWNFHLISALMFVVTLFSLSENFTVILVTIKFKQLRQPLNYVIVNLSVADFLVSLIGGTISFSTNLKGYFYMGHWACVLEGFAVTFF 1
2 GIVALWSLALLAFERYVVICRPLGNMRLNGKHAALGVAFVWIFSFIWTVPPTMGWSSYTTSKIGTTCEPNW 2
1 YSGDYNDHTFIITFFTTCFILPLLVILVSYGKLMRKLRK 0
0 VSDTQGRLGTTRKPERQVTGMVVIMILAFLICWSPYAAFSILVTACPSIELDPRLAAIPAFFSKTATVYNPVIYVFMNNQ 0
0 FRKCLVQLFQCSSQETMDANVNPISEKDTLTHTKHCGEMSTVAAHVIVFNPRSEDEQGSCQSFAQLAISENKVYPL* 0

>VAOP_xenTro Xenopus tropicalis (frog) Gt 0...2.1.0.0 indel - +GSTO2 -C10orf92 - 383 aa 000 nm no_ref genome vertebrate ancient new   
0 MPTNVSLLATPENSTVWNPFTGPLKTIEAWNFHLLAALMFVVTSLSIAENFIVILVTAKFKQLRQPLNYIIVNLSVADFLVSVIGGTISIATNSRGYFYLGSWACVLEGFAVTFF 1
2 GIVALWSLSVLAFERYIVICRPLGNLRLQGKHSALAIIFVWVFSFVWTIPPTMGWSSYTTSKIGTTCEPNW 2
1 YSGEMRDHTYIITFLTTCFVFPLLVIFMSYGKLMRKLRK 0
0 VSDTQGRLGSTRKPEKEVTRMVVIMILAFLICWTPYAAFSILITAHPTIDLDPRLAAIPAFFAKTASMYNPIIYVYMNKQ 0
0 FRRCLYQMFNINDPEAKESNLNPTSERGVLTRNNNGGEMLAIATHITSSAVTNREEEKSSSNSFAHIPVSDNKVCPM*

>VAOP_danRer Danio rerio (zebrafish) Gt 0...2.1.0.0 indel - - - - 378 aa 000 nm 17067577 NM_131586 vertebrate ancient valop vertebrate assembly missing exon 3   
0 MEASSAAVNAVSPAEDPFSAPLSSIAPWNYSVLAALMFVVTALSLSENFTVMLVTFRFQQLRQPLNYIIVNLSLADFLVSLTGGSISFLTNYHGYFFLGKWACVLEGFAVTFF 1
2 GIVALWSLAVLAFERFFVICRPLGNIRLRGKHAALGLVFVWSFSFIWTVPPVLGWSSYTVSRIGTTCEPNW 2
1 YSGNFHDHTFIITLFSTCFIFPLGVIIVCYCKLIRKLRK 0
0 VSNTHGRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIIITAHPSMHVDPRLAAIPAFVAKTAAVYNPIIYVFMNKQ 0
0 FRKCLVQLLSCSKVTVVEGNNNQTTERAGMTSGSNTGEMSAIAARVSVPKTEENPGDRSTFSHIPIPENKVCPM*

>VAOP_takRub Takifugu rubripes (teleost) Gt 0...2.1.0.0 indel +INPP5A -NXK6 - +KNDC1 362 aa 000 nm no_ref genome vertebrate ancient   
0 MESLSLSVNGVSYTVAAELAPTNDPFTGPINNIAQWNFTILAVLMFVVTSLSLCENFLVMFITFKFKQLRQPLNYIIVNLAIADFLVSLTGGLISFLTNARGYFFLGRWACVLEGFAVTYF 1
2 GIVAMWSLAVLSFERFFVICRPLGNMRLQAKHAAIGLLFVWTFSFVWTFPPVLGWNRYTVSKIGTTCEPDW 2
1 YSNNMTSHSYIITFFSTCFILPLGIIFFCYGKLLRKLRK 0
0 VSHGRLATARKPERQVTRMVVVMIVAFMVAWTPYATFAILVTIHPTIELDPRLASIPAFFSKTAAVYNPIIYVFMNKQ 0
0 FRKCLIQHFIGMGVMAESNMNPTSERPGITAESQTGEMSAIAARVPVGATAALHSDGSPTDCGSLAQLPIPENKVCPI* 0

>VAOP_rutRut Rutilus rutilus (minnow) Gt 0...2.1.0.0 indel x x x x 383 aa 000 nm 12906786 AY116411 vertebrate ancient vertebrate   
0 MELFPVAVNGVSHAEDPFSGPLTFIAPWNYKVLATLMFVVTAASLSENFAVMLVTFRFTQLRKPLNYIIVNLSLADFLVSLTGGTISFLTNYHGYFFLGKWACVLEGFAVTYF 1
2 GIVALWSLAVLAFERFFVICRPLGNIRLRGKHAALGLLFVWTFSFIWTIPPVLGWSSYTVSKIGTTCEPNW 2
1 YSGNFHDHTFIIAFFITCFILPLGVIVVCYCKLIKKLRK 0
0 VSNTHGRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIVVTAHPSIHLDPRLAAAPAFFSKTAAVYNPVIYVFMNKQ 0
0 FRKCLVQLLRCRDVTIIEGNINQTSERQGMTNESHTGEMSTIASRIPKDGSIPEKTQEHPGERRSLAHIPIPENKVCPM* 0

>VAOP_calMil Callorhinchus milii (elephantfish) Gt 0...2.1.0.0 indel x x x x 080 aa 000 nm no_ref genome fragment   
0 VASTQGRLGVARKPEKQVTRMVIVMILAFLFCWTPYAAFSITVTACPTIKLDPRLAAIPAFFSKTATVYNPIIYVFMNKQ 0

>VAOP_petMar Petromyzon marinus (lamprey) Gt 0...2.1.0.0 indel x x x x 445 aa 000 nm 9427550 U90667 vertebrate ancient exons 123 in traces pineal gland-specific   
0 MDALQESPPSHHSLPSALPSATGGNGTVATMHNPFERPLEGIAPWNFTMLAALMGTITALSLGENFAVIVVTARFRQLRQPLNYVLVNLAAADLLVSAIGGSVSFFTNIKGYFFLGVHACVLEGFAVTYF 1
2 GVVALWSLALLAFERYFVICRPLGNFRLQSKHAVLGLAVVWVFSLACTLPPVLGWSSYRPSMIGTTCEPNW 2
1 YSGELHDHTFILMFFSTCFIFPLAVIFFSYGKLIQKLKK 0
0 ASETQRGLESTRRAEQQVTRMVVVMILAFLVCWMPYATFSIVVTACPTIHLDPLLAAVPAFFSKTATVYNPVIYIFMNKQ 0
0 FRDCFVQVLPCKGLKKVSATQTAGAQDTEHTASVNTQSPGNRHNIALAAGSLRFTGAVAPSPATGVVEPTMSAAGSMGAPPNKSTAPCQQQGQQQQQQGTPIPAITHVQPLLTHSESVSKICPV* 0

>PPIN_anoCar Anolis carolinensis (lizard) Gt 0...2...0.0 indel -CPEB2 -CACNA2D3 +SELK +ACTR8 346 aa 000 nm no_ref genome parapinopsin syntenic deleted in chicken   
0 MDSLDTNTLSPNASTVRVVLMPRIGYTIIAIIMATSCTLSVILNTAVIAITIKYRQLRQPINYSLVNLAIADLGAALLGGSLNVETNAVGYYNLGRVGCVTEGFAMAFF 1
2 GIVALCTIAVIAVDRAIVIAKPMGTITFTTRKAMIGVAVSWIWSLVWNTPPLFGWGGYQMEGVMTSCAPDWANSDPINVSYIICYFLFCFTIPFITILASYGYLIWTLRQ 0 
0 VAKVGLAQRGSTTKAEAQVSRMVIVMVMAFLICWLPYATFALVVVGNPQIYINPIIATIPMYMAKSSTFYNPIIYIFMNKQ 0
0 FRDCLVRCLLCGRNPCASEQTDEDDLEVSTIAPAPSSRRGKVAPV* 0

>PPIN_xenTro Xenopus tropicalis (frog) Gt 0...2...0.0 indel - - +SELK - 349 aa 000 nm no_ref genome parapinopsin bistable UV lamprey pineal broken contigs  
0 MADEALLPPMMNVTNEEMHPGKVLMPRIGYTILALIMAVFCAAALFLNVTVIVVTFKYRQLRHPINYSLVNLAIADLGVTVLGGALTVETNAVGYFNLGRVGCVIEGFAVAFF 1
2 GIAALCTIAVIALDRVFVVCKPMGTLTFTPKQALAGIAASWIWSLIWNTPPLFGWGSYELEGVMTSCAPNWYSADPVNMSYIVCYFSFCFAIPFLIIVGSYGYLMWTLRQ 0
0 VAKLGVAEGGTTSKAEVQVSRMVIVMILAFLVCWLPYAAFAMTVVANPGMHIDPIIATVPMYLTKTSTVYNPIIYIFMNKQ 0
0 FQECVIPFLFCGRNPWAAEKSSSMETSISVTSGTPTKRGQVAPA* 0

>PPIN_ictPun Ictalurus punctatus (catfish) Gt 0...2...0.0 indel x x x x 347 aa 000 nm no_ref genome parapinopsin parapinopsin index sequence   
0 MASIILINFSETDTLHLGSVNDHIMPRIGYTILSIIMALSSTFGIILNMVVIIVTVRYKQLRQPLNYALVNLAVADLGCPVFGGLLTAVTNAMGYFSLGRVGCVLEGFAVAFF 1
2 GIAGLCSVAVIAVDRYMVVCRPLGAVMFQTKHALAGVVFSWVWSFIWNTPPLFGWGSYQLEGVMTSCAPNWYRRDPVNVSYILCYFMLCFALPFATIIFSYMHLLHTLWQ 0
0 VAKLQVADSGSTAKVEVQVARMVVIMVMAFLLTWLPYAAFALTVIIDSNIYINPVIGTIPAYLAKSSTVFNPIIYIFMNRQ 0
0 FRDYALPCLLCGKNPWAAKEGRDSDTNTLTTTVSKNTSVSPL* 0

>PPIN_danRer Danio rerio (zebrafish) Gt 0...2...0.0 indel - - +SELK - 338 aa 000 nm no_ref XM_681591 parapinopsin parapinopsin   
0 MESETSTAASGSIAEVMPRMGYTILAVIIGVFSVCGVILNVTVITVTLKYKQLRQPLNFALVNLAVADLGCAVFGGLPTVVTNAMGYFSLGRVGCVLEGFAVAFF 1
2 GIAALCSVAVIALERCMVVCRPVGSISFQTRHAVFGVAVSWLWSFIWNTPPLFGWGRLQLEGVRTSCAPDWYSRDLANVSFIVCYFLLCFALPFSVIVYSYTRLLWTLRQ 0
0 VSRLQVCEGGSAARAEAQVSCMVVVMILAFLLTWLPYASFALCVILIPELYIDPVIATVPMYLTKSSTVFNPIIYIFMNRQ 0
0 FRDRALPFLLCGRNPWAAEAEEEEEETTVSSVSRSTSVSPA* 0

>PPIN_oncMyk Oncorhynchus mykiss (trout) Gt 0...2...0.0 indel x x x x 347 aa 000 nm no_ref genome parapinopsin   
0 MDHQQLLPNLHGNISSSPGSVSEALLSRTGFTILAVIIGVFSVSGVCMNVLVIMVTMRHRKLRQPLNYALVNLAVADLGCALFGGLPTMVTNAMGYFSMGRLGCVLEGFAVAFF 1
2 GIAGLCSVAVIAVDRYVVVCRPMGAVMFQTRHAVGGVVLSWVWSFLWNTPPLFGWGSFELEGVRTSCSPNWYSREPGNMSYIILYFLLCFAIPFSIIMVSYARILFTLHQ 0
0 VSKLKVLEGNSTTRVEIQVVRMVVVMVMAFLLSWLPYAAFALSVILDPSLHINPLIATVPMYLAKSSTVYNPIIYVFMNRQ 0
0 FRDCAVPFLLCGLNPWASEPVGSEADTALSSVSKNPRVSPQ*

>PPIN_calMil Callorhinchus milii (elephantfish) Gt 0...2...0.0 indel x x x x 109 aa 000 nm no_ref genome fragment   
0 MDPHNRSANLSEGPGLGGGGAVPGWGPSVRAPLSLVMAVISLSSIVLNSLAIAVVLRFQVLQQPLNYALLSLASADLGTAATGGVLSTVCTALGSFVLGRHSCVAEGFF 1 

>PPIN_petMar Petromyzon maritimus (lamprey) Gt 0...2...0.0 indel x x x x 344 aa 000 nm no_ref genome parapinopsin bistable pineal UV/green   
0 MENLTSLDLLPNGEVPLMPRYGFTILAVIMAVFTLASLVLNSTVIIVTLRHRQLRHPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVGCVIEGFAVAFF 1
2 GIAALCTIAVIAVDRFVVVCKPLGTLMFTRRHALLGITWAWLWSFVWNTPPLFGWGSYKLEGVRTSCAPDWYSRDPANVSYIVSFFSFCFAIPFLVIVVAYGRLLWTLHQ 0
0 VAKLGMGESGSTAKAEAQVSRMVVVMVVAFLVCWLPYALFAMIVVAKPGVYIDPVIATLPMYLTKTSTVYNPIIYIFMNRQ 0
0 FRDCAVPFLLCGRNPWAEPSSESATTASTSATSVTLASVPGQVSPS* 0

>PPIN_letJap Lethenteron japonicum lamprey Gt 0...2...0.0 indel x x x x 344 aa 000 nm 14981504 AB116380 parapinopsin bistable pineal UV/green   
0 MENLTSLDLLPNGEVPLMPRYGFTILAVIMAVFTIASLVLNSTVVIVTLRHRQLRHPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVGCVIEGFAVAFF 1
2 GIAALCTIAVIAVDRFVVVCKPLGTLMFTRRHALLGIAWAWLWSFVWNTPPLFGWGSYELEGVRTSCAPDWYSRDPANVSYITSYFAFCFAIPFLVIVVAYGRLMWTLHQ 0
0 VAKLGMGESGSTAKAEAQVSRMVVVMVVAFLVCWLPYALFAMIVVTKPDVYIDPVIATLPMYLTKTSTVYNPIIYIFMNRQ 0
0 FRDCAVPFLLCGRNPWAEPSSESATAASTSATSVTLASAPGQVSPS* 0

>PPINa_cioInt Ciona intestinalis (tunicate) Gt 0...2...0.0 indel -HOXB1 +HHEX +CUL4A - 391 aa 000 nm 11591373 NM_001032555 parapinopsin Ci-opsin1 odd exons larval ocellus   
0 MDHDVTPTVDLTDGVPQCKDLNPYVLKGDGWVPQHISRANRSTYSFLCVYMTFVFLLSCSLNILVIVATLKNK 0
0 VLRQPLNYIIVNLAVVDLLSGFVGGFISIAANGAGYFFWGKTMCQIEGYFVSNF 1
2 GVTGLLSIAVMAFERYFVICKPFGPVRFEEKHSIF 1
2 GIVITWVWSMFWNTPPLIFWDGYDTEGLGTSCAPNWFVKEKRERLFIILYFVFCFVIPLAVIMICYGKLILTLRQ 0
0 IAKESSLSGGTSPEGEVTKMVVVMVTAFVFCWLPYAAFAMYNVVNPEAQ 0
0 IDYALGAAPAFFAKTATIYNPLIYIGLNRQ 0
0 FRDCVVRMIFNGRNPWVDELVGSQVSSTGSQLTAVSSNKVAPA* 0

>PPINb_cioInt Ciona intestinalis (tunicate) Gt 0...2...0.0 indel -TMEM165 +FUT4 - - 353 aa 000 nm no_ref genome parapinopsin jgi gene model wrong both ends  
0 MTTAETTTECYEKNPYIRNEMGWVPKHILIAERHIYTILAVYMTFIFLLAVSLNGFVIIATMKNK 0
0 KLRQPLNYIIINLSIADFLSGLVGGFIGMISNSAGYFYFGKTVCILEGYIVSVA 1
2 GVCGLMSISVMAFERYFVVCKPYGPFTLTNTHAAL 1
2 GIGFTWTWSVLWSTPGLIWLDGYVPEGLGTSCAPNWFSKNK 2
1 SERIFIFVYFVFCFFIPLLVIIICYGKIVLFLKQ 0
0 ATRQSSASSNRQADNKVTKMVLVMISAFLICWTPYGVLSLYNAINPDKQ 0
0 LDYGLGAVPVFFAKTANIYNPLIYIGLNKQ 0
0 FRDGVIKMVFRGRNPWAEEMSTQQRQRSTEAGQPIVSNEV* 0

>PPINa_cioSav Ciona savignyi (tunicate) 88%
0 MPTEASIAVDVSPTMGIPQCKDINPYVLKGDGWVPQHISRADRSVYSFLAVYMTFICLISCSLNILVITATLKNK 0
0 VLRQPLNYIIVNLAVVDLLSGLVGGVISIFANGAGYFFWGKFMCQVEGYTVSNF 1
2 GVTGLLSIAVMAFERYFVICKPFGPVRFEEKHAVI 1
2 GIAVTWIWAMFWNTPPLIFWDGYDTEGLGTSCAPNWFVKGNTERLFIILYFVFCFLIPLAIIVLCYGKLILQLRQ 0
0 IAKESSLSGGTSPEGEVTKMVVVMVTAFVICWLPYAAFAMYNVVNPEAQ 0
0 IDYALGAAPAFFAKTATIYNPLIYIGLNRQ 0
0 FRDCVVRMIFNGRNPWVDEMVGSQVSSSASQMTAVSSNKVAPA* 0

>PPINb_cioSav Ciona savignyi 59%
0 MSSIPQNYSNGNPYATTDSGWVPEHIEIANRSTYSGLCVFMSFVFVLAVPLNLLVIVATYKNK 0
0 DLRRPINYIIVNLAVADLTCSVVGGLLGVLNNGAGYYFLGKSVCIFEGYVMSVT 1
2 GVCGILSITVMAFERYFVVCKPFGQTNLKWSHAIT 1
2 GIVFTWTWSVIWHTPGLFFWNGYEPEGFGTSCAPNWFSQQK 2
1 SERIFIFAYFAFCFLTPLTIIFACYLKLILFIRK 0
0 VSKKSMVNEADRRDFEVTRMVFVMIAAFLICWLPYGCLSMYNAIHPDNL 0
0 LSYGIGSVPAFFAKTATIYNPIIYMGLNKK 0
0 FRDGVIRMLFKGRNPWLDGRNTTSSTSTRAQGSLINREVDI* 0

>PARIE_utaSta Uta stansburiana (lizard) Gd+Go 0...2...0.0 indel x x x x 347 aa 522 nm 16543463 DQ100320 parietopsin shift in counterion Gt + Go   
0 MENDSSLATELAEGAIVKPTIFPKAGYGVLAFLMFLNALFSIFNNSLVIAVTLKNPQLRNPINIFILNLSFSDLMMSLCGTTIVIATNYYGYFYLGRKFCIFQGFAVNYF 1
2 GIVSLWSLTILAYERYNVVCQPLGTLQMSTKRGYQLLGFIWVFCLFWAVVPLFGWSSYGPEGVQTSCSIGWEERSWSNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHG 0
0 LNKKVEQLGGKSSPEEEFRAVIMVLVMVVAFLICWLPYTVFALIVVFNPALNISPLAATIPTYLSKTSPVYNPIIYIFLNKQ 0
0 FRDCAVEFITCGQVVLTSPEEDISTSAIPVEGKGPCKINQVTPV* 0

>PARIE_anoCar Anolis carolinensis (lizard) Gd+Go 0...2...0.0 indel +EEA1 -FLJ46688 +BTG1 - 347 aa 000 nm no_ref genome parietopsin Go like scallop, gusducin not transducin   
0 MENESSLVLEGAEGYIVRPTIFPRAGYGVLAFLMFINALFSLFNNFLVIAVTLKNPQLRNPINIFILNLSFSDLMMSICGTTIVIATNYHGYFYLGRRFCIFQGFAVNYF 1
2 GIVSLWSLTILAYERYNVVCQPLGTLQMSTQRAYQLLGFIWVFCLFWAVVPLFGWSSYGPEGVQTSCSIGWEERSWNNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHG 0
0 LNKKVEQLGGKSNPEEEFRAVIMVLVMVVAFLICWLPYTLFALTVVFNPALNISPLAATIPTYLSKTSPVYNPIIYIFLNKE 0
0 FRECAVEFITCGKVVLTSPEEDISTSAISDEGIAPCKINQVTPV* 0

>PARIE_xenTro Xenopus tropicalis (frog) Gd+Go 0...2...0.0 indel -lum -DCN - - 346 aa 000 nm 16543463 NM_001045791 parietopsin   
0 MDGNSTTPGIAVNLTVMPTIFPRSGYSILSFLMFLNAVFSICNNAIVILVTLKHPQLRNPINIFILNLSFSDLMMALCGTTIVVSTNYHGYFYLGKQFCIFQGFAVNYF 1
2 GIVSLWSLTLLAYERYNVVCEPIGALKLSTKRGYQGLVFIWLFCLFWAIAPLFGWSSYGPEGVQTSCSIGWEERSWSNYSYIISYFLTCFIIPVGIIGFSYGSILRSLHQ 0
0 LNRKIEQQGGKTNPREEKRVVIMVLFMVLAFLICWLPYTVFALIVVINPQLYISPLAATLPTYFAKTSPVYNPIIYIFLNKQ 0
0 FRTYAVQCLTCGHINLDSLEEDTESVSAQAENMLTPKTNQVAPA* 0

>PARIE_takRub Takifugu rubripes (teleost) Gd+Go 0...2...0.0 indel -HSP90B1 +NT5DC2 -KCND3 -FLNC 351 aa 000 nm 16543463 genome parietopsin   
0 MDSNSTPWSSPPAPLQAEAVTVAPTIFPRVGYSILSFLMFINTVLSVFNNSLAIAVMLKNPSLLQPINIFILSLAVSDLMIGLCGSLVVTITNYHGSFFIGHTACVFQGFAVNYF 1
2 GLVSLCTLTLLAYERYNVVCKPRAGLKLTMRRSIIGLLFVWTFCLFWAVTPLLGWSSYGPEGVQTSCSLAWEERSWNNYSYLILYTLLCFIFPVGVIIYCYCKVLTSMNK 0
0 LNKSVELQGGLSCRRENKHAINMVLAMIIAFFVCWLPYTALSVVVVVDPELHIPPLVATMPMYFAKTSPVYNPIIYFLSNKQ 0
0 FRDATLEVLSCSRYIPHASSRVSINMRSLNRRSVNTHSKVSPL* 0

>PARIE_gasAcu Gasterosteus aculeatus (stickleback) Gd+Go 0...2...0.0 indel -HSP90B1 +NT5DC2 -KCND3 -FLNC 361 aa 000 nm no_ref genome parietopsin   
0 MDSNSTLWSSGSPPPSIHGKMLTITPTIFPRVGYSILSFLMFINTVLTVFNNVLVITVLVRNPSLLQPMNVFILSLAVSDLMIGLCGSLVVTITNYHGSFFIGHTACIFQGFAVNYF 1
2 GLVSLCTLTLLSYERYNVVCRPRNALKLSMRRSIHGLLIVWTFCLFWAVAPLFGWSGYGPEGVQTSCSLAWEERSWSNYSYLVLYTLLCFIVPVAVIIYCYAKVLTSMNT 0
0 LNRSVEVQGGRSSQKENDHAVSMVLAMIIAFFSCWLPYTALSVVVVVDPTLYIPPLVATMPMYFAKTSPVYNPIIYFLSNKQ 0
0 FRDAALEMLSCGRYIAHMPNTVSINMRSLNRRSRLSSLSRNVNSHSKVLPL* 0

>PARIE_danRer Danio rerio (zebrafish) Gd+Go 0...2...0.0 indel - +NT5DC2 +FBXL13 - 337 aa 000 nm 16543463 genome parietopsin   
0 MENFAKTELTMMVQPTIFPRVGYSILSYLMFINTTLSVFNNVLVIAVMVKNLHFLNAMTVIIFSLAVSDLLIATCGSAIVTVTNYEGSFFLGDAFCVFQGFAVNYF 1
2 GLVSLCTLTLLAYERYNVVCKPMAGFKLNVGRSCQGLLLVWLYCLFWAVAPLLGWSSYGPEGVQTSCSLGWEERSWRNYSYLILYTLMCFILPTVIITYCYSNVLLTMRK 0
0 INKSIECQGGKNCAEDNEHAVRMVLAMIIAFFICWLPYTAISVLVVVNPEISIPPLIATMPMYFAKTSPVYNPIIYFLTNKR 0
0 FRESSLEVLSCGRYISRETGGPLMGSSMQRGQSRVNPV* 0

>PARIE_petMar Petromyzon marinus (lamprey) Gd+Go 0...2...0.0 indel x x x x 082 aa 000 nm no_ref genome fragment   
0 LNKKIKRVGGHPDPREEMRATVMVLAMVGAFLACWLPYTVLALCVVLAPGTQIPPLVATLPMYFAKTSPMYNPIIYFFLNPQ 0

>ENCEPH_homSap Homo sapiens (human) Gt 0...2...0.0 indel -EXO1 -WDR64 -KMO +FH 403 aa 000 nm 12242008 NM_014322 parietopsin OPN3 with intron loss   
0 MYSGNRSGGHGYWDGGGAAGAEGPAPAGTLSPAPLFSPGTYERLALLLGSIGLLGVGNNLLVLVLYYKFQRLRTPTHLLLVNISLSDLLVSLFGVTFTFVSCLRNGWVWDTVGCVWDGFSGSLF 1
2 GIVSIATLTVLAYERYIRVVHARVINFSWAWRAITYIWLYSLAWAGAPLLGWNRYILDVHGLGCTVDWKSKDANDSSFVLFLFLGCLVVPLGVIAHCYGHILYSIRM 0
0 LRCVEDLQTIQVIKILKYEKKLAKMCFLMIFTFLVCWMPYIVICFLVVNGHGHLVTPTISIVSYLFAKSNTVYNPVIYVFMIRK 0
0 FRRSLLQLLCLRLLRCQRPAKDLPAAGSEMQIRPIVMSQKDGDRPKKKVTFNSSSIIFIITSDESLSVDDSDKTNGSKVDVIQVRPL* 0

>ENCEPH_monDom Monodelphis domestica (opossum) Gt 0...2...0.0 indel -EXO1 -WDR64 -KMO +FH 411 aa 000 nm no_ref genome encephalopsin OPN3 extra intron alt splicing   
0 MYSDNSSDDGGGGYWGSGRAGGASGTGVTGEPGPEGSPRQAPLFSPGTYELLALLIATIGLLGLCNNLLVLVLYYKFQRLRTPTHLFLVNISFNDLLVSLFGVTFTFVSCLRSGWVWDSVGCAWDGFSNTLF 1
2 GIVSIMTLTVLAYERYNRIVHAKVINFSWAWRAITYIWLYSLVWTGAPLLGWNRYTLEIHGLGCSVDWKSKDPNDSSFVIFLFFGCLMLPVGVMAYCYGHILYAIRM
0 LRCVEELQTIQVIKILRYEKKVAKMCFLMIAIFLFCWMPYAVICLLVANGYGSLVTPTVAIIASLFAKSSTAYNPIIYIFMSRK 0
0 FRRCLLQLLCFRLLKFQQPKKDRPVIRTEKQIRPIVMSQKVGDRPKKKVTFSSSSIIFIITSDETQMIDENDKNSGTKVNVIQVRPL* 0

>ENCEPH_galGal Gallus gallus (chicken) Gt 0...2...0.0 indel -EXO1 -WDR64 -PIGM +RGS7 396 aa 000 nm no_ref genome encephalopsin OPN3   
0 MHSGNGTGATSRPQLAAAGHEVPGERPLFSAGTYELLALLIATIGTLGVCNNLLVLVLYYKFKRLRTPTNLFLVNISLSDLLVSVCGVSLTFMSCLRSRWVWDAAGCVWDGFSNSLF 1
2 GIVSIMTLTVLAYERYIRVVHAKVIDFSWSWRAITYIWLYSLAWTGAPLLGWNRYTLEIHGLGCSMDWKSKDPNDTSFVLLFFLGCLVAPVVIMAYCYGHILYAVRM 0
0 LRCVEDFQTSQVIKLLKYEKKVAKMCFLMISTFLICWMPYAVVSLLVTYGYSNLVTPTVAIIPSFFAKSSTAYNPVIYIFMSRK 0
0 FRQCLLQLLCFRLMRFQRIMKEPSGAGNVKPIRPIVMSQKVGDRPKKKVTFSSSSIIFIIASDDTQQIDDNSKHNGTKVNVIQVKPL* 0

>ENCEPH_anoCar Anolis carolinensis (lizard) Gt 0...2...0.0 indel -EXO1 -WDR64 -PIGM +RGS7 408 aa 000 nm no_ref genome encephalopsin OPN3   
0 MFSANGTRSGAGSDLEPGPGQQQQQREASEEEERGAGLSPFSAGTYELLALLVAAIGLLGLCNNLLVLVLYAKFKRLRTPTHLFLVNISLSDLLVSLFGVSFTFGSCLRHRWVWDAAGCVWDGFSNSLF 1
2 GIVSIMTLTVLAYERYIRVVHARVIDFSWSWRAITYIWLYSLAWTGAPLLGWNHYTLEIHGLGCSVDWQSKEPSDSSFVLFFFLGCLAAPVGIMAYCYGHILHAIRM 0
0 LRCVEDLQSIQVIKILRYEKKVAKMCFLMVTTFLICWMPYAVVSLLIAYGYGHLITPTVAIIPSFFAKSSTAYNPVIYIFMSRK 0
0 FRRCLVQLFCVQFLRFKRTLKEQPAIESNKPIRPIVMSQKVGDRPKKKVTFSSSSIIFIITSDDTEQIDVSTKCSDTKINVIQVKPL* 0

>ENCEPH_xenTro Xenopus tropicalis (frog) Gt 0...2...0.0 indel -ASAH3L +RPS6 -ADFP -DENND4C 388 aa 000 nm    
0 MPVTNGSHNNSISWLHSKDMFTEDTYHFLALIVATVGFLGLVNNLLVLILYCKFKRLQTPTNLLFFNTSLCHFVFSLLAITFTFMSCVRGSWAFSVEMCVFHGFSKNLL 1
2 GIVSFGTLTVVAYERYARVVYGKYVNSSWSKRSITFVWVYSLAWTGFPLIGWNLYTFETHKLDCSFEWTATDPKDTAFVLLFFLACITLPLSIMAYCYGYILYEIQK 0
0 LRSVKNIQNFQEITILDYEIKMAKMCLLMMLTFLIGWMPYTILSLLVTSGYSKFITPTITVMPSLLAIASAAYNPVIHIFTIKK 0
0 FRQCLVQLLPPINFHPPINPPINNFWRLLKNLNGRLAMKKVKPVLGKGRSHNRPEKKVPPINFSSSDFFTRTTSDTGTHGITESTKGKRTNVRLIQVHPL* 0

>ENCEPH4a_takRub Takifugu rubripes (teleost) Gt 0...2...0.0 indel -CALD1 +TNK2 -RAB18 +ABI1 403 aa 000 nm 12670711 AF402774 encephalopsin TMT multiple tissue circadian clock
0 MIVSNVSLSGCAGVNGAVCAAEGHQAGGSDRSTLTPTGNLVVSVFLGFIGTFGLVNNLLVLVLFCRYKMLRSPINLLLMNISISDLLVCVLGTPFSFAASTQGRWLIGEAGCVWYGFANSLF 1
2 GVVSLISLAVLSFERYSTMMTPTEADPSNYCKVCLGITLSWVYSLVWTVPPLFGWSSYGPEGPGTTCSVNWTAKTTNSISYIICLFVFCLIVPFLVIVFCYGKLLCAIRQ 0
0 VSGINASTSRKREQRVLCMVVIMVICYLLCWLPYGVVALLATFGPPDLVTPEASIIPSVLAKSSTVINPIIYVFMNKQ 0
0 FYRCFLALLCCQDPRSGSSMKSSSKVATKAKGVTPTGQRRTDFLYMVASLGRPAATIPQLGPSFDATNDFTKPPSSDTIKPVVVSLAAHCDG*

>ENCEPH4b_takRub Takifugu rubripes (teleost) Gt 0...2...0.0 indel +TFRC +CHES1 -MYEOV2 -ARHGAP21 407 aa 000 nm no_ref genome encephalopsin   
0 MIVCNVSLSCAHCPGEGTAANDAYAQASGSLATPTLSQRGHLVVAVCLGFIGTVGFLSNFLVLALFCRYRALRTPMNLMLVSISASDLLVSVLGTPFSFAASTQGRWLIGRAGCVWYGFVNACL 1
2 GIVSLISLAVLSYERYCTMVSSTIASNRDYRPVLGGICFSWFYSLAWTVPPLLGWSRYGPEGPGTTCSVDWRTQTPNNISYIVCLFTFCLLLPFFVILYSYGKLLHTIRQ 0 
0 VRRVSSTVTRRREHRVLVMVVAMVVCYLICWLPYGVTALLATFGPPNLLTPEATITPSLLAKFSTVINPFIYIFMNKQ 0
0 FYRCFRAFLNCSTPKRDSTVRTFTRISLRALRQDQQQKGSALAPSSARPTPNSIHESSLKGSHSTPSNGGAAAAKSPAANRSKPKLILVAHYRE* 0

>ENCEPH_gasAcu Gasterosteus aculeatus (stickleback) Gt 0...2...0.0 indel -LDOC1L +CDC42EP3 -KMO +IDE 389 aa 000 nm no_ref genome encephalopsin OPN3   
0 MNPDNGTREERSTDHSIFAVGTYKLLAFAIGTIGVFGFCNNVVVIVLYCKFKRLRTPTNLLVVNISLSDLLVSVIGINFTFVSCIRGGWTWSRATCIWDGFSNSLF 1
2 GIVSIMTLASLAYERYIRVVHAQVVDFPWAWRAIGHIWLYSLVWTGAPLLGWNRYTLEIHRLGCSLDWASKDPNDASFILLFLLACFFVPVGIMIYCYGNILYAVQM 0
0 LRSIQDLQTVQIIKILRYEKKVAVMFLLMISCFLLCWTPYAVVSMMEAFGRKNMVSPTVAIIPSFFAKSSTAYNPLICVFMSRK 0
0 FRRCLMQLLCSRVTCLQCNLKERPLAPVQRPIRPIVVSAACGGGRVRPKKRVTFSSSSIVFIITRNDIRHTDVTSNTRESSEANVFQVRPL* 0

>ENCEPH_calMil Callorhinchus milii (elephantfish) Gt 0...2...0.0 indel x x x x 097 aa 000 nm no_ref genome fragment   
0 MNPTNSTEPQEEHLFSPNTYKLLAVIIGTIGIVGFCNNILVLLLYYKFKRLRTPTNLLLVNISVSDLLVSVFGLSFTFVSCTQGRWGWDSAACVWDG

>ENCEPH4_calMil Callorhinchus milii (elephantfish) Gt 0...2...0.0 indel x x x x 177 aa 000 nm no_ref genome fragment   
0 MLNSSPNSSPSLPLSQVGWTGLSRTGLTVVAVCLGIIMVLGFLNNLLVLVLFCKYKVLRSPMNMLLLNISVSDMLVCICGTPFSFAASVQGRWLVGEQGCKWYGFANSLF 1
0 REHRILLMVISMVTFYLLCWLPYGTVALIGTFGNADLITPTCSVIPSILAKSSTVINPVIYVIMNKQ 0

>ENCEPH5_calMil Callorhinchus milii (elephantfish) Gt 0...2...0.0 indel x x x x 070 aa 000 nm no_ref genome fragment   
AQTREHRILLMVISMVTFYLLCWLPYGTVALIGTFGNADLITPTCSVIPSILAKSSTVINPVIYVIMNKQ 0

>ENCEPH_squAca Squalus acanthias (dogfish) Gt 0...2...0.0 indel x x x x 202 aa 000 nm no_ref genome fragment   
0 MNAANSTDTREESLFSPGTYQVLAVIIGTIGVVGFCNNLLMLVLYCKFKRLRTPTNLFLVNISISDLLLSVFGVIFTFVSCVKGRWVWDSAACVWDGFSNCLF 1
2 GISSIMSLTVLAYERYIRVVNATAIDFSWAWRAITYIWLYSLAWTGAPLIGWNSYTLELHRLGCSVNWDSRNPSDTSFVLFLFLGCLLCPIGVIAYCYG

>ENCEPH_petMar Petromyzon marinus (lamprey) Gt 0...2...0.0 indel x x x x 293 aa 000 nm no_ref genome fragment   
0 MQSPKQDSLHYAGDTGAKAAPDSAQGNASALGSNFLLHGGDLGEGSTAFSAATFRLLAGVVGTIGVAGFLNNLLLVALFVGFKRLQTPTNLLLVNISLSDLLVSVFGNTLTLVSCVRRRWVWGNGGCVWDGFSNSLF 1
2 GIVSISTLTALSYERYARLIKAQVLDFSWAWRAVTYTWLYSAAWTGAPLLGWSRYVLEKHGLGCSIDWASSNPPDAAFVLFFFLGCLAAPLLVMGFCFGRIALAITQ 0
0 CWSPYAVASLFVASGFEHLVSPPVSIVPSLLAKSNAVCNPLLFLLMSGN 0

>ENCEPH4_braFlo Branchiostoma floridae (amphioxus) Gt 0...2...0.0 indel -ZFYVE1 +RTF1 -CES1 -POMT2 402 aa 000 nm 12435605 AB050608 encephalopsin Amphiop4 new exon 12 and 34 + perfect fit   
0 MALYNNTSSPSQDLLWDAPYSQGHIWDNSSASNSSEDVMDQGKVELQDFSDAGYTAIATCLALI 1
2 GFVGFTNNFVVILLIGCHRQLRTPFNLLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANSLF 1
2 GIVSLVTLSALAFERYCVVVRSSDMLTYKSSLVVITFIWLYSLLWTSLPLLGWSSYQFEGHN 0
0 VGCSVNWVQHNPDNVSYIVTLMVTCFFVPMVVVCWSYAWIWRTVRM 0
0 SSEAKPECGNSQNAGRLVTTMVVVMIICFLVCWTPYAVMALIVTFGADHLVTPTASVIPSLVAKSSTAYNPIIYVLMNNQ 0
0 FREFLLARLQRVCCRQQAVPRVTPMDDNVHVRLGGEGPSQSQQFLPAGENVENVDMLEYVQENCKPKADSLSTISE* 0

>ENCEPH4_braBel Branchiostoma belcheri (amphioxus) Gt 0...2...0.0 indel x x x x 401 aa 000 nm no_ref genome encephalopsin Amphiop4 introns from braFlo   
0 MPLYNTSSGPTQGLPWDTPYSQDPIWNDSSPSNSSEDAVVDQGRGELQDFSDAGYTAIATGLALI 1
2 GLVGSMNNFVVILLIGCHRQLRTPFNLLLLNVSVADLLVSVCGNTLSFASAVQHRWLWGRPGCVWYGFANSLF 1
2 GIVSLVTLSALAFERYCVVVRSSEMLTYKSSLGMIAFIWMYSLLWTSLPLLGWSSYQFEGHS 0
0 VGCSVNWVKHNVNNVSYIITLMVTCFFVPMVVVCWSYACIWRTVRM 0
0 SAEMKSEFGNPQNTGRLVTTMVVVMIVCFLVCWTPYTVMALIVTFGADHLVTPTASVIPSLVAKSSTAYNPIIYVLMNNQ 0
0 FREFLLARLRTFCCRQPRMLRVTPMDDNAHARLVGEGPSHAQQVIPSEENGENVEMRKVQGNQLKADSLSTISE* 0

>ENCEPH5_braFlo Branchiostoma floridae (amphioxus) Gt 0...2...0.0 indel -ZFYVE1 +RTF1 +ATP6V0E1 -Etf1 409 aa 000 nm no_ref genome encephalopsin extra 0 intron   
0 MLGMHNVMNATDYDNNNATFAAWNFQRNGTTEEEVEFSGFDTVAVVIAAIGIAGFLSNGAVVLLFLKFRQLRTPFNMLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANHLF 1
2 GLVSLISLAVISYERYRMVVKPKGPGSSYLTYNKVGLAIIFIYLYCLLWTTLPIVGWSSYQLE 0
0 GPKISCSVAWEEHSLSNTSYIVAIFIMCLLLPLLIIIYSYCRLWYKVKK 0
0 GSQNLPPAIRKSSQKEQKIARMVVVMITCFLVCWLPYGAMALVVSFGGESLISPTAAVVPSLLAKSSTCYNPLVYFAMNNQ 0
0 FRRYFQDLLCCGRRLFDASASVNTCNTSAMPRHSPVFQKPDSDQYNGIQKSREPQMRTTGQNAPYRQWIEMQTIAVVVKADEVNNKFGEVKT* 0

>ENCEPH5_braBel Branchiostoma belcheri (amphioxus) Gt 0...2...0.0 indel x x x x 421 aa 000 nm 12435605 AB050609 encephalopsin Amphiop5 extra Nfrag in mrna   
0 MLGIYNVVNATEYGNNTTFAAWDFKRNGTGGEEEVEFFGYDAVAGVIAIIGVVGFVSNGAVVVLFLKFPQLRTPFNLLLLNMAVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANHLF 1
2 GLVSLISLAVISFLRYRMVVKPKGPGSSYLTYTKVGLAILFIYLYCLLWTTLPIAGWSSYQLE 0
0 GPKIGCSVAWEEHSWSNTSYIVVLFITCLFAPLLIIVYSYYRLWHKVKQ 0
0 GSRNLPAAMRKSSQKEQKIAMMVIVMITCFMVCWLPYGAMALVVTFGGERLISHTAAVVPSLLAKSSTCYNPVVYFAMNSQ 0
0 FRRYFQDLLCCGRRLFDVSQSVVTGNTAMPRNNSQGFRKDDSDQKQDNGLPKQSEGPMCDHSSNESQMEGSRHNTAASQQWIEMQTIAVVVKAVEVDTSAANEP* 0

>ENCEPH_strPur Stronglyocentrotus purpuratus GLEAN3_03451 modified terminal exon by extending penultimate to stop codon
0 MSLATKKHFIRNAVEEGGHLLEKWDKGG 2
1 YAFIMTFLGLNSLMSHAVIAVDRYLVITKPHF 1
2 GIVVTYPKAFLMISIPWVFSFAWAVFPLAGWGEFTYEGTGAWCSVRWDSDQPQIMSYVLAMMFLTFISSIVIMMYCYICIFLTTRRMPRWATSNSIKTHERNRRRR 2
1 EQKLLKTLIAIAIAFLVAWSPYAITSMIVVFGGSELLSLTATTLPSLFAKSSVMINPIIYAVTSRVFRKSLKK 0
0 MLTSFFPGCMTYIMTDKSPPSSSRPIQLGLCKYHFLY* 0

>PIN_stoPur Stronglyocentrotus purpuratus GLEAN3_05569 0.2.2.0.0 16311335 opsin1 PIN-type introns no cdna no sacKow 
0 MSNLMTGLVTNVNALSGIGNETPTTIGLSSLVVPVSRTTYNYLTVYTGFLTIFGILNNGIVMILFARFPSLRHPINSFLFNVSLSDLIISCLASPFTFASNFAGRWLFGDLGCTLYAFLVFVA 1
2 GTEQIVILAALSIQRCMLVVRPFTAQKMTHRWALFFISLTWIYSLIICVPPLFGWNRYTYEGPGT 1
2 ACSVAWNSPSPGDTSYIIFIFVLVLVIPFGIIIFCYGLLVYAVKK 0
0 ISRTQAALSSEAKADRKVSKMIFIMILFFLIAWTPYTGFSLYVTFGKNVVITPLAGTFPPFFAKLCTIHNPIIYFLLNKQ 0
0 FKDALIQLFCCGENPFDRDESEHEGRGGRHRHRTAPSATAHIGGRGRASSLPTATSMLDIPQAASTAASSSGKTQNKESLEKGPSTSETTNKRVFELSSKIQKFEISEKNNTPSSSELPGASSLSGALMPPRRAMKNQVGCLPPVDN* 0
>RGR_homSap Homo sapiens (human) ?? 0.2.1.2.1.0.0 indel +PCDH21 -LRIT1 -GRID1 -WAPAL 296 aa 000 NM_001012720 var2 retinal epithelium Mueller exon-skipping splice isoform   
0 MAETSALPTGFGELEVLAVGMVLLVE 1
2 ALSGLSLNTLTIFSFCKTPELRTPCHLLVLSLALADSGISLNALVAATSSLLR 2
1 RWPYGSDGCQAHGFQGFVTALASICSSAAIAWGRYHHYCT 1
2 RSQLAWNSAVSLVLFVWLSSAFWAALPLLGWGHYDYEPLGTCCTLDYSKGDR 2
1 NFTSFLFTMSFFNFAMPLFITITSYSLMEQKLGKSGHLQ 0
0 VNTTLPARTLLLGWGPYAILYLYAVIADVTSISPKLQM 0
0 VPALIAKMVPTINAINYALGNEMVCRGIWQCLSPQKREKDRTK* 0

>RGR_ornAna Ornithorhynchus anatinus (platypus) missing exon 1 retinal ganglia RGR last D in DRY motif, afros ERY, other placentals GRY
0 1
2 ALLGLCLNGLTIASFRKIKELRTPSNLLVVSLALADSGICLNALMAALSSFLR 2
1 HWPYGAEGCRLHGFQGFATALASISLSAAIGWDRYLRHCS 1
2 RSKPQWGTAVSTVLFAWGFSAFWSMMPILGWGQYDYEPLRTCCTLDYSKGDR 2
1 NFTTYLFAVAFFNFVIPLFIMLTSYQSIEQRFKKSGLFK 0
0 LNTRLPTRTLLFCWGPYALLCFYATVENVTFISPKLRM 0
0 IPALIAKTVPVIDAFTYALRNEDYRGGIWQFLTGQKIERVEVENKIK* 0

>RGR_galGal Gallus gallus (chicken) ?? 0.2.1.2.1.0.0 indel +PCDH21 -LRIT1 +CHAT -PARG 296 aa 000 nm 14985289 NM_001031216 retinal ganglia RGR   
0 MVTSHPLPEGFTEIEVFAIGTALLVE 1
2 ALLGFCLNGLTIISFRKIKELRTPSNLLVLSIALADCGICINAFIAAFSSFLR 2
1 YWPYGSEGCQIHGFQGFLTALASISSSAAVAWDRYHHYCT 1
2 RSKLQWSTAISMMVFAWLFAAFWATMPLLGWGEYDYEPLRTCCTLDYSKGDR 2
1 NYITFLFALSIFNFMIPGFIMMTAYQSIHQKFKKSGHYK 0
0 FNTGLPLKTLVICWGPYCLLSFYAAIENVMFISPKYRM 0
0 IPAIIAKTVPTVDSFVYALGNENYRGGIWQFLTGQKIEKAEVDSKTK* 0

>RGR_xenTro Xenopus tropicalis (frog) ?? 0.2.1.2.1.0.0 indel +PCDH21 -LRIT1 +CHAT -PARG 296 aa 000 nm no_ref BC135113 retinal ganglia RGR   
0 MVTSYPLPEGFTETEVFAIGTTLLVE 0
0 ALLGLLLNGLTLLSFYKIRELRTPSNLFIISLAVADTGLCLNAFVAAFSSFLR 2
1 YWPYGSEGCQIHGFQGFVAALSSIGSCAAIAWDRYHQYCT 1
2 RSKLHWSTAVSVVFFIWGFSAFWSAMPLFGWGEYDYEPLRTCCTLDYSKGDR 2
1 NYISYLFTMAFFEFLVPLFILMTAYQSIYQKMKKSGQIR 0
0 FNTSMPVKSLVFCWGPYCLLCFYAVIQDATILSPKLRM 0
0 IPALLAKTSPAVNAYVYGLGNENYRGGIWQYLTGQKLEKAETDNKTK* 0

>RGR_gasAcu Gasterosteus aculeatus (stickleback) ?? 0.2.1.2.1.0.0 indel +PCDH21 -LRIT1 +CHAT -PARG 296 aa 000 nm no_ref genome retinal ganglia RGR   
0 MVSSYPLPDGFTDFDVFSLGSCLLVE 0
0 GLLGILLNAVTIAAFLKVRELRTPSNFLVFSLAVADIGISMNATIAAFSSFLR 2
1 YWPYGSDGCQTHGFQGFVTALASIHFIAAIAWDRYHQYCT 1
2 RTKLQWSSAITLAVFVWLFTAFWSAMPLIGWGEYDYEPLRTCCTLDYTKGDR 2
1 NYVSYLIPMAIFNMAIQVFVVMSSYQSIAQKFKKTGNPR 0
0 FNPNTPLKAMLFCWGPYGILAFYAAVENATLVSTKLRM 0
0 MAPILAKTSPTFNVFLYALGNENYRGGIWQLLTGEKIDVPQIENKSK* 0

>RGR_calMil Callorhinchus milii (elephantfish) ?? 0.2.1.2.1.0.0 indel x x x x 227 aa 000 nm no_ref genome fragment + frag petMar   
0 EGFTDFEVFGLGTALLVE 0
0 GLVGLLLNGLTLLAFYKIKELRTPSNLLITSLALSDFGISMNAFIAAFSSFLR 2
1 YWPYGSEGCQTHGFHGFLMALASINACAAIAWDRYHQNCS 1
2 RSRLQWSSAITVTVFIWGIAAFWSAMPLLGWGVYDYEPLRTCCTLDYSKGDR 2
1 EFIFPIFIMLSSYQSCKSKFKKTGQVK 0
0 FNTGLPVKTLIFCWGPYSLLCFYATIENITILSPKLRM 0

>RGRa_cioInt Ciona intestinalis (tunicate) Ci-opsin3 mRNA 0.2.0.2.1.0.0 PPT1 NOMO2+ FOXC2- FAHD1- AB079882 12687683 289 aa larval visual cycle photoisomerase last 4 introns like RGR
0 MEVNDKRVYGVLMGLL 1
2 GLLTITGYSLLFVIFAKRPDLKKKNKFLLSLATSDLLITVHVFASTIAAFAPQWPFGDLGCQ 0
0 VDAFIGMAPTFISIAGAALIAKDKYYRFCKPKM 1
2 MVGRNYSFHVYLTWTMGIIGGALPFIGFGRYGFETDDVTWRTGCLLDFKSISA 2
1 KYSFYIILISTVWFVWPVYKLVSSYMKISTKINKFYP 0
0 LLFVVPVQMAIGLLPYAIYAMVSITIGVSAVPYFCVVINN 0
0 LAAKVFVGSNPFIYIYFDPELRESCKQIFCSPPAPTNDKISEDSKDE* 0

>RGRb_cioInt AB078611 12373590 entire neural tube CiNut
0 MEIDFGFARTVYGVALLLM 1
2 VFITLLGYAVYFGAIWRSKTLQTRHIWLTSLACGDIIMMVHLILESLSSLGMGHRPRQNFECQ 0
0 VGALVGLFSGYVTIASITWIAIDRYYRQCKPEK 1
2 VGVNYCFYVIIVWAMSFLAASGPALGFGAYESAEENTVKCLIDLNKKDT 2
1 NSRLYIILVSAVWFVYPFVKMILYNKKLVQEAKEPQP 0
0 MAFAVPLTFFLCYLPFAIYASLKITVGLPPLNSMVVASIY 0
0 MLPKVISVVNPYLYMRSDPELLAACRHVVGLTDGKKAV* 0

>RGRa_cioSav Ciona savignyi (tunicate) Ci-opsin3 mRNA 0.2.0.2.1.0.0 new 68% 288 aa larval visual cycle photoisomerase
0 MDFSSKRTYGIAMAAL 1
2 GFIAWVGYGLLFVIFAKSPDLKKKNRFLFSLAVSDLLITIHVVASVVASFQSEWPFGSIGCQ 0
0 LDAFIGMAPTFISIAGAALVAKDKYYRICKPKM 1
2 LVGRNYSFSIYANWTLGIIGGLLPFFGFGQYGFETDDLSLRTGCLLDFKTVSA 2
1 KYRFYIVFISLVWFVWPLYKLTSHYIKISAKLDRFHP 0
0 LMFVVPLQMLVSLLPYAIYAMISITVGVSSAPYYLVAVNN 0
0 IAAKVFIGTNPFIYIYFDPELRLACKNLFKYSSTPVQDQIQDKKDE* 0

>PER1_homSap Homo sapiens (human) ?? 0.2.0.2.1.0.1 indel -CFI +NOLA1 +EGF -ELOVL6 338 aa 000 nm 17167409 NM_006583 peropsin RRH retinal photoisomerase
0 MLRNNLGNSSDSKNEDGSVFSQTEHNIVATYLIMA 1
2 GMISIISNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0
0 VYAGLNIFFGMASIGLLTVVAVDRYLTICLPDV 1
2 GRRMTTNTYIGLILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDR 2
1 SFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDCTESLNRDWSDQIDVTK 0
0 MSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKK 2
1 FRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI* 0

>PER1_monDom Monodelphis domestica (opossum) ?? 0.2.0.2.1.0.1 indel -CFI +NOLA1 +EGF -ELOVL6 326 aa 000 nm no_ref genome peropsin RRH   
0 MFKNNSVKTLAPEKEGPSVFSPIEHKIVAAYLITA 1
2 GVISIVSNVIVLGIFVKYKALRTATNTIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYDGCQ 0
0 IYAGLNIFFGMASIGLLTAVAIDRYLTICQPDL 1
2 GRMTSYNYTLMILTAWVNGFFWALMPIVGWAGYAPDPTGATCTINWRKNDV 2
1 SFVSYTMTVITINFAMPLGVMFYCYYNVSQKMKQYSPSNCPDHINRDWSNQVAVTK 0
0 MSVVMILMFLLAWSPYSIVCLWASFGDPKEIPPAMAIVAPLFAKSSTFYNPCIYVAANKK 2
1 FRRAISAMIRCQTHQSMPISNALPMN* 0

>PER1_ornAna Ornithorhynchus anatinus (platypus) XM_001506366 peropsin RRH
0 MRRNDSANLLESEHHDRSAFSQTDHNIVAAYLITA 1
2 GIMSIVSNVIVLGIFVKFEELRTATNAIIINLAVTDIGVSGIGYPMSAASDLHGSWKFGHAGCQ 0
0 IYAGLNIFFGMSSIGLLTVVAVDRYLTICRPAI 1
2 GRKMTRSNYTAMILAAWMNGFFWASMPLLGWASYASDPTGATCTINWRKNDA 2
1 SFISYTMTVIAVNFAVPLIVMFYCYYNVSKAMRQYPASRVLENLNIDWSEQVDVTK 0
0 MSVVMILMFLMAWSPYSIVCLWSSFGDPKKISPAVAIMAPLFAKSSTFYNPCIYVVANKK 2
1 FRRAMLSMVQCQTHREITITDVLPMNRSRSPH* 0

>PER1_xenTro Xenopus tropicalis (frog) ?? 0.2.0.2.1.0.1 indel -CFI +NOLA1 +EGF -ELOVL6 347 aa 000 nm no_ref genome peropsin RRH   
0 METLAEVSTLLPAGTGTVNISDASSEVHSVFSQSEHNIVAAYLITA 1
2 GVISILSNIIVLGIFVKYKELRTATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYVGCQ 0
0 IYAGLNIFFGMASIGLLTVVAIDRYLTICRPDI 1
2 GRRISGRHYTAMILAAWINAVFWSVMPVVGWSSYAPDPTGATCTINWRKNDV 2
1 SFVSYTMSVVAVNFVVPLMVMFYCYYNVSRTMKGYGSRSSLGGINADWSDQTDVTK 0
0 MSMVMIVMFLVAWSPYSIVCLWSSFGDPRKIPPAMAIIAPLFAKSSTFYNPCIYVIANKK 2
1 FRRAILSMVQCKSRQEVTLDNHFPMNVSQSTLTT* 0

>PER1_gasAcu Gasterosteus aculeatus (stickleback) ?? 0.2.0.2.1.0.1 indel +GPR68 -GNPDA1 -ENPEP -C14orf100 338 aa 000 nm no_ref genome peropsin RRH   
0 MGIDPEVNVTDDVTLYGGKSAFTQLEHNIVAGYLITA 1
2 GVISLFSNIVVLLMFWKFKELRTATNFIIINLAFTDIGVAGIGYPMSAASDIHGSWKFGYAGCQ 0
0 IYAALNIFFGMASIGLLTVVAIDRYLTICRPDI 1
2 GQKMTMQSYNLLILAAWLNAVFWSSMPVVGWASYAPDPTGATCTINWRQNDV 2
1 SFISYTMAVIAVNFVLPLSAMFYCYYNVSATVKRYKASNCLDSANIDWSDQMDVTK 0
0 MSIVMIIMFLVAWSPYSIVCLWASFGDPKTIPAPMAIIAPLFAKSSTFYNPCIYVIANKK 2
1 FRRAIIGMVRCQTRQRITINSQVPMTTSQQPLTQ* 0

>PER1_calMil Callorhinchus milii (elephantfish) ?? 0.2.0.2.1.0.1 indel x x x x 151 aa 000 nm no_ref genome fragment   
1 LFVSYTMTVIAVNFVVPLSVMFFCYYNVSKTMSRFISSPSPENINLDWSDQLDVTK 0
0 MSVVMIVMFLLAWSPYSIVCLWASFGNPKLIPPAMAIIAPLFAKSSTFYNPCIYVIANKK 2
1 FRKAIMAMICCQNRQEITINHTLPMTISRVPLTE* 0

>PER1a_braFlo Branchiostoma floridae (amphioxus) ?? 0.2.0.2.2.0.0.0 indel x x x x 365 aa 000 nm 12435605 AB050610 peropsin Amphiop3 frag   
0 MDIPTETPYGAGDDPAGTGWRWAETDQNGFHKYDHLIVGLYLFVI 1
2 GIIGTVENGITLATFTKFRSLRSPTTMLLVHLAIADLGICIFGYPFSGASSLR 0
0 SHWLFGGVGCQWYGFNGMFFGMANIGLLTCVAVDRYLVICRQDL 1 
2 VDKVNYNTYGVMAALGWLFAAFWAALPLVGWAEYSLEPS 1
2 GTACTINWQKNDSLYISYVTSCFILGFALPLAVMMFCYWQ 0
0 ASCFVNKVLKGDISGDLTFPVAVNVDWEYQNHFSK 0
0 MCLAMVAAFVVAWTPYSVLFLFAAFGNPADIPAWITLLPPLIAKSSALYNPIIYIIANRRFRSAIFSMVKGQNPDVE 0
0 TLFARDFRISPIEDTGKEMSSMGNANA* 0

>PER1a_braBel Branchiostoma belcheri (amphioxus) ?? 0.2.0.2.2.0.0.0 indel x x x x 365 aa 000 nm 12435605 AB050610 peropsin Amphiop3   
0 MDIPTETPYGAEEDIGESAGWRWTETDKNGFHKYDHLIVGLYLFVI 1
2 GIIGTIENGITLATFSKFRSLRSPTTMLLVHLAIADLGICIFGYPFSGASSLR 0
0 SHWLFGGVGCQWYGFNGMFFGMANIGLLTCVAVDRYLVICRHDL 1
2 VDKVNYNTYGVMAALGWLFAAFWAALPLVGWAEYALEPS 1
2 GTACTINFQKNDSLYISYVTSCFVLGFVVPLAVMAFCYWQ 
0 ASCFVSKVLKGDIAGDLTFPVAANVDWEYQNHFSK 0
0 MCLAMVAAFVVAWTPYSVLFLFAAFWNPADIPAWLTLLPPLIAKSSALYNPIIYIIANRRFRNAICSMMKGQDPDVE 0
0 DDEHADEHRVRSIEDNDKEIISMVNLNMTV* 0

>PER1_sacKol Saccoglossus kowalevskii best hit: PERa_braFlo e = -49 Identities = 97/246 (39%)
IIYYFFLLSTGLTIFGMSLSCVSSFAGRWLFGKFGCYFHGFAGMLFGLGSIGNLTVISIDRYIITCKRNL 1
2 WSYRHYYALLAVAWSNALFWSMMPLFGWSSYALEPEGTSCTIDWMNNDNQYISYVSCVTVTCFILPCAVMTYDYLAAYMKMVKAGYTLSEETEKPNND 0
0 MCIALVAAFLLSWFPSATVFLWAAFGNPGNIPLSFTGVADAFTKIPAVFNPVIYVALNPEFRKYFGKTIGCRRKRKKPIAVRLNGSEQNVENTI* 0

>PER1b_braFlo Branchiostoma floridae (amphioxus) ?? 0.2.0.2.2.1.0.0.0 indel x x x x 522 aa 000 nm 12435605 AB050607 peropsin Amphiop2 PER/NEUR frag   
0 MIPTNNNTENNDLEWGLEKEHGVSATIMGVYLTIV 1
2 GLVSTVGNATVVLMFMLKWRQLCRKAPNLLIINLAAVDLCISVFGYPFSASSGFANQWLFSDAICT 0
0 LYGFSCFLLSMVSMHTLCLISAHRYITICRPEH 1
2 ASKLTMTRTILAVVGAWVYGISVAVPPLFGIA 1
2 GYTYESFGLSCTIDFHGTTVADMVYLSILIILCYVINVAVMGTCYFKIIRK 2
1 FSKHRFREVRDVRTSHQHSFERGVTL 0
0 RCILMTLFYLISWTPYTAVAVWTMVGPPPPVQLGMVAALTAKTHCAFNPILYMLMSE 0
0 VYRKLVLRTMCPCCFNKISNKLVRLPADDSKHSGNLDIFTVGYNTRDQAVQINKNAARRFCFVMET 0

>PER1b_braBel Branchiostoma belcheri (amphioxus) ?? 0.2.0.2.2.1.0.0.0 indel x x x x 522 aa 000 nm 12435605 AB050607 peropsin Amphiop2 RRH   
0 MIPTNNNTENNDLEWGLEKEHGVSATIMGVYLTIV 1
2 GLVATVGNATVVLMFIMKWRQLCRKAPNLLVINLAAANLCITIFGYPFSASSGYAHQWLFPDAICT 0
0 LYGFSCFLLSMVSMHTLCLISAHRYITICRPEH 1
2 ASKLTMNRTVLAVIGTWLYAIAVAVPPLFNIA 1
2 RYTYEPSGLSCTIDFRVTTVADLVYLGSLIVLCYVIHVAVMATCYFKIIRK 2
1 FSRHRFRQVRDIRTSHQRSFEMGVTM 0
0 RCILMTLFYLLSWTPYTAVCIWTMVGPPPPVVVSMAAALIAKTHCAFNPILYAFMSE 0
0 VYRKLVFRTMCPCCFNRISCKFVGTPTGGSKVSANPDIFTVDYNSRDQAVQINKAPSRRFCFVMET 0
0 SEDLGSDDTGLTGHSGLWRSGAEVEGLGGLQVTQSPSVSGSELSLSLLDFLPPKPSGRAVSAKLPSPPALNSERATCPESSQQPSDRPATGLRQYQKGDTTRSSVGDLILTEDD
VTNLPPASETWGRKKSENPLSYRQTTRRTFGRSRKHSYIVD* 0

>PER2_braFlo Branchiostoma floridae (amphioxus) Go 0.2.2.2.2.0.0 indel x x x x 391 aa 000 nm 12435605 AB050606 peropsin Amphiop1 RRH no petMar frag   
0 MNASPSSWLPSGELFTDSPENSSEWPWTDGPTDTAWHHHQTVDPVTYGGYLASAVYLTIT 1
2 GLIAFVGNIFAIIVFLTEKEFRKKEHNSFALNLAIADLSVCVFAYPSSTIS 1
2 GYAGEWMLGDVGCTIYGFLCFTFSLTSMVTLCAISVYRYIVICKPQY 1
2 AHLLTHRRTNYVILGIWLYALVFSVPPLFGVNRYTYEPIS 2
1 ITCSLDWNVQHVGETIYTAAVIIIVYVLNVSIMCFCYFNIIFKSANLKFAALASEKTRTAAKKDIWKTSM 0
0 MCLAMVVSFLIAWTPYAVSSTWDILTEEDLPIIATILPTMFAKSSCMMNPIIYSCCNGKFRQAALKTFSK 0

>PER2_braBel Branchiostoma belcheri (amphioxus) Go 0.2.2.2.2.0.0 indel x x x x 391 aa 000 nm 12435605 AB050606 peropsin Amphiop1 RRH no petMar   
0 MNASPSSWLSSGEFFTDSPENSSEWPWTDGPTDTTWRHHQSVDSVSYEGYLASAIYITLT 1
2 GLIAFFGNVITITVFLTEKEFRKKQQNGFVLNLAIADLSVCVFAYPSSAIA 1
2 GYAGRWVLGDVGCTIYGFLCFTFALVSMVTLCVISIYRYILICKPQY 1
2 AHLLTHRRTVYVIIGTWLYALVFTVPPLVGVKRYTYEPMQ 2
1 ITCSLDWNVQHPGEKAYIAAVLVIVYVLQVLIMCFCYFNIIFKSANLKFAALASEKTKMAAKKDTWKTSV 0
0 MCLTMVVSFLIAWTPYAVSSTWDILSAEDLPIIATILPSLFAKSSCMMNPIIYACCNTKFRQAAVKSFRK 0
0 LCGMCKQKVPLSTPQVVLAMQRNTEFTSTVEPTGQAFPMRVLPSISATHTAL* 0

>PER2a_strPur Stronglyocentrotus purpuratus Go GLEAN3_27634 overshoots iMet opsin3.2 XM_778236 spread across tandem inline, introns id PER2_patYes scallop2
0 MAASVTESSATEAISRLEPEYMVPLTRTGYLLTAIYLTIV 1
2 GSIATVGNITVICVLCRYRTFRKRSINLLLINMAASDLGVSVAGYPLTTVSGYWGRWLFGDVGCQFYAFCVYTLSCSTISTHAAIAVYRYIYIVKTDL 1
2 RPKLTANFTSGVIVVIWVYAFFWTVTPFVGWSSYIYEPFGTSCSVNWVGRTISDISYMVACTIGVYLLQIFIMLYCYIRVAKK 2
1 IRGVDPGRTEEKDAGVVVFGRLRKREAKIDTHVTK 0
0 MCFMMMLTFIVVWAPYAVECLRAAHVHRISALSSVLPTMFAKSSCMVNPIIFLTSSSKFRQDLGKLWSRPSSQDSLQLEER
NKTQRSLYVRHSELGSAHGNDTASVYYEKERIYIGEMRATSIQKEAELLQRDPELLSIASSTNSDVKFVVRDRPKRYTKR
PVKPQGPRGPEMFTASGVTNKGSSTSDSGGQSTSSGTTGSKPKRSGRKASRQYSMKSQSEDTGEIFTLDGSALEMMSLRKL* 0

>PER2b_strPur Stronglyocentrotus purpuratus Go 17067569 GLEAN3_27633 opsin3.1 RRH no cdna inline tandem partner of PER2_strPur, introns id PER2_patYes scallop2
0 MNSFSEESYVTDPTTTQPTLFLTPLSQTGYLLTALYLTLV 1
2 GIVSTIGNITVLCVLCRYGTFRKRSVNILLMNMAVSDLGVSVAGYPLTAISGYRGRWVFADIGCQFSGFCVYALSCSTISTHAVVAIYRYIYIVKPYH 1
2 RPRLSSSTSCLAILCIWTFTLFWTITPFFGWSSYTYEPFGTSCSINWYGKSLGDLTYIICCVVFVFILPIIIMLYCYIGVAKK 2
1 IKGIDPLRTEERDIAVVFGRLRKHETKIDTRVTK 0
0 ICFMMMASFIVVWTPYAVGSIWASKIGKISASASVLPTMFAKSSCMINPIIFLTSSSKFRADLGKLWNRPSSLEHTIRVEERSREQRSFF
VRQSALPDAMVSRSASVYYDKERIYIGEMRAASIQKEADLLHRDPEAISIASSTSSSLQFVLKDRQNRYKKKAGEASKKGSNILHFPYDDTE
GSMINNLMRPRSHSVTSDNISRVFAPSLKRPTKKRSMSHPDIPSTSADIFTVSPTTIKNLQKQ* 0

>NEUR_homSap Homo sapiens (human) ?? 0.2.2.2.0.1 indel +CD2AP +GPR115 -PTCHD1 -MUT 355 aa 000 nm 15774036 NM_181744 neuropsin OPN5   
0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFR 2
1 LHTVTTVRKSSAVLEIHEEV* 0

>NEUR_monDom Monodelphis domestica (opossum) ?? 0.2.2.2.0.1 indel +CD2AP +GPR115 -PTCHD1 -MUT 352 aa 000 nm no_ref genome neuropsin OPN5   
0 MALNHSVSPQDDYIPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GVLSTLGNGYVIYMSSKRKKKLRPAEIMTVNLAVCDLGIS 1
2 VVGKPFTIISCFSHRWVFGWVGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSY 1
2 GTWLKRHHAYICLVIIWAYATFWATMPLAGLGNYAPEPFGTSCTLDWWLAQASVTGQTFILNILFFCLLLPTAVIVFSYVKIIAKVKSSTKEVAHFDSRIQSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPVQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCQSGGQKAAKKESLRTYR 2
1 HTVATIRKSSAVSETHQEV* 0

>NEUR_ornAna Ornithorhynchus anatinus (platypus) ?? 0.2.2.2.0.1 indel +CD2AP +GPR115 -PTCHD1 - 351 aa 000 nm no_ref genome neuropsin OPN5   
0 MTNYSAPQLGDYLPHYLREGDPFVSKLSWEADLVAGVYLVII 1
2 GVLSTLGNGYVIYMSSRRKKKLRPAEIMTVNLAVCDLGIS 1
2 VVGKPFTIVSCFCHRWVFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSY 1
2 GTWLKRHHAYICLAIIWAYASFWATMPLVGLGNYAPEPFGTSCTLDWWLAQASVAGQAFILNILFFCLLLPTAVIVFSYVKIIAKVKSSTKEVAHFDSRIQNSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPIQFSVVPTLLAKSAAMYNPIIYQVIDCRISCCRLGGPKTGKKESLKNSR 2
1 HSMSTIRKPSAVSGPHQEV* 0

>NEUR_galGal Gallus gallus (chicken) ?? 0.2.2.2.0.1 indel +CD2AP +GPR115 -PTCHD1 - 352 aa 000 nm no_ref genome neuropsin OPN5   
0 MASDCNSSSQEEYLPHYMQQEDPFASKLSREADIIAGFYLTVI 1
2 GILSTLGNGYVIFMSSKRKKKLRPAEIMTVNLAVCDLGIS 1
2 VGKPFSIISFFSHRWIFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLAY 1
2 GTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPFGTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGQPDSVPIQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCRSGGPKTLQKKSSLKES 2
1 YTISSHRDSAALSGTQLEV* 0

>NEUR_anoCar Anolis carolinensis (lizard) ?? 0.2.2.2.0.1 indel +CD2AP +GPR115 -PTCHD1 +ITSN2 340 aa 000 nm no_ref genome neuropsin OPN5   
0 MEQGQNISSQDDNQQEEDPFASKLSVEADIVAGVYLLVI 1
2 GILSTLGNGYVIYMSTQRKKKLKPAEIMTVNLAVCDLGIS 1
2 VGKPFSIIAFFSHRWIFGWSGCRWYGWAGFFFGIGSLITMTAVSLDRYFKICHLSY 1
2 GTWLKRHHVFICLGIIWSYAAFWATIPFAGFGNYAPEPFGTSCTLDWWLAQGSVAGQAFILNILFFCLVLPTAVIMFCYVKIIAKVQSSTKEVAHYDTRIQNQHVLEMKLTK 0
0 VAMLICAGFMFAWIPYAVVSVWSAFGRPDSVPIKVSVIPTLLAKSAAMYNPVIYQVIDCKSACCRPGNLQPLQKKNSRYVF 2
1 MLQWDKGHDEV* 0

>NEUR_xenTro Xenopus tropicalis (frog) ?? 0.2.2.2.0.1 indel +CD2AP +GPR115 -PTCHD1 - 340 aa 000 nm no_ref genome neuropsin OPN5 truncated   
0 MAGNSSYREESGYIPHYERDSDPFASKLSREADIFAGVYLMAI 1
2 GILSTLGNGYVIYMACSRKKKLRPAEIMTINLAVCDLGIS 1
2 VTGKPFAIVSCFSHRWVFGWNACRWYGWAGFFFGCGSLITLTVVSLDRYLKICHLRY 1
2 GTWLKRRHAFIALAVIWAYATLWATLPLVGVGNYAPEPFGTTCTLDWWLAQASVKGQIFVLSMLFFCLLFPTMVIVFSYAKIIAKVKSSAKEVAHFDTRNQNNHTLEIKLTK 0
0 VAMLICAGFLIAWFPYAVVSVWSAFGQPDSIPIELSVVPTMMAKSASMYNPIIYQVIDCKPACCKKDKSLQNTTSRYVFVVYIPFHHYR 2

>NEUR_gasAcu Gasterosteus aculeatus (stickleback) ?? 0.2.2.2.0.1 indel +CD2AP +GPR115 -PTCHD1 - 331 aa 000 nm no_ref genome neuropsin OPN5 truncated   
0 MENETWTHPSYIPHYLLRGDPFASRLSKEADIIAAFYICII 1
2 GIMSATGNGYVIYMTIKRKSKLKPPELMTVNLAVFDFGIS 1
2 VTGKPFFVVSSFAHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRY 1
2 GTWLKRQHAFLCLVFVWMYAAFWATMPLVGWGNYAPEPFGTSCTLDWWLAQASVSGQSFVVAILFFCLVLPAGIIVFSYVMIIFKVKSSAKEISNFDARIKNSHNLEIKLTK 0
0 VAMLICAGFLIAWIPYAVVSVVSAFGEPDSVPISVSVIPTLLAKSSAMYNPIIYQVLDLKNSCMKSSCFKGLKKPRHFRKSR 2

>NEUR_calMil Callorhinchus milii (elephantfish) ?? 0.2.2.2.0.1 indel x x x x 209 aa 000 nm no_ref genome fragment maybe petMar   
2 GLLSTLGNGYVIYLSITQKRKLKPPEILITNLAISDFGMS 1
2 VGGQPFLIISCFSHRWIFGWVGCRWHGWAGFFFGCGSLITMTVVSLDRYLKICHLQY 1
2 GSWLQRRHVFMSLAFIWFYAAFWATMPLVGWGNYAPEPFGTSCTLDWWLARVSVSGLIFVLTILFFCLLLPIIIIVFSYIKIIAKVKSSAKEVAHFDSRIQNHHSLEMNLTK 0

>PER1_lotGig Lottia gigantea (limpet) peropsin
0 MTAAEFSSFEHSIVGITYMVI 1
2 GISGTLLSLLVALTFIREKGLFKYGRAWLHISLAIANVGVVGAFPFSGSSSFSGR 2
1 WLYGSGMCTFYGFIGMFFGIAAIGNVFALCVERYLVSKKKDS 1
2 VDKVSNQFYWMITALVWINAFFWGIMPALGWTS 2
1 YDIEPSGTSCTIKWQNYDSGYPSFMAMLSLTCFLIPLPVALICLILSGTDKITEDKEEKTYFREDQLRS 0
0 TCTFLLILALIGWGPYCFICIWALFADTTQVSMLAAVIPPLAAKTMVLLYPVAYCQGNKR FKNAFLGMFIFNESPKQQ* 0

>PER1_aplCal Aplysia californica (sea_slug) peropsin from 8.8k contig cdnas EB338056 to traces
0 MNDDGNALDGAGVDPAATAAHVSTALGFTKFEHASVGALYMLF 1
2 CIVGVTLNLLTALTFYKDTKLTKGSQPWLHILLALANVGVVAPSPFPASSSFSGR 2
1 WLYGSTMCQIYAFEGMFIGIAAIGAVIALCIERYIACQRSGA 1
2 NDTQGWFYGWSITLVLGNALFWAIMPLLGWSR 2
1 YSVEHTGTSCSIDWKNPDESFVSYIMTLEVFSFGIPMMSAFFCLISASPRPQPQGGATAAAGSGQEQQGEDTACKGCFSEDQLRL 0
0 LCYVFIGFVLVGWGPFAYLCTLAVFSDARGISMLAAAIPPLACKAMVSAYPLAYAVVSPR FRQSFLALIGGGEKKKE* 0

>PER1_todPac Todarodes pacificus (squid) retinal photoisomerase retinochrome 2226795 peropsin
0 MFGNPAMTGLHQFTMWEHYFTGSIYLVL 1
2 GCVVFSLCGMCIIFLARQSPKPRRKYAILIHVLITAMAVNGGDPAHASSSIVGR 2
1 WLYGSVGCQLMGFWGFFGGMSHIWMLFAFAMERYMAVCHREF 1
2 YQQMPSVYYSIIVGLMYTFGTFWATMPLLGWAS 2
1 YGLEVHGTSCTINYSVSDESYQSYVFFLAIFSFIFPMVSGWYAISKAWSGLSAIPDAEKEKDKDILSEEQLTA 0
0 LAGAFILISLISWSGFGYVAIYSALTHGGAQLSHLRGHVPPIMSKTGCALFPLLIFLLTARSLPKSDTKKP* 0

>PER2_patYes Patinopecten yessoensis Go depolarizing PM 9287291 AB006455 scop2 inner retina layer
0 MPFPLNRTDTALVISPSEFRIIGIFISIC 1
2 CIIGVLGNLLIIIVFAKRRSVRRPINFFVLNLAVSDLIVALLGYPMTAASAFSNRWIFDNIGCKIYAFLCFNSGVISIMTHAALSFCRYIIICQYGY 1
2 RKKITQTTVLRTLFSIWSFAMFWTLSPLFGWSSYVIEVVPVSCSVNWYGHGLGDVSYTISVIVAVYVFPLSIIVFSYGMILQEKVCKDSRKNGIRAQQRYTPRFIQDIEQRVTF 0
0 ISFLMMAAFMVAWTPYAIMSALAIGSFNVENSFAALPTLFAKASCAYNPFIYAFTNANFRDTVVEIMAPWTTRRVGVSTLPWPQVTYYPRRRTS
AVNTTDIEFPDDNIFIVNSSVNGPTVKREKIVQRNPINVRLGIKIEPRDSRAATENTFTADFSVI* 0
>MEL1_homSap Homo sapiens (human) Gq 0.0.1.2.2.1.1.1.0.0 indel -GRID1 -WAPAL +LDB3 +BMPR1A 483 aa 000 nm 16961436 NM_033282 melanopsin OPN4   
0 MNPPSGPRVPPSPTQEPSCMATPAPPSWWDSSQSSISSLGRLPSISPT 0 
0 APGTWAAAWVPLPTVDVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCR 2
1 SRSLRTPANMFIINLAVSDFLMSFTQAPVFFTSSLYKQWLFGET 1
2 GCEFYAFCGALFGISSMITLTAIALDRYLVITRPLATFGVASKRRAAFVLLGVWLYALAWSLPPFFGW 1
2 SAYVPEGLLTSCSWDYMSFTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGR 2
1 ALQTFGACKGNGESLWQRQRLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAG 2
1 YAHVLTPYMSSVPAVIAKASAIHNPIIYAITHPKYR 2
1 VAIAQHLPCLGVLLGVSRRHSRPYPSYRSTHRSTLTSHTSNLSWISIRRRQESLGSESEV 0
0 GWTHMEAAAVWGAAQQANGRSLYGQGLEDLEAKAPPRPQGHEAETPGK 0
0 TKGLIPSQDPRM* 0

>MEL1_monDom Monodelphis domestica (opossum) Gq 0.0.1.2.2.1.1.1.0.0 indel -GRID1 -WAPAL +LDB3 +BMPR1A 483 aa 000 nm no_ref genome melanopsin OPN4   
0 MNPSPMLRGLSCPAQDTNCTKIMASMSEWNNTEEDAYHLVDLPSIAPT 0
0 AVVLPPSSQNIFPTADVPDHAHYTIGATILAVGFTGVLGNLLVIYTFCR 2
1 SHSLRTPANMFIINLAISDFFMSFTQAPVFFASSMYKRWIFGEK 1
2 ACEFYAFCGALFGITSMITLMAIALDRYFVITRPLASIGVISKKKTGFILLGVWLYSLAWSLPPFFGW 1
2 SAYVPEGLLTSCSWDYTTFTPSVRAYTMLLFCFVFFIPLIVIIYCYIFIFRAIQDTNK 2
1 AVHSIGSGESTASPRHCQRMKNEWKMAKIALVVILLYVLSWAPYSTVALVAFAG 2
1 YSHILTPYMNSVPAIIAKASAIHNPIIYAISHPKYR 2
1 MAIAQNFPCLRALLCVRHPRTRSFSSYRFTRRSTMTSQASDISWLPRGRRQLSLGSESEI 0
0 GWNNMEAGTTSLTSRNQQGSCRMDQETMETRELAAIAKAKGRSWETLEK 0
0 TLEEMDDSSLLEVSVDMEQ* 0

>MEL1_galGal Gallus gallus (chicken) Gq 0.0.1.2.2.1.1.1.0.0 indel -GRID1 -WAPAL +LDB3 +BMPR1A 529 aa 000 nm 16856781 AY88294 melanopsin OPN4m   
0 MDLPPRAPT 0
0 KMTVKDVRGAFPTVDVPDHAHYTIGTVILIVGITGTLGNFLVIYAFCR 2
1 SRTLQKPANIFIINLAVSDFLMSITQSPVFFTNSLHKRWIFGEK 1
2 GCELYAFCGALFGITSMITLMVIALDRYFVITKPLASVRVMSKKKALIILVGVWLYSLAWSLPPFFGW 1
2 SAYVPEGLLTSCSWDYMTFTPSVRAYTMLLFCFVFFIPLIAIIYSYVFIFEAIKKANK 2
1 SVQTFGCKHGNRELQKQYHRMKNEWKLAKIALIVILLYVISWSPYSVVALVAFAG 2
1 YSHVLTPFMNSVPAVIAKASAIHNPIIYAITHPKYR 2
1 TAIATYVPCLGFLLRVSPKESRSFSSYPSSRRTTITSQSSETSGLQKGKRRLSSISDSES 0
0 GCTDTETDITSMISRPASSQVSYEMGEDTTQTSDLGGKPKVKSHDSGIFRK 0
0 TVVDADEIPMVEINDTEHSATSTCKTSEKCNVEEIQ 0
0 RSESLSGIGLREGESRHRTSASQIPSIIITYSNVQGVELHSGYSAGFLHPKNKSHKQNKSSNS* 0

>MEL1_xenTro Xenopus tropicalis (frog) Gq 0.0.1.2.2.1.1.1.0.0 indel -GRID1 -WAPAL +LDB3 +BMPR1A 596 aa 000 nm 16856781 DQ384639 melanopsin OPN4m   
0 MNYQSVRKGITCPPQDANCSRILESLNSWNNSEVNSYKLVELPPIVTT 0
0 ETPQYEIHHVYPTVDVPDHVHYVVGAVILAVGITGMLGNFLVIYAFCR 2
1 SRSLRSPANMFIINLAITDFLMSVTQAPVFFATSLHKRWIFGEK 1
2 GCELYAFCGALFGITSMITLMVIAVDRYFVITRPLTSIGVMSKKRAVLILSGVWLYSLAWSLPPFFGW 1
2 SAYVPEGLLTSCTWDYMTFTPSVRAYTMLLFCFVFFIPLFIIIYCYIFIFKAIKNTNR 2
1 AVQKIGTDNNKESHKQYQKMKNEWKMAKIALIVILLYVVSWSPYSTVALLAFAG 2
1 YASILTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2
1 MAIAKYIPCLGSLLRVKRRDSRSYSSYPSSRRSTVTSHCSQSSDVGGHPKLKNHLPSVSDSES 0
0 GWTDTEADSSVNSRPASRQVSYEMGKDTTETNDLKSKAKLKSHDSGIFEK 0
0 TSMDADDISLVELGTVDRSSPIM 0
0 ANKHLNGLGQRKGDSFTRRSPSSRIPSIVVTHSNHQGSPAAVRHNSTLPGIKVSNSQDREKELKRQIEKVKQYVPIVTITSDTENSTGGFSNELLPANTS* 0

>MEL1_danRer Danio rerio (zebrafish) Gq 0.0.1.2.2.1.1.1.0.0 indel - +USP54 +LDB3 +BMPR1A 594 aa 000 nm no_ref AY078161 melanopsin OPN4m   
0 MMSGAAHSVRKGISCPTQDPNCTRIVESLSAWNDSVMSAYRLVDLPPTTTTTTSVA 0
0 MVEESVYPFPTVDVPDHAHYTIGAVILTVGITGMLGNFLVIYAFSR 2
1 SRTLRTPANLFIINLAITDFLMCATQAPIFFTTSMHKRWIFGEK 1
2 GCELYAFCGALFGICSMITLMVIAVDRYFVITRPLASIGVLSQKRALLILLVAWVYSLGWSLPPFFGW 1
2 SAYVPEGLLTSCTWDYMTFTPSVRAYTMLLFIFVFFIPLIVIIYCYFFIFRSIRTTNE 2
1 AVGKINGDNKRDSMKRFQRLKNEWKMAKIALIVILMYVISWSPYSTVALTAFAG 2
1 YSDFLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2
1 LAIAKYIPCLRLLLCVPKRDLHSFHSSLMSTRRSTVTSQSSDMSGRFRRTSTGKSRLSSASDSES 0
0 GWTDTEADLSSMSSRPASRQVSCDISKDTAEMPDFKPCNSSSFKSKLKSHDSGIFEK 0
0 SSSDVDDVSVAGIIQPDRTLTN 0
0 AGDITDVPISRGAIGRIPSIVITSESSSLLPSVRPTYRISRSNVSTVGTNPARRDSRGGVQQGAAHLSNAAETPESGHIDNHRPQYL* 0

>MEL1_danRer Danio rerio (zebrafish) Gq 0.0.1.2.2.1.1.1.0.0 indel - +USP54 +LDB3 +BMPR1A 473 aa 000 nm no_ref genome melanopsin OPN4m   
0 QVAMVQDVRHPFPTVDVPDHAHYTIGSVILAVGITGMVGNLLVMYAFCK 2
1 SRSLRTPANMFIINLAVTDFLMCVTQTPIFFTTSLHKRWIFGEK 1
2 GCELYAFCGALFGICSMITLMIIAVDRYFVITRPLASIGVMSRKRALLILSAAWAYSMGWSLPPFFGW 1
2 SGAYVPEGLLTSCSWDYMTFSPSVRAYTMLLFTFVFFIPLFVIIYCYFFIFKAIRETNR 2
1 AVGKINGEGGPRDSIKKIHRMKNEWKMAKIALIVILLYVISWSPYSCVALTAF 2
1 YADMLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2
1 SAIAKYIPCLGVLLCVPRRDRFSSSSFISTRRSTLTSQSSETSSNLHRAGKARLSSVSDSES 0
0 GWTDTEADLSTASSRPASRQVSSEIRKDLCDIKHSSSLRLKVKSRDSGIFDR 0
0 0
0 QNDVSEKADEKRPLVRIPSIIVTSETCPAVLPAGHSSRLIPGAPAVTDS* 0

>MEL1_takRub Takifugu rubripes (teleost) Gq 0.0.1.2.2.1.1.1.0.0 indel - +USP54 +LDB3 +BMPR1A 555 aa 000 nm no_ref genome melanopsin OPN4m   
0 MNFGKSALQPPAQQSVVSCGGGGPEPNCTLRLAVTVMMSVRLAELQLHAST 0
0 LQVAMVRPFPTVDVPDHAHYTIGSVILVIGITGMIGNFLVIYAFCR 2
1 SRSLRTPANMFIINLAVTDLLMCVTQTPIFFTTSMYKRWIFGEK 1
2 GCELYAFCGALFGICSMITLTVIAIDRYFVITRPLTSIGVLSRKRAFVILMTVWIYSLGWSLPPFFGW 1
2 SAYVPEGLLTSCTWDYMTFSPSVRAYTMLLFIFVFFLPLFIIIYCYFFIFRAIRATNK 2
1 AVGKVNGSVHSHSRRRESVKNFQRLQNEWKMAKIALMVILLYVISWSPYSCVALTAFAG 2
1 YADMLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2
1 LALAKYIPCLGFLLCISPHELQSTSSSFMSLRRSTVTSQTSDISGQFRPQSKPRRSSASDSES 0
0 CLTDTEADLSSMGSRPASRQVSCDISRDTTELPEYKPASSFNSKVKSPDSGIFEK 0
0 TSFDFDASMAASRERSSIPN 0
0 SGEFPEGHVMRRTLARIPSIIITSESSHFLPNGRKASSTTCIANGSDIKVGPR* 0

>MEL1_gasAcu Gasterosteus aculeatus (stickleback) Gq 0.0.1.2.2.1.1.1.0.0 indel - - +LDB3 +BMPR1A 556 aa 000 nm no_ref genome melanopsin OPN4m   
0 MNAGESELLLPTQQSILPCGDHEPNCPVAQAETLALSAASANGSA 0
0 VQVAMVSRAPHPYPTVDVPDHAHYTIGSVILAIGITGIIGNVLVIYAFSK 2
1 SRSLRTPANMFIINLAITDLLMCVTQAPIFFTTSMHKRWIFGEK 1
2 GCELYAFCGALFGICSMITLTVIALDRYFVITRPLTSIGMMSRRRALLILMGAWTYSLGWSLPPFFGW 1
2 SAYVPEGLLTSCTWDYMTFTPSVRAYTMLLFIFVFFLPLFIIIYCYFFIFRAIRVTNR 2
1 AVGKMNGSIHSHGSGRDSTKNFHRLQNEWKMAKIALIVILLYVVSWSPYSAVALTAFAG 2
1 YADMLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2
1 IALAKYIPFLGVLLCVPPRELRSASSSFRSTRRSTVTSQTSDVSSQQRRQGSRNSRLSSASDSES 0
0 CLTDTEADGSSVGSRPASRQVSCDIGRDTAELPEFKPSSSFKSKMKSHDSGIFEK 0
0 SYDTDISMAGVSERGSIPN 0
0 QTDFAEGRDRRSTIGRIPSIVITSETSPFLPTGRNGSCNGRPKTANSSHPGAGSG* 0

>MEL1_oryLat Oryzias latipes (medaka) Gq 0.0.1.2.2.1.1.1.0.0 indel - +USP54 +LDB3 +BMPR1A 504 aa 000 nm no_ref genome melanopsin OPN4m   
0 LQVAMVPQTFHPFPTVDVPDHAHYTIGSVILAIGITGIIGNFLVIYAFSR 2
1 SRSLRTPANMFIINLAITDLLMCVTQSPIFFTTSMHKRWIFGEK 1
2 GCELYAFCGALFGICSMITLTVIAIDRYFVITRPLTSIGVLSRKRALLILSAAWAYSLGWSLPPFFGW 1
2 SAYVPEGLLTSCTWDYMTFTPSVRAYTMLLFIFVFFLPLFIIIYCYVFIFRAIRSTNR 2
1 AVGKINGNTRDAVKSFNRLQNEWKMAKIALIVILLYVISWSPYSTVALTAFAG 2
1 YADMLTPYMNSIPAVIAKASAIHNPIIYAITHPKYR 2
1 MALAKYIPGLGVLLCIHPKDLRSASSSFVSTRRSTVTSQSSDISSQLRRQSTFKSRLSSLSDSES 0
0 GLTDTEADLSSLSSRPASRQVSCEISRDTAELPDFKHTSSFKAKLKNNDSGIFEK 0
0 TSFDTVSIGGVSEHNSIPS 0
0 NRDFGDGNVTRATIGRIPSIVVTSEMSPFLPVGRNGSRTNRSKMANSSAGAGPV* 0

>MEL1_calMil Callorhinchus milii (elephantfish) Gq 0.0.1.2.2.1.1.1.0.0 indel - - - - 369 aa 000 nm no_ref genome melanopsin OPN4m   
0 ASVTDAQHHHMFPTVDVPDHAHYIIGATILAVGVTGMVGNFLVIYAFLR 2
1 SRSLRTPANTFIINLAATDFLMSVTQSPIFFITSIHKRWIFGEK 1
2 GCELYAFCGALFGITSMITLMVIALDRYFVITRPLASIGVLSHRRAGLIILSLWLYSLAWSLPPFFGW 1
2 SAYVPEGLLTSCTWDYMTFTPSVRAYTMLLFCFVFFIPLGVIIYCYIFIFRAIKSTNK 2
1 KVGGSTNRESQKQHQRMKNEWKMAKIALIVILLFVISWSPYSTVALTAFAG 2
1 YADMLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2
1 MAIAKYVPLLGLLLRVSRRDSRTSGQYYSTRRSTLTSQTSDLSGYPRGKGRLSSASDSES 0

>MEL1b_calMil Callorhinchus milii (elephantfish) Gq 0.0.1.2.2.1.1.1.0.0 indel x x x x 113 aa 000 nm no_ref EB687868 melanopsin OPN4m   
1 SKSLRTPANMFIINLAISDFFMSATQPPVFFVTSLHKRWIFGEK
2 GCKLYAFCGALFGITSMITLMAISIDRYWVITKPLQSISSTTTKKNTLKVIILVWLYSLAWSLPPLLGW 1

>MEL1_petMar Petromyzon marinus (lamprey) Gq 0.0.1.2.2.1.1.1.0.0 indel x x x x 205 aa 000 nm no_ref genome fragment   
1 SKSLRSPANIFIINLAFADFFMSITQTPIFFVTSLHKRWIFGEK 1
2 GCELYAFCGALFGIASMVTLMVIATDRYLVLTRPLASIGAMSKRRAMYITAAVWFYSLAWSLPPFFGW 1
2 SAYVPEGLMTSCTWDYVTFTPAVRSYTMLLFCFVFFIPLIVIIFCYVRIFAAIKNTNR 2
1 YADMLTPYMNSVPAIIAKASAIHNPIVYAITHPKYR 2

>MEL1a_braFlo Branchiostoma floridae (amphioxus) Gq 0.0.1.2.2.1.1.1.0.0 indel - - - - 709 aa 000 nm no_ref genome melanopsin Amphi-mop 12 exons +tandem dup assembly error   
0 MTELPSFQPPTNSTEEENAVFPTALTEWISE 0
0 VGNQVGEAALKLLSGEGDGMEVTPTPGCTGNASVCNGTDSGGGVVWDIPPLAHYIVGTAVFCVGCCGMFGNAVVVYSFIK 2
1 SKGLRTPANFFIINLALSDFLMNLTNMPIFAVNSAFQRWLLSDF 1
2 ACELYGFAGGLFGCLSINTLMAISMDRYLVITKPFLVMRIVTKQR 0
0 VMFAILLLWIWSLVWALPPLFGWSAYVPEGF 1
2 GTSCTFDYMTPKLSYHIFTYIIFFTMYFIPMGVIIYCYYNIFATVKSGDKQFGKAVKEMAHEDVKNK 0
0 AQQERQRKNEIKTAKIAFIVITLFLSAWTPYAVVSALGTLGYQDLVTPYLQSIPAVFAKSSAVYNPI 1
2 VYAITHPKFRAAVKKHIPCLSGCLPADEEETKTKTRGATTTASMSMTQTTAPTV 0
0 HDPQASVHSGSSVSVDDSSGVSRQDTMMVK 0
0 VEVDNRMEKAGGGAADTAPKDGTSVPTVSAQIEVRPSGNVNTKAEVIPSPQSAAVAHGASASPVPK 0
0 VAELSSSVSLESAAIPGKIPTPLPSQPIAAPIERHMAAMADDPPPKPRGVATTVNVRRSESGYERSQDSLRKK 0
0 AVSETRSRSFNSTKDHFASERQTSTTLNQPRDMYSGDMVKKTRQSPEKQEYDNPAFDAGIAEIDTDSENETEGSYDMLSVRFQAMAEEPPVETYRKASDMSINLGKASLMLTEAHDETVL* 0

>MEL1a_braBel Branchiostoma belcheri (amphioxus) Gq 0.0.1.2.2.1.1.1.0.0 indel x x x x 707 aa 000 nm 15936279 AB205400 melanopsin Amphi-mop   
0 MTEIPSFQPPINATEVEEENAVFPTALTEWFSE 0
0 VGNQVGEVALKLLSGEGDGMEVTPTPGCTGNGSVCNGTDSGGVVWDIPPLAHYIVGTAVFCIGCCGMFGNAVVVYSFIK 2
1 SKGLRTPANFFIINLALSDFLMNLTNMPIFAVNSAFQRWLLSDF 1
2 ACELYGFAGGLFGCLSINTLMAISMDRYLVITKPFLVMRIVTKQR 0
0 VMFAILLLWIWSLVWALPPLFGWSAYVSEGF 1
2 GTSCTFDYMTPKLSYHIFTYIIFFTMYFIPGGVMIYCYYNIFATVKSGDKQFGKAVKEMAHEDVKNK 0
0 AQQERQRKNEIKTAKIAFIVISLFMSAWTPYAVVSALGTLGYQDLVTPYLQSIPAMFAKSSAVYSPI 1
2 VYAITYPKFREAVKKHIPCLSGCLPASEEETKTKTRGQSSASASMSMTQTTAPV 0
0 HDPQASVDSGSSVSVDDSSGVSRQDTMMVK 0
0 VEVDKRMEKAGGGAADAAPQEGASVSTVSAQIEVRPSGKVTTKADVISTPQTAHGLSASPVPK 0
0 VAELGSSATLESAAIPGKIPTPLPSQPIAAPIERHMAAMADEPPPKPRGVATTVNVRRTESGYDRSQDSQRKK 0
0 VVGDTHRSRSFNTTKDHFASEQPAALIQPKELYSDDTTKKMARQSSEKHEYDNPAFDEGITEVDTDSENETEGSYDMLSVRFQAMAEEPPVETYRKASDLAINLGKASLMLSEAHDETVL* 0

>MEL1b_braFlo Branchiostoma floridae (amphioxus) Gq 0.0.1.2.2.1.1.1.0.0 indel - - - - 402 aa 000 nm no_ref genome melanopsin Amphiop6   
0 MSPNLTNTSLLPNRTDRPELSPADVTMQLVFGSMMLVFGLIGVVGNAVALYAFCR 2
1 SRSLRRPKNYLIANLCLTDMVVCLVYSPIIVTRSLSHG 2
1 LPSKESCIVEGFVVGLGSIVSICSLAGIAVERYVTITQPIKSLSILTHRALLGAVSAVWVYAFLLAFPPLVGWGRYVSEESKISCTFDYLSTDDATRAHVIVLVIGAFGLPFS
VITYCYVRSFATVRKCTKERKQMSPLAKSDSRSEVKAAVNSFVITTSFCLCWCPYAVVATMGVSGFTVHSHAVFIAALLAKLSVLFNPVAYVLSIP 1
2 NSNVNIESTELTVPYSASRESCLLSRAATERLAGRSPSLTDIVREFGLQQTASHRE

>MEL1b_braBel Branchiostoma belcheri (amphioxus) Gq 0.0.1.2.2.1.1.1.0.0 indel x x x x 402 aa 000 nm 12435605 AB050611 melanopsin Amphiop6   
0 MSSNLTNVSLVANRTDQTELSPTDVTMQLIFGSMMLVFGLIGVVGNVVALYAFCR 2
1 TRSLRRPKNYVVANLCLTDMFVCLVYCPIVVSRSFSHG 2
1 FPSKESCIVEGFMVGVGSIASICSLAAIAVERYLSVTQPLKSLTILTQRKLLVAVLTVWVYSLLLAFPPLVGWGRYVREETYISCTFDYLSTDDATRAYVITLVMGAFGFPLL
TIAYCYIRVFTTARKHAEERKFMSPLKRPESRTEIKTAVTACVITTSFCLCWCPYAVVATLGISGVSVQQQTVFSAALLAKLTVIINPIVYVLSIPNFRKALFAQEREKYASED
VVLTSLPGKTRRMKKVERSQSSNSNVVIEVKESSMAYSTSRESCLLSRAATKRLAGKTKSIVDLVDEFGLQETAPHKESLV* 0

>MEL2_galGal Gallus gallus (chicken) Gq 0.0.1.2.2.1.1.1.0.0 indel +GRID2+SMARCAD1 -PGDS -SEC24B +COL25A1 544 aa 000 nm 17977531 AY882944 melanopsin   
0 MGTQPHSVTKSEIPDHVLYTVGTCVLVIGSIGIIGNLLVLYAFYS 2
1 NKKLRTPQNFFIMNLAVSDFLMSASQAPICFVNSLHREWILGDI 1
2 GCDLYAFCGALFGITSMMTLLAISVDRYLVITKPLRSIQWTSKKRTIQIIAAVWLYSLGW 1
2 SVAPLLGWSSYVPEGLMISCTWDYVTYSPANRSYTMILCCCVFFIPLIIILHCYLFMFLAIRSTGR 2
1 DVQKLGSCSRKSFLSQSMKNEWKLAKIAFVVIIVYVLSWSPYACVTLIAWAG 2
1 RGNTLTPYSKSVPAVIAKASAIYNPIIYAIIHPRYR 2
1 KTIHNAVPCLRFLIRISKNDLLRGSINESSFRTSLSSHQSLAGRTKNTCVSSVSTGEA 0
0 NWSDVELDTVEPAHEKLQPRRSHSFSSSLRQKRDLLPDSYSCSEETEEK 0
0 VSLSSSYLEKVLGRSAFPSSPVALVTSSLRAASLPVGLNSSSASRGAGSDISQMKTEESHNNGGLDSIVSNTVPQIIIIPTSETNLFQEEPEEEETELFHFHDKKNNLLDLEGLSSSTEFLEAVEKFLS* 0

>MEL2_anoCar Anolis carolinensis (lizard) Gq 0.0.1.2.2.1.1.1.0.0 indel +GRID2+SMARCAD1 -ATOH1 +PDLIM5 +BMPR1B 290 aa 000 nm no_ref genome melanopsin   
0 MGPHHRTKVDVPDHVLYTVGSCVLVIGCIGITGNLLVLYAFYS 2
1 NKRLRTPPNYFIMNLAVSDFLMSATQAPICFLNSMHKEWVLGDI 1
2 GCNLYAFCGALFGITSMITLLAISVDRYCVITKPLQSIKRTSKKRTCIIIVFVWLYSLGWSVCPLFGW 1
2 SSYIPEGLMISCTWDYVTYSPANRSYTMMLCCCVFFIPLVIIFHCYIFMFLAIRSTGR 2
1 RKSSISHSIKSEWKLAKIAFVAIVVFVLSWSPYACVTLISWAG 2
1 YARTLTPYSKSVPAVIAKASAIYNPIIYAIIHPRYR 2
1 RTIRSAVPCLRFLIPISKSDLSTSSMSESSFRASVSSRHSFSYRNKSTYISSISAKET 0
0 TWCDVELDPVESGHKKLQAYRSNSFSAKGVAEEESGLLLRTNNCNVPARKK 0

>MEL2_xenLae Xenopus laevis (frog) Gq 0.0.1.2.2.1.1.1.0.0 indel +SMARCAD1 +PDLIM5 +BMPR1B 535 aa 000 nm no_ref genome melanopsin Xmop  21 0 0  
0 MDLGKTVEYGTHRQDAIAQIDVPDQVLYTIGSFILIIGSVGIIGNMLVLYAFYR 2
1 NKKLRTAPNYFIINLAISDFLMSATQAPVCFLSSLHREWILGDI 1
2 GCNVYAFCGALFGITSMMTLLAISINRYIVITKPLQSIQWSSKKRTSQIIVLVWMYSLMWSLAPLLGW 1
2 SSYVPEGLRISCTWDYVTSTMSNRSYTMMLCCCVFFIPLIVISHCYLFMFLAIRSTGR 2
1 NVQKLGSYGRQSFLSQSMKNEWKMAKIAFVIIIVFVLSWSPYACVTLIAWAG 2
1 HGKSLTPYSKTVPAVIAKASAIYNPIIYGIIHPKYR 2
1 ETIHKTVPCLRFLIREPKKDIFESSVRGSIYGRQSASRKKNSFISTVSTAET 0
0 VSSHIWDNTPNGHWDRKSLSQTMSNLCSPLLQDPNSSHTLEQTLTWPDDPSPKEILLPSSLKSVTYPIGLESIVKDEHTNNSCVR
NHRVDKSGGLDWIINATLPRIVIIPTSESNISETKEEHDNNSEEKSKRTEEEEDFFNFHVDTSLLNLEGLNSSTDLYEVVERFLS* 0

>MEL2_danRer Danio rerio (zebrafish) Gq 0.0.1.2.2.1.1.1.0.0 indel - +FLJ39155 +PDLIM5 - 346 aa 000 nm no_ref genome melanopsin   
0 MEPQRQIYKRLDVPDHVHYIIAFLILIIGTLGVSGNALVMFAFYR 2
1 NKKLRSLPNYFIMNLAVSDFLMAITQSPIFFINCLYKEWMFGEL 1
2 GCKIYAFCGALFGITSMINLLAISIDRYLVITKPLQTIQWNSKRRTGLAILCIWLYSLAWSLAPLIGW 1
2 GSYIPEGLMTSCTWDYVSPSPANKSYTMMLCCFVFFIPLSIILYCYLFMFLSVRQASR 2
1 QKSSFVKQQSMRSEWKLAKIAAVVIVVYVLSWAPYACVTLVAWAG 2
1 HQDVLTPYSKTLPAVLAKSSAIYNPFIYAIIHNKYR 2
1 RTLAEKVPGLSCLSRSQKDGLSSSTNSDASAQDSSVSRQSSVSKNRLHSTMVQ* 0

>MEL2_tetNig Tetraodon nigroviridis (pufferfish) Gq 0.0.1.2.2.1.1.1.0.0 indel - - - +BMPR1B 404 aa 000 nm no_ref genome melanopsin   
0 MEPKDTHITSSFFSKVDVPDHVHYIIAFFVFVIGILGITGNVLVIFAFYS 2
1 NKKLRSLPNYFIVNLAVSDLLMASTQSPIFFINLYKEWMFGET 1
2 ACKMYAFCGALFGITSMINLLAISVDRYVVITKPLQTIRRSSKRRTALAILMVWLYSLAWSLAPLVGW 1
2 GSYIPEGLMTSCTWDYVTYTLANRSYTMMLCCFVFFIPLAIILCCYLLMFLAIRKTSR 2
1 RKSTLIQQKSIRSEWKLAKIAFVVIVVYVLSWSPYACVTLISWAG 2
1 YGSTLTPYSKSVPAVIAKASAIYNPIIYAIIHPRYR 2
1 KTIRSAVPCLRFLIPISKSDLSTSSMSDSSFRSALSCRHSYRSRSTYISSISAKET 0
0 TWCDVELDPVESGHKKLQAYRSNSFSAKGVAEEESGLLLRTNNCNVPARKK 0

>MEL2_gasAcu Gasterosteus aculeatus (stickleback) Gq 0.0.1.2.2.1.1.1.0.0 indel KNTC2 +FLJ39155 +PDLIM5 +BMPR1B 353 aa 000 nm no_ref genome melanopsin   
0 MEPDNAHTQRSFINKVDVPDHAHYIVAVFVVVIGTLGITGNALVMLAVYS 2
1 NKKLRNLPNYFIMNLAVSDFLMAFTQSPIFFINCLYKEWAFGET 1
2 GCKIYAFCGALFGIASMINLLAISIDRYLVITKPLQAIHWGSKRRTTLAILLVWLYSLAWSLAPLVGW 1
2 GSYIPEGLMTSCTWDYVTYTLANRSYTMMLCCFVFFIPLGIILYCYLFMFLAIRKTSR 2
1 RKSTLIKQKSMKSEWKLAKIAFVVIVVYVLSWSPYACVTLISWAG 2
1 HADILSPYSKAVPAIIAKASAIYNPFIYAIIHNKYR 2
1 MTLAAKFPCLRFLSPTPRKDTSSSISESSYRDSVISRQSTASRTHFITACPDTVN 0

>MEL1_strPur Stronglyocentrotus purpuratus GLEAN3_22851 opsin4 no cdna losing introns, expressed in larval postoral arm first exons missing
1 WTKSLRTPPNMLIVNLAISDFGMVITNFPLMFASTIYNRWLFGDA 1
2 GCQFYAFCGALFGIMSIANMTAIALDR 2
1 YYVICWSLEAVRSVTHRRSMIIIIIVWCYAIFWSIPPFFGVGSYVLEGYGLGCTFDFMTKDLNHYLHV
SFLFASSFVVPVTIIIVCFTRIAITVRAHRHELNKMRTKLTEDKDKKHKSSIRRANKAKTEFQIAKVGFQVTIFYVLSWM
PYSIVAVIGQYFDSDLLTPLGTVVPVIFAKCSAIWNPIIYCLSHEKFNAALKEKLMGMCGIEIPSKHRSMGSQESSVTGR
RGMHRQNSSTLSESSVTSTVDQDAIELKDRKQGPATVKVQQEKVEGGTYRRNPGDVTFSKDAGVEVDEKRRGDQGQRDDR
VRPQGEGQMDQWSQPPPAPASASAPTPGVNDKEYLTKM* 0

>MEL2_strPur Stronglyocentrotus purpuratus cdna: S.droebachiensis DQ285097 retained intron? WLEKMKTTQILHKPVTFLRLKSSFEPRFKPRFKRRF tube feet
0 MPTTLMENSTPGWMADDSQMEETHPAFPLIGGYLLVVVLLGTAGNSLVIYTFLRFKKLHSPINLLIVNLSASDLLVATTG
TPLSMVSSFYGRWLFGTNACAFYGFVNYYCGCISLNSLAAISVFRYIIVVRGQAQNNKLSLRSSIYAILVIHLYTLIFST
PPLYGWNRFVLAGYHTSCDIDFHTKTPLFVSYICYMFFFLFFLPLGLISWSYFKIYQRVSKHSNSMRTSFTGVTKEINSDEKHA 2
1 NHRRTASTLFVTIVVFLFAWFPYCIVSLWVLIGDANSISKLSTTIPSLFAKSSVIYNPLIYVVLNSKFRKALIQTLSFLKCLSKHELSESS* 0
>CILI2_plaDum Platynereis dumerilii (ragworm) Gt 0...2...0.0 indel x x x x 310 aa 000 nm 16311335 CT030681 proto cilliary htgs 5 exons
0 MDDLGFLGNSSVNYTVPLLQEDPLLLRILYFGPTSYVITAIYLCIVGVIGTLSNGVIMYLYFKDKSLRSPMNLLFVNLAMSDFTVAFFGAMFQFGLTCTRKYMSPGMALCDFYGFITFLG 1
2 GLASEMNLFIISVERYLAVVRPFDVGNLTNRRVIAGG 1
2 VFVWLYSLVFAGGPLVGWSSYRPEGLGTWCSISWQDRSMNTMSYVTAVFLGCYFFPVSIIIFCYFNVWRKVKE 0
0 AADAQGGAGTAGKAEKSIFRMSVIMVTCYLTAWTPYAIVCLIASYGPPNGLPIYAEVLPSLFAKSSQVYNPIIYVLMNKP 0
0 YRSALVSLVCRGRNPFDEAGGTAGGTTAKDETLGKGNKVAAA* 0

>CILI1_plaDum Platynereis dumerilii (ragworm) Gt 0...2...0.0 indel x x x x 355 aa 000 nm 15514158 AY692353 lophotrochozoa ciliary polychaeta new genomic   
0 MDGENLTIPNPVTELMDTPINSTYFQNLNAETDGGNHYIYNAFTATDYNICAAYLFFIACLGVSLNVLVLVLFIKDRKLRSPNNFLYVSLALGDLLVAVFGTAFKFIITARKTLLREEDGFCKWYGFITYLG 1
2 GLAALMTLSVIAFVRCLAVLRLGSFTGLTTRMGVAAMA 1
2 FIWIYSLAFTLAPLLGWNHYIPEGLATWCSIDWLSDETSDKSYVFAIFIFCFLVPVLIIVVSYGLIYDKVRK 0
0 VAKTGGSVAKAEREVLRMTLLMVSLFMLAWSPYAVICMLASFGPKDLLHPVATVIPAMFAKSSTMYNPLIYVFMNKQ 0
0 FRRSLKVLLGMGVEDLNSESERATGGTATNQVAAT* 0
>MEL1_plaDum Platynereis dumerilii (polychaete) Gq 0.0.1.2.2.1.1.1.0.0 indel x x x x 383 aa 000 nm 11874910 AJ316544 rhabdomeric melanopsin unavailable genomically   
0 MSRSEVLVPGSMSLDGLLTTAHPIGNDSI 0
0 ETILHPYWQQFDIENTIPDSWHYAVAAWMTFFGILGVSGNLLVVWTFLK 2
1 TKSLRTAPNMLLVNLAIGDMAFSAINGFPLLTISSINKRWVWGKL 1
2 WRELYAFVGGIFGLMSINTLAWIAIDRFYVITNPLGAAQTMTKKRAFIILTIIWANASLWALAPFFGW 1
2 GAYIPEGFQTSCTYDYLTQDMNNYTYVLGMYLFGFIFPVAIIFFCYLGIVRAIFAHHA 2
1 EMMATAKRMGANTGKADADKKSEIQIAKVAAMTIGTFMLSWTPYAVVGVFGMIK 2
1 PHSEMFIHPLLAEIPVMMAKASARYNPIIYALSHPKFR 2
1 AEIDKHFPWLLCCCKPKPKAQLPSSTTKGSIASKTEADTSV* 0

>MEL1_capCap Capitella capitata jgi 119596 distally uncertain shares 2 introns with melanopsins
0 MMSYADGYLDNSTAPIEESPYLPHGTFFHPHWRPYREMLLNMNPLIYYGLGLYMAVVGIVGTLGNLVVITLFIK 2
1 TRSLRTPPNMFIINLALSDMGFCATNGFPLMTVASFQKLWRWGPV 1
2 ACELYALAGSITGFNSIATLALISMDRYMVIAKPFYAMKHVSHKRSLIQIILAWTWAFIWSAPPLLRMGYGRYIP
EGFQVSCTFDYLSRDLKNLIFVWCLFVFGFFIPVLAIACSYVGIIRAVGAQSKEMRKTAEKMGAKTGKSDKEKKQDIAMAKVA
AGTIGLFLMSWTPYAAVSMIGIAGNRSWITPYVSQIPVMFAKASAMWNPILYALSHPKFRAALEDHMPWLLVC

>MEL1_helRob Helobdella robusta (leech)
0 MDSVTWYKDFHPHWWKYHDIIVNAPMAFYYFLGTFFAVVGFLGVFGNIIVVWVFSR 2
1 TPSLRTPSNVLVINLAICDILFSALIGFPMSALSCFQRHWIWGNF 1
2 YCQFYSFVAGITGLASINCLAVIAVDRYLVVGQPLAMLNQSHFRRSFYHVLIIWTWACVWSAMPLIGWGEYILEGFGVSCTFD
YLTRTTWNISFNVCLFTFCFGMPVSVIILSYIGIIRSIAKNRKEFSSLTAENSSRARQEIKIAKVFAVCMTAFILCWVPYAT
VAQLGIYGYDQMVSPYTAELPVMLAKTSALWNPIIYAFSHPKYRKCLKELPIFRQKRKFRFLGSKSHTKSSDGVGTIALTSTINRTATRH* 0

>MEL2_helRob Helobdella robusta (leech) fragmentary model from scaffold_39
1 TPILRTHANVLIINLALCDLIFSSLIGFPMTALSCFKRHWIWGDL 1
2 GCDFYGFVAGWTGLGSITCLAFISIDRYMAIVHPFYMLNKKSSSVLTLLQIGAVWSWALIWSVMPLFGWGRFIPEAFGVSCTFDYLTRTWSNICFNYVLITCGFFLPVVIIVTSYIGIVIEVTKS 1
2 VEEDGMKDHMDAQNARCYVFVAVLSM 0
0 LKTAKVLACCFGAFLICWTPYAIVAQLGINGFAHLVTPFTSEVPVLFAKTSSIWNPLIYALSHPRYRRAV 0

>MEL_schMed Schmidtea (planaria) most like RHAB_plaDum extension of AF112361 
0 eVYHYLVGVYISIVGISGVLGNLLVLYIFAR 2
1 AKSLRTPPNMFIMSLAIGDLTFSAVNGFPLLTISSFNTRWAWGKL 1
2 TCEIYGFIGGLFGFISINTMALISLDRYFVIAQPFQTMKSLTIKRAIIMLVFVWLYSLIWSTPPFFGY 1
2 GNYVPEGFQTSCTFDYLTQSKGNIIFNIGMYIGNFIIPVGIIIFCYYQIVKAVRVHELEMLKMAQKMNASHPTSMKTG 1
2 AKKADVQAAKISVIIVFLYMLSWTPYAIIALMALTGRRDHLNPYTAELPVLFAKTSAMYNPFIYAINHPKFRIQLEKKFPCLICCCPPKPK 0
0 * 0
 
>MEL1_schMan Schistosoma mansoni AF155134 11166392 381 aa 6 exons 0.1.2.2.2.0
0 MKQNLTFATLWPDDNDFASIVHSHWHKFIQPDPLYYYLVGIYIGIVGILAVMGNSLVITLFLL 2
1 CKQLRTPPNMLIVSLAISDFSFALINGFPLKTIAAFNHRWGWGKL 1
2 ACELYGFAGSIFGFISLTTMAFIALDRYLVIVQPFETFSRITYGKVIVMIFITWIWSALWSIPPFFGY 1
2 GSYIPEGFHTSCTFDYLSTDLPNLIFNAGLYILGFLCPVFIIIFSYYQIVKTVRLNELELMKMAQSLDLQNPSAMKTg 1
2 GDKKADIEAAKTSIILVLLYLMSWSPYAIVCLMTLIGSRDSLTPFHSELPVLFAKTSAVYNPIVYAVKHPKFRMEIEKRFPFLICCCPPKPK 0
0 ERLQNTIVSKIQVSQIGIGTVSGGNENTLNTVKRED* 0

>MEL2_schMan Schistosoma mansoni CD096414 12973350 (46%)
0 MSSNRTIEMLRPYMKDFDSIVLPYWYKFEQPNPYYQYAIGLFIAVVGITGMCLNLLVIVFFTM 2
1 FKSLRTPSNILVVNLAISDFGFSAVIGFPLKTMAAFNNFWPWGKL 1
2 ACDLYGLAGGLFGFVSLSTIAAVALDRYLVIATPFESVFQTTPRRTLLLMLFLWMWSLMWTIPPLFGF 1
2 GRYVTEGYQTSCTMDYISTDLNNRLFNIGLFGFGFLCPLFLSLFCYARIILIVRSRGKDFIEMAASSKGTNQKEKSANV 1
2 SSSKSDTFVSKSSAILLGVYLICWTPYSFVCLMALIGYADYITPLMVEIPCLCAKTANPCIYAFRYPKFRSLLQQRFGFLRLTKNRVSY 0
0 ERSQHAILSTIHVVTDCQYGTVSGGNENTLNTFILLD* 0

>MEL3_schMan Schistosoma mansoni Smp_180030 
0 MSEIKNFTRSLLLYNRTFSMIKNNIHDSDIIMLNHWIKYTQPDPIYNYLVAIFVALIGIFGTITNLLVIFVFL 2
1 TPKSSISLQCALIINLAISDFGFSAVIGFPLKTIAAFNQYWPWGSV 1
2 ACQLYGFISATFGFLSLTTIAAISFDRYLVIVKDHKTTNFRVICTVIGFLWIWSIIWTIPPFFGF 1
2 GRYVLEGYQTSCTFDYISNDMPSLLFSGGMYIFGFMFPVLLCIYCYVNLLKIVRNNERVVLISLSNDGASKQRESVR 1
2 NRKRLDIEATKSVILSLLFYLMSWTPYAMVCLISILGQSYFLTPTIAEMPHIFAKMAAIYNPILYAFTNRKFKNALGIRKTSSVIMQQQRLLSKGQLKPLVSLLFLVN* 0

>MEL1_lotGig Lottia gigantea (limpet) FC774055 ests long tail omitted 3 exons best:molluscan melanopsins
0 MTTLPPKENTTHLFDTWDDMFVHPHWKNFPPVSAAWHNFIGIFITFVGITGVIGNFVVIYTFSR 2
1 TKSLRTASNMFVVNLALSDLTFSAVNGFPLFSLSSFSHKWIFGRV 1
2 ACELYGLIGGIFGLMSINTMAMISIDRYLVITSPFTAMRNMTHKRAFLMIVGVWIWSILWAIPPIFGWGAYIPEGFQTSCTFDYLTRGDNRRSYIMCLYICGFVVPLGVIIFCYVFIIKSVMNHEKE
MAKMADKLDAKDVRSTKEKAKAEIKIAKVSMTIILLYLMSWTPYAIVALIAQWGPALVVTPYVSEIPVLFAKASAMHNPVIYALSHPKFRDAVSKLMPWFLCCCGLTDAEKKARDELAKSKFTRQT

>MEL1_sepOff Sepia officinalis (octopus) Go? AF000947 492 Mollusca Cephalopoda complete 
MGRDIPDNETWWYNPTMEVHPHWKQFNQVPDAVYYSLGIFIGICGIIGCTGNGIVIYLFTK 2
1 TKSLQTPANMFIINLAFSDFTFSLVNGFPLMTISCFIKKWVFGMA 1
2 ACKVYGFIGGIFGLMSIMTMSMISIDRYNVIGRPMAASKKMSHRRAFLMIIFVWMWSTLWSIGPIFGWGAYVLEGVLCNCSFDYITRDSATRSNIVCMYIFAFCFPILIIFF
CYFNIVMAVSNHEKEMAAMAKRLNAKELRKAQAGASAEMKLAKISIVIVTQFLLSWSPYAVVALLAQFGPIEWVTPYAAQLPVMFAKASAIHNPLIYSVSHPKFREAIAENFPWII
TCCQFDEKEVEDDKDAETEIPATEQSGGESADAAQMKEMMAMMQKMQQQQAAYPPQGAYPPQGGYPPQGYPPPPAQGGYPPQGYPPPPQGYPPAQGYPPQGYPPPQGAPPQGAPPQ
AAPPQGVDNQAYQA* 0

>MEL1_todPac Todarodes pacificus (squid) Go? X70498 480 11106382 Mollusca Cephalopoda complete 'squid rhodopsin'
MGRDLRDNETWWYNPSIVVHPHWREFDQVPDAVYYSLGIFIGICGIIGCGGNGIVIYLFTK 2
1 TKSLQTPANMFIINLAFSDFTFSLVNGFPLMTISCFLKKWIFGFA 1
2 ACKVYGFIGGIFGFMSIMTMAMISIDRYNVIGRPMAASKKMSHRRAFIMIIFVWLWSVLWAIGPIFGWGAYTLEGVLCNCSFDYISRDSTTRSNILCMFILGFFGPILIIFF
CYFNIVMSVSNHEKEMAAMAKRLNAKELRKAQAGANAEMRLAKISIVIVSQFLLSWSPYAVVALLAQFGPLEWVTPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAISQTFPWVL
TCCQFDDKETEDDKDAETEIPAGESSDAAPSADAAQMKEMMAMMQKMQQQQAAYPPQGYAPPPQGYPPQGYPPQGYPPQGYPPQGYPPPPQGAPPQGAPPAAPPQGVDNQAYQA* 0

>MEL1_entDof Enteroctopus dofleini Go? X07797 475 Mollusca Cephalopoda complete
MVESTTLVNQTWWYNPTVDIHPHWAKFDPIPDAVYYSVGIFIGVVGIIGILGNGVVIYLFSK 2
1 TKSLQTPANMFIINLAMSDLSFSAINGFPLKTISAFMKKWIFGKV 1
2 ACQLYGLLGGIFGFMSINTMAMISIDRYNVIGRPMAASKKMSHRRAFLMIIFVWMWSIVWSVGPVFNWGAYVPEGILTSCSFDYLSTDPSTRSFILCMYFCGFMLPIIIIA
FCYFNIVMSVSNHEKEMAAMAKRLNAKELRKAQAGASAEMKLAKISMVIITQFMLSWSPYAIIALLAQFGPAEWVTPYAAELPVLFAKASAIHNPIVYSVSHPKFREAIQTTFPWL
LTCCQFDEKECEDANDAEEEVVASERGGESRDAAQMKEMMAMMQKMQAQQAAYQPPPPPQGYPPQGYPPQGAYPPPQGYPPQGYPPQGYPPQGYPPQGAPPQVEAPQGAPPQGVDNQAYQA* 0

>MEL_aplCal Aplysia californica (sea hare) Gq-coupled 4 exons melanopsin AASC01108363 uncertainties
0 MNVSSSLTSQPYHELLHPHWLEHEEAPEGVHLSVGVFITLVGVLAVCGNSLVIITCIR 2 
1 FKDLRTRSNILIINLAVGDLLMCLIDFPLLAAASFYGEWPYGRQ 1
2 VCQMYAFLTAIAGLVTINTLAVIAADRYWAVVRRPTPGQKLPKCVTSIAVASVWAYSISWALCPILGWGAYVLDGIRTTCTFDFLTRTWENRSFVIGMMIGNFVLPFALMVFSYFRIWVAVRKVKSG 2
1 NVFCAIRHNYNLALGSTLFVKQHRYRLHCEQKTVKIIMFLLIAFTVSWSPYLAVSIIGLFGDRSQLTYQNTLTASLIAKTSMVFNPILYSISHPKVRKRIANLACCYSVRRHQQQTSRIKTGRRSTSSATPSRS* 0

>MEL2_lotGig most like MEL_entDof e-60 84/222 (37%) 338 aa Gq-coupled
0 MSIASHVWTNSSTNHFNFSVLHQHWQNQTPLSTACQYTIGIFISTVAVIAVIGNSIVIWAHVR 2
1 IKSLSTTSNMLILNLCVGCLIMCIVDFPLYATSSFLQKWIFGHK 1
2 VCEIYATITGTAGLLIMNSYSAIAFDRFITVTRYNNPNYPRSKSATMCISGFVWIYSLSWSMAPVVGWSRYQLDGSGTT
CTFDYLSTTWTNRSFILSIAFFNFVLPLCFILFAYSRILHLISSHSREMKSYRSAVIISKGKASIPKRFRSERKTAITLLI
TVVVFCLSWVPYVIIALIGQFGNQSFITPQISVIPQLVAKLSTVTNPILYSLSHPVVRNKLFLRLRHELYRRPSDSVSSSRGIQMKNIEFI* 0

>MEL1_patYes Patinopecten yessoensis Gq 9287291 AB006454 scop1 49% MOLL_RHO_entDof then MEL scallop retina
0 MADNKSTLPGLPDINGTLNRSMTPNTGWEGPYDMSVHLHWTQFP
PVTEEWHYIIGVYITIVGLLGIMGNTTVVYIFSN 2
1 TKSLRSPSNLFVVNLAVSDLIFSAVNGFPLLTVSSFHQKWIFGSL 1
2 FCQLYGFVGGVFGLMSINTLTAISIDRYVVITKPLQASQTMTRRKVHLMIVIVWVLSILLSIPPFFGWGAYIPEGFQTSCTFDYLTKTARTRTYIVVLYLFGFLIPLIIIGVC
YVLIIRGVRRHDQKMLTITRSMKTEDARANNKRARSELRISKIAMTVTCLFIISWSPYAIIALIAQFGPAHWITPLVSELPMMLAKSSSMHNPVVYALSHPKFRKALYQRVPWLF
CCCKPKEKADFRTSVCSKRSVTRTESVNSDVSSVISNLSDSTTTLGLTSEGATRANRETSFRRSVSIIKGDEDPCTHPDTFLLAYKEVEVGNLFDMTDDQNRRDSNLHSLYIPTRVQHRPTTQSLGTTPGGVYIVDNGQRVNGLTFNS* 0
>ENCEPH_apiMel Apis mellifera (bee) Gt 0...2...0.0 indel x x x x 329 aa 000 nm 16291092 NM_001039968 encephalopsin ciliary Gt pteropsin clock   
0 MSLNRSTMEHVIYEDQVSPVMYIGAAIALGFIGFFGFTANLLVAIVIVKDAQILWTPVNVILFNLV 0
0 FGDFLVSIFGNPVAMVSAATGGWYWGYKMCLW 2
1 YAWFMSTLGFASIGNLTVMAVERWLLVARPMQALSIR 2
1 HAVILASFVWIYALSLSLPPLFGWGSYGPEAGNVSCSVSWEVHDPVTNSDTYIGFLFVLGLIVPVFTIVSSYAAIVLTLKKVRKRA 1
2 GASGRREAKITKMVALMITAFLLAWSPYAALAIAAQYFN 0
0 AKPSATVAVLPALLAKSSICYNPIIYAGLNNQFSRFLKKIFDARGSRTAVPDSQHTALTALNRQEQRK* 0

>ENCEPH1_anoGam Anopheles gambiae (mosquito) Gt 0...2...0.0 indel x x x x 461 aa 000 nm no_ref XM_312503 encephalopsin GPROP11 adjacent head-to-head tandem GPROP12   
0 MYDVTDAAAINSDHQELMAPWAYNGAAVTLFFIGFFGFFLNIFVIALMYKDVQ 0
0 LWTPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWLYGKSICVAYGFFMSLL 1
2 GIASITTLTVLSYERFCLISRPFAAQNRSKQGACLAVLFIWSYSFALTSPPLFGWGAYVNEAANIS 2
1 CSVNWESQTANATSYIIFLFIFGLILPLAVIIYSYINIVLEMRK 0
0 NSARVGRVNRAERRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELIGPGLAVLPALVAKSSICYNPIIYVGMNTQ FRAAFWRIRRSNGVAGQPDSNNTNNSNRDKESARHTAKEGL
ECSLDFCHWTVRGTRVSISSAERNVPAPAARERSGGHSVTGSREESRDRHVTLKTMLSVGPRSPSSVAPVAADCSTTDVPTSGDGSVRIVRQDSELSVIHDGGGGGGGSSSRVLVIKSQKPRSNML* 0

>ENCEPH2_anoGam Anopheles gambiae (mosquito) Gt 0...2...0.0 indel x x x x 434 aa 000 nm no_ref XM_312502 encephalopsin GPROP12   
0 MNDAPNDVAASAVDYEDLMAPWAYNASAVTLFFIGFFGFFLNLFVIALMCKDMQ 0
0 LWTPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWIFGRTLCVAYGFFMSLL 1
2 GITSITTLTVLSYERYCLISRPFSSRNLTRRGAFLAIFFIWGYSFALTSPPLFGWGAYVQEAANIS 2
1 CSVNWESQTKNATTYIIFLFVFGLVVPLIVIVYSYTNIIVNMRE 0
0 NSARVGRINRAEQRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELIGPGLAVLPALVAKSSICYNPIIYVGMNTQ FRAAFSRVRNKGQQAAADQNTTTMQRELTKSSRDMVECSF
DFCRKKSRFKISLVKPTAPLAVVDVSSTSHRDKGTSRSPLDQTVLNETNEDVGRERSGGGGGGGAYAGTRFVRPDFELSVINSGKSILIKSKNFRSNLL* 0

>ENCEPH_aedAeg Aedes aegypti opsin XM_001650752 genome frameshifted
0 MESWAYVASAVTLFFIGFFGFFLNLFVIALMCKDVQ 0
0 LWTPINIILFNLVCSDFSVSIIGNPFTLTSAISRHWIFGRTVCIAYGFFMSLL 1
2 GITSITTLTVLSYERFCLISHPFSSRSLSRRGAVFAILFIWSYSFALTSPPLFGWGAYVNEAANIS 2
1 CSVNWESQTLNATSYIIFLFVFGLVVPLVVIVYSYTNIVVNMKR 0
0 NAARVGRINRAEKRVTRMVFVMVLAFMIAWTPYAVFALIEQFGPTDIISPALGVLPALIAKSSICYNPIIYVGMNTQFRAAFNRVRNNESVDNNTITNQKDITMNTSK
EIVECSFDFCRKKRLKIKLQSNAKSKNNNNNSRNQSIADPSSTSNGDDLDQPSPAQTVLNSTVANSGSVASFGPKKRLRSDFELSVISSGKSILIKSNTFRSNLV* 0

>ENCEPH_culPip Culex pipiens
0 MPPWAYVATAVVLFFIGFFGFFLNLFVIALMCKEVQV 0
0 LWTPMNIILLNLVCSDFSVSIVGNPFTLSSAISHRWLFGRKLCVAYGFFMSLL 1
2 GITSITTLTVLSYERFYLISRPFSSRSLSRRGALGAVLLIWCYSFALTSPPLFGWGAYVNEAANIS 2
1 CSVNWETQTLNATTYIIYLFVFGLVVPLTVIVYSYTNIIVNMKK 0
0 NAARVGRINRAEKRVTTMVAVMVIAFMVAWTPYSVFALMEQFGPPDVIGPGLAVLPALIAKSSICYNPIIYVGMNTQFRAAFNRVRHDPGDMANTTTNQKELT
RSSRDVASVDCSFDFCRKKNRLKMKLHNSIHRSAAIKAGRSQSPHSESERSSSGATQERSTRVQTMLNATVDSRSISGSTGKLMKSDFELSVINSGKSILIKSNTFRSNLV* 0

>ENCEPH_triCas Tribolium castaneum (60%)55 298
0 MKNFNSTEIGDELLIPVEGYIAAAVVLFCIGFFGFSLNLTVIIFMLKERQ 0
0 LWSPLNIILFNLVVSDFLVSVLGNPWTFFSAINYGWIFGETGCTIYGFIMSLL 1
2 SITSITTLTVLAFERYLLIARPFRNNALNFHSAALSVFSIWLYSLSLTIPPLIGWGEYVHEAANLS 2
1 CSVNWEEKSPNSTSYILYLFAFGLFLPLVIITFSYVNIILTMRR 0
0 NAAFRVGQVSKAENKVAYMIFIMIIAFLTAWSPYAIMALIVQFGDAALVTPGMAVIPALLAKSSICYNPVIYIGLNAQVKGAKWVSGLIYLFQFQQAWMQKWKKNRR
GSDALGTSRVMLETIHQACRDEKTDKLLEKKTKFCKDFETDVSML* 0

>ENCEPH_bomMor Bombyx mori moth 
0 0
0 LWTPLNIILFNLVCSDFSVSVLGNPFTLISALFHRWIFGHTMCVLYGFFMALL 1
2 GITSITTLTVISFERYLMVTRPLTSRHLSSKGAVLSIMFIWTYSLALTTPPLLGWGNYVNEAANIQ 2
1 CSVNWHEQSTNTLTYIMFLFAMGQILPLSVITFSYVNIIRTLKR 0
0 NSQRLGRVSRAEARATAMVFIMIIAFTVAWTPYSLFALMEQFATEGIVSVLKV* 0

>ENCEPH_dapPul Daphnia pulex 2nd exon proximal uncertain KFLYVQNVNSCDSTISFFWTEYRT TNDSVSSEIKFYAGT retained intron?
0 MPVWVYWSASAYLLFISIAGLFMNIVVVVIILNDSQVI 0
0 PISAAAALKFTWPFSHELCVAYAMIMSTA 1
2 IGSITTLTVLALWRCQHVVWCPTNRNSNFTDPNGRLDRRQGALLLTFIWTYTLIVTCPPLFGWGRYDREAAHIS 2
1 CSVNWESKMDNNRSYILYMFAMGLFIPLMAIFVSYISILLFIHK 0
0 SQQTSNNSDTVEKRVTFMVAVMIGAFLTAWTPYSIMALVETFTGDNV ISPAVATVPSLFAKTSAVLNPLIYGLLNTQVGSAD* 0
>CHEL_LWS_limPol Limulus polyphemus L03781 520 Arthropoda Chelicerata lateral_eye complete genFut
MANQLSYSSLGWPYQPNASVVDTMPKEMLYMIHEHWYAFPPMNPLWYSILGVAMIILGIICVLGNGMVIYLMMTTKSLRTPTNLLVVNLAFSDFCMMAFMMP
TMTSNCFAETWILGPFMCEVYGMAGSLFGCASIWSMVMITLDRYNVIVRGMAAAPLTHKKATLLLLFVWIWSGGWTILPFFGWSRYVPEGNLTSCTVDYLTKDWSSASYVVIYGLA
VYFLPLITMIYCYFFIVHAVAEHEKQLREQAKKMNVASLRANADQQKQSAECRLAKVAMMTVGLWFMAWTPYLIISWAGVFSSGTRLTPLATIWGSVFAKANSCYNPIVYGISHPR
YKAALYQRFPSLACGSGESGSDVKSEASATTTMEEKPKIPEA

>CHEL_LWS_ixoSca Ixodes scapularis ocellar TC19272 UP|OPSO_LIMPO (P35361) (57%)
0 MGSEGQRTNMSLLDELASPYMKNGTLVESVPDEMLYMVHPHWYNFKPMNPLWHSLLGFAMVILGVISVVGNSMVIYIMTTSKSLRSPTNMLVVNLAFSDW 2
1 CMMAFMMPTMAANCFAETWILGPFMCEVYGMVGSLFGCGSIWSMVMITLDRYNVIVRGVAAAPLTHKRAALMIFFVWFWALTWTLLPFFGWSR 2
1 YVPEGNMTSCTIDYLTKALWSASYVVAYAGGVYWTPLFINIYCYSKIVRAVAQHEKQLRLQARKMNVASLRANAEQTKTSAEARLAK 0 
0 IALMTVGLWFMAWTPYLTIAWAGIFSDGSKLTPLATIWGSVFAKANACYNPIVYGISHPKYRAALARRFPSLVCMPPGGDQLDTRSEASGITTIEDKVMTTET* 0

>INSE_LWS1_apiMel Apis mellifera (bee) Gq 0.1.0.0.1.0.0.1 indel x x x x 386 aa 000 nm 16291092 NM_001077825 rhabdomeric Lop2 long wavelength ocelli   
0 MDTLNITTSFFIEVMPSNISTLTTTGPQFARQLMRFNNQTVVSKVPEEMLHLIDLYW 2
1 YQFPPLDPLWHKILGLVMIILGIMGWCGNGVVVYVFIMTPSLRTPSNLLVVNLAFSDFIMMGFMCPPMVICCFYETW 0
0 VLGSLMCDIYAMVGSLCGCASIWTMTAIALDRYNVIVK 0
0 GMSGTPLTIKRAMLQILGIWLFGLIWTILPLVGWNR 2
1 YVPEGNMTACGTDYLSQDWTFKSYILVYSFFVYYTPLFTIIYSYYFIVS 0
0 AVAAHEKAMKEQAKKMNVTSLRSGDNQNTSAEAKLAK 0
0 VALTTISLWFMAWTPYLVINYIGIFNRSLITPLFTIWGSLFAKANAIYNPIVYGIS 2
1 HPKYRAALKEKLPFLVCGSTEDQTAATAGDKASEN* 0

>INSE_LWS2_apiMel Apis mellifera U26026 529 5 exonsArthropoda Insecta 540 complete genNow
0 MIAVSGPSYEAFSYGGQARFNNQTVVDKVPPDMLHLIDANWYQYPPLNPMWHGILGFVIGMLGFVSVMGNGMVVYIFLSTKSLRTPSNLFVINLAISDFLMMFCMSPPM 0
0 VINCYYETWVLGPLFCQIYAMLGSLFGCGSIWTMTMIAFDRYNVIVKGLSGKPLSINGALIRIIAIWLFSLGWTIAPMFGWNR 2
1 YVPEGNMTACGTDYFNRGLLSASYLVCYGIWVYFVPLFLIIYSYWFIIQAVAAHEKNMREQAKKMNVASLRSSENQNTSAECKLAK 0
0 VALMTISLWFMAWTPYLVINFSGIFNLVKISPLFTIWGSLFAKANAVYNPIVYGIS 2
1 HPKYRAALFAKFPSLACAAEPSSDAVSTTSGTTTVTDNEKSNA* 0

>INSE_LWS_triCas Tribolium castaneum (red flour beetle) ES544655 3 exons from AAJJ01000967 5 fusion relative to bee
0 MSVMGEPNFIAWAAQRSGYGGGNLTVVDKVLPDMLHLVDAHWYQFPPMNPLWHGILGFVIGVLGFVSIVGNGMVIYIFSSTKALRTPSNLL
VVNLAFSDFLMMXCMSPAMVINCYNETWVLGPLVCELYGMSGSLFGCASIWTMTFIALDRYNVIVKGLSAQPLTKKGAMLRILIIWVFSTLW
TIAPFFGWNRYVPEGNMTACGTDYLTKDWVSRSYILVYAVWVYFVPLFTIIYSYWFIVQ 0
0 AVAAHEKSMREQAKKMNVASLRSSEAAQTSAECKLAKIALMTITLWFFAWTPYLVTNFTGIFEGAKISPLATIWCSLFAKANAVYNPIVYGIS 2
1 HPKYRQALQKKFPSLVCAGEPDDTTSTASGVTNVTTDEKPATA* 0

>INSE_LWS_papXut Papilio xuthus AB007424 520 Arthropoda Insecta Rh2 complete 
0 MAIANLEPGMGASEAWGGQAAAFGSNQTVVDKVTPDMMHLIDPHWYQFPPMNPMWHGLLGFTIGVLGFISITGNGMVVYIFTSTKSLKTPSNLLVVNLAFSD
FLMMLCMAPPMLINCYYETWVFGPLACELYACAGSLFGSISIWTMTMIAFDRYNVIVKGIAAKPMTINGALLRILGIWLFSLAWTIAPMLGWNRYVPEGNMTACGTDYLSKSWLSR
SYILVYSIFVYYTPLLLIIYSYFFIVQAVAAHEKAMREQAKKMNVASLRSSEAANTSAECKLAKVALMTISLWFMAWTPYLVINYTGVFETAPISPLATIWGSVFAKANAVYNPIV
YGISHPKYRAALYQKFPSLACQPSAEETGSVASGATTACEEKPSA* 0

>INSE_LWS_manSex Manduca sexta L78080 520 Arthropoda Insecta White complete genFut
0 MDPGPGLAALQAWAAKSPAYGAANQTVVDKVPPDMMHMIDPHWYQFPPMNPLWHALLGFTIGVLGFVSISGNGMVIYIFMSTKSLKTPSNLLVVNLAFSDFL
MMCAMSPAMVVNCYYETWVWGPFACELYACAGSLFGCASIWTMTMIAFDRYNVIVKGIAAKPMTSNGALLRILGIWVFSLAWTLLPFFGWNRYVPEGNMTACGTDYLSKSWVSRSY
ILIYSVFVYFLPLLLIIYSYFFIVQAVAAHEKAMREQAKKMNVASLRSSEAANTSAECKLAKVALMTISLWFMAWTPYLVINYTGVFESAPISPLATIWGSLFAKANAVYNPIVYG
ISHPKYQAALYAKFPSLQCQSAPEDAGSVASGTTAVSEEKPAA* 0

>INSE_LWS_bomTer Bombus terrestris AY485301 529 Arthropoda Insecta partial 
 YQFPPLNPMWHGILGFVIGLLGFISVSGNGMVVYIFLSTKSLRTPSNMFVINLAISDFLMMFCMSPPMVINCYYETWVLGPLFCQVYAMLGSLFGCGSIWTM
TMIAFDRYNVIVKGLSGKPLTINGALLRILGIWLFSLIWTIAPMFGWNRYVPEGNMTACGTDYFSKDIVSVSYILLYSIWVYFFPLFLIIWSYWFIXQAVAAHEKNMREQAKKMNV
ASLRSSENQNTSAECKLAKVALMTISLWFMAWTPYLVINWSGIFSLVKISPLYTIWGSLFAKANAV* 0

>INSE_LWS_catBom Cataglyphis bombycinus U32501 510 Arthropoda Insecta complete 
0 MMSIASGPSHAAYTWTAQGGGFGNQTVVDKVPPEMLHLVDAHWYQFPPMNPLWHAILGFVIGILGMISVIGNGMVIYIFTTTKSLRTPSNLLVINLAISDFL
MMLSMSPAMVINCYYETWVLGPLVCELYGLTGSLFGCGSIWTMTMIAFDRYNVIVKGLSAKPMTINGALLRILGIWFFSLGWTIAPMFGWNRYVPEGNMTACGTDYLTKDLLSRSY
ILVYSFFCYFLPLFLIIYSYFFIIQAVAAHEKNMREQAKKMNVASLRSAENQSTSAECKLAKVALMTISLWFMAWTPYLVINYAGIFETVKINPLFTIWGSLFAKANAVYNPIVYG
ISHPKYRAALFQRFPSLACSSGPAGADTLSTTTTVTEGTEKPAA* 0

>INSE_LWS_pieRap Pieris rapae AB177984 540 Arthropoda Insecta complete 
0 MAITNLDPAPGVAAMQSFGIHAEAFGSNQTVIDKVLPEMMHLIDPHWYQFPPLNPLWHALLGFTISVLAFISITGNGMVVYIFTTTKSLKTPSNLLVVNLAF
SDFLMMAMMAPPLVVNSYNETWVFGPTACQFYACFGSLFGCVSIWTMTAIAFDRYNVIVKGIAAKPMTINSALLRILGVWLFSLAWTLAPIFGWSRYVPEGNMTACGTDYLSKDWA
SRSYIILYAIACYFLPLFLIVYSYWFIVQAVAAHERAMREQAKKMNVASLRSSEQANTSAECKLAKVALMTISLWFMAWTPYLVINFAGVFETSPISPLSTIWGSVFAKANAVYNP
IVYGISHPKYRAALYQRFPALACQPSPAEETGSVASAATACTEEKPSA* 0

>INSE_LWS_vanCar Vanessa cardui AF385333 530 Arthropoda Insecta complete 
0 MAITSLDPGAAALQAWGGQMAAFGSNETVVDKVLPDMLHLVDPHWYQFPPMNPLWHGLLGFVIGILGFISITGNGMVIYIFTTTKSLKTPSNILVVNLAFSD
FLMMCVMSPPMVVNCYTETWVFGPLACQLYACAGSLFGCASIWTMTMIAFDRYNVIVKGIAAKPLTINGAMLRVLGIWVFSLAWTVAPLFGWGRYVPEGNMTACGTDYLDKSWFNR
SYILIYSIFCYFSPLFLIIYSYFFIVQAVAAHEKAMREQAKKMNVASLRSSDAANTSAECKLAKVALMTISLWFMAWTPYLVINYAGIFETATITPLATIWGSVFAKANAVYNPIV
YGISHPKYRAALYARFPALACQPSPEDNASVASAATATEEKPSA* 0

>INSE_LWS_helSar Heliconius sara AF126753 550 Arthropoda Insecta partial 
 HQFPPMNPLWHGLLGFVIGVLGFISVTGNGMVVYIFTTTKSLKTPSNILVVNLAFSDFLMMFMMAPPMVINCYNETWVFGPLACQLYACAGSLYGCVSIWTM
TMIAFDRYNVIVKGIAAKPMTINGALLRVFGIWAFSLAWTIAPLFGWGRYVPEGNMTACGTDYFDQSFSNRSYILLYSIACYYAPLFLIIYSYFFIVQAVAAHEKAMREQAKKMNV
ASLRSSDAANTSAECKLAKVALMTISLWFMAWTPYLVINYAGIFKTMT* 0

>INSE_LWS_schGre Schistocerca gregaria X80071 520 Arthropoda Insecta complete 
0 MASASLISEPSFSAYWGGSGGFANQTVVDKVPPEMLYLVDPHWYQFPPMNPLWHGLLGFVIGVLGVISVIGNGMVIYIFSTTKSLRTPSNLLVVNLAFSDFL
MMFTMSAPMGINCYYETWVLGPFMCELYALFGSLFGCGSIWTMTMIALDRYNVIVKGLSAKPMTNKTAMLRILFIWAFSVAWTIMPLFGWNRYVPEGNMTACGTDYLTKDWVSRSY
ILVYSFFVYLLPLGTIIYSYFFILQAVSAHEKQMREQRKKMNVASLRSAEASQTSAECKLAKVALMTISLWFFGWTPYLIINFTGIFETMKISPLLTIWGSLFAKANAVFNPIVYG
ISHPKYRAALEKKFPSLACASSSDDNTSVASGATTVSDEKSEKSASA* 0

>INSE_LWS_pedHum Pediculus humanus AAZO01003609 73% INSE_LWS_schGre INSE_LWS_triCas 
0 MVKSQELLMVRPTQNHHFWSQGLNFGNQTVVDKVLPEMLHMIDPHW 2 tg 
1 FPPMNSLWHGILGFVIGVLGAVSLIGNGVVIYIFTTTKNLRTPSNLLVVNLAFSDFFMMFCMSPIMVVNCYNETWIMG 2
1 GPFMCEIYGLLGSLFGNASIWTMTMIALDRYNVIVKG 1
2 GLSAKPMTIKMALLNILFVWIQALLWTILPLFGWNR 2
1 YVPEGNMTACGTDYLTKDWFSRSYILVYSIFVYFLPLFIIIYSYIFIIQ 0
0 AVIDHENNMRMQAKKMEVASLRSQDDKKKSVEIKLAK 0
0 IALMTIALWFFAWTPYLVINYAGIFETLTISPLFTIWGSVFAKANAVYNPIVYGIS 2
1 HPKYRAALKKKFPSLVCGTLEDDHTSAASGITTCNEENTTKV* 0

>INSE_LWS_droMel Drosophila melanogaster Z86118 508 Arthropoda Insecta Rh6 2 exons complete genNow contains frameshift
0 MASLHPPSFAYMRDGRNLSLAESVPAEIMHMVDPYWYQWPPLEPMWFGIIGFVIAILGTMSLAGNFIVMYIFTSSKGLRTPSNMFVVNLAFSDFMMMFTMFPPVVLN
GFYGTWIMGPFLCELYGMFGSLFGCVSIWSMTLIAYDRYCVIVKGMARKPLTATAAVLRLMVVWTICGAWALMPLFGWNRYVPEGNMTACGTDYFAKDWWNRSYIIVYSLW
VYLTPLLTIIFSYWHIMKAVAAHEKAMREQAKKMNVASLRNSEADKSKAIEIKLAKVALTTISLWFFAWTPYTIINYAGIFESMHLSPLSTICGSVFAKANAVCNPIVYGL 0
0 SHPKYKQVLREKMPCLACGKDDLTSDSRTQATAEISESQA* 0

>INSE_MWS_droMel Drosophila melanogaster NM_079683 478 5 exons Arthropoda Insecta Rh1 ninaE complete genNow
0 MES 2
1 FAVAAAQLGPHFAPLSNGSVVDKVTPDMAHLISPYWNQFPAMDPIWAKILTAYMIMIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAIS
DFGIMITNTPMMGINLYFETWVLGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQVIVKGMAGRPMTIPLALGKIAYIWFMSSIWCLAPAFGWSR 2
1 YVPEGNLTSCGIDYLERDWNPRSYLIFYSIFVYYIPLFLICYSYWFIIA 0
0 AVSAHEKAMREQAKKMNVKSLRSSEDAEKSAEGKLAKVALVTITLWFMAWTPYLVINCMGLFKFEGLTPLNTIWGACFAKSAACYNPIVYGIS 2
1 HPKYRLALKEKCPCCVFGKVDDGKSSDAQSQATASEAESKA* 0

>INSE_MWS_calEry Calliphora erythrocephala M58334 490 Arthropoda Insecta Rh1 complete 
0 MERYSTPLIGPSFAALTNGSVTDKVTPDMAHLVHPYWNQFPAMEPKWAKFLAAYMVLIATISWCGNGVVIYIFSTTKSLRTPANLLVINLAISDFGIMITNT
PMMGINLFYETWVLGPLMCDIYGGLGSAFGCSSILSMCMISLDRYNVIVKGMAGQPMTIKLAIMKIALIWFMASIWTLAPVFGWSRYVPEGNLTSCGIDYLERDWNPRSYLIFYSI
FVYYLPLFLICYSYWFIIAAVSAHEKAMREQAKKMNVKSLRSSEDADKSAEGKLAKVALVTISLWFMAWTPYTIINTLGLFKYEGLTPLNTIWGACFAKSAACYNPIVYGISHPKY
GIALKEKCPCCVFGKVDDGKASDATSQATNNESETKA* 0

>INSE_MWS_droMel Drosophila melanogaster M12896 420 3 exons 381 aa Arthropoda Insecta Rh2 complete genNow
0 MERSHLPETPFDLAHSGPRFQAQSSGNGSVLDN 0
0 VLPDMAHLVNPYWSRFAPMDPMMSKILGLFTLAIMIISCCGNGVVVYIFGGTKSLRTPANLLVLNLAFSDFCMMASQSPVMIINFYYETWVLGPLWCDIYAGCGSLFGCVSIWSMC
MIAFDRYNVIVKGINGTPMTIKTSIMKILFIWMMAVFWTVMPLIGWSAYVPEGNLTACSIDYMTRMWNPRSYLITYSLFVYYTPLFLICYSYWFIIAAVAAHEKAMREQAKKMNVKSL
RSSEDCDKSAEGKLAKVALTTISLWFMAWTPYLVICYFGLFKIDGLTPLTTIWGATFAKTSAVYNPIVYGIS 2
1 HPKYRIVLKEKCPMCVFGNTDEPKPDAPASDTETTSEADSKA* 0

>CRUS_LWS_meoOer Neogonodactylus oerstedii DQ646869 489 Arthropoda Crustacea Rh1 complete 
0 MSYWNSNKIVEEYSLPSTNPYGNFTVVDTVPENMLHMIHSHWYQFPPLNPMWYGILAFVVTVVGLCSICGNFVVIWVFMNTKALRSPANTLVVSLAVSDFIM
MACMFPPLVLNCYWGTWIFGPLFCEVYAFIGNTVGCASIGNMIFITFDRYNVIVKGISGTPLSQKNTTLQVLFVWICSIMWCVFPFFGWNRYVPRGDMTACGTDYLTEDEFSRSYL
YVYSVWVYIGPLALIIYCYFHIVSAVATHEKQMRDQAKKMGVKSLRTEEAKKTSAECRLAKVALTTVSLWFMAWTPYLIINWAGMFYPSVVSPLFSIWGSVFAKANAVYNPIVYAI
SHPKYRAALYKKLPCLACSTESADEGSATNSATTTTAEKYESA* 0

>CRUS_LWS_neoOer Neogonodactylus oerstedii DQ646871 522 Arthropoda Crustacea Rh3 complete 
0 MSYWNSNKAMEEYSLPSTNPYGNFTVVDTVPENMLHMVHSHWYQFPPLNPMWYGILAFVVTVVGLCSICGNFVVIWVIMNTKALRSPANTLVVSLAVSDYIM
MTCMFPPLVLNCYWGTWIFGPLFCEVYAFIGNTVGCASIGNMIFITFDRYNVIVKGVSGKPLSQKNATLQVLFVWICSIMWCVFPFFGWNRYVPEGNMTACGTDYLTEDEFSRSYL
YIYSVWVYIGPLALIIYCYFHIVSAVATHEKQMRDQAKKMGVKSLRTEEAKKTSAGCRLAKVALTTVSLWFMAWTPYLIINWAGMFYPSVVSPLFSIWGSVFAKSNAVYNPIVYAI
SHPKYRAALYKKLPCLACSTESADEGSATNSTTTATAEKYESA* 0

>CRUS_LWS_eupSub Euphausia superba DQ852576 487 Arthropoda Crustacea DQ852580 partial 
0 MNPLWYGLLGFVIFCLGCLSVFGNSVVIWVFTSTKTLRSPANMLVVNLALSDFLMMANMSPPTVHSCYHGTWMLGPTYCEYYALVGSLSGCISIWTMVWITL
DRYNVIVKGVAATPLTNKGAFARNIFSWLSALIWCVSPLYGWNRYVPEGNMTACGTDYLTDDWLSHSYLYAYTFWVYLFPFFIIVYCYTYIVSAVFAHEKGMRDQAKKMGVKSLRN
EEAQKTSAECRLAKVALVTVSLWFIAWTPYCVINVTGMWDKTKITPLFTIWGSL* 0

>CRUS_LWS_homGam Homarus gammarus DQ852587 515 Arthropoda Crustacea DQ852590 
0 MNPLWYGLLALWMFVMGTLSVCGNSIVIWVFMNTKALRTPANLLVVNLAISDFLMMFCMCPPLLINCYYQTWVWGAFACEVYGCIGSTVGTCSIFCMVFITM
DRYNVIVKGVSATPLTTNGAMLRNLFSWVTSIGWCLPPFFGFNAYVPEGNLIACGTDYLKESVPYHVYLYLYSVWCYFLPLVIIVYCYTYIVAAVSAHERQMREQAKKMGVKSLRS
EESKKTSNECRLAKVALTTVSLWFIAWTPYLIINWAGMINKPSVSPLLTI* 0

>CRUS_LWS_camLud Cambarus ludovicianus AF003543 529 Arthropoda Crustacea partial 
 LHMIHLHWYQYPPMNPMMYPLLLVFMLITGILCLAGNFVTIWVFMNTKSLRTPANLLVVNLAMSDFLMMFTMFPPMMITCYYHTWTLGATFCEVYAFLGNLC
GCASIWTMVFITFDRYNVIVKGVAGEPLSTKKASLWILTVWVLSFTWCVAPFFGWNRYVPEGNLTGCGTDYLSEDILSRSYLYIYSTWVYFLPLAITIYCYVFIIKAVAAHEKGMR
DQAKKMGIKSLRNEEAQKTSAECRLAKIAMTTVALWFIAWTPYLLINWVGMFARSYLSPVYTIWGYVFAKANAVYNPIVYAIS* 0

>CRUS_LWS_proMil Procambarus milleri AF003546 522 Arthropoda Crustacea partial 
 LHMIHLHWYQYPPMNPMMYPLLLIFMLFTGILCLAGNFVTIWVFMNTKSLRTPANLLVVNLAMSDFLMMFTMFPPMMVTCYYHTWTLGPTFCQVYGFLGNLC
GCASIWTMVFITFDRYNVIVKGVAGEPLSTKKASLWILIVWVLSLAWCMAPFFGWNRYVPEGNLTGCGTDYLSEDILSRSYLYIYSTWVYFLPLTITIYCYVFIIKAVAAHEKGMR
DQAKKMGIKSLRNEEAQKTSAECRLAKIAMTTVALWFIAWTPYLLINWVGMFARSYLSPVYTIWGYVFAKANAVYNPIVYAIS* 0

>CRUS_LWS_arcGre Archaeomysis grebnitzkii DQ852573 496 Arthropoda Crustacea DQ852575 partial 
0 MNPLWYGLLGFVIFCLGILSVCGNAVVIWVFMNTKSLRSPANLLVVNLAFSDFLMMLNMFPPMVHSCYHGTWMLGAFFCEFYGFTGSLFGCISIWTMVFITM
DRYNVIVKGVAAEPLTSKGASIRILFVWTVAFAWTILPFFGWNRYVPEGNLTACGTDYLTEDSTSHLYLYMYASWAYYTPLLYIIYAYTFIVQAVSAHEKGMREQAKKMGVKSLRN
EEAQKTSAECRLAKVALMTVSLWFMAWTPYMIINFTGMNDRTKLTPLCTIWGSL* 0

>CRUS_LWS_holCos Holmesimysis costata DQ852581 512 Arthropoda Crustacea DQ852586 partial 
0 MNPLWYGLLGFWMTVMGTLSVAGNFVVIWVFMNTKSLRTPANLLVVNLAISDFFMMLTMTPPLLANAYWGTWILGAFFCEVYAFLGSFFGCVSIWSMVFITA
DRYNVIVKGVSAEPLTSGGAMMRIAGTWAFTLAWCLPPFFGWNRYVPEGNMLACGTDYLTETELSRSYLYVYSVWVYLFPLAYIIYSYTFIVKAVAAHEKGMREQAKKMGVKSLRS
EEAQKTSAECRLCKVALMTVTLWFMAWTPYFIINWGGMFNKPMVTPLFS* 0

>CRUS_LWS_mysDil Mysis diluviana DQ852591 501 Arthropoda Crustaceapartial 
0 MKSRWYIILGLIISVLAILSVIGNLTVIVVFINTRSLRSPSNLLIVNLAFSDFFMMCNMCPAMLLACIYKTWLLGPTYCAWYAFSGSLFGCLSIWTMVWITL
ERYNVIVKGVSSKPLSVKGAITRIVLTWIFAVIWCSFPLVGWNRYVPEGNLTACGTDYLSDDIYSQSYIYLYSVMVYFIPLGITIYCYSYIVHAVANHEKSMKEQAKKMGVKSFRN
EETQRTSAEFRLAKIALMTVSLWFIAWTPYLVINIVGMVARQQLNPLSTI* 0

>CRUS_LWS_neoAme Neomysis americana DQ852592 520 Arthropoda Crustacea DQ852598 partial 
0 MNPLWYSLVGFWMVIMGVLSVVGNFVVLWVFMTTKSLRTPANLLVVNLALSDFLMMFTMFPPMVISCYWQTWTLGAFFCEVYAFLGSLFGCVSIWSMVWITL
DRYNVIVKGVSGEPLTNSGAMTRIAGTWVTAFAWCLPPFFGWNRYVPEGNMTACGTDYLTDDKFSHSYLYIYSVWVYIFPLFLNIYLYTFIIKAVANHEKQMREQAKKMGVKSLRS
EESQKTSAECRLAKVALMTVSLWFMAWTPYFIINWAGMLSKSNVTPLFSIWGSV* 0

>CHEL_LWS_limPol Limulus polyphemus L03782 530 Arthropoda Chelicerata ocelli complete 
0 MANQLSYSSLGWPYQPNASVVDTMPKEMLYMIHEHWYAFPPMNPLWYSILGVAMIILGIICVLGNGMVIYLMMTTKSLRTPTNLLVVNLAFSDFCMMAFMMP
TMASNCFAETWILGPFMCEVYGMAGSLFGCASIWSMVMITLDRYNVIVRGMAAAPLTHKKATLLLLFVWIWSGGWTILPFFGWSRYVPEGNLTSCTVDYLTKDWSSASYVIIYGLA
VYFLPLITMIYCYFFIVHAVAEHEKQLREQAKKMNVASLRANADQQKQSAECRLAKVAMMTVGLWFMAWTPYLIIAWAGVFSSGTRLTPLATIWGSVFAKANSCYNPIVYGISHPR
YKAALYQRFPSLACGSGESGSDVKSEASATMTMEEKPKSPEA* 0

>CRUS_MWS_hemSan Hemigrapsus sanguineus D50583 480 Arthropoda Crustacea D50584 complete 
0 MANVTGPQMAFYGSGAATFGYPEGMTVADFVPDRVKHMVLDHWYNYPPVNPMWHYLLGVVYLFLGVISIAGNGLVIYLYMKSQALKTPANMLIVNLALSDLI
MLTTNFPPFCYNCFSGGRWMFSGTYCEIYAALGAITGVCSIWTLCMISFDRYNIICNGFNGPKLTQGKATFMCGLAWVISVGWSLPPFFGWGSYTLEGILDSCSYDYFTRDMNTIT
YNICIFIFDFFLPASVIVFSYVFIVKAIFAHEAAMRAQAKKMNVTNLRSNEAETQRAEIRIAKTALVNVSLWFICWTPYAAITIQGLLGNAEGITPLLTTLPALLAKSCSCYNPFV
YAISHPKFRLAITQHLPWFCVHEKDPNDVEENQSSNTQTQEKS* 0

>INSE_UVV_camAbd Camponotus abdominalis AF042788 360 Arthropoda Insecta complete 
0 MYNGSFHWEARILPAGPPRLLGWNVPAEELVHIPEHXLVYPEPNPSLHYLLAIVYILFTFVALFGNGLVIWIFCSAKSLRTPSNLFVVNLAFCDFMMMLKAP
IFIYNSFHTGFATGHLGCQIFACMGSLSGIGAGMTNAAIAYDRYSTIARPLDGKLSRGQVLLLIMLIWTYTIPWALMPLMQVWGRFVPEGFLTSCSFDYLTDSQEIRYFVPTIFTF
SYCVPMLLIIYYYSQIVGHVVSHEKALREQAKKMNVESLRSNVNTNAQSAEIRIAKAAITICFLFVLSWTPYGALAMIGAFGNRALLTPGITMIPACACKFVACLDPYVYAISHPR
YRLELQKRLPWLELQEKPVADTQSTTTEMVHTPAS* 0

>INSE_UVV_catBom Cataglyphis bombycinus AF042787 360 Arthropoda Insecta complete 
0 MYTNRSVHWEARILPAGPPRLLGWNVPAEELVHIPEHWLVYPEPNPSLHYLLAILYTLFTFVALLGNGLVIWIFISAKSLRTPSNMFVVNLAFCDFIMMLKA
PIFIYNSFNTGFATGHLGCQIFACMGALSGIGASMTNAAIAYDRYSTIARPLDGKLSRGQVILLIALIWTYTIPWALMPLMHVWGRFVPEGFLTSCTFDYLTDTPEIRYFVATIFT
FSYCIPMSLIIYYYSQIVSHVVNHEKALREQAKKMNVESLRSNTNTNAQSAEIRIAKAAITICFLFVLSWTPYGTLAMIGAFGNKALLTPGVTMIPACTCKFVACLDPYVYAISHP
RYRLELQKRLPWLELQEKPIETQSTTTETVNTASS* 0

>INSE_UVV_manSex Manduca sexta L78081 357 Arthropoda Insecta complete genFut
0 MNNQSENYYHGAQFEALKSAGAIEMLGDGLTGDDLAAIPEHWLSYPAPPASAHTALALLYIFFTFAALVGNGMVIFIFSTTKSLRTSSNFLVLNLAILDFIM
MAKAPIFIYNSAMRGFAVGTVGCQIFALMGAYSGIGAGMTNACIAYDRHSTITRPLDGRLSEGKVLLMVAFVWIYSTPWALLPLLKIWGRYVPEGYLTSCSFDYLTNTFDTKLFVA
CIFTCSYVFPMSLIIYFYSGIVKQVFAHEAALREQAKKMNVESLRANQGGSSESAEIRIAKAALTVCFLFVASWTPYGVMALIGAFGNQQLLTPGVTMIPAVACKAVACISPWVYA
IRHPMYRQELQRRMPWLQIDEPDDTVSTATSNTTNSAPPAATA* 0

>INSE_UVV_papXut Papilio xuthus AB028218 --- Arthropoda Insecta Rh5 partial 
0 MIPAAVMDNHTENNYNYGAYFAPYRLEGVELLGAGLTGEDLAAIPEHWLSYPAPPASAHTMLALVYVFFTAAALIGNGLVIFIFSASKSLRTPSNLLVVQLA
VLDFLMMLKAPIFIYNSIKRGFASGVIGCQIFAFMGSVSGTAAGLTNACIAYDRHSTITRPLDGRLSRGKVLLMMVCVWLYTAPWAILPQLQIWGRYVPEGFLTSCTFDYLTTTFD
NKLFVASMFVCVYIFPMIAILYFYSGIVKQVFAHEAALREQAKKMNVDSLRSNQNAAAESAEIRIAKAALTVCFLYVASWTPYGVMSLIGAFGDQNLLTPGVTMIPALACKGVACI
DPWVYAISHPKYRQELQKRMPWLQIDEPDDNASNTTSNTANSSAPA* 0

>INSE_UVV_droMel Drosophila melanogaster NM_057353 375 nm 2 exons 378 aa Arthropoda Insecta Rh4 complete genNow
0 MEPLCNASEPPLRPEARSSGNGDLQFLGWNVPPDQIQYIPEHWLTQLEPPASMHYMLGVFYIFLFCASTVGNGMVIWIFSTSKSLRTPSNMFVLNLAVFDLI
MCLKAPIFIYNSFHRGFALGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNVITKPMNRNMTFTKAVIMNIIIWLYCTPWVVLPLTQFWDRFVP 1
2 EGYLTSCSFDYLSDNFDTRLFVGTIFFFSFVCPTLMILYYYSQIVGHVFSHEKALREQAKKMNVESLRSNVDKSKETAEIRIAKAAITICFLFFVSWTPYG
VMSLIGAFGDKSLLTPGATMIPACTCKLVACIDPFVYAISHPRYRLELQKRCPWLGVNEKSGEISSAQSTTTQEQQQTTAA* 0

>INSE_UVV_droMel Drosophila melanogaster M17718 345 1 exon Arthropoda Insecta Rh3 complete 
0 MESGNVSSSLFGNVSTALRPEARLSAETRLLGWNVPPEELRHIPEHWLTYPEPPESMNYLLGTLYIFFTLMSMLGNGLVIWVFSAAKSLRTPSNILVINLAF
CDFMMMVKTPIFIYNSFHQGYALGHLGCQIFGIIGSYTGIAAGATNAFIAYDRFNVITRPMEGKMTHGKAIAMIIFIYMYATPWVVACYTETWGRFVPEGYLTSCTFDYLTDNFDT
RLFVACIFFFSFVCPTTMITYYYSQIVGHVFSHEKALRDQAKKMNVESLRSNVDKNKETAEIRIAKAAITICFLFFCSWTPYGVMSLIGAFGDKTLLTPGATMIPACACKMVACID
PFVYAISHPRYRMELQKRCPWLALNEKAPESSAVASTSTTQEPQQTTAA* 0

>INSE_BLU_manSex Manduca sexta AD001674 450 Arthropoda Insecta complete genFut
0 MATNFTQELYEIGPMAYPLKMISKDVAEHMLGWNIPEEHQDLVHDHWRNFPAVSKYWHYVLALIYTMLMVTSLTGNGIVIWIFSTSKSLRSASNMFVINLAV
FDLMMMLEMPLLIMNSFYQRLVGYQLGCDVYAVLGSLSGIGGAITNAVIAFDRYKTISSPLDGRINTVQAGLLIAFTWFWALPFTILPAFRIWGRFVPEGFLTTCSFDYFTEDQDT
EVFVACIFVWSYCIPMALICYFYSQLFGAVRLHERMLQEQAKKMNVKSLASNKEDNSRSVEIRIAKVAFTIFFLFICAWTPYAFVTMTGAFGDRTLLTPIATMIPAVCCKVVSCID
PWVYAINHPRYRAELQKRLPWMGVREQDPDAVSTTTSVATAGFQPPAAEA* 0

>INSE_BLU_apiMel Apis mellifera AF004168 439 nm 8 exons Arthropoda Insecta complete genNow
0 MLLHNKTLAGKALAFIAEEG 2
1 YVPSMREKFLGWNVPPEYSDLVHPHWRAFPAPGKHFHIGLAIIYSMLLIMSLVGNCCVIWIFST 2
1 SKSLRTPSNMFIVSLAIFDIIMAFEMPMLVISSFMERMIGWEIGCDVYSVFGSISGMGQAMTNAAIAFDRYR 2
1 TISCPIDGRLNSKQAAVIIAFTWFWVTPFTVLPLLKVWGRYTT 1
2 EGFLTTCSFDFLTDDEDTKVFVTCIFIWAYVIPLIFIILFYSRLLSSIRNHEKMLREQ 0
0 AKKMNVKSLVSNQDKERSAEVRIAKVAFTIFFLFLLAWTPYATVALIGVYGNR 2
1 ELLTPVSTMLPAVFAKTVSCIDPWIYAINHPR 2
1 YRQELQKRCKWMGIHEPETTSDATSAQTEKIKTDE* 0

>INSE_BLU_droMel Drosophila melanogaster U67905 437 nm 3 exons Arthropoda Insecta Rh5 complete genNow
0 MHINGPSGPQAYVNDSLGDGSVFPMGHGYPAEYQHMVHAHWRGFREAPIYYHAGFYIAFIVLMLSSIFGNGLVIWIFST 2
1 SKSLRTPSNLLILNLAIFDLFMCTNMPHYLINATVGYIVGGDLGCDIYALNGGISGMGASITNAFIAFDRYKTISNPIDGRLSYGQIVLLILFTWLWATPFSVLPLFQIWGRYQP 1
2 EGFLTTCSFDYLTNTDENRLFVRTIFVWSYVIPMTMILVSYYKLFTHVRVHEKMLAEQAKKMNVKSLSANANADNMSVELRIAKAALIIYMLFILAWTPYSVVALIGCFGEQQLI
TPFVSMLPCLACKSVSCLDPWVYATSHPKYRLELERRLPWLGIREKHATSGTSGGQESVASVSGDTLALSVQN* 0

>INSE_UVV_apiMel Apis mellifera AF004169 353 nm 5 exons Arthropoda Insecta complete genNow
0 MSNDSIHWEARYLPAGPPRLLGWNVPAEELIHIPEHWLVYPEPNPSLHYLLALLYILFTFLALLGNGLVIWIFCA 2
1 AKSLRTPSNMFVVNLAICDFFMMIKTPIFIYNSFNTGFALGNLGCQIFAVIGSLTGIGAAITNAAIAYDRYS 2
1 TIARPLDGKLSRGQVILFIVLIWTYTIPWALMPVMGVWGRFVPEGFLTSCSFDYLTDTNEIRIFVATIFTFSYCIPMILIIYYYSQIVSHVVNHEKALREQAKKMNVDSLRSNANTSSQSAEIRIAK 0
0 AAITICFLYVLSWTPYGVMSMIGAFGNKALLTPGVTMIPACTCKAVACLDPYVYAISHPKYR 2
1 LELQKRLPWLELQEKPISDSTSTTTETVNTPPASS* 0

>INSE_UVV1_pedHum Pediculus humanus AAZO01000117 best:INSE_UVV_apiMel exon 1 uncertain
0 MKITTESENNISLSYYQPF 1
2 IDKEESLIWNVDPSELVHIPDHWFNFSAPHPLSNYLLGFLYFIFFVISCTGNGIVIWIFTT 2
1 SKNLRTASNVFVVNLAIFDFIMMAKTPIMIYNSMNLGFECGFVWCQIFASAGALSGIGASITNTCIAYDRCE 2
1 TITNPLQKSGKKKAFLLAAFTWIYALPWAVLPFLEIWGKFAPEGYLTTCTVDYLTDTSQTRMFIVTIFFAAYVLPLSLIIYFYTKIVLHVINHEKSLKAQ 0
0 AKKMNVESLRSDGNKNYAVEIRITKVAIAMCFLFVISWTPYAVVALIGCFGNK 2
1 HLITPLVSMIPACACKAVACIDPYIYAISHPRFR 2
1 VEVNKRFACLAGCLQEKELQDDAVSKNTVNAENVDT* 0

>INSE_UVV2_pedHum Pediculus humanus AAZO01007270 38% INSE_UVV_apiMel exon 1 uncertain
0 MVVAVAEEV 1
2 EEWKKLGLFDDEYLYKINKYWMKFPPPSPMSHYFMGIIYSVIMVVGVFGNFLIIYLFLR 2
1 KRSLRTPSNVFIFNLAVSDSLLLLKMPVFIINSFYLGPALGNL 1
2 GCSAYGFVGGLTGTVSIMTLAAIAFDRYQVIVHPLERKTKAAVYFQILLIWIYAFFSIIPLLDVGLNKYVPEGYLTSCSFDYLTQDTASRLTIFVFFVAAWIVPLSIILGSYM
ALYKVVLKARGTHFNTVMTRHCKDIEIQRPELKAAVTVICIVCLWTLSWTPYAVALLGITGNEKYISPMSSMIPALFCKTASCIDPFVYAATNRRFRNELKRKYRKRSRYQPSLKTE
RKDFFTLSEDNNDRGKGNTIRIREK* 0
>ENCEPHa_nemVec Nematostella vectensis (anemone) no cdna complete 1 exon 306 aa best:ENCEPH4_braFlo scaffold_465_Cont27987 alt: Nemve1:219988 Nem1
0 MIDNALGRHEANIVLGYYIAIFVIGFVTNTIVVIIFISSQRLHTTPNLILFSMSVCDWLMATMAKSVGIYGNARYWPTVGKVTCDYYAFATSAIGYASILHLAALAVEKRMAVVSPMTNSFNGRRMLVIIATLWGFAILW
AVFPLIGWSSYGPEPGYVSCSITWYTTDHNNVSYIICVSVLFFLIPIVTMTFCFASIYHTIRNLSHEATARWGSDARATQETIRAKAKTAKMAFLMVMCFLFAWTPYAVVSLWTTFGDTHRIPALLGVLPSLFAKLSSCY
NPIIYFFMYTKFRCAGKALLYQEHH* 0

>ENCEPHb_nemVec Nematostella vectensis (anemone) NC-extended 1 exon 275 aa best:ENCEPH4_braFlo scaffold_273_Cont21871 alt:Nemve1:130042 Nem3 
0 MSNEALRRHEANIVLGYYIAIFVIGFVTNTIVVITFIFSKRLHTTPNLILFSMSVCDWLMAAMAKSVGIYGNARYWPTVGKVTCDYYAFATSAIGYASILHLAALAVEKRMAVASPMTNSLNERRMLVIIATLWGFAILWAVFPLIGWSSYGPEPGYV
SCSITWYTTDHNNVSYIICISVLFFFVPIVTMTFSFASIYKAIRNISHEAIARWGSHARATQETIKAKAKTAKMAFLMVMCFLFAWTPYAVVSLWTTFGGTHRNPALLGVLPSLFAKLSSCYNPIIYFFMYTKFRRAAKL
LFIKKVIRPTEAERSRVLSGIRTRAASTFAVKLTVHAEKGQQIAPNITPQGAVKGPVESIEDNLQKIETITPCTSV* 0

>ENCEPHc_nemVec Nematostella vectensis (anemone) C-extended 1 exon 289 aa best: ENCEPH5_braFlo scaffold_11_Cont2404alt: Nemve1:85309 Nem2
0 MELFTSYHAITVMYSLLAAGAFVLNGIVLIIFLATRSLRTIPNMILLSMAWADWLMACLADAVGAYANANNWPSMVGGLCVYYGFITTALGLTSMIHLTALSVERFVTVTIPMTRPITETQMLLVVTFLWAFSFLWAIFP
LVGWSSYGPEPGYAACSIAWYRQDLNNMSYILCLFMFFFFLPIVIMIACFSSIYFTVRKLTRDSMRRWGA
SSDSTQQTLAAERKTAWMSFIMVLAFLFAWVPYAVVSLYASFGGVTTIPKLMSTLPAMLAKTSACYNPII
YFFMYSKFRKAFQRFFFKNVITPSQTGGSSTINSASVIPTSFPRASYAVRSSSVLPSSTASHSFRSRE* 0

>MEL_nemVec Nematostella vectensis best: 25% MEL1_calMil 465 aa non-mel intron FC274660 FC274659 FC285526 XM_001633226
0 MSKSAAGGIDNRGFIRGLPFKDKMNNTNSSGKELLVLESLSHESAVWVIAQVVLWGCIFVISSLGNSLVLLCIVKSNRLHSSIYAFYGSLAASDCIA 1
2 GMLCCPLLLVTALHQLWIMGKVMCHVYSTLLSTSLNASIATLCLISMDRLNAVRKPFEYRGHNTFTQRWCKWLLVLSWVHSIFWAAA
PLGGWGEIITDSATYTCKPNWSAASIVNRSYSLCLALFPFAFPVFLMVAIYCVIYRHTKKCSNLMSGLEDGRNLVAE
QERQMRERRLFRTVLIIIGAFAACWAIYTLATTCKLFIGQTPPTWLVQLGLICAIAGSCVNPVIYTIRDATFARELGRL
HPCLAWLLKQSFSKDSGSREQVYTVTCATRASISRELLDVSRKNSRETLSLMLIDSVRED
RGCDSKSRNSMARDDKARDELPRDNKARDELSHDMTCVDERDCRLSLRSDLTRDMYLQMRETNL* 0

>LWS_nemVec2 Nematostella vectensis best: best: INSE_LWS_pieRap 33% 1 exon FC241088 FC262508 XM_001635772
0 MATNSTAQNLTTLKVTDFATIETSFTAIALLVIMLLTIIGNLMVCYVVLSNKRLWTEMNMFLVNLAFGDLAVGLICMVFPLITAIKREWIFGRGILCQLNACCNSVLFCSTIFTHTVISIDRYI
VIVHPMKKIMTRKKAALMIVGVWVFSVFIVLGPVFGWGRMEYNASTLQCGFGFPRDKMASMYIVIVAIIAFIIPLLIMTYTYIRIYISVLEHTRRMSETATAMQQQAVFSAQKRIVFTFFIALLAF
FACWAPFFSFIAFAVVVKNPHDIPHGLGLASYVCGFINSACNPFIIGLRSKQFKSGFSRILCCCRGRDPKSSDSARRLKADKTISSRTMTDYDPDKD
SVFESDPQKKRPLLPRRTEERVGTDARARNGSSLGTESVTHSDKPLLRDPLDEMDELTLPKKHKVHMVRPRKRSSESSECASQRSAKAVSYKQFLHVKGDSPWRESGV* 0