Opsin evolution: alignment

From genomewiki
Revision as of 21:25, 26 November 2007 by Tomemerald (talk | contribs)
Jump to navigationJump to search

This shows an alignment of 147 opsins, almost all of ciliary type. Unalignable termini have been trimmed. Fragmentary sequences are mostly not shown: these score low and so fall to the bottom. (That can still be useful as two important clades, jawless fish and chondrichthyes, are largely represented by fragments.) Notice the numerous invariant (blue) and nearly invariant sequences (red) -- these anchor the alignment with near-certainty. Some of these are not specific to opsins but are rather properties of GPCR signalling proteins generally.

Opsins have seven alpha helical sections traversing the cell membrane with the intervening sequence alternating as cytoplasmic and extra-cellular. Certain key residues such as the lysine where the retinal is covalently bound, counterions, and recognition sites diagnostic for binding of other proteins require markups that will be added shortly.

Among other things, the alignment by MultAlign shows that the Opsin Classifier has properly named the opsins -- each classifies as expected from its name. Deletions and insertions show up clearly on the alignment, readily resolved as to type using the phylogenetic ordering to establish ancestral conditon. The alignment also exhibits some anomalies where the sequence in question needs re-evaluation back at the primary data source (cDNA and/or genome).

MultAlign is apparently the only alignment software that allows line width to be specified. That's important here because it enables the entire alignment (147 sequences x 413 residues) to be seen in a single window. The numbering schemes allows specific residues and regions to be discussed. Colored text output was also an option (it allows copy and paste of specific residues) but the file again is huge with color markups and awkward to display tightly within genomeWiki. Uncolored text is still available at the main opsin evolution page.


Opsin align.png


Here are those sequences aligned (after some trimming of unalignable regions) to show rare genomic events such as indels and intron gains and losses:

>RHO1_homSap LAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLG GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWS
>RHO1_monDom LAAYMFMLIVLGFPINFLTLYVTIQHKKLRTPLNYILLNLAIADLFMVFGGFTMTLYTSLHGYFVFGPTGCNLEGFFATLG GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIIGVAFTWVMALACAFPPLIGWS
>RHO1_ornAna LAAYMFMLIMLGFPINFLTLYVTIQHKKLRTPLNYILLNLAFANHFMVLGGFTTTLYTSLHGYFVFGPTGCNIEGFFATLG GEIALWSLVVLAIERYIVVCKPMSNFRFGENHAIMGVAFTWIMALACALPPLVGWS
>RHO1_galGal LAAYMFMLILLGFPVNFLTLYVTIQHKKLRTPLNYILLNLVVADLFMVFGGFTTTMYTSMNGYFVFGVTGCYIEGFFATLG GEIALWSLVVLAVERYVVVCKPMSNFRFGENHAIMGVAFSWIMAMACAAPPLFGWS
>RHO1_anoCar LAAYMFLLILLGFPINFLTLFVTIQHKKLRTPLNYILLNLAVANLFMVLMGFTTTMYTSMNGYFIFGTVGCNIEGFFATLG GEMGLWSLVVLAVERYVVICKPMSNFRFGETHALIGVSCTWIMALACAGPPLLGWS
>RHO1_xenTro LAAYMFLLILLGFPINFMTLYVTIQHKKLRTPLNYILLNLVFANHFMVLCGFTVTMYTSMHGYFIFGQTGCYIEGFFATLG GEMALWSLVVLAIERYVVVCKPMANFRFGENHAIMGVVFTWIMALSCAAPPLFGWS
>RHO1_danRer VAAYMFFLIITGFPVNFLTLYVTIEHKKLRTPLNYILLNLAIADLFMVFGGFTTTMYTSLHGYFVFGRLGCNLEGFFATLG GEMGLKSLVVLAIERWMVVCKPVSNFRFGENHAIMGVAFTWVMACSCAVPPLVGWS
>RHO1_Raja   LAAYMFFLILTGLPVNFLTLFVTIQHKKLRQPLNYILLNLAVSDLFMVFGGFTTTIITSMNGYFIFGPAGCNFEGFFATLG GEVGLWCLVVLAIERYMVVCKPMANFRFGSQHAIIGVVFTWIMALSCAGPPLVGWS
>RHO1_lamp1  LAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAVANLFMVLFGFTLTMYSSMNGYFVFGPTMCNFEGFFATLG GEMSLWSLVVLAIERYIVICKPMGNFRFGSTHAYMGVAFTWFMALSCAAPPLVGWS
>RHO1_lamp2  LAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAVSNLFMILFGFTTTMYTSMNGYFVFGPTMCSIEGFFATLG GEVSLWSLVVLAIERYIVICKPMGNFRFGNTHAIMGVALTWVMALSCAAPPLLGWS
>RHO1_lamp3  LAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAMANLFMVLFGFTVTMYTSMNGYFVFGPTMCSIEGFFATLG GEVALWSLVVLAIERYIVICKPMGNFRFGNTHAIMGVAFTWIMALACAAPPLVGWS
>RHO2_galGal VCCYIFFLISTGLPINLLTLLVTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFVFGPVGCAVEGFFATLG GQVALWSLVVLAIERYIVVCKPMGNFRFSATHAMMGIAFTWVMAFSCAAPPLFGWS
>RHO2_anoCar VCCYIFFLIFTGLPINILTLLVTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFIFGPIGCAIEGFFATLG GQVALWSLVVLAIERYIVVCKPMGNFRFSATHALMGISFTWFMSFSCAAPPLLGWS
>RHO2_Gekko  LSFYMFFLIAAGMPLNGLTLFVTFQHKKLRQPLNYILVNLAAANLVTVCCGFTVTFYASWYAYFVFGPIGCAIEGFFATIG GQVALWSLVVLAIERYIVICKPMGNFRFSATHAIMGIAFTWFMALACAGPPLFGWS
>RHO2_Lat    LCAYMFLLIILGFPINFLTLLVTFKHKKLRQPLNYILVNLAVASLFMVVFGFTVTFYSSLNGYFVLGPMGCAMEGFFATLG GQVALWSLVVLAIERYIVVCKPMGNFRFASSHAIMGIAFTWIMALACAAPPLVGWS
>RHO2_Geo    ISAYVFTLILIGFPVNFMTLFVTFKLKKLRQPLNFILVNLCVADLLMIMFGFTTTFYTAMNGYFVFGPTGCNIEGFFATLG GEVSLWSLVMLAIERYIVVCKPMGNFRFATTHAALGVVFTWVMASACAVPPLVGWS
>SWS1_homSap QAAFMGTVFLIGFPLNAMVLVATLRYKKLRQPLNYILVNVSFGGFLLCIFSVFPVFVASCNGYFVFGRHVCALEGFLGTVA GLVTGWSLAFLAFERYIVICKPFGNFRFSSKHALTVVLATWTIGIGVSIPPFFGWS
>SWS1_macDom QTVFMGFVFCAGTPLNAVVLVATLRYKKLRQPLNYILVNVSLCGFIFCIFAVFTVFISSSQGYFIFGRHVCAMEAFLGSVA GLVTGWSLAFLAFERFIVICKPFGNFRFNSKHAMMVVLATWVIGIGVSIPPFFGWS
>SWS1_galGal QTAFMGIVFAVGTPLNAVVLWVTVRYKRLRQPLNYILVNISASGFVSCVLSVFVVFVASARGYFVFGKRVCELEAFVGTHG GLVTGWSLAFLAFERYIVICKPFGNFRFSSRHALLVVVATWLIGVGVGLPPFFGWS
>SWS1_Taenio QTIFMGLVFVAGTPLNAIVLIVTIKYKKLRQPLNYILVNISVSGLMCCVFCIFTVFIASSQGYFVFGKHMCAFEGFAGATG GLVTGWSLAFLAFERYIVICKPFGNFRFNSRHALLVVAATWIIGVGVAIPPFFGWS
>SWS1_Gekko  QTAFMGFVFFVGTPLNAIILFAIVKYKKLRQPLNYILVNISAAGFLFCVVAVFTVFISSSQGYFIFGKHICALEAFLGSLA GLVTGWSLAFLALERYIVICKPFGNFRFSAKHASLVVAATWFIGIGVSIPPYFGWS
>SWS1_Utasta QTAFMGFVFFAGTPLNAIILIVTVKYKKLRQPLNYILVNISFAGFLFCVFSVFTVFLASSQGYFFFGRHICALEAFLGSVA GLVTGWSLAFLAFERYIVICKPFGNFRFNSKHALLVVAATWFIGIGVSIPPFFGWS
>SWS1_Xenlae QAIFMGMVFLIGTPLNFIVLLVTIKYKKLRQPLNYILVNITVGGFLMCIFSIFPVFVSSSQGYFFFGRIACSIDAFVGTLT GLVTGWSLAFLAFERYIVICKPMGNFNFSSSHALAVVICTWIIGIVVSVPPFLGWS
>SWS1_Danio  QAAFMGFVFIVGTPMNGIVLFVTMKYKKLRQPLNYILVNISLAGFIFDTFSVSQVSVCAARGYYSLGYTLCSMEAAMGSIA GLVTGWSLAVLAFERYVVICKPFGSFKFGQGQAVGAVVFTWIIGTACATPPFFGWS
>SWS1_Oryzia QAAFMGFVFFVGTPLNFVVLLATAKYKKLRVPLNYILVNITFAGFIFVTFSVSQVFLASVRGYYFFGQTLCALEAAVGAVA GLVTSWSLAVLSFERYLVICKPFGAFKFGSNHALAAVIFTWFMGVGCACPPFFGWS
>SWS1_Geotri QAAFMGFVFICGTPLNAIVLVVTIKYKKLRQPLNYILVNISAAGLVFCLFSISTVFVASMQGYFFLGPTICALEAFFGSLA GLVTGWSLAFLAAERYIVICKPFGNFRFGSKHALVAVGLTWMLGLSVALPPFFGWS
>SWS2_ornAna LAAFMFLLITLGFPINLLTVICTIKYKKLRSHLNYILVNLAVSNMLVVCVGSATAFYSFAHMYFVLGPTACKIEGFAATLG GMVSLWSLAVIAFERFLVICKPLGNLSFRGTHAIFGCAATWVFGLAASLPPLFGWS
>SWS2_galGal MAAFMFLLIALGVPINTLTIFCTARFRKLRSHLNYILVNLALANLLVILVGSTTACYSFSQMYFALGPTACKIEGFAATLG GMVSLWSLAVVAFERFLVICKPLGNFTFRGSHAVLGCVATWVLGFVASAPPLFGWS
>SWS2_Taenio MAAFMFLLVLLGVPINALTVLCTAKYKKLRSHLNYILVNLAVANLLVVCVGSTTAFYSFSQMYFALGPLACKIEGFTATLG GMVSLWSLAVVAFERFLVICKPLGNFTFRGSHAVLGCAITWIFGLIASLPPLFGWS
>SWS2_Utasta MAAFMFLLIILGVPINVLTIFCTFKYKKLRSHLNYILVNLAVSNLLVVCIGSTTAFYSFAQMYFSLGPTACKIEGFAATLG GMVSLWSLAVVAFERFLVICKPLGNFSFRGTHAIIGCIITWVFGLVASLPPLFGWS
>SWS2_Xenopu ISAFMLFTIIFGFPLNLLTIICTVKYKKLRSHLNYILVNLAVANLIVICFGSTTAFYSFSQMYFSLGTLACKIEGFTATLG GIIGLWSLAVVAFERFLVICKPMGNFTFRESHAVLGCILTWVIGLVAAIPPLLGWS
>SWS2_Danio  MSAFMLFLFIAGTAINVLTIVCTIQYKKLRSHLNYILVNLAISNLWVSVFGSSVAFYAFYKKYFVFGPIGCKIEGFTSTIG GMVSLWSLAVVALERWLVICKPLGNFTFKTPHAIAGCILPWCMALAAGLPPLLGWS
>SWS2_Takifu MSAFMFFLFVAGTGINVLTIACTIQYKKLRSHLNYILVNLAFSNLLVTTVGSFTCFCCFFVRYMIVGPLGCKIEGFAATLG GMVSLWSLAVVAFERWLVVCKPLGNFIFKPDHAIVCCIFTWFFALIISAPPLFGWS
>SWS2_Geotri MSAFMLFLVLAGFPLNFLTVFVTIKYKKLRSHLNYILVNLAIANLIVVCCGSTLAFYSFMHKYFILGPLFCKMEGFTATLG GMLSLWSLAVLAFERCLVICKPFGNIAFRGTHALIRCGFAWAAAIAASTPPLFGWS
>LWS_ornAna  TSLWMIFVVIASVFTNGLVLVATMKFKKLRHPLNWILVNLAVADLGETLIASTISVINQIFGYFILGHPMCVLEGYTVSLC GITGLWSLSIISWERWIVVCKPFGNVKFDAKLAMVGIVFSWVWAAVWTAPPIFGWS
>LWS_galGal  TSLWMIFVVAASVFTNGLVLVATWKFKKLRHPLNWILVNLAVADLGETVIASTISVINQISGYFILGHPMCVVEGYTVSAC GITALWSLAIISWERWFVVCKPFGNIKFDGKLAVAGILFSWLWSCAWTAPPIFGWS
>LWS_anoCar  TSVWMIFVVIASIFTNGLVLVATAKFKKLRHPLNWILVNLAIADLGETVIASTISVINQISGYFILGHPMCVLEGYTVSTC GISALWSLAVISWERWVVVCKPFGNVKFDAKLAVAGIVFSWVWSAVWTAPPVFGWS
>LWS_Lithoch ATLWMFVVVVLSVFTNGLVLVATMKFKKLRHPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPMCIFEGYVVSVC GIAALWSLTIISWERWIVVCKPFGNVKFDAKWATAGIVFSWVWAAVWCAPPIFGWS
>LWS_Gastero STLWMFIVVALSVFTNGLVLVATAKFKKLQHPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPMCVFEGYVVSVC GITALWSLTIISWERWIVVCKPFGNVKFDAKWATAGIVFSWIWSAVWCAPPIFGWS
>LWS_Petrom  TSVWMIIVVVLSLFSNGLVLVATVKFKKLRHPLNWIIVNLAIADILETIFASTISVCNQVYGYFILGHPMCVFEGYVVSTC GIAGLWSLAIISWERWMVVCKPFGNIKFDGKIATILIVFSWVWPASWCSLPIFGWS
>LWS_lamprey TSVWMIIVVVLSLFTNGLVLVATMKFKKLRHPLNWILVNLAIADILETIFASTISVCNQVFGYFILGHPMCVFEGYVVSTC GIAGLWSLAIISWERWMVVCKPFGNIKFDGKIAIILIVFSWVWPACWCSLPIFGWS
>LWS_Geotria TSFWMIIVVILSLFTNGLVLVATLKFKKLRHPLNWILVNLAIADIGETIFASTVSVVNQIFGYFILGHPLCVFEGFTVSVC GITALWSLAIISFERWMVVCKPFGNLKFDGKVAIVLIIFSWAWSAGWCAPPIFGWS
>PIN_galGal  VAVLMGTVVACASVVNGLVIVVSICYKKLRSPLNYILVNLAVADLLVTLCGSSVSLSNNINGFFVFGRRMCELEGFMVSLT GIVGLWSLAILALERYVVVCRPLGDFQFQRRHAVSGCAFTWGWALLWSTPPLLGWS
>PIN_UtaSta  VAVLMGLVVVSAAFVNGLVIVVSIQYKKLRSPLNYILVNLAIADLLVTSFGSTLSFANNIYGFFVLGQTACEFEGFMVSLT GIVGLWSLAILAFERYLVICKPVGDFRFQQRHAVFGCVFTWMWSLVWTLPPLFGWS
>PIN_pheMad  LAALMGVVVLSASLANGLVIAVSVRFKRLRSPLNYILVNLATADLLVTFFGSIISFVNNAVGFFVFGKTACRFEGFMVSLT GIVGLWSLAILAFERYLVICKPVGDFQFQRRHAVIGCLYTWGWSLIWTVPPLFGWS
>PIN_podSic  VAVLMGLVVISATLVNGLVIVVSVQFKKLRSPLNYVLVNLAVADLLVTFFGSTISFVNNAQGFFIFGQATCEFEGFMVSLT GIVGLWSLAILAFERYLVICKPVGDFRFPARHAVLGCAFTWGWSFVWTVPPLLGWS
>PIN_xenTro  VAAVMCMVVILAFFVNGLVIVVTLKYKKLRSPLNYILVNLAIANLLVTIFGSSVSFSNNVVGYFFMGKTMCEFEGFMVSLT GIVGLWSLAILAFERYLVICKPMGDFRFQQKHAILGCSFTWVWSFIWTSPPLFGWC 
>PIN_bufJap  VAVLMGMVVFLAFFVNGMVIVVSLKYKKLRSPLNYILVNLAVADILVTMFGSTVSFHNNIFGFFTLGKLVCELEGFVVSLT GIVGLWSLAILAFERYIVICKPMGDFRFQQRHAVMGCAFTWIWAFLWTSPPLIGWC
>VAOP_galGal VAAVMFVVTSLSLAENLAVILVTFKFKQLRQPVNYVIVNLSVADFLVSLTGGTISFLANLKGYFYMGHWACVLEGFAVTFF GIVALWSLALLAFERYIVICRPVGNMRLRGKHAAQGIAFVWTFSFIWTIPPTMGWS
>VAOP_anoCar ISALMFVVTLFSLSENFTVILVTIKFKQLRQPLNYVIVNLSVADFLVSLIGGTISFSTNLKGYFYMGHWACVLEGFAVTFF GIVALWSLALLAFERYVVICRPLGNMRLNGKHAALGVAFVWIFSFIWTVPPTMGWS
>VAOP_xenTro LAALMFVVTSLSIAENFIVILVTAKFKQLRQPLNYIIVNLSVADFLVSVIGGTISIATNSRGYFYLGSWACVLEGFAVTFF GIVALWSLSVLAFERYIVICRPLGNLRLQGKHSALAIIFVWVFSFVWTIPPTMGWS
>VAOP_danRer LAALMFVVTALSLSENFTVMLVTFRFQQLRQPLNYIIVNLSLADFLVSLTGGSISFLTNYHGYFFLGKWACVLEGFAVTFF GIVALWSLAVLAFERFFVICRPLGNIRLRGKHAALGLVFVWSFSFIWTVPPVLGWS
>VAOP_rutRut LATLMFVVTAASLSENFAVMLVTFRFTQLRKPLNYIIVNLSLADFLVSLTGGTISFLTNYHGYFFLGKWACVLEGFAVTYF GIVALWSLAVLAFERFFVICRPLGNIRLRGKHAALGLLFVWTFSFIWTIPPVLGWS
>VAOP_Petro  LAALMGTITALSLGENFAVIVVTARFRQLRQPLNYVLVNLAAADLLVSAIGGSVSFFTNIKGYFFLGVHACVLEGFAVTYF GVVALWSLALLAFERYFVICRPLGNFRLQSKHAVLGLAVVWVFSLACTLPPVLGWS
>PPIN_anaCar IAIIMATSCTLSVILNTAVIAITIKYRQLRQPINYSLVNLAIADLGAALLGGSLNVETNAVGYYNLGRVGCVTEGFAMAFF GIVALCTIAVIAVDRAIVIAKPMGTITFTTRKAMIGVAVSWIWSLVWNTPPLFGWG
>PPIN_Xenop  LALIMAVFCAAALFLNVTVIVVTFKYRQLRHPINYSLVNLAIADLGVTVLGGALTVETNAVGYFNLGRVGCVIEGFAVAFF GIAALCTIAVIALDRVFVVCKPMGTLTFTPKQALAGIAASWIWSLIWNTPPLFGWG
>PPIN_Ictal  LSIIMALSSTFGIILNMVVIIVTVRYKQLRQPLNYALVNLAVADLGCPVFGGLLTAVTNAMGYFSLGRVGCVLEGFAVAFF GIAGLCSVAVIAVDRYMVVCRPLGAVMFQTKHALAGVVFSWVWSFIWNTPPLFGWG
>PPIN_Danio  LAVIIGVFSVCGVILNVTVITVTLKYKQLRQPLNFALVNLAVADLGCAVFGGLPTVVTNAMGYFSLGRVGCVLEGFAVAFF GIAALCSVAVIALERCMVVCRPVGSISFQTRHAVFGVAVSWLWSFIWNTPPLFGWG
>PPIN_Oncor  LAVIIGVFSVSGVCMNVLVIMVTMRHRKLRQPLNYALVNLAVADLGCALFGGLPTMVTNAMGYFSMGRLGCVLEGFAVAFF GIAGLCSVAVIAVDRYVVVCRPMGAVMFQTRHAVGGVVLSWVWSFLWNTPPLFGWG
>PPINa_Ciona LCVYMTFVFLLSCSLNILVIVATLKNKVLRQPLNYIIVNLAVVDLLSGFVGGFISIAANGAGYFFWGKTMCQIEGYFVSNF GVTGLLSIAVMAFERYFVICKPFGPVRFEEKHSIFGIVITWVWSMFWNTPPLIFWD
>PPINb_Ciona LAVYMTFIFLLAVSLNGFVIIATMKNKKLRQPLNYIIINLSIADFLSGLVGGFIGMISNSAGYFYFGKTVCILEGYIVSVA GVCGLMSISVMAFERYFVVCKPYGPFTLTNTHAALGIGFTWTWSVLWSTPGLIWLD
>PPIN_lamp   LAVIMAVFTIASLVLNSTVVIVTLRHRQLRHPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVGCVIEGFAVAFF GIAALCTIAVIAVDRFVVVCKPLGTLMFTRRHALLGIAWAWLWSFVWNTPPLFGWG
>PARIE_Utast LAFLMFLNALFSIFNNSLVIAVTLKNPQLRNPINIFILNLSFSDLMMSLCGTTIVIATNYYGYFYLGRKFCIFQGFAVNYF GIVSLWSLTILA YERYNVV--CQPLGTLQMSTKR GYQLLGFIWVFCLFWAVVPLFGWS
>PARIE_Anole LAFLMFINALFSLFNNFLVIAVTLKNPQLRNPINIFILNLSFSDLMMSICGTTIVIATNYHGYFYLGRRFCIFQGFAVNYF GIVSLWSLTILA YERYNVV--CQPLGTLQMSTQR AYQLLGFIWVFCLFWAVVPLFGWS
>PARIE_Xenop LSFLMFLNAVFSICNNAIVILVTLKHPQLRNPINIFILNLSFSDLMMALCGTTIVVSTNYHGYFYLGKQFCIFQGFAVNYF GIVSLWSLTLLA YERYNVV--CEPIGALKLSTKR GYQGLVFIWLFCLFWAIAPLFGWS
>ENCEPH_braB VAGVIAIIGVVGFVSNGAVVVLFLKFPQLRTPFNLLLLNMAVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANHLF GLVSLISLAVIS FLRYRMVVKPKGPGSSYLTYTK VGLAILFIYLYCLLWTTLPIAGWS
>ENCEPH_homS LALLLGSIGLLGVGNNLLVLVLYYKFQRLRTPTHLLLVNISLSDLLVSLFGVTFTFVSCLRNGWVWDTVGCVWDGFSGSLF GIVSIATLTVLA YERYIRVV-----HARVINFSW AWRAITYIWLYSLAWAGAPLLGWN
>ENCEPH_monD LALLIATIGLLGLCNNLLVLVLYYKFQRLRTPTHLFLVNISFNDLLVSLFGVTFTFVSCLRSGWVWDSVGCAWDGFSNTLF GIVSIMTLTVLA YERYNRIV-----HAKVINFSW AWRAITYIWLYSLVWTGAPLLGWN
>ENCEPH_galG LALLIATIGTLGVCNNLLVLVLYYKFKRLRTPTNLFLVNISLSDLLVSVCGVSLTFMSCLRSRWVWDAAGCVWDGFSNSLF GIVSIMTLTVLA YERYIRVV-----HAKVIDFSW SWRAITYIWLYSLAWTGAPLLGWN
>ENCEPH_anoC LALLVAAIGLLGLCNNLLVLVLYAKFKRLRTPTHLFLVNISLSDLLVSLFGVSFTFGSCLRHRWVWDAAGCVWDGFSNSLF GIVSIMTLTVLA YERYIRVV-----HARVIDFSW SWRAITYIWLYSLAWTGAPLLGWN
>ENCEPH_xenT LALIVATVGFLGLVNNLLVLILYCKFKRLQTPTNLLFFNTSLCHFVFSLLAITFTFMSCVRGSWAFSVEMCVFHGFSKNLL GIVSFGTLTVVA YERYARVV-----YGKYVNSSW SKRSITFVWVYSLAWTGFPLIGWN
>ENCEPH_braB IATGLALIGLVGSMNNFVVILLIGCHRQLRTPFNLLLLNVSVADLLVSVCGNTLSFASAVQHRWLWGRPGCVWYGFANSLF GIVSLVTLSALA FERYCVVV----RSSEMLTYKS SLGMIAFIWMYSLLWTSLPLLGWS
>ENCEPH_braF IATCLALIGFVGFTNNFVVILLIGCHRQLRTPFNLLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANSLF GIVSLVTLSALA FERYCVVV----RSSDMLTYKS SLVVITFIWLYSLLWTSLPLLGWS
>ENCEPH2_Api AIALGFIGFFGFTANLLVAIVIVKDAQILWTPVNVILFNLVFGDFLVSIFGNPVAMVSAATGGWYWGYKMCLWYAWFMSTL GFASIGNLTVMA VERWLLVA----RPMQALSIRH AVILASFVWIYALSLSLPPLFGWG
>ENCEPH1_Ano AAVTLFFIGFFGFFLNIFVIALMYKDVQLWTPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWLYGKSICVAYGFFMSLL GIASITTLTVLS YERFCLIS--RPFAAQNRSKQG ACLAVLFIWSYSFALTSPPLFGWG
>ENCEPH2_Ano SAVTLFFIGFFGFFLNLFVIALMCKDMQLWTPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWIFGRTLCVAYGFFMSLL GITSITTLTVLS YERYCLIS--RPFSSRNLTRRG AFLAIFFIWGYSFALTSPPLFGWG
>CILL2_Platy AIYLCIVGVIGTLSNGVIMYLYFKDKSLRSPMNLLFVNLAMSDFTVAFFGAMFQFGLTCTRKYMSPGMALCDFYGFITFLG GLASEMNLFIIS VERYLAVV--RPFDVGNLTNRR VIAGGVFVWLYSLVFAGGPLVGWS
>CILL1_Platy AAYLFFIACLGVSLNVLVLVLFIKDRKLRSPNNFLYVSLALGDLLVAVFGTAFKFIITARKTLLREEDGFCKWYGFITYLG GLAALMTLSVIA FVR CLAV-LRLGSFTGLTTRM GVAAMAFIWIYSLAFTLAPLLGWN
>MEL1_homSap LGTVILLVGLTGMLGNLTVIYTFCRSRSLRTPANMFIINLAVSDFLMSFTQAPVFFTSSLYKQWLFGETGCEFYAFCGALF GISSMITLTAIA LDRYLVIT-RPLATFGVASKRR AAFVLLGVWLYALAWSLPPFFGWS
>MEL1_smiCra IGATILVVGFTGVLGNLLVIYTFCRSRSLRTPANMFIINLAISDFFMSFTQAPVFFASSLYERWIFGEKGCEFYAFCGALF GITSMITLMVIA LDRYFVIT-RPLASIGMISKKK TGLILLGVWLYSLAWSLPPFFGWS
>MEL1_galGal IGTVILIVGITGTLGNFLVIYAFCRSRTLQKPANIFIINLAVSDFLMSITQSPVFFTNSLHKRWIFGEKGCELYAFCGALF GITSMITLMVIA LDRYFVIT-KPLASVRVMSKKK ALIILVGVWLYSLAWSLPPFFGWS
>MEL1_xenTro VGAVILAVGITGMLGNFLVIYAFCRSRSLRSPANMFIINLAITDFLMSVTQAPVFFATSLHKRWIFGEKGCELYAFCGALF GITSMITLMVIA VDRYFVIT-RPLTSIGVMSKKR AVLILSGVWLYSLAWSLPPFFGWS
>MEL1a_Bran  VGTAVFCIGCCGMFGNAVVVYSFIKSKGLRTPANFFIINLALSDFLMNLTNMPIFAVNSAFQRWLLSDFACELYGFAGGLF GCLSINTLMAIS MDRYLVIT-KPFLVMRIVTKQR VMFAILLLWIWSLVWALPPLFGWS
>MEL2_anoCar VGSCVLVIGCIGITGNLLVLYAFYSNKRLRTPPNYFIMNLAVSDFLMSATQAPICFLNSMHKEWVLGDIGCNLYAFCGALF GITSMITLLAIS VDRYCVIT-KPLQSIKRTSKKR TCIIIVFVWLYSLGWSVCPLFGWS
>MEL2_xenLae IGSFILIIGSVGIIGNMLVLYAFYRNKKLRTAPNYFIINLAISDFLMSATQAPVCFLSSLHREWILGDIGCNVYAFCGALF GITSMMTLLAIS INRYIVIT-KPLQSIQWSSKKR TSQIIVLVWMYSLMWSLAPLLGWS
>NEUR_homSap AGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFF GCGSLITMTAVS LDRYLKIC--YLSYGVWLKRKH AYICLAAIWAYASFWTTMPLVGLG
>RGR_homSap  VLLVEALSGLSLNTLTIFSFCKTPELRTPCHLLVLSLALADSGISLNALVAATSSLLRVSHRRWPYGSDGCQAHGFQGFVT ALASICSSAAIA WGRYHHYC-----TRSQLAWNS AVSLVLFVWLSSAFWAALPLLGWG
>PER_homSap  VATYLIMAGMISIISNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQVYAGLNIFF GMASIGLLTVVA VDRYLTIC--LPDVGRRMTTNT YIGLILGAWINGLFWALMPIIGWA
>PERa_Branc  VGLYLFVIGIIGTIENGITLATFSKFRSLRSPTTMLLVHLAIADLGICIFGYPFSGASSLRSHWLFGGVGCQWYGFNGMFF  

Here's the distal alignment of the opsins:


>RHO1_homSap   RYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKE AAAQQQESATTQKAEK----EVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQ FRNCMLTTIC CGKNPLG DDEASATVS  KTE     TSQVAPA
>RHO1_monDom   RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGQLVFTVKE AAAQQQESATTQKAEK----EVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSNFGPIFMTIPAFFAKSSSVYNPVIYIMMNKQ FRTCMITTLC CGKNPLG DDEASATAS  KTE     TSQVAPA
>RHO1_ornAna   RYIPEGMQCSCGIDYYTLRPEVNNESFVIYMFVVHFTIPMTIIFFCYGRLVFTVKE AAAQQQESATTQKAEK----EVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTVPAFFAKSSAIYNPVIYIMMNKQ FRNCMLTTIC CGKNPLG DDEASATAS  KTEQSSVSTSQVSPA
>RHO1_galGal   RYIPEGMQCSCGIDYYTLKPEINNESFVIYMFVVHFMIPLAVIFFCYGNLVCTVKE AAAQQQESATTQKAEK----EVTRMVIIMVIAFLICWVPYASVAFYIFTNQGSDFGPIFMTIPAFFAKSSAIYNPVIYIVMNKQ FRNCMITTLC CGKNPLG DEDTSAG    KTETSSVSTSQVSPA
>RHO1_anoCar   RYIPEGMQCSCGVDYYTPTPEVHNESFVIYMFLVHFVTPLTIIFFCYGRLVCTVKA AAAQQQESATTQKAER----EVTRMVVIMVISFLVCWVPYASVAFYIFTHQGSDFGPVFMTIPAFFAKSSAIYNPVIYILMNKQ FRNCMIMTLC CGKNPLG DEDTSAGT   KTETSTVSTSQVSPA
>RHO1_xenTro   RYIPEGMQCSCGVDYYTLKPEVNNESFVVYMFIVHFTIPLCVIFFCYGRLLCTVKE AAAQQQESATTQKAEK----EVTRMVVMMVIFFLICWVPYAYVAFYIFTHQGSDFGPVFMTVPAFFAKSSAIYNPVIYIVLNKQ FRNCLITTLC CGKNPFG DEEGSSAASS KTEASSVSSSQVSPA
>RHO1_danRer   RYIPEGMQCSCGVDYYTRTPGVNNESFVIYMFIVHFFIPLIVIFFCYGRLVCTVKE AARQQQESETTQRAER----EVTRMVIIMVIAFLICWLPYAGVAWYIFTHQGSEFGPVFMTLPAFFAKTSAVYNPCIYICMNKQ FRHCMITTLC CGKNPFE EEEGASTTAS KTEASSVSSSSVSPA
>RHO1_Raja     RYIPEGLQCSCGVDYYTMKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKE AAAQQQESESTQRAER----EVTRMVIIMVVAFLICWVPYASVAFYIFINQGCDFTPFFMTVPAFFAKSSAVYNPLIYILMNKQ FRNCMITTIC LGKNPFE EEESTSASAS KTEASSVSSSQVAPA
>RHO1_lamprey1 RYLPEGMQCSCGPDYYTLNPNFNNESFVIYMFLVHFIIPFIVIFFCYGRLLCTVKE AAAAQQESASTQKAEK----EVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDFGATFMTVPAFFAKTSALYNPIIYILMNKQ FRNCMITTLC CGKNPLG DEDSGASTS  KTEVSSVSTSQVSPA
>RHO1_lamprey2 RYLPEGMQCSCGPDYYTMNPTYNNESFVIYMFIVHFTIPFVIIFFSYGRLLCTVKE AAAAQQESASTQKAEK----EVTRMVVLMVVGFLVCWVPYASVAFYIFTNQGSDFGATFMTLPAFFAKSSALYNPVIYILMNKQ FRNCMITTLC CGKNPLG DDDSGASTS  KTEVSSVSTSQVAPA
>RHO1_lamprey3 RYIPEGMQCSCGPDYYTLNPNFNNESYVVYMFVVHFLVPFVIIFFCYGRLLCTVKE AAAAQQESASTQKAEK----EVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDFGATFMTLPAFFAKSSALYNPVIYILMNKQ FRNCMITTLC CGKNPLG DDESGASTS  KTEVSSVSTSQVSPA
>RHO2_galGal   RYMPEGMQCSCGPDYYTHNPDYHNESYVLYMFVIHFIIPVVVIFFSYGRLICKVRE AAAQQQESATTQKAEK----EVTRMVILMVLGFMLAWTPYAVVAFWIFTNKGADFTATLMAVPAFFSKSSSLYNPIIYVLMNKQ FRNCMITTIC CGKNPFG DEDVSSTVSQSKTEVSSVSSSQVSPA
>RHO2_anoCar   RYIPEGMQCSCGPDYYTLNPDYHNESYVLYMFGVHFVIPVVVIFFSYGRLICKVRE AAAQQQESASTQKAER----EVTRMVILMVLGFLLAWTPYAMVAFWIFTNKGVDFSATLMSVPAFFSKSSSLYNPIIYVLMNKQ FRNCMITTIC CGKNPFG DEDVSSSVSQSKTEVSSVSSSQVSPA
>RHO2_Gekko    RFIPEGMQCSCGPDYYTLNPDFHNESYVIYMFIVHFTVPMVVIFFSYGRLVCKVRE AAAQQQESATTQKAEK----EVTRMVILMVLGFLLAWTPYAATAIWIFTNRGAAFSVTFMTIPAFFSKSSSIYNPIIYVLLNKQ FRNCMVTTIC CGKNPFG DEDVSSSVSQSKTEVSSVSSSQVAPA
>RHO2_Latime   RYIPEGLQCSCGPDYYTLNPDFHNESYVMYLFLVHFLLPIIIIFFTYGRLICKVKE AAAQQQESASTQKAEK----EVTRMVILMVIGFLTAWVPYASAAFWIFCNRGAEFTATLMTVPAFFSKSSCLFNPIIYVLLNKQ FRNCMITTLC CGKNPLG DDDTSSAVSQSKTDVSSVSSSQVSPA
>RHO2_Geot     RYIPEGMQCSCGPDYYTLNPKYYNESYVIYLFLVHFLLPVTIIFFTYGRLICTVKE AAAQQQESASTQKAER----EVTRMVIIMVVGFLVCWVPYASFAFYLFMNKGILFSATAMTVPAFFSKSSVLYNPIIYVLLNKQ FRTCMVTTLF CGKNPFG EDDSSMVSTS KTEVSSVSSSQVSPS
>SWS1_homSap   RFIPEGLQCSCGPDWYTVGTKYRSESYTWFLFIFCFIVPLSLICFSYTQLLRALKA VAAQQQESATTQKAER----EVSRMVVVMVGSFCVCYVPYAAFAMYMVNNRNHGLDLRLVTIPSFFSKSACIYNPIIYCFMNKQ FQACIMKMV  CGKAMTDESDTCSSQ    KTEVSTVSSTQVGPN
>SWS1_macDom   RFIPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIMPLFLICFSYSQLLRALRA VAAQQQESATTQKAER----EVSRMVVMMVGSFCLCYVPYAALAMYMVNNQNHGLDLRLVTIPAFFSKSACVYNPIIYCFMNKQ FHACIMEMV  CRKPMTDDSDVSSSQ    KTEVSAVSSSQVGPT
>SWS1_galGal   RYMPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRA VAAQQQESATTQKAER----EVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGLDLRLVTIPAFFSKSACVYNPIIYCFMNKQ FRACIMETV  CGKPLTDDSDASTSAQ   RTEVSSVSSSQVGPT
>SWS1_Taeni    RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRA VAAQQQESATTQKAER----EVSRMVVVMVGSFCMCYVPYAALAMYMVNNREHGIDLRLVTIPAFFSKSSCVYNPIIYCFMNKQ FRACIMETV  CGRPMTDDSEVSSSAQ   RTEVSSVSSSQVGPS
>SWS1_Gekko    RFIPEGLQCSCGPDWYTVGTKYYSEYYTWFLFVLCFIVPLSIIVFSYSQLLSALRA VAAQQQESATTQKAER----EVSRMVVVMVGSFCLCYVPYAALAMYMVNNRNHGIDLRMVTIPAFFSKSSCVYNPIIYCFMNKQ FRGCILEMV  CGKTMAEESEVSSASQ   KTEVSSVSSSQVGPS
>SWS1_Uta      RFIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRA VAAQQQESATTQKAER----EVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGIDLRLVTIPAFFSKSACVYNPIIYCFMNKQ FRACIMETV  CGKPMTDESDVSSSAQ   KTEVSSVSSSQVSPS
>SWS1_Xenop    RYMPEGLQCSCGPDWYTVGTKYRSEYYTWFIFIFCFVIPLSLICFSYGRLLGALRA VAAQQQESASTQKAER----EVSRMVIFMVGSFCLCYVPYAAMAMYMVTNRNHGLDLRLVTIPAFFSKSSCVYNPIIYSFMNKQ FRGCIMETV  CGRPMSDDSSVSSTSQ   RTEVSTVSSSQVSPA
>SWS1_Danio    RYIPEGLGCSCGPDWYTNCEEFSCASYSKFLLVTCFICPITIIIFSYSQLLGALRA VAAQQAESASTQKAEK----EVSRMIIVMVASFVTCYGPYALTAQYYAYSQDENKDYRLVTIPAFFSKSSCVYNPLIYAFMNKQ FNACIMETV  FGKKIDESSEVSS      KTETSSVSA
>SWS1_Oryzia   RYIPEGLQCSCGPDWYTVGTKYKSEYYTYFLFVFCFVVPLSIIIFSYGSLLGTLRA VAAQQQESASTQKAER----EVSRMVIMMVASFCTCYVPYAALAVYMVTNRDHNIDLRFVTVPAFFSKASCVYNPLIYSFMNKQ FNGCIMEMV  FGKKMEEASEVSS      KTEVSTDS
>SWS1_Geotri   RYIPEGLQCSCGPDWYTVGTKYKSEYYTYFLFVFCFVVPLSIIIFSYGSLLGTLRA VAAQQQESASTQKAER----EVSRMVIMMVASFCTCYVPYAALAVYMVTNRDHNIDLRFVTVPAFFSKASCVYNPLIYSFMNKQ FRACILETV  CGKPITDESETSSS     RTEVSSVSTTQMIPG
>SWS2_ornAna   RYIPEGLQCSCGPDWYTTNNKWNNESYVIFLFSFCFGVPLSIIIFSYGRLLLTLRA VAKQQEQSATTQKAER----EVTKMVIVMVLGFLVCWLPYASFSLWVVTNRGQVFDLRMASIPSVFSKASTIYNPIIYVFMNKQ FRSCMLKLVF CGKSPFGDEDEISGSS   QATQVSSVSSSQVSPA
>SWS2_galGal   RYIPEGLQCSCGPDWYTTDNKWHNESYVLFLFTFCFGVPLAIIVFSYGRLLITLRA VARQQEQSATTQKADR----EVTKMVVVMVLGFLVCWAPYTAFALWVVTHRGRSFEVGLASIPSVFSKSSTVYNPVIYVLMNKQ FRSCMLKLLF CGRSPFGDDEDVSGSS   QATQVSSVSSSHVAPA
>SWS2_Taeni    RYIPEGLQCSCGPDWYTTDNKWNNESYVIFLFCFCFGFPLTVIVFSYGRLLLTLRA VAKQQEQSASTQKAER----EVTKMVVVMVLGFLVCWLPYCSFALWVVTHRGHPFDLGLASIPSVFSKASTVYNPIIYVFMNKQ FRSCMLKLVF CGRSPFGDEDDVSGSS   QATQVSSVSSSQVSPA
>SWS2_Uta      RYIPEGLQCSCGPDWYTTNNKWNNESYVLFLFSFCFGVPLSVIIFSYGRLLLTLRA VAKQQEQSATTQKAER----EVTKMVVVMVMGFLVCWLPYASFALWVVTHRGEPFDVRLATIPSVFSKASSVYNPVIYVFMNKQ FRSCMLKLVF CGKSPFGDEDDVSGSS   QTTQVSSVSSSQVSPA
>SWS2_Xeno     RYIPEGLQCSCGPDWYTVNNKWNNESYVLFLFCFCFGFPLAIIVFSYGRLLLALHA VAKQQEQSATTQKAER----EVTRMVIVMVVGFLVCWLPYASFALWAVTHRGELFDLRMSSVPSVFSKASTVYNPFIYIFMNRQ FRSCMMKMIF CGKNPLGDDEETSVSG    STQVSSVSSSQIAPS
>SWS2_Danio    RYIPEGLQCSCGPDWYTTNNKFNNESYVMFLFCFCFAVPFSTIVFCYGQLLITLKL AAKAQADSASTQKAER----EVTKMVVVMVFGFLICWGPYAIFAIWVVSNRGAPFDLRLATIPSCLCKASTVYNPVIYVLMNKQ FRSCMMKMVF NKNIEEDEASSSS      QVTQVSSVAPEK
>SWS2_Takif    RYIPEGFQCSCGPDWYTTGNKYNNESYVWFIFGFGFAVPLFVIVFCYSQLLVMLKS AAKAQAESASTQKAER----EVTRMVVVMILGFLVCWLPYASFALWVVNNRGTPFDLRLATIPACFSKASTVYNPIIYVVLNKQ FRSCMKKMLG MSGGDDEESSS          QSVTEVSKVSPS
>SWS2_Geotria  RYIPEGLQCSCGPDWYTTNNKYNNESYVMFLFIFCFGTPFTIIIVSYSKLILTLRA AAAQQQESASTQKAEK----EVSRMVVIMVGGFLVCWLPYASLALWIVFNRGSPFDLRLATIPSVFSKASTVYNPVIYIFLNKQ FRSCMMKTIF CGKNPLGDDEDATSTTT    QVSSVSTSQVAPA
>LWS_ornAna    RYWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMSTCCILPLSIIVLCYLQVWLAIRA VAKQQKESESTQKAEK----EVSRMVVVMILAYCFCWGPYTIFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ FRNCIMQL   FGKKVDDGSELSSTS    RTEVSSVSS  VSPA
>LWS_galGal    RYWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFFPLAIIILCYLQVSLAIRA VAAQQKESESTQKAEK----EVSRMVVVMIVAYCFCWGPYTFFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ FRNCILQL   FGKKVDDGSEVSTS     RTEVSSVSNSSVSPA
>LWS_anoCar    RYWPHGLKTSCGPDVFSGSDDPGVLSYMIVLMITCCFIPLAVILLCYLQVWLAIRA VAAQQKESESTQKAEK----EVSRMVVVMIIAYCFCWGPYTVFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ FRNCIMQL   FGKKVDDGSELSSTS    RTEVSSVSNSSVSPA
>LWS_Lithoch   RYWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMLTCCIFPLAIIILCYLAVWMAIRA VAMQQKESESTQKAER----EVSRMVVVMIVAYCVCWGPYTFFACFAAANPGYAFHPLAAAMPAYFAKSATIYNPIIYVFMNRQ FRTCIMQL   FGKQVDDGSEVSTS     KTEV     SSVAPA
>LWS_Gastero   RYWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCLIPLAIIILCYLAVWLAIRA VAMQQKESESTQKAER----DVSRMVVVMIVAYIVCWGPYTTFACFAAANPGYAFHPLAAAMPAYFAKSATIYNPVIYVFMNRQ FRSCIMQL   FGKEVDDGSEVSTs     KTEV     SSVAPA
>LWS_Petromy   RYWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFLPLSIIILCYLQVWLAIHS VAQQQKESETTQKAER----DVSRMVVVMILAYVFCWGPYTFFACFAAANPGYSFHPIAAALPAYFAKGATIYNPIIYVFMNRQ FRNCILQL   FGKKVDDGSEVSSSS    RTEVSSVSNSSVSPA
>LWS_lamprey   RYWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFLPLSVIILCYLQVWLAIHS VAQQQKESETTQKAER----DVSRMVVVMILAYIFCWGPYTFFACYAAANPGYAFHPLTAALPAYFAKSATIYNPVIYVFMNRQ FRNCIMQL   FGKKVDDGSEVSSAS    RTEVSSVSNSSISPA
>LWS_Geotria   RYWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFIPLALIIICYLQVWLAIHT VAQQQKESETTQKAER----DVSRMVVVMIFAYIFCWGPYTFFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ FRNCIMQL   FGKKVDDGSEVSSSA    RTEVSSVSNSSVSPA
>PIN_galGal    SYVPEGLRTSCGPNWYTGGSNNN--SYILSLFVTCFVLPLSLILFSYTNLLLTLRA AAAQQKEADTTQRAER----EVTRMVIVMVMAFLLCWLPYSTFALVVATHKGIIIQPVLASLPSYFSKTATVYNPIIYVFMNKQ FQSCLLEMLC CGYQPQRTGKASPGTPGPHADVTAAGLRNKVMPAHPV
>PIN_Uta sta   SYVPEGLRTSCGPNWYTGGSGNN--SYIMALFVTCFALPLGMIIFSYASLLLTLRA VATQQKEVETTQQAEK----EVTRRVIAMVMAFLVCWLPYASFAMVVATNKDLVIQPALASLPSYFSKTATVYNPIIYVFMNKQ FRSCLLSTMS CGHRPRGAQETTPAMISIPQGPTSALQGSRNKVTPSA
>PIN_Phelsuma  SYVPEGLGTSCGPNWYMGGTNNN--SYIVALFVTCFALPLSMILFSYANLLLTLRA VAAQQKEQETTQRAEK----EVTRMVITMVMAFLVCWLPYATFAMVVATTKDLSIQPGLASLPSYFSKTATVYNPIIYVFMNKQ FRSCLLNTVS CGRIPQTMPGTPATTAVRGGFVLTSEGRGNKVASTEL
>PIN_Podarcis  SYVPEGLRTSCGPNWYSGGSSNN--SYIMTLFVTCFAMPLSTILFSYANLLMTLRT VAAQQKEQETTQRAER----EVTRMVVAMVAAFLVCWLPYASFAMVVATHKDLAIRPALASLPSYFSKTATVYNPIIYVFMNKQ FRSCLLYKMS CGHRALSSQDTTPAGISLPGRLTTSASKGSRNQVSPS
>PIN_xenTro    SYVPEGLRTSCGPNWYTGGTNNN--SYIMALFLTCFIMPLSTIIFSYSNLLMALRA VAAQQKDSETTQRAEK----EVTRMVIAMVLAFLICWLPYASFAVVVAVNKDVVIEPTVASLPSYFSKTATVYNPIIYVFMNKQ FRNCLMTLLC CGRSFGDDETSSASGRTDVTSVSEAGGNKVTPA
>PIN_Bufo      SYVPEGLGTSCGPNWYTGGTNNN--SYILALFTTCFMMPLTTIIFSYSNLLLALRA VAAQQKESETTQRAER----EVTRMVIAMVLAFLICWLPYAVFAIVMASNKNVVIDPTLASMPSYFSKTATVYNPVIYVFMNKQ FRDCLTKLLC CGRNPFGEDETSTTSGRTDVTSVSEGGGNKVTPA
>CILL2_Platyn  SYRPEGLGTWCSISWQDRSMNTM--SYVTAVFLGCYFFPVSIIIFCYFNVWRKVKE AADAQGAGTAGKAEKS-----IFRMSVIMVTCYLTAWTPYAIVCLIASYGPPNGLPIYAEVLPSLFAKSSQVYNPIIYVLMNKP 
>ENCEPH_braBel SYQFEGHSVGCSVNWVKHNVNNV--SYIITLMVTCFFVPMVVVCWSYACIWRTVRM SAEMKSEFGNPQNTGR----LVTTMVVVMIVCFLVCWTPYTVMALIVTFGADHLVTPTASVIPSLVAKSSTAYNPIIYVLMNNQ FREFLLARLRTFCCRQPRMLRVTPMDDNAHARLVGEGPSHAQQVIPSEEN
>ENCEPH_braFlo SYQFEGHNVGCSVNWVQHNPDNV--SYIVTLMVTCFFVPMVVVCWSYAWIWRTVRM SSEAKPECGNSQNAGR----LVTTMVVVMIICFLVCWTPYAVMALIVTFGADHLVTPTASVIPSLVAKSSTAYNPIIYVLMNNQ FREFLLARLQRVCCRQQAVPRVTPMDDNVHVRLGGEGPSQSQQFLPAGEN
>VAOP_galGal   SYTTSKIGTTCEPNWYSGAYNDR--SYIIAFFTTCFIVPLLVILVSYGKLLQKLRK VSNTQGRLRTARKPER----QVTRMVVVMIIAFLICWMPYAVFSILATAYPSIELDPHLAAIPAFFSKTATVYNPIIYVFMNKQ FRMCLIQMFK CSAIETAESNMNPTSERATLTQDKRDSQLSVMAVRST
>VAOP_anoCar   SYTTSKIGTTCEPNWYSGDYNDH--TFIITFFTTCFILPLLVILVSYGKLMRKLRK VSDTQGRLGTTRKPER----QVTGMVVIMILAFLICWSPYAAFSILVTACPSIELDPRLAAIPAFFSKTATVYNPVIYVFMNNQ FRKCLVQLFQ CSSQETMDANVNPISEKDTLTHTKHCGEMSTVAAHVI
>VAOP_xenTro   SYTTSKIGTTCEPNWYSGEMRDH--TYIITFLTTCFVFPLLVIFMSYGKLMRKLRK VSDTQGRLGSTRKPEK----EVTRMVVIMILAFLICWTPYAAFSILITAHPTIDLDPRLAAIPAFFAKTASMYNPIIYVYMNKQ FRRCLYQMFN INDPEAKESNLNPTSERGVLTRNNNGGEMLAIATHIT
>VAOP_Danio    SYTVSRIGTTCEPNWYSGNFHDH--TFIITLFSTCFIFPLGVIIVCYCKLIRKLRK VSNTHGRLGNARKPER----QVTRMVVVMIVAFMVAWTPYAAFSIIITAHPSMHVDPRLAAIPAFVAKTAAVYNPIIYVFMNKQ FRKCLVQLLS CSKVTVVEGNNNQTTERAGMTSGSNTGEMSAIAARVS
>VAOP_Ruti     SYTVSKIGTTCEPNWYSGNFHDH--TFIIAFFITCFILPLGVIVVCYCKLIKKLRK VSNTHGRLGNARKPER----QVTRMVVVMIVAFMVAWTPYAAFSIVVTAHPSIHLDPRLAAAPAFFSKTAAVYNPVIYVFMNKQ FRKCLVQLLR CRDVTIIEGNINQTSERQGMTNESHTGEMSTIASRIP
>VAOP_Petrom   SYRPSMIGTTCEPNWYSGELHDH--TFILMFFSTCFIFPLAVIFFSYGKLIQKLKK ASETQRGLESTRRAEQ----QVTRMVVVMILAFLVCWMPYATFSIVVTACPTIHLDPLLAAVPAFFSKTATVYNPVIYIFMNKQ FRDCFVQVLP CKGLKKVSATQTAGAQDTEHTASVNTQSPGNRHNIAL
>PPIN_anaCar   GYQMEGVMTSCAPDWANSDPINV--SYIICYFLFCFTIPFITILASYGYLIWTLRQ VAKVGLAQRGSTTKAEA---QVSRMVIVMVMAFLICWLPYATFALVVVGNPQIYINPIIATIPMYMAKSSTFYNPIIYIFMNKQ FRDCLVRCLL CGRNPCASEQTDEDDLEVSTIAPAPSSRRGKVAPV
>PPIN_Xeno     SYELEGVMTSCAPNWYSADPVNM--SYIVCYFSFCFAIPFLIIVGSYGYLMWTLRQ VAKLGVAEGGTTSKAEV---QVSRMVIVMILAFLVCWLPYAAFAMTVVANPGMHIDPIIATVPMYLTKTSTVYNPIIYIFMNKQ FQECVIPFLF CGRNPWAAEKSSSMETSISVTSGTPTKRGQVAPA
>PPIN_Icta     SYQLEGVMTSCAPNWYRRDPVNV--SYILCYFMLCFALPFATIIFSYMHLLHTLWQ VAKLQVADSGSTAKVEV---QVARMVVIMVMAFLLTWLPYAAFALTVIIDSNIYINPVIGTIPAYLAKSSTVFNPIIYIFMNRQ FRDYALPCLL CGKNPWAAKEGRDSDTNTLTTTVSKNTSVSPL
>PPIN_Danio    RLQLEGVRTSCAPDWYSRDLANV--SFIVCYFLLCFALPFSVIVYSYTRLLWTLRQ VSRLQVCEGGSAARAEA---QVSCMVVVMILAFLLTWLPYASFALCVILIPELYIDPVIATVPMYLTKSSTVFNPIIYIFMNRQ FRDRALPFLL CGRNPWAAEAEEEEEETTVSSVSRSTSVSPA
>PPIN_Oncorhy  SFELEGVRTSCSPNWYSREPGNM--SYIILYFLLCFAIPFSIIMVSYARILFTLHQ VSKLKVLEGNSTTRVEI---QVVRMVVVMVMAFLLSWLPYAAFALSVILDPSLHINPLIATVPMYLAKSSTVYNPIIYVFMNRQ FRDCAVPFLL CGLNPWASEPVGSEADTALSSVSKNPRVSPQs
>PPIN_lamp     SYELEGVRTSCAPDWYSRDPANV--SYITSYFAFCFAIPFLVIVVAYGRLMWTLHQ VAKLGMGESGSTAKAEA---QVSRMVVVMVVAFLVCWLPYALFAMIVVTKPDVYIDPVIATLPMYLTKTSTVYNPIIYIFMNRQ FRDCAVPFLL CGRNPWAEPSSESATAASTSATSVTLASAPGQVSPS
>PARIE_Uta     SYGPEGVQTSCSIGWEERSWSNY--SYLIVYFLSCFFIPVLIIGFSYGNVIRSLHG LNKKVEQLGGKSSPEEEF--RAVIMVLVMVVAFLICWLPYTVFALIVVFNPALNISPLAATIPTYLSKTSPVYNPIIYIFLNKQ FRDCAVEFIT CGQVVLTSPEEDISTSAIPVEGKGPCKINQVTPV
>PARIE_Anole   SYGPEGVQTSCSIGWEERSWNNY--SYLIVYFLSCFFIPVLIIGFSYGNVIRSLHG LNKKVEQLGGKSNPEEEF--RAVIMVLVMVVAFLICWLPYTLFALTVVFNPALNISPLAATIPTYLSKTSPVYNPIIYIFLNKE FRECAVEFIT CGKVVLTSPEEDISTSAISDEGIAPCKINQVTPV
>PARIE_Xenop   SYGPEGVQTSCSIGWEERSWSNY--SYIISYFLTCFIIPVGIIGFSYGSILRSLHQ LNRKIEQQGGKTNPREEK--RVVIMVLFMVLAFLICWLPYTVFALIVVINPQLYISPLAATLPTYFAKTSPVYNPIIYIFLNKQ FRTYAVQCLT CGHINLDSLEEDTESVSAQAENMLTPKTNQVAPA
>ENCEPH_braBel SYQLEGPKIGCSVAWEEHSWSNT--SYIVVLFITCLFAPLLIIVYSYYRLWHKVKQ GSRNLPAAMRKSSQKEQ---KIAMMVIVMITCFMVCWLPYGAMALVVTFGGERLISHTAAVVPSLLAKSSTCYNPVVYFAMNSQ FRRYFQDLLC CGRRLFDVSQSVVTGNTAMPRNNSQGFRKDDSDQKQD
>ENCEPH_homSap RYILDVHGLGCTVDWKSKDANDS--SFVLFLFLGCLVVPLGVIAHCYGHILYSIRM LRCVEDLQTIQVIKILKYEKKLAKMCFLMIFTFLVCWMPYIVICFLVVNGHGHLVTPTISIVSYLFAKSNTVYNPVIYVFMIRK FRRCLLQLLC FRLLKFQQPKKDRPVIRTEKQIRPIVMSQKVGDRPKKKVT
>ENCEPH_monDom RYTLEIHGLGCSVDWKSKDPNDS--SFVIFLFFGCLMLPVGVMAYCYGHILYAIRM LRCVEELQTIQVIKILRYEKKVAKMCFLMIAIFLFCWMPYAVICLLVANGYGSLVTPTVAIIASLFAKSSTAYNPIIYIFMSRK FRRCLLQLLC FRLLKFQQPKKDRPVIRTEKQIRPIVMSQKVGDRPKKKVT
>ENCEPH_galGal RYTLEIHGLGCSMDWKSKDPNDT--SFVLLFFLGCLVAPVVIMAYCYGHILYAVRM LRCVEDFQTSQVIKLLKYEKKVAKMCFLMISTFLICWMPYAVVSLLVTYGYSNLVTPTVAIIPSFFAKSSTAYNPVIYIFMSRK FRQCLLQLLC FRLMRFQRIMKEPSGAGNVKPIRPIVMSQKVGDRPKKKVT
>ENCEPH_anoCar HYTLEIHGLGCSVDWQSKEPSDS--SFVLFFFLGCLAAPVGIMAYCYGHILHAIRM LRCVEDLQSIQVIKILRYEKKVAKMCFLMVTTFLICWMPYAVVSLLIAYGYGHLITPTVAIIPSFFAKSSTAYNPVIYIFMSRK FRRCLVQLFC VQFLRFKRTLKEQPAIESNKPIRPIVMSQKVGDRPKKKVT
>ENCEPH_xenTro LYTFETHKLDCSFEWTATDPKDT--AFVLLFFLACITLPLSIMAYCYGYILYEIQK LRSVKNIQNFQEITILDYEIKMAKMCLLMMLTFLIGWMPYTILSLLVTSGYSKFITPTITVMPSLLAIASAAYNPVIHIFTIKK FRQCLVQLLP PINFHPPINPPINNFWRLLKNLNGRLAMKKVKPVLGKGRS
>ENCEPH1_Anoph AYVNEAANISCSVNWESQTANAT--SYIIFLFIFGLILPLAVIIYSYINIVLEMRK NSARVGRVNRAERRVT-------SMVAVMIVAFMVAWTPYAIFALIEQFGPPELIGPGLAVLPALVAKSSICYNPIIYVGMNTQ FRAAFWRIRR SNGVAGQPDSNNTNNSNRDKESARHTAKEGL
>ENCEPH2_Anoph AYVQEAANISCSVNWESQTKNAT--TYIIFLFVFGLVVPLIVIVYSYTNIIVNMRE NSARVGRINRAEQRVT-------SMVAVMIVAFMVAWTPYAIFALIEQFGPPELIGPGLAVLPALVAKSSICYNPIIYVGMNTQ FRAAFSRVRN KGQQAAADQNTTTMQRELTKSSRDMVECSF
>ENCEPH2_Apis  SYGPEAGNVSCSVSWEVHDPVTNSDTYIGFLFVLGLIVPVFTIVSSYAAIVLTLKK VRKRAGASGRREAKIT-------KMVALMITAFLLAWSPYAALAIAAQYFNAKPSATV-AVLPALLAKSSICYNPIIYAGLNNQ FSRFLKKIFD ARGSRTAVPDSQHTALTALNRQEQRK
>CILL1_Platyn  HYIPEGLATWCSIDWLSDETSDK--SYVFAIFIFCFLVPVLIIVVSYGLIYDKVRK VAKTGGSVAKAEREVL-------RMTLLMVSLFMLAWSPYAVICMLASFGPKDLLHPVATVIPAMFAKSSTMYNPLIYVFMNKQ FRRSLKVLLG MGVEDLNSESERATGGTATNQVAAT
>PER_homSap    SYAPDPTGATCTINWRKNDRSFV--SYTMTVIAINFIVPLTVMFYCYYHVTLSIKH HTTSDCTESLNRDWSD--QIDVTKMSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKK FRRAMLAMFK CQTHQTMPVTSILPMDVSQNPLASGRI
>PERa_Bran     EYALEPSGTACTINFQKNDSLYI--SYVTSCFVLGFVVPLAVMAFCYWQASCFVSK VLKGDIAGDLTFPVAAN.QNHFSKMCLAMVAAFVVAWTPYSVLFLFAAFWNPADIPAWLTLLPPLIAKSSALYNPIIYIIANRR FRNAICSMMK GQDPDVEDDEHADEHRVRSIEDNDKEIISMVNLNMTV
>PERc_Bran     YTYETPMQITCSLDWNVQHPGEK--AYIAAVLVIVYVLQVLIMCFCYFNIIFKSAN LKFAALASEKTKMAAKKDTWKTSVMCLTMVVSFLIAWTPYAVSSTWDILSAE-DLPIIATILPSLFAKSSCMMNPIIYACCNTK FRQAAVKSFR KLCGMCKQKVPLSTPQVVLAMQRNTEFTSTVEPT
>NEUR_homSap   DYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKS SSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYK FACCQTGGLK ATKKKSLEGFRLHTVTTVRKSSAVLEIHEEWE
>MEL_Platy     AYIPEGFQTSCTYDYLTQDMNNY--TYVLGMYLFGFIFPVAIIFFCYLGIVRAIFA HHAEMMATAKRMGAN...EIQIAKVAAMTIGTFMLSWTPYAVVGVFGMIKPHSEMFIH.AEIPVMMAKASARYNPIIYALSHPK FRAEIDKHFPWLLCCCKPKPKAQLPSSTTKGSIASKTEADTSV
>MEL1_homSap   AYVPEGLLTSCSWDYMSFTPAVR--AYTMLLCCFVFFLPLLIIIYCYIFIFRAIRE TGRALQTFGACKGNG.QSECKMAKIMLLVILLFVLSWAPYSAVALVAFAGYAHVLTPYMSSVPAVIAKASAIHNPIIYAITHPK YRVAIAQHLP CLGVLLGVSRRHSRPYPSYRSTHRSTLTSHTSNL

>PPINa_Ciona   YDTEGLGTSCAPNWFVKEKRERL--FIILYFVFCFVIPLAVIMICYGKLILTLRQ IAKESSLSGGTSPEGEVTKMVVVMVTAFVFCWLPYAAFAMYNVVNPEAQ IDYALGAAPAFFAKTATIYNPLIYIGLNRQ FRDCVVRMIF NGRNPWVDELVGSQVSSTGSQLTAVSSNKVAPA
>PPINb_Ciona   GYVPEGLGTSCAPNWFSKNKSER--IFIFVYFVFCFFIPLLVIIICYGKIVLFLKQVSLY ATRQSSASSNRQADNKVTKMVLVMISAFLICWTPYGVLSLYNAINPDKQ LDYGLGAVPVFFAKTANIYNPLIYIGLNKQ FRDGVIKMVF RGRNPWAEEMSTQQRQRSTEAGQPIVSNEV
>PPIN2_Ciona   SVIWHTPGLFFWNGYEPEGFGTS--CAPNWFSQQKRERIFIFAYFAFCFLTPLTIIFACYLKLILFIRKVSVSKKSMVNEADRRDFEVTRMVFVMIAAFLICWLPYGCLSMYNAIHPD FRDGVIRMLF KGRNPWLDGRNTTSSTSTRAQ


Structural and functional markers along the opsin molecule:

>RHO1_homSap rhodopsin                 <----------TM1--------->    c1     <----------TM2------->    x1      <c--i------TM3--------->           c2       <----------TM4-------->               x2         <----------TM5------c->     c3
MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLGGEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSRYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKE 
AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA* 0
           c3       <----------TM6------->    x3      <--------b-TM7---gprot>  helix8    palm cyto tail