Opsin evolution: alignment: Difference between revisions

From genomewiki
Jump to navigationJump to search
No edit summary
 
No edit summary
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
This shows an alignment of 157 opsins, almost all of ciliary type. It needs some trimming of unalignable ends to fit better. Fragmentary sequences of course score low and so fall to the bottom -- that's still useful because these fragments represent two important clades, jawless fish and chondrichthyes. Notice the numerous invariant (blue) and nearly invariant sequences (red) -- these anchor the alignment with near-certainty. Opsins have seven alpha helical sections traversing the cell membrane with the intervening sequence alternating as cytoplasmic and extra-cellular. Among other things, the alignment shows that the opsin classifier has properly named the opsins -- each ortholog classifies together as expected from its name. Deletions and insertions show up clearly on the alignment and are readily resolved as to type using the phylogenetic ordering to establish ancestral conditon.
'''See also:''' [[Opsin_evolution|Curated Sequences]] | [[Opsin_evolution:_ancestral_introns|Ancestral Introns]] | [[Opsin_evolution:_informative_indels|Informative Indels]] | [[Opsin_evolution:_ancestral_sequences|Ancestral Sequences]] | [[Opsin_evolution:_Cytoplasmic_face|Cytoplasmic face]] | [[Opsin_evolution:_update_blog|Update Blog]]


This section provides an alignment of 230 opsins, mostly ciliary and rhabdomeric types. It could be updated to the full 420 curated reference sequences available using [http://bioinfo.genopole-toulouse.prd.fr/multalin/ MultAlin] or [http://npsa-pbil.ibcp.fr/cgi-bin/align_multalin.pl similar tools] that allow precise control of formatting and color but too many sequence becomes unwieldy.


N- and C-terminals have been trimmed away because they are generally unalignable and uninformative [[Opsin_evolution:_Cytoplasmic_face|outside]] a narrow gene class. Fragmentary sequences are mostly not shown: these score low and so fall to the bottom. (That can still be useful as two important clades, jawless fish and chondrichthyes, are largely represented by fragments.) Notice the numerous invariant (red) and nearly invariant sequences (blue) -- these anchor the alignment with near-certainty. Some of these are not specific to opsins but are rather properties of GPCR signaling proteins generally.


Opsins have seven alpha helical sections traversing the cell membrane with the intervening sequence alternating as cytoplasmic and extra-cellular. Certain key residues such as the lysine where the retinal is covalently bound, counterions, and recognition sites diagnostic for binding of other proteins require markups that will be added shortly.


Among other things, the alignment by [http://bioinfo.genopole-toulouse.prd.fr/multalin/multalin.html MultAlign] shows that the <span style="color: #990099;">Opsin Classifier</span> has properly named the opsins -- each classifies just as expected from its name. Deletions and insertions show up clearly on the alignment, readily resolved as to type using the known phylogenetic topology to establish ancestral condition. The alignment also exhibits some anomalies where the sequence in question needs re-evaluation at the primary data source (cDNA and/or genome).


MultAlign is apparently the only alignment software that allows line width to be specified. That's important here because it enables the entire alignment to be seen in a single window. The numbering schemes allows specific residues and regions to be discussed. Colored text output was also an option (it allows copy and paste of specific residues) but the file again is huge with color markups and awkward to display tightly within genomeWiki.




[[Image:Opsin align.png|left]]


[[Image:Opsin align.png]]
 
Here are those sequences aligned (after some trimming of unalignable regions, and anomalous and fragmentary sequences). These are in text form which can be searched by motif using web browser text search, unlike the graphic above (which is more conveniently colored however).
<pre>
Consensus  ............g......N..V.......k. LR .P.N...vNLA..Dl.....................g....C..yg%.....G..s...$..ia.#RY.v!..P.  ..........a.......W.....w...Pl.  GW. .Y.pEg..tsC..#w..........s%....f...%..Pl.!I.%.Y..i............... .    .................E.....m...m!..F...W.PYa..............  .p.....P..fAK.s..%NP!IY......%R...................
RHO1_homSa  QFSMLAAYMFLLIVLGFPINFLTLYVTVQHKK-LR-TPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTG-CNLEGFFATLGGEIALWSLVVLAIERYVVVCKPM---SNFRFGENHAIMGVAFTWVMALACAAPPLA--GWS-RYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNF--GPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTI--CCGKNPLGDD
RHO1_monDo  QFSCLAAYMFMLIVLGFPINFLTLYVTIQHKK-LR-TPLNYILLNLAIADLFMVFGGFTMTLYTSLHGYFVFGPTG-CNLEGFFATLGGEIALWSLVVLAIERYVVVCKPM---SNFRFGENHAIIGVAFTWVMALACAFPPLI--GWS-RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGQLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSNF--GPIFMTIPAFFAKSSSVYNPVIYIMMNKQFRTCMITTL--CCGKNPLGDD
RHO1_bosTa  QFSMLAAYMFLLIMLGFPINFLTLYVTVQHKK-LR-TPLNYILLNLAVADLFMVFGGFTTTLYTSLHGYFVFGPTG-CNLEGFFATLGGEIALWSLVVLAIERYVVVCKPM---SNFRFGENHAIMGVAFTWVMALACAAPPLV--GWS-RYIPEGMQCSCGIDYYTPHEETNNESFVIYMFVVHFIIPLIVIFFCYGQLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSDF--GPIFMTIPAFFAKTSAVYNPVIYIMMNKQFRNCMVTTL--CCGKNPLGDD
RHO1_ornAn  QYSVLAAYMFMLIMLGFPINFLTLYVTIQHKK-LR-TPLNYILLNLAFANHFMVLGGFTTTLYTSLHGYFVFGPTG-CNIEGFFATLGGEIALWSLVVLAIERYIVVCKPM---SNFRFGENHAIMGVAFTWIMALACALPPLV--GWS-RYIPEGMQCSCGIDYYTLRPEVNNESFVIYMFVVHFTIPMTIIFFCYGRLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNF--GPIFMTVPAFFAKSSAIYNPVIYIMMNKQFRNCMLTTI--CCGKNPLGDD
RHO1_galGa  KFSALAAYMFMLILLGFPVNFLTLYVTIQHKK-LR-TPLNYILLNLVVADLFMVFGGFTTTMYTSMNGYFVFGVTG-CYIEGFFATLGGEIALWSLVVLAVERYVVVCKPM---SNFRFGENHAIMGVAFSWIMAMACAAPPLF--GWS-RYIPEGMQCSCGIDYYTLKPEINNESFVIYMFVVHFMIPLAVIFFCYGNLVCTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTNQGSDF--GPIFMTIPAFFAKSSAIYNPVIYIVMNKQFRNCMITTL--CCGKNPLGDE
RHO1_xenTr  KYSALAAYMFLLILLGFPINFMTLYVTIQHKK-LR-TPLNYILLNLVFANHFMVLCGFTVTMYTSMHGYFIFGQTG-CYIEGFFATLGGEMALWSLVVLAIERYVVVCKPM---ANFRFGENHAIMGVVFTWIMALSCAAPPLF--GWS-RYIPEGMQCSCGVDYYTLKPEVNNESFVVYMFIVHFTIPLCVIFFCYGRLLCTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVVMMVIFFLICWVPYAYVAFYIFTHQGSDF--GPVFMTVPAFFAKSSAIYNPVIYIVLNKQFRNCLITTL--CCGKNPFGDE
RHO1_neoFo  KYSALAAYMFFLILTGFPINFLTLYVTVQHKK-LR-TPLNYILLNLAVADLFMVFGGFTTTMYTAMNGYFVFGVVG-CNLEGFFATFGGIIALWCLVVLAIERYIVVCKPI---SNFRFGENHAIMGVVFTWIMALACAGPPLF--GWS-RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFIVHFTIPLIIIFFCYGRLMCTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVYIMVISYLVCWLPYASVSFYIFTHQGSDF--GPVFMTVPAFFAKTASVYNPVIYILMNKQFRNCMITTL--CCGKNPFGDE
RHO1_latCh  KYSALAAYMFFLILVGFPINFLTLFVTIQHKK-LR-TPLNYILLDLAVADLCMVFGGFFVTMYSSMNGYFVLGPTG-CNIEGFFATLGGQVALWALVVLAIERYVVVCKPM---SNFRFGENHAIMGVIFTWIMALSCAVPPLF--GWS-RYIPEGMQSSCGVDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKDA--AAQ------------Q-----QESATTQKAEKEVTRMVIVMVISFLVCWVPYASVAAYIFFNQGSEF--GPVFMTAPSFFAKSASFYNPVIYILLNKQFRNCMITTL--CCGKNPFGDE
RHO1_anoCa  QFSALAAYMFLLILLGFPINFLTLFVTIQHKK-LR-TPLNYILLNLAVANLFMVLMGFTTTMYTSMNGYFIFGTVG-CNIEGFFATLGGEMGLWSLVVLAVERYVVICKPM---SNFRFGETHALIGVSCTWIMALACAGPPLL--GWS-RYIPEGMQCSCGVDYYTPTPEVHNESFVIYMFLVHFVTPLTIIFFCYGRLVCTVKAA--AAQ------------Q-----QESATTQKAEREVTRMVVIMVISFLVCWVPYASVAFYIFTHQGSDF--GPVFMTIPAFFAKSSAIYNPVIYILMNKQFRNCMIMTL--CCGKNPLGDE
RHO1_petMa  KYSVLAAYMFFLILVGFPVNFLTLFVTVQHKK-LR-TPLNYILLNLAVANLFMVLFGFTLTMYSSMNGYFVFGPTM-CNFEGFFATLGGEMSLWSLVVLAIERYIVICKPM---GNFRFGSTHAYMGVAFTWFMALSCAAPPLV--GWS-RYLPEGMQCSCGPDYYTLNPNFNNESFVIYMFLVHFIIPFIVIFFCYGRLLCTVKEA--AAA------------Q-----QESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDF--GATFMTVPAFFAKTSALYNPIIYILMNKQFRNCMITTL--CCGKNPLGDE
RHO1_letJa  KYSALAAYMFFLILVGFPVNFLTLFVTVQHKK-LR-TPLNYILLNLAMANLFMVLFGFTVTMYTSMNGYFVFGPTM-CSIEGFFATLGGEVALWSLVVLAIERYIVICKPM---GNFRFGNTHAIMGVAFTWIMALACAAPPLV--GWS-RYIPEGMQCSCGPDYYTLNPNFNNESYVVYMFVVHFLVPFVIIFFCYGRLLCTVKEA--AAA------------Q-----QESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDF--GATFMTLPAFFAKSSALYNPVIYILMNKQFRNCMITTL--CCGKNPLGDD
RHO1_geoAu  KFSALAAYMFFLILVGFPVNFLTLFVTVQHKK-LR-TPLNYILLNLAVSNLFMILFGFTTTMYTSMNGYFVFGPTM-CSIEGFFATLGGEVSLWSLVVLAIERYIVICKPM---GNFRFGNTHAIMGVALTWVMALSCAAPPLL--GWS-RYLPEGMQCSCGPDYYTMNPTYNNESFVIYMFIVHFTIPFVIIFFSYGRLLCTVKEA--AAA------------Q-----QESASTQKAEKEVTRMVVLMVVGFLVCWVPYASVAFYIFTNQGSDF--GATFMTLPAFFAKSSALYNPVIYILMNKQFRNCMITTL--CCGKNPLGDD
RHO1_leuEr  MFSALAAYMFFLILTGLPVNFLTLFVTIQHKK-LR-QPLNYILLNLAVSDLFMVFGGFTTTIITSMNGYFIFGPAG-CNFEGFFATLGGEVGLWCLVVLAIERYMVVCKPM---ANFRFGSQHAIIGVVFTWIMALSCAGPPLV--GWS-RYIPEGLQCSCGVDYYTMKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKEA--AAQ------------Q-----QESESTQRAEREVTRMVIIMVVAFLICWVPYASVAFYIFINQGCDF--TPFFMTVPAFFAKSSAVYNPLIYILMNKQFRNCMITTI--CLGKNPFEEE
RHO1_calMi  QFSILAAYMFFLIITCFPVNFLTLYVTFEHKK-LR-QPLNFILLNLAVADLFMVFGGFFITVYTSLHGYFVFGVTG-CNFEGFFATLGGEIGLWSLVVLAIERYVVVCKPM---SNFRFGTNHAIMGVAFTWVMALACAVPPLM--GWS-RYIPEGLQCSCGVDYYTLKPEINNESFVIYMFVVHFLIPLIIIFFCYGRLVCTVKEA--AAQ------------Q-----QESESTQRAEREVTRMVIIMVIFFLICWVPYASVAFFIFTNQGSEF--GPIFMAVPAFFAKSSALYNPLIYILLNKQFRNCMITTL--CCGKNPFEED
RHO1_takRu  KYSLVAAYMLFLIITAFPVNFLTLFVTVKHKK-LR-TPLNYVLLNLAVADLFMVIGGFTVTLYTALHAYFVLGVTG-CNIEGFFATLGGEIALWSLVVLAVERYIVVCKPM---TNFRFGEKHAIAGLVFTWIMALTCATPPLL--GWS-RYIPEGMQCSCGIDYYTPKPEINNTSFVIYMFILHFSIPLAIIFFCYSRLLCTVRAA--AAL------------Q-----QESETTQRAEKEVTRMVIVMVISFLVCWVPYASVAWYIFANQGTEF--GPVFMTAPAFFAKSAALYNPVIYILLNRQFRNCMITTV--CCGKNPFGDD
RHO2_galGa  KYRLVCCYIFFLISTGLPINLLTLLVTFKHKK-LR-QPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFVFGPVG-CAVEGFFATLGGQVALWSLVVLAIERYIVVCKPM---GNFRFSATHAMMGIAFTWVMAFSCAAPPLF--GWS-RYMPEGMQCSCGPDYYTHNPDYHNESYVLYMFVIHFIIPVVVIFFSYGRLICKVREA--AAQ------------Q-----QESATTQKAEKEVTRMVILMVLGFMLAWTPYAVVAFWIFTNKGADF--TATLMAVPAFFSKSSSLYNPIIYVLMNKQFRNCMITTI--CCGKNPFGDE
RHO2_anoCa  KYKVVCCYIFFLIFTGLPINILTLLVTFKHKK-LR-QPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFIFGPIG-CAIEGFFATLGGQVALWSLVVLAIERYIVVCKPM---GNFRFSATHALMGISFTWFMSFSCAAPPLL--GWS-RYIPEGMQCSCGPDYYTLNPDYHNESYVLYMFGVHFVIPVVVIFFSYGRLICKVREA--AAQ------------Q-----QESASTQKAEREVTRMVILMVLGFLLAWTPYAMVAFWIFTNKGVDF--SATLMSVPAFFSKSSSLYNPIIYVLMNKQFRNCMITTI--CCGKNPFGDE
RHO2_neoFo  KYSIVCAYMFFLIITGLPINLLTLVVTFKHKK-LR-QPLNYILVNLAVADLFMVCFGFTVTFSTAINGYFIFGPRG-CAIEGFMATLGGEVALWSLVVLAIERYIVVCKPM---GNFRFSNNHSIIGIVFTWLAALSCAAPPLF--GWS-RYLPEGMQCSCGPDYYTMNPDYHNESFVIYMFVVHFFIPVIVIFVSYGRLICKVKEA--AAQ------------Q-----QESASTQKAEREVTRMVILMVIGFMTAWTPYATVAFWIFMNKGAEF--GATFMAAPAFFSKSSALYNPIIYVLMNKQFRNCMVTTL--CCGKNPFGDD
RHO2_latCh  KFSVLCAYMFLLIILGFPINFLTLLVTFKHKK-LR-QPLNYILVNLAVASLFMVVFGFTVTFYSSLNGYFVLGPMG-CAMEGFFATLGGQVALWSLVVLAIERYIVVCKPM---GNFRFASSHAIMGIAFTWIMALACAAPPLV--GWS-RYIPEGLQCSCGPDYYTLNPDFHNESYVMYLFLVHFLLPIIIIFFTYGRLICKVKEA--AAQ------------Q-----QESASTQKAEKEVTRMVILMVIGFLTAWVPYASAAFWIFCNRGAEF--TATLMTVPAFFSKSSCLFNPIIYVLLNKQFRNCMITTL--CCGKNPLGDD
RHO2_gekGe  KFKVLSFYMFFLIAAGMPLNGLTLFVTFQHKK-LR-QPLNYILVNLAAANLVTVCCGFTVTFYASWYAYFVFGPIG-CAIEGFFATIGGQVALWSLVVLAIERYIVICKPM---GNFRFSATHAIMGIAFTWFMALACAGPPLF--GWS-RFIPEGMQCSCGPDYYTLNPDFHNESYVIYMFIVHFTVPMVVIFFSYGRLVCKVREA--AAQ------------Q-----QESATTQKAEKEVTRMVILMVLGFLLAWTPYAATAIWIFTNRGAAF--SVTFMTIPAFFSKSSSIYNPIIYVLLNKQFRNCMVTTI--CCGKNPFGDE
RHO2_geoAu  MYSAISAYVFTLILIGFPVNFMTLFVTFKLKK-LR-QPLNFILVNLCVADLLMIMFGFTTTFYTAMNGYFVFGPTG-CNIEGFFATLGGEVSLWSLVMLAIERYIVVCKPM---GNFRFATTHAALGVVFTWVMASACAVPPLV--GWS-RYIPEGMQCSCGPDYYTLNPKYYNESYVIYLFLVHFLLPVTIIFFTYGRLICTVKEA--AAQ------------Q-----QESASTQKAEREVTRMVIIMVVGFLVCWVPYASFAFYLFMNKGILF--SATAMTVPAFFSKSSVLYNPIIYVLLNKQFRTCMVTTL--FCGKNPFGED
SWS2_ornAn  IFMSLAAFMFLLITLGFPINLLTVICTIKYKK-LR-SHLNYILVNLAVSNMLVVCVGSATAFYSFAHMYFVLGPTA-CKIEGFAATLGGMVSLWSLAVIAFERFLVICKPL---GNLSFRGTHAIFGCAATWVFGLAASLPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKWNNESYVIFLFSFCFGVPLSIIIFSYGRLLLTLRAV--AKQ------------Q-----EQSATTQKAEREVTKMVIVMVLGFLVCWLPYASFSLWVVTNRGQVF--DLRMASIPSVFSKASTIYNPIIYVFMNKQFRSCMLKLV--FCGKSPFGDE
SWS2_utaSt  LFMGMAAFMFLLIILGVPINVLTIFCTFKYKK-LR-SHLNYILVNLAVSNLLVVCIGSTTAFYSFAQMYFSLGPTA-CKIEGFAATLGGMVSLWSLAVVAFERFLVICKPL---GNFSFRGTHAIIGCIITWVFGLVASLPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKWNNESYVLFLFSFCFGVPLSVIIFSYGRLLLTLRAV--AKQ------------Q-----EQSATTQKAEREVTKMVVVMVMGFLVCWLPYASFALWVVTHRGEPF--DVRLATIPSVFSKASSVYNPVIYVFMNKQFRSCMLKLV--FCGKSPFGDE
SWS2_taeGu  IFKAMAAFMFLLVLLGVPINALTVLCTAKYKK-LR-SHLNYILVNLAVANLLVVCVGSTTAFYSFSQMYFALGPLA-CKIEGFTATLGGMVSLWSLAVVAFERFLVICKPL---GNFTFRGSHAVLGCAITWIFGLIASLPPLF--GWS-RYIPEGLQCSCGPDWYTTDNKWNNESYVIFLFCFCFGFPLTVIVFSYGRLLLTLRAV--AKQ------------Q-----EQSASTQKAEREVTKMVVVMVLGFLVCWLPYCSFALWVVTHRGHPF--DLGLASIPSVFSKASTVYNPIIYVFMNKQFRSCMLKLV--FCGRSPFGDE
SWS2_neoFo  VFMVLSVFMFFLLITGIPINVLTIICTFKYKK-LR-SHLNYILVNLAVANLIVVGFGSTTAFYSFSQMYFAWGPLA-CKIEGFAATLGGMVSLWSLAVVAFERFLVICKPL---GNFTFRSTHAIIGCVATWVFGLISSAPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKWNNESYVIFLFCFCFGFPLSVIIFSYGRLLMTLRAV--AKQ------------Q-----EQSASTQKAEREVTKMVVVMVLGFLVCWLPYTVFSLWVVTHRGESF--ELALGSIPAVFSKSSTVYNPLIYVFMNKQFRSCMMKLI--FCGKSPFGDE
SWS2_galGa  LFRAMAAFMFLLIALGVPINTLTIFCTARFRK-LR-SHLNYILVNLALANLLVILVGSTTACYSFSQMYFALGPTA-CKIEGFAATLGGMVSLWSLAVVAFERFLVICKPL---GNFTFRGSHAVLGCVATWVLGFVASAPPLF--GWS-RYIPEGLQCSCGPDWYTTDNKWHNESYVLFLFTFCFGVPLAIIVFSYGRLLITLRAV--ARQ------------Q-----EQSATTQKADREVTKMVVVMVLGFLVCWAPYTAFALWVVTHRGRSF--EVGLASIPSVFSKSSTVYNPVIYVLMNKQFRSCMLKLL--FCGRSPFGDD
SWS2_xenTr  IFMSISAFMLFTIIFGFPLNLLTIICTVKYKK-LR-SHLNYILVNLAVANLIVICFGSTTAFYSFSQMYFSLGTLA-CKIEGFTATLGGIIGLWSLAVVAFERFLVICKPM---GNFTFRESHAVLGCILTWVIGLVAAIPPLL--GWS-RYIPEGLQCSCGPDWYTVNNKWNNESYVLFLFCFCFGFPLAIIVFSYGRLLLALHAV--AKQ------------Q-----EQSATTQKAEREVTRMVIVMVVGFLVCWLPYASFALWAVTHRGELF--DLRMSSVPSVFSKASTVYNPFIYIFMNRQFRSCMMKMI--FCGKNPLGDD
SWS2_geoAu  IFYGMSAFMLFLVLAGFPLNFLTVFVTIKYKK-LR-SHLNYILVNLAIANLIVVCCGSTLAFYSFMHKYFILGPLF-CKMEGFTATLGGMLSLWSLAVLAFERCLVICKPF---GNIAFRGTHALIRCGFAWAAAIAASTPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKYNNESYVMFLFIFCFGTPFTIIIVSYSKLILTLRAA--AAQ------------Q-----QESASTQKAEKEVSRMVVIMVGGFLVCWLPYASLALWIVFNRGSPF--DLRLATIPSVFSKASTVYNPVIYIFLNKQFRSCMMKTI--FCGKNPLGDD
SWS2_takRu  VFYGMSAFMFFLFVAGTGINVLTIACTIQYKK-LR-SHLNYILVNLAFSNLLVTTVGSFTCFCCFFVRYMIVGPLG-CKIEGFAATLGGMVSLWSLAVVAFERWLVVCKPL---GNFIFKPDHAIVCCIFTWFFALIISAPPLF--GWS-RYIPEGFQCSCGPDWYTTGNKYNNESYVWFIFGFGFAVPLFVIVFCYSQLLVMLK-S--AKA------------Q-----AESASTQKAEREVTRMVVVMILGFLVCWLPYASFALWVVNNRGTPF--DLRLATIPACFSKASTVYNPIIYVVLNKQFRSCMKKML--GMSGGD   
SWS2_gasAc  TFYSLAFYMFFILIVGTFINALTVACTVQNKK-LR-SHLNYILVNLAVSNLLVSGVGAFTAFLSFAARYFVLGTLA-CKVEGFLATLGGMVSLWSLAVIAFERWLVICKPL---GNFIFKPDHALVCCAFTWVFALAASAPPLV--GWS-RYIPEGLQCSCGPDWYTTNNKYNNESYVLFLFGFCFAVPFCTICFCYSQLLFTMKMA--AKA------------Q-----AESASTQKAEREVTRMVVLMVMGFLVCWMPYASFALWVVNNRGQTF--DLRFASIPSVFSKSSAVYNPVIYVLLNKQFRSCMMKML--GMGGGD   
SWS1_homSa  AFYLQAAFMGTVFLIGFPLNAMVLVATLRYKK-LR-QPLNYILVNVSFGGFLLCIFSVFPVFVASCNGYFVFGRHV-CALEGFLGTVAGLVTGWSLAFLAFERYIVICKPF---GNFRFSSKHALTVVLATWTIGIGVSIPPFF--GWS-RFIPEGLQCSCGPDWYTVGTKYRSESYTWFLFIFCFIVPLSLICFSYTQLLRALKAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCVCYVPYAAFAMYMVNNRNHGL--DLRLVTIPSFFSKSACIYNPIIYCFMNKQFQACIMKMV---CGKAMTDES
SWS1_monDo  AFHFQTVFMGFVFCAGTPLNAVVLVATLRYKK-LR-QPLNYILVNVSLCGFIFCIFAVFTVFISSSQGYFIFGRHV-CAMEAFLGSVAGLVTGWSLAFLAFERFIVICKPF---GNFRFNSKHAMMVVLATWVIGIGVSIPPFF--GWS-RFIPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIMPLFLICFSYSQLLRALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVMMVGSFCLCYVPYAALAMYMVNNQNHGL--DLRLVTIPAFFSKSACVYNPIIYCFMNKQFHACIMEMV---CRKPMTDDS
SWS1_anoCa  AFYFQTAFMGFVFFAGTPLNAIILIVTVKYKK-LR-QPLNYILVNISFAGFLFCTFSVFTVFMASSQGYFFFGRHV-CAMEAFLGSVAGLVTGWSLAFLAFERYIVICKPF---GNFRFNSRHALLVVAATWIIGVGVAIPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCLCYVPYASLAMYMVNNRDHGL--DLRLVTIPAFFSKSSCVYNPIIYCFMNKQFRACILETV---CGKPMSDES
SWS1_utaSt  AFYFQTAFMGFVFFAGTPLNAIILIVTVKYKK-LR-QPLNYILVNISFAGFLFCVFSVFTVFLASSQGYFFFGRHI-CALEAFLGSVAGLVTGWSLAFLAFERYIVICKPF---GNFRFNSKHALLVVAATWFIGIGVSIPPFF--GWS-RFIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGI--DLRLVTIPAFFSKSACVYNPIIYCFMNKQFRACIMETV---CGKPMTDES
SWS1_taeGu  AFYLQTIFMGLVFVAGTPLNAIVLIVTIKYKK-LR-QPLNYILVNISVSGLMCCVFCIFTVFIASSQGYFVFGKHM-CAFEGFAGATGGLVTGWSLAFLAFERYIVICKPF---GNFRFNSRHALLVVAATWIIGVGVAIPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCMCYVPYAALAMYMVNNREHGI--DLRLVTIPAFFSKSSCVYNPIIYCFMNKQFRACIMETV---CGRPMTDDS
SWS1_galGa  AFYLQTAFMGIVFAVGTPLNAVVLWVTVRYKR-LR-QPLNYILVNISASGFVSCVLSVFVVFVASARGYFVFGKRV-CELEAFVGTHGGLVTGWSLAFLAFERYIVICKPF---GNFRFSSRHALLVVVATWLIGVGVGLPPFF--GWS-RYMPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGL--DLRLVTIPAFFSKSACVYNPIIYCFMNKQFRACIMETV---CGKPLTDDS
SWS1_neoFo  AFFLQAAFMGFVLFVGTPLNAIVLFVTIKYKK-LQ-QPLNYILVNISLAGFIFCFFGVFAVFIASCQGYFIFGKTV-CALEGFTGSVAGLVTGWSLAILAFERYLVICKPI---GNFRFGSKHSMIAVVAAWVIGVGVSIPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIIPLFIICFSYSQLLGALRAV--AAQ------------Q-----QESATTQKAEREVSRMIIVMVGSFCVCYVPYAALAMYMVNNRDHGI--DLRLVTIPAFFSKSSFVYNPIIYCFMNKQFRACIMQTV---FGKPMTDDS
SWS1_xenLa  AFTLQAIFMGMVFLIGTPLNFIVLLVTIKYKK-LR-QPLNYILVNITVGGFLMCIFSIFPVFVSSSQGYFFFGRIA-CSIDAFVGTLTGLVTGWSLAFLAFERYIVICKPM---GNFNFSSSHALAVVICTWIIGIVVSVPPFL--GWS-RYMPEGLQCSCGPDWYTVGTKYRSEYYTWFIFIFCFVIPLSLICFSYGRLLGALRAV--AAQ------------Q-----QESASTQKAEREVSRMVIFMVGSFCLCYVPYAAMAMYMVTNRNHGL--DLRLVTIPAFFSKSSCVYNPIIYSFMNKQFRGCIMETV---CGRPMSDDS
SWS1_geoAu  AFYLQAAFMGFVFICGTPLNAIVLVVTIKYKK-LR-QPLNYILVNISAAGLVFCLFSISTVFVASMQGYFFLGPTI-CALEAFFGSLAGLVTGWSLAFLAAERYIVICKPF---GNFRFGSKHALVAVGLTWMLGLSVALPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTYFLFVFCFVVPLSIIIFSYGSLLGTLRAV--AAQ------------Q-----QESASTQKAEREVSRMVIMMVASFCTCYVPYAALAVYMVTNRDHNI--DLRFVTVPAFFSKASCVYNPLIYSFMNKQFRACILETV---CGKPITDES
SWS1_danRe  AFYLQAAFMGFVFIVGTPMNGIVLFVTMKYKK-LR-QPLNYILVNISLAGFIFDTFSVSQVSVCAARGYYSLGYTL-CSMEAAMGSIAGLVTGWSLAVLAFERYVVICKPF---GSFKFGQGQAVGAVVFTWIIGTACATPPFF--GWS-RYIPEGLGTACGPDWYTKSEEYNSESYTYFLLITCFMMPMTIIIFSYSQLLGALRAV--AAQ------------Q-----AESESTQKAEREVSRMVVVMVGSFVLCYAPYAVTAMYFANSDEPNK--DYRLVAIPAFFSKSSSVYNPLIYAFMNKQFNACIMETV---FGKKIDESS
SWS1_oryLa  AFYLQAAFMGFVFFVGTPLNFVVLLATAKYKK-LR-VPLNYILVNITFAGFIFVTFSVSQVFLASVRGYYFFGQTL-CALEAAVGAVAGLVTSWSLAVLSFERYLVICKPF---GAFKFGSNHALAAVIFTWFMGVGCACPPFF--GWS-RYIPEGLGCSCGPDWYTNCEEFSCASYSKFLLVTCFICPITIIIFSYSQLLGALRAV--AAQ------------Q-----AESASTQKAEKEVSRMIIVMVASFVTCYGPYALTAQYYAYSQDENK--DYRLVTIPAFFSKSSCVYNPLIYAFMNKQFNGCIMEMV---FGKKMEEAS
LWS_homSap  VYHLTSVWMIFVVIASVFTNGLVLAATMKFKK-LR-HPLNWILVNLAVADLAETVIASTISVVNQVYGYFVLGHPM-CVLEGYTVSLCGITGLWSLAIISWERWMVVCKPF---GNVRFDAKLAIVGIAFSWIWAAVWTAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSYPGVQSYMIVLMVTCCITPLSIIVLCYLQVWLAIRAV--AKQ------------Q-----KESESTQKAEKEVTRMVVVMVLAFCFCWGPYAFFACFAAANPGYPF--HPLMAALPAFFAKSATIYNPVIYVFMNRQFRNCILQLF--GKKVDDGS 
LWS_monDom  VYNLTSLWMVFVVIASIFTNGLVLVATMKFKK-LR-HPLNWILVNLAVADLGETVIASTISVINQIYGYFILGHPL-CVLEGYTVSLCGITGLWSLAIISWERWVVVCKPF---GNVKFDAKLAMVGIIFSWVWAAVWTAPPLF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMATCCIFPLSIILLCYVQVWLAIRAV--AKQ------------Q-----KESESTQKAEKEVSRMVVVMILAYCFCWGPYTLFACFAAANPGYSF--HPLTASLPAYFAKSATIYNPIIYVFMNRQFRTCILQLF--GKKVDDGS 
LWS_ornAna  AYNVTSLWMIFVVIASVFTNGLVLVATMKFKK-LR-HPLNWILVNLAVADLGETLIASTISVINQIFGYFILGHPM-CVLEGYTVSLCGITGLWSLSIISWERWIVVCKPF---GNVKFDAKLAMVGIVFSWVWAAVWTAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMSTCCILPLSIIVLCYLQVWLAIRAV--AKQ------------Q-----KESESTQKAEKEVSRMVVVMILAYCFCWGPYTIFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLF--GKKVDDGS 
LWS_galGal  VYNLTSLWMIFVVAASVFTNGLVLVATWKFKK-LR-HPLNWILVNLAVADLGETVIASTISVINQISGYFILGHPM-CVVEGYTVSACGITALWSLAIISWERWFVVCKPF---GNIKFDGKLAVAGILFSWLWSCAWTAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFFPLAIIILCYLQVSLAIRAV--AAQ------------Q-----KESESTQKAEKEVSRMVVVMIVAYCFCWGPYTFFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCILQLF--GKKVDDGS 
LWS_anoCar  VYNITSVWMIFVVIASIFTNGLVLVATAKFKK-LR-HPLNWILVNLAIADLGETVIASTISVINQISGYFILGHPM-CVLEGYTVSTCGISALWSLAVISWERWVVVCKPF---GNVKFDAKLAVAGIVFSWVWSAVWTAPPVF--GWS-RYWPHGLKTSCGPDVFSGSDDPGVLSYMIVLMITCCFIPLAVILLCYLQVWLAIRAV--AAQ------------Q-----KESESTQKAEKEVSRMVVVMIIAYCFCWGPYTVFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLF--GKKVDDGS 
LWS_xenTro  VYNISSLWMIFVVLASVFTNGLVLVATLKFKK-LR-HPLNWILVNMAIADLGETVIASTISVCNQIFGYFVLGHPM-CILEGYTVSVCGIAALWSLTVIAWERWFVVCKPF---GNIKFDGKLAATGIIFSWVWAAGWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMLVLMITCCIIPLAIIVLCYMHVWLTIRQV--AQQ------------Q-----KESESTQKAEREVSRMVVVMIIAYIFCWGPYTFFACFAAFNPGYNF--HPLAAAMPAYFAKSATIYNPIIYVFMNRQFRNCIYQLF--GKKVDDGS 
LWS_takRub  VYNVATVWMFIVVVLSVFTNGLVLVATAKFKK-LR-HPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPM-CVFEGYTVSTCGIAALWSLTIISWERWVVVCKPF---GNVKFDAKWATGGIVFSWVWAAVWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCIIPLAIIILCYLAVWLAIRSV--AMQ------------Q-----KESESTQKAEKEVSRMVVVMIVAYCVCWGPYTFFACFAAANPGYAF--HPLAAAMPAYFAKSATIYNPVIYVFMNRQFRVCIMKLF--GKEVDDGS 
LWS_gasAcu  VYNLSTLWMFIVVALSVFTNGLVLVATAKFKK-LQ-HPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPM-CVFEGYVVSVCGITALWSLTIISWERWIVVCKPF---GNVKFDAKWATAGIVFSWIWSAVWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCLIPLAIIILCYLAVWLAIRAV--AMQ------------Q-----KESESTQKAERDVSRMVVVMIVAYIVCWGPYTTFACFAAANPGYAF--HPLAAAMPAYFAKSATIYNPVIYVFMNRQFRSCIMQLF--GKEVDDGS 
LWS_petMar  VFNLTSVWMIIVVVLSLFSNGLVLVATVKFKK-LR-HPLNWIIVNLAIADILETIFASTISVCNQVYGYFILGHPM-CVFEGYVVSTCGIAGLWSLAIISWERWMVVCKPF---GNIKFDGKIATILIVFSWVWPASWCSLPIF--GWS-RYWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFLPLSIIILCYLQVWLAIHSV--AQQ------------Q-----KESETTQKAERDVSRMVVVMILAYVFCWGPYTFFACFAAANPGYSF--HPIAAALPAYFAKGATIYNPIIYVFMNRQFRNCILQLF--GKKVDDGS 
LWS_letJap  MFNLTSVWMIIVVVLSLFTNGLVLVATMKFKK-LR-HPLNWILVNLAIADILETIFASTISVCNQVFGYFILGHPM-CVFEGYVVSTCGIAGLWSLAIISWERWMVVCKPF---GNIKFDGKIAIILIVFSWVWPACWCSLPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFLPLSVIILCYLQVWLAIHSV--AQQ------------Q-----KESETTQKAERDVSRMVVVMILAYIFCWGPYTFFACYAAANPGYAF--HPLTAALPAYFAKSATIYNPVIYVFMNRQFRNCIMQLF--GKKVDDGS 
LWS_geoAus  MYNLTSFWMIIVVILSLFTNGLVLVATLKFKK-LR-HPLNWILVNLAIADIGETIFASTVSVVNQIFGYFILGHPL-CVFEGFTVSVCGITALWSLAIISFERWMVVCKPF---GNLKFDGKVAIVLIIFSWAWSAGWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFIPLALIIICYLQVWLAIHTV--AQQ------------Q-----KESETTQKAERDVSRMVVVMIFAYIFCWGPYTFFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLF--GKKVDDGS 
LWS_neoFor  VYNLTSLWMIFVVFASCFTNGLVLMATYKFKK-LR-HPLNWILVNLAIADLGETLIASTISVTNQIFGYFILGHPM-CMLEGFTVATCGITGLWSLTIIAWERWVVVCKPF---GNIKFDGKWAAGGIIFSWVWSAFWCAMPLF--GWS-RFWPHGLKTSCGPDVFSGEDKYGTRSFMIALMITCCIIPLGVIILCYIQVWWAIRTV--AKQ------------Q-----KESESTQKAEKEVSRMVVVMIFAYCFCWGPYTFMACFGAAYPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIYQLL--GKKVDDGS 
PIN_galGal  TYVGVAVLMGTVVACASVVNGLVIVVSICYKK-LR-SPLNYILVNLAVADLLVTLCGSSVSLSNNINGFFVFGRRM-CELEGFMVSLTGIVGLWSLAILALERYVVVCRPL---GDFQFQRRHAVSGCAFTWGWALLWSTPPLL--GWS-SYVPEGLRTSCGPNWYTG--GSNNNSYILSLFVTCFVLPLSLILFSYTNLLLTLRAA--AAQ------------Q-----KEADTTQRAEREVTRMVIVMVMAFLLCWLPYSTFALVVATHKGIII--QPVLASLPSYFSKTATVYNPIIYVFMNKQFQSCLLEML--CCGYQPQRTG
PIN_utaSta  IYTSVAVLMGLVVVSAAFVNGLVIVVSIQYKK-LR-SPLNYILVNLAIADLLVTSFGSTLSFANNIYGFFVLGQTA-CEFEGFMVSLTGIVGLWSLAILAFERYLVICKPV---GDFRFQQRHAVFGCVFTWMWSLVWTLPPLF--GWS-SYVPEGLRTSCGPNWYTG--GSGNNSYIMALFVTCFALPLGMIIFSYASLLLTLRAV--ATQ------------Q-----KEVETTQQAEKEVTRRVIAMVMAFLVCWLPYASFAMVVATNKDLVI--QPALASLPSYFSKTATVYNPIIYVFMNKQFRSCLLSTM--SCGHRPRGAQ
PIN_podSic  TYISVAVLMGLVVISATLVNGLVIVVSVQFKK-LR-SPLNYVLVNLAVADLLVTFFGSTISFVNNAQGFFIFGQAT-CEFEGFMVSLTGIVGLWSLAILAFERYLVICKPV---GDFRFPARHAVLGCAFTWGWSFVWTVPPLL--GWS-SYVPEGLRTSCGPNWYSG--GSSNNSYIMTLFVTCFAMPLSTILFSYANLLMTLRTV--AAQ------------Q-----KEQETTQRAEREVTRMVVAMVAAFLVCWLPYASFAMVVATHKDLAI--RPALASLPSYFSKTATVYNPIIYVFMNKQFRSCLLYKM--SCGHRALSSQ
PIN_pheMad  VYTSLAALMGVVVLSASLANGLVIAVSVRFKR-LR-SPLNYILVNLATADLLVTFFGSIISFVNNAVGFFVFGKTA-CRFEGFMVSLTGIVGLWSLAILAFERYLVICKPV---GDFQFQRRHAVIGCLYTWGWSLIWTVPPLF--GWS-SYVPEGLGTSCGPNWYMG--GTNNNSYIVALFVTCFALPLSMILFSYANLLLTLRAV--AAQ------------Q-----KEQETTQRAEKEVTRMVITMVMAFLVCWLPYATFAMVVATTKDLSI--QPGLASLPSYFSKTATVYNPIIYVFMNKQFRSCLLNTV--SCGRIPQTMP
PIN_xenTro  TFLTVAAVMCMVVILAFFVNGLVIVVTLKYKK-LR-SPLNYILVNLAIANLLVTIFGSSVSFSNNVVGYFFMGKTM-CEFEGFMVSLTGIVGLWSLAILAFERYLVICKPM---GDFRFQQKHAILGCSFTWVWSFIWTSPPLF--GWC-SYVPEGLRTSCGPNWYTG--GTNNNSYIMALFLTCFIMPLSTIIFSYSNLLMALRAV--AAQ------------Q-----KDSETTQRAEKEVTRMVIAMVLAFLICWLPYASFAVVVAVNKDVVI--EPTVASLPSYFSKTATVYNPIIYVFMNKQFRNCLMTLL--CCGRS-FGDD
PIN_bufJap  TYLTVAVLMGMVVFLAFFVNGMVIVVSLKYKK-LR-SPLNYILVNLAVADILVTMFGSTVSFHNNIFGFFTLGKLV-CELEGFVVSLTGIVGLWSLAILAFERYIVICKPM---GDFRFQQRHAVMGCAFTWIWAFLWTSPPLI--GWC-SYVPEGLGTSCGPNWYTG--GTNNNSYILALFTTCFMMPLTTIIFSYSNLLLALRAV--AAQ------------Q-----KESETTQRAEREVTRMVIAMVLAFLICWLPYAVFAIVMASNKNVVI--DPTLASMPSYFSKTATVYNPVIYVFMNKQFRDCLTKLL--CCGRNPFGED
VAOP_galGa  HFRLVAAVMFVVTSLSLAENLAVILVTFKFKQ-LR-QPVNYVIVNLSVADFLVSLTGGTISFLANLKGYFYMGHWA-CVLEGFAVTFFGIVALWSLALLAFERYIVICRPV---GNMRLRGKHAAQGIAFVWTFSFIWTIPPTM--GWS-SYTTSKIGTTCEPNWYSG--AYNDRSYIIAFFTTCFIVPLLVILVSYGKLLQKLRKV--SNT------------Q-----GRLRTARKPERQVTRMVVVMIIAFLICWMPYAVFSILATAYPSIEL--DPHLAAIPAFFSKTATVYNPIIYVFMNKQFRMCLIQMF--KCSAIETAES
VAOP_anoCa  NFHLISALMFVVTLFSLSENFTVILVTIKFKQ-LR-QPLNYVIVNLSVADFLVSLIGGTISFSTNLKGYFYMGHWA-CVLEGFAVTFFGIVALWSLALLAFERYVVICRPL---GNMRLNGKHAALGVAFVWIFSFIWTVPPTM--GWS-SYTTSKIGTTCEPNWYSG--DYNDHTFIITFFTTCFILPLLVILVSYGKLMRKLRKV--SDT------------Q-----GRLGTTRKPERQVTGMVVIMILAFLICWSPYAAFSILVTACPSIEL--DPRLAAIPAFFSKTATVYNPVIYVFMNNQFRKCLVQLF--QCSSQETMDA
VAOP_xenTr  NFHLLAALMFVVTSLSIAENFIVILVTAKFKQ-LR-QPLNYIIVNLSVADFLVSVIGGTISIATNSRGYFYLGSWA-CVLEGFAVTFFGIVALWSLSVLAFERYIVICRPL---GNLRLQGKHSALAIIFVWVFSFVWTIPPTM--GWS-SYTTSKIGTTCEPNWYSG--EMRDHTYIITFLTTCFVFPLLVIFMSYGKLMRKLRKV--SDT------------Q-----GRLGSTRKPEKEVTRMVVIMILAFLICWTPYAAFSILITAHPTIDL--DPRLAAIPAFFAKTASMYNPIIYVYMNKQFRRCLYQMF--NINDPEAKES
VAOP_danRe  NYSVLAALMFVVTALSLSENFTVMLVTFRFQQ-LR-QPLNYIIVNLSLADFLVSLTGGSISFLTNYHGYFFLGKWA-CVLEGFAVTFFGIVALWSLAVLAFERFFVICRPL---GNIRLRGKHAALGLVFVWSFSFIWTVPPVL--GWS-SYTVSRIGTTCEPNWYSG--NFHDHTFIITLFSTCFIFPLGVIIVCYCKLIRKLRKV--SNT------------H-----GRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIIITAHPSMHV--DPRLAAIPAFVAKTAAVYNPIIYVFMNKQFRKCLVQLL--SCSKVTVVEG
VAOP_rutRu  NYKVLATLMFVVTAASLSENFAVMLVTFRFTQ-LR-KPLNYIIVNLSLADFLVSLTGGTISFLTNYHGYFFLGKWA-CVLEGFAVTYFGIVALWSLAVLAFERFFVICRPL---GNIRLRGKHAALGLLFVWTFSFIWTIPPVL--GWS-SYTVSKIGTTCEPNWYSG--NFHDHTFIIAFFITCFILPLGVIVVCYCKLIKKLRKV--SNT------------H-----GRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIVVTAHPSIHL--DPRLAAAPAFFSKTAAVYNPVIYVFMNKQFRKCLVQLL--RCRDVTIIEG
VAOP_takRu  NFTILAVLMFVVTSLSLCENFLVMFITFKFKQ-LR-QPLNYIIVNLAIADFLVSLTGGLISFLTNARGYFFLGRWA-CVLEGFAVTYFGIVAMWSLAVLSFERFFVICRPL---GNMRLQAKHAAIGLLFVWTFSFVWTFPPVL--GWN-RYTVSKIGTTCEPDWYSN--NMTSHSYIITFFSTCFILPLGIIFFCYGKLLRKLRKV--S--------------H-----GRLATARKPERQVTRMVVVMIVAFMVAWTPYATFAILVTIHPTIEL--DPRLASIPAFFSKTAAVYNPIIYVFMNKQFRKCLIQHF--IGMGVMAES
VAOP_petMa  NFTMLAALMGTITALSLGENFAVIVVTARFRQ-LR-QPLNYVLVNLAAADLLVSAIGGSVSFFTNIKGYFFLGVHA-CVLEGFAVTYFGVVALWSLALLAFERYFVICRPL---GNFRLQSKHAVLGLAVVWVFSLACTLPPVL--GWS-SYRPSMIGTTCEPNWYSG--ELHDHTFILMFFSTCFIFPLAVIFFSYGKLIQKLKKA--SET------------Q-----RGLESTRRAEQQVTRMVVVMILAFLVCWMPYATFSIVVTACPTIHL--DPLLAAVPAFFSKTATVYNPVIYIFMNKQFRDCFVQVL--PCKGLKKVSA
PPIN_anoCa  GYTIIAIIMATSCTLSVILNTAVIAITIKYRQ-LR-QPINYSLVNLAIADLGAALLGGSLNVETNAVGYYNLGRVG-CVTEGFAMAFFGIVALCTIAVIAVDRAIVIAKPM---GTITFTTRKAMIGVAVSWIWSLVWNTPPLF--GWG-GYQMEGVMTSCAPDWANS--DPINVSYIICYFLFCFTIPFITILASYGYLIWTLRQV--AKV------------G----LAQRGSTTKAEAQVSRMVIVMVMAFLICWLPYATFALVVVGNPQIYI--NPIIATIPMYMAKSSTFYNPIIYIFMNKQFRDCLVRCL--LCGRNPCASE
PPIN_xenTr  GYTILALIMAVFCAAALFLNVTVIVVTFKYRQ-LR-HPINYSLVNLAIADLGVTVLGGALTVETNAVGYFNLGRVG-CVIEGFAVAFFGIAALCTIAVIALDRVFVVCKPM---GTLTFTPKQALAGIAASWIWSLIWNTPPLF--GWG-SYELEGVMTSCAPNWYSA--DPVNMSYIVCYFSFCFAIPFLIIVGSYGYLMWTLRQV--AKL------------G----VAEGGTTSKAEVQVSRMVIVMILAFLVCWLPYAAFAMTVVANPGMHI--DPIIATVPMYLTKTSTVYNPIIYIFMNKQFQECVIPFL--FCGRNPWAAE
PPIN_petMa  GFTILAVIMAVFTLASLVLNSTVIIVTLRHRQ-LR-HPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVG-CVIEGFAVAFFGIAALCTIAVIAVDRFVVVCKPL---GTLMFTRRHALLGITWAWLWSFVWNTPPLF--GWG-SYKLEGVRTSCAPDWYSR--DPANVSYIVSFFSFCFAIPFLVIVVAYGRLLWTLHQV--AKL------------G----MGESGSTAKAEAQVSRMVVVMVVAFLVCWLPYALFAMIVVAKPGVYI--DPVIATLPMYLTKTSTVYNPIIYIFMNRQFRDCAVPFL--LCGRNPWAEP
PPIN_letJa  GFTILAVIMAVFTIASLVLNSTVVIVTLRHRQ-LR-HPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVG-CVIEGFAVAFFGIAALCTIAVIAVDRFVVVCKPL---GTLMFTRRHALLGIAWAWLWSFVWNTPPLF--GWG-SYELEGVRTSCAPDWYSR--DPANVSYITSYFAFCFAIPFLVIVVAYGRLMWTLHQV--AKL------------G----MGESGSTAKAEAQVSRMVVVMVVAFLVCWLPYALFAMIVVTKPDVYI--DPVIATLPMYLTKTSTVYNPIIYIFMNRQFRDCAVPFL--LCGRNPWAEP
PPIN_ictPu  GYTILSIIMALSSTFGIILNMVVIIVTVRYKQ-LR-QPLNYALVNLAVADLGCPVFGGLLTAVTNAMGYFSLGRVG-CVLEGFAVAFFGIAGLCSVAVIAVDRYMVVCRPL---GAVMFQTKHALAGVVFSWVWSFIWNTPPLF--GWG-SYQLEGVMTSCAPNWYRR--DPVNVSYILCYFMLCFALPFATIIFSYMHLLHTLWQV--AKL------------Q----VADSGSTAKVEVQVARMVVIMVMAFLLTWLPYAAFALTVIIDSNIYI--NPVIGTIPAYLAKSSTVFNPIIYIFMNRQFRDYALPCL--LCGKNPWAAK
PPIN_oncMy  GFTILAVIIGVFSVSGVCMNVLVIMVTMRHRK-LR-QPLNYALVNLAVADLGCALFGGLPTMVTNAMGYFSMGRLG-CVLEGFAVAFFGIAGLCSVAVIAVDRYVVVCRPM---GAVMFQTRHAVGGVVLSWVWSFLWNTPPLF--GWG-SFELEGVRTSCSPNWYSR--EPGNMSYIILYFLLCFAIPFSIIMVSYARILFTLHQV--SKL------------K----VLEGNSTTRVEIQVVRMVVVMVMAFLLSWLPYAAFALSVILDPSLHI--NPLIATVPMYLAKSSTVYNPIIYVFMNRQFRDCAVPFL--LCGLNPWAS
PPIN_danRe  GYTILAVIIGVFSVCGVILNVTVITVTLKYKQ-LR-QPLNFALVNLAVADLGCAVFGGLPTVVTNAMGYFSLGRVG-CVLEGFAVAFFGIAALCSVAVIALERCMVVCRPV---GSISFQTRHAVFGVAVSWLWSFIWNTPPLF--GWG-RLQLEGVRTSCAPDWYSR--DLANVSFIVCYFLLCFALPFSVIVYSYTRLLWTLRQV--SRL------------Q----VCEGGSAARAEAQVSCMVVVMILAFLLTWLPYASFALCVILIPELYI--DPVIATVPMYLTKSSTVFNPIIYIFMNRQFRDRALPFL--LCGRNPWAA
PPINa_cioI  TYSFLCVYMTFVFLLSCSLNILVIVATLKNKV-LR-QPLNYIIVNLAVVDLLSGFVGGFISIAANGAGYFFWGKTM-CQIEGYFVSNFGVTGLLSIAVMAFERYFVICKPF---GPVRFEEKHSIFGIVITWVWSMFWNTPPLI--FWD-GYDTEGLGTSCAPNWFVK--EKRERLFIILYFVFCFVIPLAVIMICYGKLILTLRQI--AKE------------------SSLSGGTSPEGEVTKMVVVMVTAFVFCWLPYAAFAMYNVVNPEAQI--DYALGAAPAFFAKTATIYNPLIYIGLNRQFRDCVVRMI--FNGRNPWVDE
PPINa_cioS  VYSFLAVYMTFICLISCSLNILVITATLKNKV-LR-QPLNYIIVNLAVVDLLSGLVGGVISIFANGAGYFFWGKFM-CQVEGYTVSNFGVTGLLSIAVMAFERYFVICKPF---GPVRFEEKHAVIGIAVTWIWAMFWNTPPLI--FWD-GYDTEGLGTSCAPNWFVK--GNTERLFIILYFVFCFLIPLAIIVLCYGKLILQLRQI--AKE------------------SSLSGGTSPEGEVTKMVVVMVTAFVICWLPYAAFAMYNVVNPEAQI--DYALGAAPAFFAKTATIYNPLIYIGLNRQFRDCVVRMI--FNGRNPWVDE
PPINb_cioI  IYTILAVYMTFIFLLAVSLNGFVIIATMKNKK-LR-QPLNYIIINLSIADFLSGLVGGFIGMISNSAGYFYFGKTV-CILEGYIVSVAGVCGLMSISVMAFERYFVVCKPY---GPFTLTNTHAALGIGFTWTWSVLWSTPGLI--WLD-GYVPEGLGTSCAPNWFSK--NKSERIFIFVYFVFCFFIPLLVIIICYGKIVLFLKQA--TRQ------------------SSASSNRQADNKVTKMVLVMISAFLICWTPYGVLSLYNAINPDKQL--DYGLGAVPVFFAKTANIYNPLIYIGLNKQFRDGVIKMV--FRGRNPWAEE
PPINb_cioS  TYSGLCVFMSFVFVLAVPLNLLVIVATYKNKD-LR-RPINYIIVNLAVADLTCSVVGGLLGVLNNGAGYYFLGKSV-CIFEGYVMSVTGVCGILSITVMAFERYFVVCKPF---GQTNLKWSHAITGIVFTWTWSVIWHTPGLF--FWN-GYEPEGFGTSCAPNWFSQ--QKSERIFIFAYFAFCFLTPLTIIFACYLKLILFIRKV--SKK------------------SMVNEADRRDFEVTRMVFVMIAAFLICWLPYGCLSMYNAIHPDNLL--SYGIGSVPAFFAKTATIYNPIIYMGLNKKFRDGVIRML--FKGRNPWLDG
PARIE_utaS  GYGVLAFLMFLNALFSIFNNSLVIAVTLKNPQ-LR-NPINIFILNLSFSDLMMSLCGTTIVIATNYYGYFYLGRKF-CIFQGFAVNYFGIVSLWSLTILAYERYNVVCQPL---GTLQMSTKRGYQLLGFIWVFCLFWAVVPLF--GWS-SYGPEGVQTSCSIGWEER--SWSNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHGL--NKK-----------VE----QLGGKSSPEEEFRAVIMVLVMVVAFLICWLPYTVFALIVVFNPALNI--SPLAATIPTYLSKTSPVYNPIIYIFLNKQFRDCAVEFI--TCGQVVLTSP
PARIE_anoC  GYGVLAFLMFINALFSLFNNFLVIAVTLKNPQ-LR-NPINIFILNLSFSDLMMSICGTTIVIATNYHGYFYLGRRF-CIFQGFAVNYFGIVSLWSLTILAYERYNVVCQPL---GTLQMSTQRAYQLLGFIWVFCLFWAVVPLF--GWS-SYGPEGVQTSCSIGWEER--SWNNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHGL--NKK-----------VE----QLGGKSNPEEEFRAVIMVLVMVVAFLICWLPYTLFALTVVFNPALNI--SPLAATIPTYLSKTSPVYNPIIYIFLNKEFRECAVEFI--TCGKVVLTSP
PARIE_xenT  GYSILSFLMFLNAVFSICNNAIVILVTLKHPQ-LR-NPINIFILNLSFSDLMMALCGTTIVVSTNYHGYFYLGKQF-CIFQGFAVNYFGIVSLWSLTLLAYERYNVVCEPI---GALKLSTKRGYQGLVFIWLFCLFWAIAPLF--GWS-SYGPEGVQTSCSIGWEER--SWSNYSYIISYFLTCFIIPVGIIGFSYGSILRSLHQL--NRK-----------IE----QQGGKTNPREEKRVVIMVLFMVLAFLICWLPYTVFALIVVINPQLYI--SPLAATLPTYFAKTSPVYNPIIYIFLNKQFRTYAVQCL--TCGHINLDSL
PARIE_takR  GYSILSFLMFINTVLSVFNNSLAIAVMLKNPS-LL-QPINIFILSLAVSDLMIGLCGSLVVTITNYHGSFFIGHTA-CVFQGFAVNYFGLVSLCTLTLLAYERYNVVCKPR---AGLKLTMRRSIIGLLFVWTFCLFWAVTPLL--GWS-SYGPEGVQTSCSLAWEER--SWNNYSYLILYTLLCFIFPVGVIIYCYCKVLTSMNKL--NKS-----------VE----LQGGLSCRRENKHAINMVLAMIIAFFVCWLPYTALSVVVVVDPELHI--PPLVATMPMYFAKTSPVYNPIIYFLSNKQFRDATLEVL--SCSRYIPHAS
PARIE_gasA  GYSILSFLMFINTVLTVFNNVLVITVLVRNPS-LL-QPMNVFILSLAVSDLMIGLCGSLVVTITNYHGSFFIGHTA-CIFQGFAVNYFGLVSLCTLTLLSYERYNVVCRPR---NALKLSMRRSIHGLLIVWTFCLFWAVAPLF--GWS-GYGPEGVQTSCSLAWEER--SWSNYSYLVLYTLLCFIVPVAVIIYCYAKVLTSMNTL--NRS-----------VE----VQGGRSSQKENDHAVSMVLAMIIAFFSCWLPYTALSVVVVVDPTLYI--PPLVATMPMYFAKTSPVYNPIIYFLSNKQFRDAALEML--SCGRYIAHMP
PARIE_danR  GYSILSYLMFINTTLSVFNNVLVIAVMVKNLH-FL-NAMTVIIFSLAVSDLLIATCGSAIVTVTNYEGSFFLGDAF-CVFQGFAVNYFGLVSLCTLTLLAYERYNVVCKPM---AGFKLNVGRSCQGLLLVWLYCLFWAVAPLL--GWS-SYGPEGVQTSCSLGWEER--SWRNYSYLILYTLMCFILPTVIITYCYSNVLLTMRKI--NKS-----------IE----CQGGKNCAEDNEHAVRMVLAMIIAFFICWLPYTAISVLVVVNPEISI--PPLIATMPMYFAKTSPVYNPIIYFLTNKRFRESSLEVL--SCGRYISRET
CILI2_plaD  SYVITAIYLCIVGVIGTLSNGVIMYLYFKDKS-LR-SPMNLLFVNLAMSDFTVAFFGAMFQFGLTCTRKYMSPGMALCDFYGFITFLGGLASEMNLFIISVERYLAVVRPF---DVGNLTNRRVIAGGVFVWLYSLVFAGGPLV--GWS-SYRPEGLGTWCSISWQ--DRSMNTMSYVTAVFLGCYFFPVSIIIFCYFNVWRKVKE----AA-----------DA----QGGAGTAGKAEKSIFRMSVIMVTCYLTAWTPYAIVCLIASYGPPNGL--PIYAEVLPSLFAKSSQVYNPIIYVLMNKPYRSALVSLV--CRGRNPFDEA
CILI1_plaD  DYNICAAYLFFIACLGVSLNVLVLVLFIKDRK-LR-SPNNFLYVSLALGDLLVAVFGTAFKFIITARKTLLREEDGFCKWYGFITYLGGLAALMTLSVIAFVRCLAVLRLG---SFTGLTTRMGVAAMAFIWIYSLAFTLAPLL--GWN-HYIPEGLATWCSIDWL--SDETSDKSYVFAIFIFCFLVPVLIIVVSYGLIYDKVRK----VA-----------KT-------GGSVAKAEREVLRMTLLMVSLFMLAWSPYAVICMLASFGPKDLL--HPVATVIPAMFAKSSTMYNPLIYVFMNKQFRRSLKVLL--GMGVEDLNSE
ENCEPH_hom  TYERLALLLGSIGLLGVGNNLLVLVLYYKFQR-LR-TPTHLLLVNISLSDLLVSLFGVTFTFVSCLRNGWVWDTVG-CVWDGFSGSLFGIVSIATLTVLAYERYIRVVH------ARVINFSWAWRAITYIWLYSLAWAGAPLL--GWN-RYILDVHGLGCTVDWK--SKDANDSSFVLFLFLGCLVVPLGVIAHCYGHILYSIRMLRCVED-----------LQ----TIQVIKILKYEKKLAKMCFLMIFTFLVCWMPYIVICFLVVNGHGHLV--TPTISIVSYLFAKSNTVYNPVIYVFMIRKFRRSLLQLLCL         
ENCEPH_mon  TYELLALLIATIGLLGLCNNLLVLVLYYKFQR-LR-TPTHLFLVNISFNDLLVSLFGVTFTFVSCLRSGWVWDSVG-CAWDGFSNTLFGIVSIMTLTVLAYERYNRIVH------AKVINFSWAWRAITYIWLYSLVWTGAPLL--GWN-RYTLEIHGLGCSVDWK--SKDPNDSSFVIFLFFGCLMLPVGVMAYCYGHILYAIRMLRCVEE-----------LQ----TIQVIKILRYEKKVAKMCFLMIAIFLFCWMPYAVICLLVANGYGSLV--TPTVAIIASLFAKSSTAYNPIIYIFMSRKFRRCLLQLLCF         
ENCEPH_gal  TYELLALLIATIGTLGVCNNLLVLVLYYKFKR-LR-TPTNLFLVNISLSDLLVSVCGVSLTFMSCLRSRWVWDAAG-CVWDGFSNSLFGIVSIMTLTVLAYERYIRVVH------AKVIDFSWSWRAITYIWLYSLAWTGAPLL--GWN-RYTLEIHGLGCSMDWK--SKDPNDTSFVLLFFLGCLVAPVVIMAYCYGHILYAVRMLRCVED-----------FQ----TSQVIKLLKYEKKVAKMCFLMISTFLICWMPYAVVSLLVTYGYSNLV--TPTVAIIPSFFAKSSTAYNPVIYIFMSRKFRQCLLQLLCF         
ENCEPH_ano  TYELLALLVAAIGLLGLCNNLLVLVLYAKFKR-LR-TPTHLFLVNISLSDLLVSLFGVSFTFGSCLRHRWVWDAAG-CVWDGFSNSLFGIVSIMTLTVLAYERYIRVVH------ARVIDFSWSWRAITYIWLYSLAWTGAPLL--GWN-HYTLEIHGLGCSVDWQ--SKEPSDSSFVLFFFLGCLAAPVGIMAYCYGHILHAIRMLRCVED-----------LQ----SIQVIKILRYEKKVAKMCFLMVTTFLICWMPYAVVSLLIAYGYGHLI--TPTVAIIPSFFAKSSTAYNPVIYIFMSRKFRRCLVQLFCV         
ENCEPH_gas  TYKLLAFAIGTIGVFGFCNNVVVIVLYCKFKR-LR-TPTNLLVVNISLSDLLVSVIGINFTFVSCIRGGWTWSRAT-CIWDGFSNSLFGIVSIMTLASLAYERYIRVVH------AQVVDFPWAWRAIGHIWLYSLVWTGAPLL--GWN-RYTLEIHRLGCSLDWA--SKDPNDASFILLFLLACFFVPVGIMIYCYGNILYAVQMLRSIQD-----------LQ----TVQIIKILRYEKKVAVMFLLMISCFLLCWTPYAVVSMMEAFGRKNMV--SPTVAIIPSFFAKSSTAYNPLICVFMSRKFRRCLMQLLCS         
ENCEPH_xen  TYHFLALIVATVGFLGLVNNLLVLILYCKFKR-LQ-TPTNLLFFNTSLCHFVFSLLAITFTFMSCVRGSWAFSVEM-CVFHGFSKNLLGIVSFGTLTVVAYERYARVVY------GKYVNSSWSKRSITFVWVYSLAWTGFPLI--GWN-LYTFETHKLDCSFEWT--ATDPKDTAFVLLFFLACITLPLSIMAYCYGYILYEIQKLRSVKN-----------IQ----NFQEITILDYEIKMAKMCLLMMLTFLIGWMPYTILSLLVTSGYSKFI--TPTITVMPSLLAIASAAYNPVIHIFTIKKFRQCLVQLLPPINFHPPIN 
ENCEPH4a_t  GNLVVSVFLGFIGTFGLVNNLLVLVLFCRYKM-LR-SPINLLLMNISISDLLVCVLGTPFSFAASTQGRWLIGEAG-CVWYGFANSLFGVVSLISLAVLSFERYSTMMTPT---EADPSNYCKVCLGITLSWVYSLVWTVPPLF--GWS-SYGPEGPGTTCSVNWT--AKTTNSISYIICLFVFCLIVPFLVIVFCYGKLLCAIRQ---VSG------------------INASTSRKREQRVLCMVVIMVICYLLCWLPYGVVALLATFGPPDLV--TPEASIIPSVLAKSSTVINPIIYVFMNKQFYRCFLALL--CCQDPRSGSS
ENCEPH4b_t  GHLVVAVCLGFIGTVGFLSNFLVLALFCRYRA-LR-TPMNLMLVSISASDLLVSVLGTPFSFAASTQGRWLIGRAG-CVWYGFVNACLGIVSLISLAVLSYERYCTMVSST---IASNRDYRPVLGGICFSWFYSLAWTVPPLL--GWS-RYGPEGPGTTCSVDWR--TQTPNNISYIVCLFTFCLLLPFFVILYSYGKLLHTIRQ---VRR------------------VSSTVTRRREHRVLVMVVAMVVCYLICWLPYGVTALLATFGPPNLL--TPEATITPSLLAKFSTVINPFIYIFMNKQFYRCFRAFL--NCSTPKRDST
ENCEPH4_br  GYTAIATCLALIGFVGFTNNFVVILLIGCHRQ-LR-TPFNLLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPG-CVWYGFANSLFGIVSLVTLSALAFERYCVVVR-----SSDMLTYKSSLVVITFIWLYSLLWTSLPLL--GWS-SYQFEGHNVGCSVNWV--QHNPDNVSYIVTLMVTCFFVPMVVVCWSYAWIWRTVRM---SSE------------A----KPECGNSQNAGRLVTTMVVVMIICFLVCWTPYAVMALIVTFGADHLV--TPTASVIPSLVAKSSTAYNPIIYVLMNNQFREFLLARLQRVCCRQ   
ENCEPH5_br  GFDTVAVVIAAIGIAGFLSNGAVVLLFLKFRQ-LR-TPFNMLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPG-CVWYGFANHLFGLVSLISLAVISYERYRMVVKPK-GPGSSYLTYNKVGLAIIFIYLYCLLWTTLPIV--GWS-SYQLEGPKISCSVAWE--EHSLSNTSYIVAIFIMCLLLPLLIIIYSYCRLWYKVKK---GSQ-----------NL----PPAIRKSSQKEQKIARMVVVMITCFLVCWLPYGAMALVVSFGGESLI--SPTAAVVPSLLAKSSTCYNPLVYFAMNNQFRRYFQDLL--CCGRRLFDAS
PIN_stoPur  TYNYLTVYTGFLTIFGILNNGIVMILFARFPS-LR-HPINSFLFNVSLSDLIISCLASPFTFASNFAGRWLFGDLG-CTLYAFLVFVAGTEQIVILAALSIQRCMLVVRPF---TAQKMTHRWALFFISLTWIYSLIICVPPLF--GWN-RYTYEGPGTACSVAWN--SPSPGDTSYIIFIFVLVLVIPFGIIIFCYGLLVYAVKK---ISR------------T----QAALSSEAKADRKVSKMIFIMILFFLIAWTPYTGFSLYVTFGKNVVI--TPLAGTFPPFFAKLCTIHNPIIYFLLNKQFKDALIQLF--CCGENPFDRD
ENCEPH_api  MYIGAAIALGFIGFFGFTANLLVAIVIVKDAQILW-TPVNVILFNLVFGDFLVSIFGNPVAMVSAATGGWYWGYKM-CLWYAWFMSTLGFASIGNLTVMAVERWLLVARPM-----QALSIRHAVILASFVWIYALSLSLPPLF--GWG-SYGPEAGNVSCSVSWEVHDPVTNSDTYIGFLFVLGLIVPVFTIVSSYAAIVLTLKK-----------------VR----K-RAGASGRREAKITKMVALMITAFLLAWSPYAALAIAAQYFNAK-P--SATVAVLPALLAKSSICYNPIIYAGLNNQFSRFLKKIFDARGSRT   
ENCEPH1_an  AYNGAAVTLFFIGFFGFFLNIFVIALMYKDVQ-LW-TPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWLYGKSI-CVAYGFFMSLLGIASITTLTVLSYERFCLISRPF---AAQNRSKQGACLAVLFIWSYSFALTSPPLF--GWG-AYVNEAANISCSVNWE--SQTANATSYIIFLFIFGLILPLAVIIYSYINIVLEMRK-----------------NS----A-RVGRVNRAERRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELI--GPGLAVLPALVAKSSICYNPIIYVGMNTQFRAAFWRIRRSNGVAGQPD 
ENCEPH2_an  AYNASAVTLFFIGFFGFFLNLFVIALMCKDMQ-LW-TPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWIFGRTL-CVAYGFFMSLLGITSITTLTVLSYERYCLISRPF---SSRNLTRRGAFLAIFFIWGYSFALTSPPLF--GWG-AYVQEAANISCSVNWE--SQTKNATTYIIFLFVFGLVVPLIVIVYSYTNIIVNMRE-----------------NS----A-RVGRINRAEQRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELI--GPGLAVLPALVAKSSICYNPIIYVGMNTQFRAAFSRVRNKGQQA     
ENCEPH_aed  AYVASAVTLFFIGFFGFFLNLFVIALMCKDVQ-LW-TPINIILFNLVCSDFSVSIIGNPFTLTSAISRHWIFGRTV-CIAYGFFMSLLGITSITTLTVLSYERFCLISHPF---SSRSLSRRGAVFAILFIWSYSFALTSPPLF--GWG-AYVNEAANISCSVNWE--SQTLNATSYIIFLFVFGLVVPLVVIVYSYTNIVVNMKR-----------------NA----A-RVGRINRAEKRVTRMVFVMVLAFMIAWTPYAVFALIEQFGPTDII--SPALGVLPALIAKSSICYNPIIYVGMNTQFRAAFNRVRNNE       
ENCEPH_cul  AYVATAVVLFFIGFFGFFLNLFVIALMCKEVQVLW-TPMNIILLNLVCSDFSVSIVGNPFTLSSAISHRWLFGRKL-CVAYGFFMSLLGITSITTLTVLSYERFYLISRPF---SSRSLSRRGALGAVLLIWCYSFALTSPPLF--GWG-AYVNEAANISCSVNWE--TQTLNATTYIIYLFVFGLVVPLTVIVYSYTNIIVNMKK-----------------NA----A-RVGRINRAEKRVTTMVAVMVIAFMVAWTPYSVFALMEQFGPPDVI--GPGLAVLPALIAKSSICYNPIIYVGMNTQFRAAFNRVRHDP       
ENCEPH_tri  GYIAAAVVLFCIGFFGFSLNLTVIIFMLKERQ-LW-SPLNIILFNLVVSDFLVSVLGNPWTFFSAINYGWIFGETG-CTIYGFIMSLLSITSITTLTVLAFERYLLIARPF---RNNALNFHSAALSVFSIWLYSLSLTIPPLI--GWG-EYVHEAANLSCSVNWE--EKSPNSTSYILYLFAFGLFLPLVIITFSYVNIILTMRR-----------------NA----AFRVGQVSKAENKVAYMIFIMIIAFLTAWSPYAIMALIVQFGDAALV--TPGMAVIPALLAKSSICYNPVIYIGLNAQVKGAKWVSGLIYLFQFQQ 
ENCEPHa_ne  EANIVLGYYIAIFVIGFVTNTIVVIIFISSQR-LH-TTPNLILFSMSVCDWLMATMAKSVGIYGNARYWPTVGKVT-CDYYAFATSAIGYASILHLAALAVEKRMAVVSPM----TNSFNGRRMLVIIATLWGFAILWAVFPLI--GWS-SYGPEPGYVSCSITWYT--TDHNNVSYIICVSVLFFLIPIVTMTFCFASIYHTIRNLSHEAT-----------ARWGSDARATQETIRAKAKTAKMAFLMVMCFLFAWTPYAVVSLWTTFGDTHRI--PALLGVLPSLFAKLSSCYNPIIYFFMYTKFRCAGKALLYQEHH     
ENCEPHb_ne  EANIVLGYYIAIFVIGFVTNTIVVITFIFSKR-LH-TTPNLILFSMSVCDWLMAAMAKSVGIYGNARYWPTVGKVT-CDYYAFATSAIGYASILHLAALAVEKRMAVASPM----TNSLNERRMLVIIATLWGFAILWAVFPLI--GWS-SYGPEPGYVSCSITWYT--TDHNNVSYIICISVLFFFVPIVTMTFSFASIYKAIRNISHEAI-----------ARWGSHARATQETIKAKAKTAKMAFLMVMCFLFAWTPYAVVSLWTTFGGTHRN--PALLGVLPSLFAKLSSCYNPIIYFFMYTKFRRAAKLLFIKKVIRPTEA 
ENCEPHc_ne    HAITVMYSLLAAGAFVLNGIVLIIFLATRS-LR-TIPNMILLSMAWADWLMACLADAVGAYANANNWPSMVGGL-CVYYGFITTALGLTSMIHLTALSVERFVTVTIPM----TRPITETQMLLVVTFLWAFSFLWAIFPLV--GWS-SYGPEPGYAACSIAWYR--QDLNNMSYILCLFMFFFFLPIVIMIACFSSIYFTVRKLTRDSM-----------RRWGASSDSTQQTLAAERKTAWMSFIMVLAFLFAWVPYAVVSLYASFGGVTTI--PKLMSTLPAMLAKTSACYNPIIYFFMYSKFRKAFQRFFFKNVITPSQT 
MOLL_PERc_  EFRIIGIFISICCIIGVLGNLLIIIVF-AKRRSVR-RPINFFVLNLAVSDLIVALLGYPMTAASAFSNRWIFDNIG-CKIYAFLCFNSGVISIMTHAALSFCRYIIICQYG--YR-KKITQTTVLRTLFSIWSFAMFWTLSPLF--GWS-SYVIEVVPVSCSVNW--YGHGLGDVSYTISVIVAVYVFPLSIIVFSYGMILQEKVCKDSRKN-------GIRAQQRY----TPRFIQDIEQRVTFISFLMMAAFMVAWTPYAIMSALAI--GSFNV--ENSFAALPTLFAKASCAYNPFIYAFTNANFRDTVVEIMAPWTTRRVGV 
PER2_strPu  GYLLTAIYLTIVGSIATVGNITVICVL-CRYRTFRKRSINLLLINMAASDLGVSVAGYPLTTVSGYWGRWLFGDVG-CQFYAFCVYTLSCSTISTHAAIAVYRYIYIVKTD--LR-PKLTANFTSGVIVVIWVYAFFWTVTPFV--GWS-SYIYEPFGTSCSVNW--VGRTISDISYMVACTIGVYLLQIFIMLYCYIRVAKKIRGVDPGRT-------EEKDAGVVVFGRLRKREAKIDTHVTKMCFMMMLTFIVVWAPYAVECLRAA--HVHRI--SALSSVLPTMFAKSSCMVNPIIFLTSSSKFRQDLGKLWSRPSSQDSL 
PER1_strPu  GYLLTALYLTLVGIVSTIGNITVLCVL-CRYGTFRKRSVNILLMNMAVSDLGVSVAGYPLTAISGYRGRWVFADIG-CQFSGFCVYALSCSTISTHAVVAIYRYIYIVKPY--HR-PRLSSSTSCLAILCIWTFTLFWTITPFF--GWS-SYTYEPFGTSCSINW--YGKSLGDLTYIICCVVFVFILPIIIMLYCYIGVAKKIKGIDPLRT-------EERDIAVV-FGRLRKHETKIDTRVTKICFMMMASFIVVWTPYAVGSIWAS--KIGKI--SASASVLPTMFAKSSCMINPIIFLTSSSKFRADLGKLWNRPSSLEHTI 
PER_homSap  EHNIVATYLIMAGMISIISNIIVLGIF-IKYKELR-TPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAG-CQVYAGLNIFFGMASIGLLTVVAVDRYLTICLPD--VG-RRMTTNTYIGLILGAWINGLFWALMPII--GWA-SYAPDPTGATCTINW--RKNDRSFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDC-----------TESL------NRDWSDQIDVTKMSVIMICMFLVAWSPYSIVCLWASFGDPKKI--PPPMAIIAPLFAKSSTFYNPCIYVVANKKFRRAMLAMFKCQTHQTMPV 
PER_monDom  EHKIVAAYLITAGVISIVSNVIVLGIF-VKYKALR-TATNTIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYDG-CQIYAGLNIFFGMASIGLLTAVAIDRYLTICQPD--LG-R-MTSYNYTLMILTAWVNGFFWALMPIV--GWA-GYAPDPTGATCTINW--RKNDVSFVSYTMTVITINFAMPLGVMFYCYYNVSQKMKQYSPSNC-----------PDHI------NRDWSNQVAVTKMSVVMILMFLLAWSPYSIVCLWASFGDPKEI--PPAMAIVAPLFAKSSTFYNPCIYVAANKKFRRAISAMIRCQTHQSMPI 
PER_galGal  EHNIVAAYLITAGVISIFSNIVVLGIF-VKYKEFR-TATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYTG-CQIYAALNIFFGMASIGLLTVVAVDRYLTICRPD--IG-RRMTTRNYAALILAAWINAVFWASMPTV--GWA-GYASDPTGATCTANW--RKNDVPFVSYTMSVIAVNFVVPLTVMFYCYYNVSRTMKQYTSSNC-----------LESI------NMDWSDQVDVTKMSVVMIVMFLVAWSPYSIVCLWSSFGDPKKI--SPAMAIIAPLFAKSSTFYNPCIYVIANKKFRRAILAMVRCQTRQEITI 
PER_xenTro  EHNIVAAYLITAGVISILSNIIVLGIF-VKYKELR-TATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYVG-CQIYAGLNIFFGMASIGLLTVVAIDRYLTICRPD--IGGRRISGRHYTAMILAAWINAVFWSVMPVV--GWS-SYAPDPTGATCTINW--RKNDVSFVSYTMSVVAVNFVVPLMVMFYCYYNVSRTMKGYGSRSS-----------LGGI------NADWSDQTDVTKMSMVMIVMFLVAWSPYSIVCLWSSFGDPRKI--PPAMAIIAPLFAKSSTFYNPCIYVIANKKFRRAILSMVQCKSRQEVTL 
PER_gasAcu  EHNIVAGYLITAGVISLFSNIVVLLMF-WKFKELR-TATNFIIINLAFTDIGVAGIGYPMSAASDIHGSWKFGYAG-CQIYAALNIFFGMASIGLLTVVAIDRYLTICRPD--IGGQKMTMQSYNLLILAAWLNAVFWSSMPVV--GWA-SYAPDPTGATCTINW--RQNDVSFISYTMAVIAVNFVLPLSAMFYCYYNVSATVKRYKASNC-----------LDSA------NIDWSDQMDVTKMSIVMIIMFLVAWSPYSIVCLWASFGDPKTI--PAPMAIIAPLFAKSSTFYNPCIYVIANKKFRRAIIGMVRCQTRQRITI 
PERa_braFl  DHLIVGLYLFVIGIIGTVENGITLATF-TKFRSLR-SPTTMLLVHLAIADLGICIFGYPFSGASSLRSHWLFGGVG-CQWYGFNGMFFGMANIGLLTCVAVDRYLVICRQD--LV-DKVNYNTYGVMAALGWLFAAFWAALPLV--GWA-EYSLEPSGTACTINW--QKNDSLYISYVTSCFILGFALPLAVMMFCYWQASCFVNKVLKGDI-----------SGDLTFPVAVNVDWEYQNHFSKMCLAMVAAFVVAWTPYSVLFLFAAFGNPADI--PAWITLLPPLIAKSSALYNPIIYIIANRRFRSAIFSMVKGQNPDVET 
PERa_braBe  DHLIVGLYLFVIGIIGTIENGITLATF-SKFRSLR-SPTTMLLVHLAIADLGICIFGYPFSGASSLRSHWLFGGVG-CQWYGFNGMFFGMANIGLLTCVAVDRYLVICRHD--LV-DKVNYNTYGVMAALGWLFAAFWAALPLV--GWA-EYALEPSGTACTINF--QKNDSLYISYVTSCFVLGFVVPLAVMAFCYWQASCFVSKVLKGDI-----------AGDLTFPVAANVDWEYQNHFSKMCLAMVAAFVVAWTPYSVLFLFAAFWNPADI--PAWLTLLPPLIAKSSALYNPIIYIIANRRFRNAICSMMKGQDPDVEDD 
PERc_braFl  GYLASAVYLTITGLIAFVGNIFAIIVFLTE-KEFRKKEHNSFALNLAIADLSVCVFAYPSSTISGYAGEWMLGDVG-CTIYGFLCFTFSLTSMVTLCAISVYRYIVICKPQ--YA-HLLTHRRTNYVILGIWLYALVFSVPPLF--GVN-RYTYEPI-ITCSLDW--NVQHVGETIYTAAVIIIVYVLNVSIMCFCYFNIIFKSANLKFAAL-------ASEKTR--------TAAKKDIWKTSMMCLAMVVSFLIAWTPYAVSSTWDIL-TEEDL--PIIATILPTMFAKSSCMMNPIIYSCCNGKFRQAALKTFSK         
PERc_braBe  GYLASAIYITLTGLIAFFGNVITITVFLTE-KEFRKKQQNGFVLNLAIADLSVCVFAYPSSAIAGYAGRWVLGDVG-CTIYGFLCFTFALVSMVTLCVISIYRYILICKPQ--YA-HLLTHRRTVYVIIGTWLYALVFTVPPLV--GVK-RYTYEPMQITCSLDW--NVQHPGEKAYIAAVLVIVYVLQVLIMCFCYFNIIFKSANLKFAAL-------ASEKTK--------MAAKKDTWKTSVMCLTMVVSFLIAWTPYAVSSTWDIL-SAEDL--PIIATILPSLFAKSSCMMNPIIYACCNTKFRQAAVKSFRKLCGMCKQK 
PERb_braFl  SATIMGVYLTIVGLVSTVGNATVVLMFMLKWRQLCRKAPNLLIINLAAVDLCISVFGYPFSASSGFANQWLFSDAI-CTLYGFSCFLLSMVSMHTLCLISAHRYITICRPE--HA-SKLTMTRTILAVVGAWVYGISVAVPPLF--GIA-GYTYESFGLSCTIDF--HGTTVADMVYLSILIILCYVINVAVMGTCYFKIIRKFSKHRFREV-------RDVRTS--------HQHSFERGVT-LRCILMTLFYLISWTPYTAVAVWTMV-GPPP---PVQLGMVAALTAKTHCAFNPILYMLMSEVYRKLVLRTMCPCCFNKISN 
PERb_braBe  SATIMGVYLTIVGLVATVGNATVVLMFIMKWRQLCRKAPNLLVINLAAANLCITIFGYPFSASSGYAHQWLFPDAI-CTLYGFSCFLLSMVSMHTLCLISAHRYITICRPE--HA-SKLTMNRTVLAVIGTWLYAIAVAVPPLF--NIA-RYTYEPSGLSCTIDF--RVTTVADLVYLGSLIVLCYVIHVAVMATCYFKIIRKFSRHRFRQV-------RDIRTS--------HQRSFEMGVT-MRCILMTLFYLLSWTPYTAVCIWTMV-GPPP---PVVVSMAAALIAKTHCAFNPILYAFMSEVYRKLVFRTMCPCCFNRISC 
NEUR_homSa  ADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLR--PAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLS--YG-VWLKRKHAYICLAAIWAYASFWTTMPLV--GLG-DYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKS-SSKEV-------AHFDSRIHSSHVLEMKLTKV-------AMLICAGFLIAWIPYAVVSVWSAFGRPDSI--PIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEG 
NEUR_monDo  ADLVAGFYLTIIGVLSTLGNGYVIYMSSKRKKKLR--PAEIMTVNLAVCDLGISVVGKPFTIISCFSHRWVFGWVG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLS--YG-TWLKRHHAYICLVIIWAYATFWATMPLA--GLG-NYAPEPFGTSCTLDWWLAQASVTGQTFILNILFFCLLLPTAVIVFSYVKIIAKVKS-STKEV-------AHFDSRIQSSHVLEMKLTK--------AMLICAGFLIAWIPYAVVSVWSAFGQPDSI--PVQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCQSGGQKAAKKESLRT 
NEUR_ornAn  ADLVAGVYLVIIGVLSTLGNGYVIYMSSRRKKKLR--PAEIMTVNLAVCDLGISVVGKPFTIVSCFCHRWVFGWMG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLS--YG-TWLKRHHAYICLAIIWAYASFWATMPLV--GLG-NYAPEPFGTSCTLDWWLAQASVAGQAFILNILFFCLLLPTAVIVFSYVKIIAKVKS-STKEV-------AHFDSRIQNSHVLEMKLTK--------AMLICAGFLIAWIPYAVVSVWSAFGQPDSI--PIQFSVVPTLLAKSAAMYNPIIYQVIDCRISCCRLGGPKTGKKESLKN 
NEUR_galGa  ADIIAGFYLTVIGILSTLGNGYVIFMSSKRKKKLR--PAEIMTVNLAVCDLGISV-GKPFSIISFFSHRWIFGWMG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLA--YG-TWLKRHHAFICLALIWAYATFWATVPFA--GVG-SYAPEPFGTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKS-STKEV-------AHYDTRIQNSHILEMKLTKV-------AMLICAGFLIAWIPYAVVSVWSAFGQPDSV--PIQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCRSGGPKTLQKKSSLK 
NEUR_anoCa  ADIVAGVYLLVIGILSTLGNGYVIYMSTQRKKKLK--PAEIMTVNLAVCDLGISV-GKPFSIIAFFSHRWIFGWSG-CRWYGWAGFFFGIGSLITMTAVSLDRYFKICHLS--YG-TWLKRHHVFICLGIIWSYAAFWATIPFA--GFG-NYAPEPFGTSCTLDWWLAQGSVAGQAFILNILFFCLVLPTAVIMFCYVKIIAKVQS-STKEV-------AHYDTRIQNQHVLEMKLTKV-------AMLICAGFMFAWIPYAVVSVWSAFGRPDSV--PIKVSVIPTLLAKSAAMYNPVIYQVIDCKSACCRPGNLQPLQKKNSRY 
NEUR_xenTr  ADIFAGVYLMAIGILSTLGNGYVIYMACSRKKKLR--PAEIMTINLAVCDLGISVTGKPFAIVSCFSHRWVFGWNA-CRWYGWAGFFFGCGSLITLTVVSLDRYLKICHLR--YG-TWLKRRHAFIALAVIWAYATLWATLPLV--GVG-NYAPEPFGTTCTLDWWLAQASVKGQIFVLSMLFFCLLFPTMVIVFSYAKIIAKVKS-SAKEV-------AHFDTRNQNNHTLEIKLTK--------AMLICAGFLIAWFPYAVVSVWSAFGQPDSI--PIELSVVPTMMAKSASMYNPIIYQVIDCKPACCKKD--KSLQNTTSRY 
NEUR_gasAc  ADIIAAFYICIIGIMSATGNGYVIYMTIKRKSKLK--PPELMTVNLAVFDFGISVTGKPFFVVSSFAHRWLFGWEG-CRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLR--YG-TWLKRQHAFLCLVFVWMYAAFWATMPLV--GWG-NYAPEPFGTSCTLDWWLAQASVSGQSFVVAILFFCLVLPAGIIVFSYVMIIFKVKS-SAKEI-------SNFDARIKNSHNLEIKLTKTRNCATEDAMLICAGFLIAWIPYAVVSVVSAFGEPDSV--PISVSVIPTLLAKSSAMYNPIIYQVLDLKNSCMKSSCFKGLKKPRHFR 
NEUR_calMi              GLLSTLGNGYVIYLSITQKRKLK--PPEILITNLAISDFGMSVGGQPFLIISCFSHRWIFGWVG-CRWHGWAGFFFGCGSLITMTVVSLDRYLKICHLQ--YG-SWLQRRHVFMSLAFIWFYAAFWATMPLV--GWG-NYAPEPFGTSCTLDWWLARVSVSGLIFVLTILFFCLLLPIIIIVFSYIKIIAKVKS-SAKEV-------AHFDSRIQNHHSLEMNLTK                                                                                         
MEL1_homSa  AHYTLGTVILLVGLTGMLGNLTVIYTFCRSRS-LR-TPANMFIINLAVSDFLMSFTQA-PVFFTSSLYKQWLFGETGCEFYAFCGALFGISSMITLTAIALDRYLVITRPL--ATFGVASKRRAAFVLLGVWLYALAWSLPPFF--GWS-AYVPEGLLTSCSWDYMS--FTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGRALQTFGAC------KGNGESLWQRQ-RLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAGYAHVL--TPYMSSVPAVIAKASAIHNPIIYAITHPKYRVAIAQHLPCLGVLLGVS 
MEL1_monDo  AHYTIGATILAVGFTGVLGNLLVIYTFCR----LR-TPANMFIINLAISDFFMSFTQA-PVFFASSMYKRWIFGEKACEFYAFCGALFGITSMITLMAIALDRYFVITRPL--ASIGVISKKKTGFILLGVWLYSLAWSLPPFF--GWS-AYVPEGLLTSCSWDYTT--FTPSVRAYTMLLFCFVFFIPLIVIIYCYIFIFRAIQDTNKAVHSIGSG------ESTA-SPRHCQ-RMKNEWKMAKIALVVILLYVLSWAPYSTVALVAFAGYSHIL--TPYMNSVPAIIAKASAIHNPIIYAISHPKYRMAIAQNFPCLRALLCVR 
MEL1_xenTr  VHYVVGAVILAVGITGMLGNFLVIYAFCRSRS-LR-SPANMFIINLAITDFLMSVTQA-PVFFATSLHKRWIFGEKGCELYAFCGALFGITSMITLMVIAVDRYFVITRPL--TSIGVMSKKRAVLILSGVWLYSLAWSLPPFF--GWS-AYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFCFVFFIPLFIIIYCYIFIFKAIKNTNRAVQKIGTD------N-NKESHKQYQ-KMKNEWKMAKIALIVILLYVVSWSPYSTVALLAFAGYASIL--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRMAIAKYIPCLGSLLRVK 
MEL1_galGa  AHYTIGTVILIVGITGTLGNFLVIYAFCRSRT-LQ-KPANIFIINLAVSDFLMSITQS-PVFFTNSLHKRWIFGEKGCELYAFCGALFGITSMITLMVIALDRYFVITKPL--ASVRVMSKKKALIILVGVWLYSLAWSLPPFF--GWS-AYVPEGLLTSCSWDYMT--FTPSVRAYTMLLFCFVFFIPLIAIIYSYVFIFEAIKKANKSVQTFGCK------HGNRELQKQYH-RMKNEWKLAKIALIVILLYVISWSPYSVVALVAFAGYSHVL--TPFMNSVPAVIAKASAIHNPIIYAITHPKYRTAIATYVPCLGFLLRVS 
MEL1_calMi  AHYIIGATILAVGVTGMVGNFLVIYAFLRSRS-LR-TPANTFIINLAATDFLMSVTQS-PIFFITSIHKRWIFGEKGCELYAFCGALFGITSMITLMVIALDRYFVITRPL--ASIGVLSHRRAGLIILSLWLYSLAWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFCFVFFIPLGVIIYCYIFIFRAIKSTNKKVG----G------STNRESQKQHQ-RMKNEWKMAKIALIVILLFVISWSPYSTVALTAFAGYADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRMAIAKYVPLLGLLLRVS 
MEL1_danRe  AHYTIGAVILTVGITGMLGNFLVIYAFSRSRT-LR-TPANLFIINLAITDFLMCATQA-PIFFTTSMHKRWIFGEKGCELYAFCGALFGICSMITLMVIAVDRYFVITRPL--ASIGVLSQKRALLILLVAWVYSLGWSLPPFF--GWS-AYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFIFVFFIPLIVIIYCYFFIFRSIRTTNEAVGKINGD-------NKRDSMKRFQ-RLKNEWKMAKIALIVILMYVISWSPYSTVALTAFAGYSDFL--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRLAIAKYIPCLRLLLCVP 
MEL1_takRu  AHYTIGSVILVIGITGMIGNFLVIYAFCRSRS-LR-TPANMFIINLAVTDLLMCVTQT-PIFFTTSMYKRWIFGEKGCELYAFCGALFGICSMITLTVIAIDRYFVITRPL--TSIGVLSRKRAFVILMTVWIYSLGWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FSPSVRAYTMLLFIFVFFLPLFIIIYCYFFIFRAIRATNKAVGKVNGS--VHSHSRRRESVKNFQ-RLQNEWKMAKIALMVILLYVISWSPYSCVALTAFAGYADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRLALAKYIPCLGFLLCIS 
MEL1_gasAc  AHYTIGSVILAIGITGIIGNVLVIYAFSKSRS-LR-TPANMFIINLAITDLLMCVTQA-PIFFTTSMHKRWIFGEKGCELYAFCGALFGICSMITLTVIALDRYFVITRPL--TSIGMMSRRRALLILMGAWTYSLGWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFIFVFFLPLFIIIYCYFFIFRAIRVTNRAVGKMNGS--IHSHGSGRDSTKNFH-RLQNEWKMAKIALIVILLYVVSWSPYSAVALTAFAGYADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRIALAKYIPFLGVLLCVP 
MEL1_oryLa  AHYTIGSVILAIGITGIIGNFLVIYAFSRSRS-LR-TPANMFIINLAITDLLMCVTQS-PIFFTTSMHKRWIFGEKGCELYAFCGALFGICSMITLTVIAIDRYFVITRPL--TSIGVLSRKRALLILSAAWAYSLGWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFIFVFFLPLFIIIYCYVFIFRAIRSTNRAVGKINGN--T------RDAVKSFN-RLQNEWKMAKIALIVILLYVISWSPYSTVALTAFAGYADML--TPYMNSIPAVIAKASAIHNPIIYAITHPKYRMALAKYIPGLGVLLCIH 
MEL1D_danR  AHYTIGSVILAVGITGMVGNLLVMYAFCKSRS-LR-TPANMFIINLAVTDFLMCVTQT-PIFFTTSLHKRWIFGEKGCELYAFCGALFGICSMITLMIIAVDRYFVITRPL--ASIGVMSRKRALLILSAAWAYSMGWSLPPFF--GWSGAYVPEGLLTSCSWDYMT--FSPSVRAYTMLLFTFVFFIPLFVIIYCYFFIFKAIRETNRAVGKINGE------GGPRDSIKKIH-RMKNEWKMAKIALIVILLYVISWSPYSCVALTAF--YADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRSAIAKYIPCLGVLLCVP 
MEL2_galGa  VLYTVGTCVLVIGSIGIIGNLLVLYAFYSNKK-LR-TPQNFFIMNLAVSDFLMSASQA-PICFVNSLHREWILGDIGCDLYAFCGALFGITSMMTLLAISVDRYLVITKPL--RSIQWTSKKRTIQIIAAVWLYSLGWSVAPLL--GWS-SYVPEGLMISCTWDYVT--YSPANRSYTMILCCCVFFIPLIIILHCYLFMFLAIRSTGRDVQKLGSC---------SRKSFLSQ-SMKNEWKLAKIAFVVIIVYVLSWSPYACVTLIAWAGRGNTL--TPYSKSVPAVIAKASAIYNPIIYAIIHPRYRKTIHNAVPCLRFLIRIS 
MEL2_xenLa  VLYTIGSFILIIGSVGIIGNMLVLYAFYRNKK-LR-TAPNYFIINLAISDFLMSATQA-PVCFLSSLHREWILGDIGCNVYAFCGALFGITSMMTLLAISINRYIVITKPL--QSIQWSSKKRTSQIIVLVWMYSLMWSLAPLL--GWS-SYVPEGLRISCTWDYVT--STMSNRSYTMMLCCCVFFIPLIVISHCYLFMFLAIRSTGRNVQKLGSY---------GRQSFLSQ-SMKNEWKMAKIAFVIIIVFVLSWSPYACVTLIAWAGHGKSL--TPYSKTVPAVIAKASAIYNPIIYGIIHPKYRETIHKTVPCLRFLIREP 
MEL2_anoCa  VLYTVGSCVLVIGCIGITGNLLVLYAFYSNKR-LR-TPPNYFIMNLAVSDFLMSATQA-PICFLNSMHKEWVLGDIGCNLYAFCGALFGITSMITLLAISVDRYCVITKPL--QSIKRTSKKRTCIIIVFVWLYSLGWSVCPLF--GWS-SYIPEGLMISCTWDYVT--YSPANRSYTMMLCCCVFFIPLVIIFHCYIFMFLAIRSTGR------------------RKSSISH-SIKSEWKLAKIAFVAIVVFVLSWSPYACVTLISWAG---TL--TPYSKSVPAVIAKASAIYNPIIYAIIHPRYRKTIRSAVPCLRFLIPIS 
MEL2_tetNi  VHYIIAFFVFVIGILGITGNVLVIFAFYSNKK-LR-SLPNYFIVNLAVSDLLMASTQS-PIFFIN-LYKEWMFGETACKMYAFCGALFGITSMINLLAISVDRYVVITKPL--QTIRRSSKRRTALAILMVWLYSLAWSLAPLV--GWG-SYIPEGLMTSCTWDYVT--YTLANRSYTMMLCCFVFFIPLAIILCCYLLMFLAIRKTSRR----------------KSTLIQQK-SIRSEWKLAKIAFVVIVVYVLSWSPYACVTLISWAG---TL--TPYSKSVPAVIAKASAIYNPIIYAIIHPRYRKTIRSAVPCLRFLIPIS 
MEL2_gasAc  AHYIVAVFVVVIGTLGITGNALVMLAVYSNKK-LR-NLPNYFIMNLAVSDFLMAFTQS-PIFFINCLYKEWAFGETGCKIYAFCGALFGIASMINLLAISIDRYLVITKPL--QAIHWGSKRRTTLAILLVWLYSLAWSLAPLV--GWG-SYIPEGLMTSCTWDYVT--YTLANRSYTMMLCCFVFFIPLGIILYCYLFMFLAIRKTSRR----------------KSTLIKQK-SMKSEWKLAKIAFVVIVVYVLSWSPYACVTLISWAG---IL--SPYSKAVPAIIAKASAIYNPFIYAIIHNKYRMTLAAKFPCLRFLSPTP 
MEL2_danRe  VHYIIAFLILIIGTLGVSGNALVMFAFYRNKK-LR-SLPNYFIMNLAVSDFLMAITQS-PIFFINCLYKEWMFGELGCKIYAFCGALFGITSMINLLAISIDRYLVITKPL--QTIQWNSKRRTGLAILCIWLYSLAWSLAPLI--GWG-SYIPEGLMTSCTWDYVS--PSPANKSYTMMLCCFVFFIPLSIILYCYLFMFLSVRQASRQ----------------KSSFVKQQ-SMRSEWKLAKIAAVVIVVYVLSWAPYACVTLVAWAG----L--TPYSKTLPAVLAKSSAIYNPFIYAIIHNKYRATLAEKVPGLSCLSRSQ 
MEL1a_braF  AHYIVGTAVFCVGCCGMFGNAVVVYSFIKSKG-LR-TPANFFIINLALSDFLMNLTNM-PIFAVNSAFQRWLLSDFACELYGFAGGLFGCLSINTLMAISMDRYLVITKPF--LVMRIVTKQRVMFAILLLWIWSLVWALPPLF--GWS-AYVPEGFGTSCTFDYMT--PKLSYHIFTYIIFFTMYFIPMGVIIYCYYNIFATVKSGDKQFGKAVKE--M-AHEDVKNKAQQER-QRKNEIKTAKIAFIVITLFLSAWTPYAVVSALGTLGYQDLV--TPYLQSIPAVFAKSSAVYNPIVYAITHPKFRAAVKKHIPCLSGCLPAD 
MEL1a_braB  AHYIVGTAVFCIGCCGMFGNAVVVYSFIKSKG-LR-TPANFFIINLALSDFLMNLTNM-PIFAVNSAFQRWLLSDFACELYGFAGGLFGCLSINTLMAISMDRYLVITKPF--LVMRIVTKQRVMFAILLLWIWSLVWALPPLF--GWS-AYVSEGFGTSCTFDYMT--PKLSYHIFTYIIFFTMYFIPGGVMIYCYYNIFATVKSGDKQFGKAVKE--M-AHEDVKNKAQQER-QRKNEIKTAKIAFIVISLFMSAWTPYAVVSALGTLGYQDLV--TPYLQSIPAMFAKSSAVYSPIVYAITYPKFREAVKKHIPCLSGCLPAS 
MOLL_RHO_l  VYYSLGIFIAICGIIGCAGNGIVIYLFTKTKS-LQ-TPANMFIINLAFSDFTFSLVNGFPMMTISCFLKHWVFGQAACKVYGLIGGIFGLTSIMTMTMISIDRYNVIRRPM--SASKKMSHRKAFIMIVFVWIWSTIWAIGPIF--GWG-AYQLEGVLCNCSFDYIT--RDASTRSNIVCMYIFAFMFPIVVIFFCYFNIVMSVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GASAEMKLAKISIVIVTQSLLSWSPYAIVALLAQFGPIEWV--TPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAIASNFPWILTCCQ   
MOLL_RHO_s  VYYSLGIFIGICGIIGCTGNGIVIYLFTKTKS-LQ-TPANMFIINLAFSDFTFSLVNGFPLMTISCFIKKWVFGMAACKVYGFIGGIFGLMSIMTMSMISIDRYNVIGRPM--AASKKMSHRRAFLMIIFVWMWSTLWSIGPIF--GWG-AYVLEGVLCNCSFDYIT--RDSATRSNIVCMYIFAFCFPILIIFFCYFNIVMAVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GASAEMKLAKISIVIVTQFLLSWSPYAVVALLAQFGPIEWV--TPYAAQLPVMFAKASAIHNPLIYSVSHPKFREAIAENFPWIITCCQ   
MOLL_RHO_t  VYYSLGIFIGICGIIGCGGNGIVIYLFTKTKS-LQ-TPANMFIINLAFSDFTFSLVNGFPLMTISCFLKKWIFGFAACKVYGFIGGIFGFMSIMTMAMISIDRYNVIGRPM--AASKKMSHRRAFIMIIFVWLWSVLWAIGPIF--GWG-AYTLEGVLCNCSFDYIS--RDSTTRSNILCMFILGFFGPILIIFFCYFNIVMSVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GANAEMRLAKISIVIVSQFLLSWSPYAVVALLAQFGPLEWV--TPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAISQTFPWVLTCCQ   
MOLL_RHO_e  VYYSVGIFIGVVGIIGILGNGVVIYLFSKTKS-LQ-TPANMFIINLAMSDLSFSAINGFPLKTISAFMKKWIFGKVACQLYGLLGGIFGFMSINTMAMISIDRYNVIGRPM--AASKKMSHRRAFLMIIFVWMWSIVWSVGPVF--NWG-AYVPEGILTSCSFDYLS--TDPSTRSFILCMYFCGFMLPIIIIAFCYFNIVMSVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GASAEMKLAKISMVIITQFMLSWSPYAIIALLAQFGPAEWV--TPYAAELPVLFAKASAIHNPIVYSVSHPKFREAIQTTFPWLLTCCQ   
MOLL_MEL_p  WHYIIGVYITIVGLLGIMGNTTVVYIFSNTKS-LR-SPSNLFVVNLAVSDLIFSAVNGFPLLTVSSFHQKWIFGSLFCQLYGFVGGVFGLMSINTLTAISIDRYVVITKPL--QASQTMTRRKVHLMIVIVWVLSILLSIPPFF--GWG-AYIPEGFQTSCTFDYLT--KTARTRTYIVVLYLFGFLIPLIIIGVCYVLIIRGVRRHDQKMLTIT-R--S-MKTED--ARANNK-RARSELRISKIAMTVTCLFIISWSPYAIIALIAQFGPAHWI--TPLVSELPMMLAKSSSMHNPVVYALSHPKFRKALYQRVPWLFCCCK   
LOPH_RHO_p  WHYAVAAWMTFFGILGVSGNLLVVWTFLKTKS-LR-TAPNMLLVNLAIGDMAFSAINGFPLLTISSINKRWVWGKLWRELYAFVGGIFGLMSINTLAWIAIDRFYVITNPL--GAAQTMTKKRAFIILTIIWANASLWALAPFF--GWG-AYIPEGFQTSCTYDYLT--QDMNNYTYVLGMYLFGFIFPVAIIFFCYLGIVRAIFAHHAEMMATA-K--R-MGAN---TGKADA-DKKSEIQIAKVAAMTIGTFMLSWTPYAVVGVFGMIKPHSEMFIHPLLAEIPVMMAKASARYNPIIYALSHPKFRAEIDKHFPWLLCCCKPK 
RHAB_schMe  YHYLVGVYISIVGISGVLGNLLVLYIFARAKS-LR-TPPNMFIMSLAIGDLTFSAVNGFPLLTISSFNTRWAWGKLTCEIYGFIGGLFGFISINTMALISLDRYFVIAQPF--QTMKSLTIKRAIIMLVFVWLYSLIWSTPPFF--GYG-NYVPEGFQTSCTFDYLT--QSKGNIIFNIGMYIGNFIIPVGIIIFCYYQIVKAVRVHELEMLKMA-Q--K-MNASHPTSMKTGA--KKADVQAAKISVIIVFLYMLSWTPYAIIALMALTGRRDHL--NPYTAELPVLFAKTSAMYNPFIYAINHPKFRIQLEKKFPCLICCCPPK 
MEL1_schMa  YYYLVGIYIGIVGILAVMGNSLVITLFLLCKQ-LR-TPPNMLIVSLAISDFSFALINGFPLKTIAAFNHRWGWGKLACELYGFAGSIFGFISLTTMAFIALDRYLVIVQPF--ETFSRITYGKVIVMIFITWIWSALWSIPPFF--GYG-SYIPEGFHTSCTFDYLS--TDLPNLIFNAGLYILGFLCPVFIIIFSYYQIVKTVRLNELELMKMA-Q--S-LDLQNPSAMKTGG-DKKADIEAAKTSIILVLLYLMSWSPYAIVCLMTLIGSRDSL--TPFHSELPVLFAKTSAVYNPIVYAVKHPKFRMEIEKRFPFLICCCPPK 
MEL1_capCa  IYYGLGLYMAVVGIVGTLGNLVVITLFI--KS-LR-TPPNMFIINLALSDMGFCATNGFPLMTVASFQKLWRWGPVACELYALAGSITGFNSIATLALISMDRYMVIAKPF--YAMKHVSHKRSLIQIILAWTWAFIWSAPPLLRMGYG-RYIPEGFQVSCTFDYLS--RDLKNLIFVWCLFVFGFFIPVLAIACSYVGIIRAVGAQSKEMRKTA-E--K-MGAK---TGKSDK-EKKQDIAMAKVAAGTIGLFLMSWTPYAAVSMIGIAGNRSWI--TPYVSQIPVMFAKASAMWNPILYALSHPKFRAALEDHMPWLLVC     
MEL2_schMa  YQYAIGLFIAVVGITGMCLNLLVIVFFTMFKS-LR-TPSNILVVNLAISDFGFSAVIGFPLKTMAAFNNFWPWGKLACDLYGLAGGLFGFVSLSTIAAVALDRYLVIATPF--ESVFQTTPRRTLLLMLFLWMWSLMWTIPPLFGFG-K-RYVTEGYQTSCTMDYIS--TDLNNRLFNIGLFGFGFLCPLFLSLFCYARIILIVRSRGKDFIEMAAS--S-KGTNQKEKSANVS-SSKSDTFVSKSSAILLGVYLICWTPYSFVCLMALIGYADYI--TPLMVEIPCLCAKTA---NPCIYAFRYPKFRSLLQQRFGFLRLTKNRV 
MEL1_helRo  FYYFLGTFFAVVGFLGVFGNIIVVWVFSRTPS-LR-TPSNVLVINLAICDILFSALIGFPMSALSCFQRHWIWGNF-CQFYSFVAGITGLASINCLAVIAVDRYLVVGQPL--AMLNQSHFRRSFYHVLIIWTWACVWSAMPLI--GWG-EYILEGFGVSCTFDYLT--RTTWNISFNVCLFTFCFGMPVSVIILSYIGIIRSIAKNRKEFSSL--------------TAENSS-RARQEIKIAKVFAVCMTAFILCWVPYATVAQLGIYGYDQMV--SPYTAELPVMLAKTSALWNPIIYAFSHPKYRKCLKELPIF         
MEL1_strPu          MNAVTTALPHGLNKPTIEARWTKS-LR-TPPNMLIVNLAISDFGMVITN-FPLMFASTIYNRWLFGDAGCQFYAFCGALFGIMSIANMTAIALDRYYVICWSL--EAVRSVTHRRSMIIIIIVWCYAIFWSIPPFFGVG---SYVLEGYGLGCTFDFMT--KDLNHYLHVSFLFASSFVVPVTIIIVCFTRIAITVRAHRHELNKMRTKLTEDKDKKHKSSIRRAN-KAKTEFQIAKVGFQVTIFYVLSWMPYSIVAVIGQYFDSDLL--TPLGTVVPVIFAKCSAIWNPIIYCLSHEKFNAALKEKLMGMCGIEIPS 
MEL2_strPu  AFPLIGGYLLVVVLLGTAGNSLVIYTFLRFKK-LH-SPINLLIVNLSASDLLVATT-GTPLSMVSSFYGRWLFGTNACAFYGFVNYYCGCISLNSLAAISVFRYIIVVRGQ--AQNNKLSLRSSIYAILVIHLYTLIFSTPPLY--GWN-RFVLAGYHTSCDIDFHT--KTPLFVSYICYMFFFLFFLPLGLISWSYFKIYQRVSK--HSNSMRTSFTGVTKEINSDEKHANHR-------RTASTLFVTIVVFLFAWFPYCIVSLWVLIGDANSI--SKLSTTIPSLFAKSSVIYNPLIYVVLNSKFRKALIQTLSFLKCLSKHE 
MOLL_MEL_a  VHLSVGVFITLVGVLAVCGNSLVIITCIRFKD-LR-TRSNILIINLAVGDLLMCLI-DFPLLAAASFYGEWPYGRQVCQMYAFLTAIAGLVTINTLAVIAADRYWAVVRRP--TPGQKLPKCVTSIAVASVWAYSISWALCPIL--GWG-AYVLDGIRTTCTFDFLT--RTWENRSFVIGMMIGNFVLPFALMVFSYFRIWVAVRKVKSGNVFCAIRHNYNLALGSTLFVKQHRYRLHCEQKTVKIIMFLLIAFTVSWSPYLAVSIIGLFGDRSQL--TYQNTLTASLIAKTSMVFNPILYSISHPKVRKRIANLACCYSVRRHQQ 
MOLL_MEL_l  CQYTIGIFISTVAVIAVIGNSIVIWAHVRIKS-LS-TTSNMLILNLCVGCLIMCIV-DFPLYATSSFLQKWIFGHKVCEIYATITGTAGLLIMNSYSAIAFDRFITVTRYN--NPNYPRSKSATMCISGFVWIYSLSWSMAPVV--GWS-RYQLDGSGTTCTFDYLS--TTWTNRSFILSIAFFNFVLPLCFILFAYSRILHLISS--HSREMKSYRSAVIISKGKASIPKRFR----SERKTAITLLITVVVFCLSWVPYVIIALIGQFGNQSFI--TPQISVIPQLVAKLSTVTNPILYSLSHPVVRNKLFLRLRHELYRRPSD 
CHEL_LWS_l  WYSILGVAMIILGIICVLGNGMVIYLMMTTKS-LR-TPTNLLVVNLAFSDFCMMAFMMPTMTSNCFAE-TWILGPFMCEVYGMAGSLFGCASIWSMVMITLDRYNVIVRGM--AA-APLTHKKATLLLLFVWIWSGGWT-ILPF-FGWS-RYVPEGNLTSCTVDYLT--KDWSSASYVVIYGLAVYFLPLITMIYCYFFIVHAVAEHEKQLREQAKK-----MNVASLRANADQQKQSAECRLAKVAMMTVGLWFMAWTPYLIISWAGVFSSGTRL--TPLATIWGSVFAKANSCYNPIVYGISHPRYKAALYQRFPSLACGSGE 
CHEL_LWS_i  WHSLLGFAMVILGVISVVGNSMVIYIMTTSKS-LR-SPTNMLVVNLAFSDWCMMAFMMPTMAANCFAE-TWILGPFMCEVYGMVGSLFGCGSIWSMVMITLDRYNVIVRGV--AA-APLTHKRAALMIFFVWFWALTWT-LLPF-FGWS-RYVPEGNMTSCTIDYLT--KALWSASYVVAYAGGVYWTPLFINIYCYSKIVRAVAQHEKQLRLQARK-----MNVASLRANAEQTKTSAEARLAKIALMTVGLWFMAWTPYLTIAWAGIFSDGSKL--TPLATIWGSVFAKANACYNPIVYGISHPKYRAALARRFPSLVCMPPGG 
INSE_LWS1_  WHKILGLVMIILGIMGWCGNGVVVYVFIMTPS-LR-TPSNLLVVNLAFSDFIMMGFMCPPMVICCFYE-TWVLGSLMCDIYAMVGSLCGCASIWTMTAIALDRYNVIVKGM--SG-TPLTIKRAMLQILGIWLFGLIWT-ILPL-VGWN-RYVPEGNMTACGTDYLS--QDWTFKSYILVYSFFVYYTPLFTIIYSYYFIVSAVAAHEKAMKEQAKK-----MNVTSLRSGDNQNTSA-EAKLAKVALTTISLWFMAWTPYLVINYIGIFNR-SLI--TPLFTIWGSLFAKANAIYNPIVYGISHPKYRAALKEKLPFLVCGSTED 
INSE_LWS2_  WHGILGFVIGMLGFVSAMGNGMVVYIFLSTKS-LR-TPSNLFVINLAISNFLMMFCMSPPMVINCYYE-TWVLGPLFCQIYAMLGSLFGCGSIWTMTMIAFDRYNVIVKGL--SG-KPLSINGALIRIIAIWLFSLGWT-IAPM-FGWN-RYVPEGNMTACGTDYFN--RGLLSASYLVCYGIWVYFVPLFLIIYSYWFIIQAVAAHEKNMREQAKK-----MNVASLRSSENQNTSA-ECKLAKVALMTISLWFMAWTPYLVINFSGIFNL-VKI--SPLFTIWGSLFAKANAVYNPIVYGISHPKYRAALFAKFPSLACAAEPS 
INSE_LWS_c  WHAILGFVIGILGMISVIGNGMVIYIFTTTKS-LR-TPSNLLVINLAISDFLMMLSMSPAMVINCYYE-TWVLGPLVCELYGLTGSLFGCGSIWTMTMIAFDRYNVIVKGL--SA-KPMTINGALLRILGIWFFSLGWT-IAPM-FGWN-RYVPEGNMTACGTDYLT--KDLLSRSYILVYSFFCYFLPLFLIIYSYFFIIQAVAAHEKNMREQAKK-----MNVASLRSAENQSTSA-ECKLAKVALMTISLWFMAWTPYLVINYAGIFET-VKI--NPLFTIWGSLFAKANAVYNPIVYGISHPKYRAALFQRFPSLACSSGPA 
INSE_LWS_p  WHGLLGFTIGVLGFISITGNGMVVYIFTSTKS-LK-TPSNLLVVNLAFSDFLMMLCMAPPMLINCYYE-TWVFGPLACELYACAGSLFGSISIWTMTMIAFDRYNVIVKGI--AA-KPMTINGALLRILGIWLFSLAWT-IAPM-LGWN-RYVPEGNMTACGTDYLS--KSWLSRSYILVYSIFVYYTPLLLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSEAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYTGVFET-API--SPLATIWGSVFAKANAVYNPIVYGISHPKYRAALYQKFPSLACQPSA 
INSE_LWS_m  WHALLGFTIGVLGFVSISGNGMVIYIFMSTKS-LK-TPSNLLVVNLAFSDFLMMCAMSPAMVVNCYYE-TWVWGPFACELYACAGSLFGCASIWTMTMIAFDRYNVIVKGI--AA-KPMTSNGALLRILGIWVFSLAWT-LLPF-FGWN-RYVPEGNMTACGTDYLS--KSWVSRSYILIYSVFVYFLPLLLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSEAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYTGVFES-API--SPLATIWGSLFAKANAVYNPIVYGISHPKYQAALYAKFPSLQCQSAP 
INSE_LWS_v  WHGLLGFVIGILGFISITGNGMVIYIFTTTKS-LK-TPSNILVVNLAFSDFLMMCVMSPPMVVNCYTE-TWVFGPLACQLYACAGSLFGCASIWTMTMIAFDRYNVIVKGI--AA-KPLTINGAMLRVLGIWVFSLAWT-VAPL-FGWG-RYVPEGNMTACGTDYLD--KSWFNRSYILIYSIFCYFSPLFLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSDAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYAGIFET-ATI--TPLATIWGSVFAKANAVYNPIVYGISHPKYRAALYARFPALACQPSP 
INSE_LWS_t  WHGILGFVIGVLGFVSIVGNGMVIYIFSSTKA-LR-TPSNLLVVNLAFSDFLMMXCMSPAMVINCYNE-TWVLGPLVCELYGMSGSLFGCASIWTMTFIALDRYNVIVKGL--SA-QPLTKKGAMLRILIIWVFSTLWT-IAPF-FGWN-RYVPEGNMTACGTDYLT--KDWVSRSYILVYAVWVYFVPLFTIIYSYWFIVQAVAAHEKSMREQAKK-----MNVASLRSSEAAQTSA-ECKLAKIALMTITLWFFAWTPYLVTNFTGIFEG-AKI--SPLATIWCSLFAKANAVYNPIVYGISHPKYRQALQKKFPSLVCAGEP 
INSE_LWS_s  WHGLLGFVIGVLGVISVIGNGMVIYIFSTTKS-LR-TPSNLLVVNLAFSDFLMMFTMSAPMGINCYYE-TWVLGPFMCELYALFGSLFGCGSIWTMTMIALDRYNVIVKGL--SA-KPMTNKTAMLRILFIWAFSVAWT-IMPL-FGWN-RYVPEGNMTACGTDYLT--KDWVSRSYILVYSFFVYLLPLGTIIYSYFFILQAVSAHEKQMREQRKK-----MNVASLRSAEASQTSA-ECKLAKVALMTISLWFFGWTPYLIINFTGIFET-MKI--SPLLTIWGSLFAKANAVFNPIVYGISHPKYRAALEKKFPSLACASSS 
INSE_LWS_b  WHGILGFVIGLLGFISVSGNGMVVYIFLSTKS-LR-TPSNMFVINLAISDFLMMFCMSPPMVINCYYE-TWVLGPLFCQVYAMLGSLFGCGSIWTMTMIAFDRYNVIVKGL--SG-KPLTINGALLRILGIWLFSLIWT-IAPM-FGWN-RYVPEGNMTACGTDYFS--KDIVSVSYILLYSIWVYFFPLFLIIWSYWFIXQAVAAHEKNMREQAKK-----MNVASLRSSENQNTSA-ECKLAKVALMTISLWFMAWTPYLVINWSGIFSL-VKI--SPLYTIWGSLFAKANAV                               
INSE_LWS_d  WFGIIGFVIAILGTMSLAGNFIVMYIFTSSKG-LR-TPSNMFVVNLAFSDFMMMFTMFPPVVLNGFYG-TWIMGPFLCELYGMFGSLFGCVSIWSMTLIAYDRYCVIVKGM--AR-KPLTATAAVLRLMVVWTICGAWA-LMPL-FGWN-RYVPEGNMTACGTDYFA--KDWWNRSYIIVYSLWVYLTPLLTIIFSYWHIMKAVAAHEKAMREQAKK-----MNVASLRNSEADKSKAIEIKLAKVALTTISLWFFAWTPYTIINYAGIFES-MHL--SPLSTICGSVFAKANAVCNPIVYGLSHPKYKQVLREKMPCLACGKDDL 
CRUS_LWS_m  WYGILAFVVTVVGLCSICGNFVVIWVFMNTKA-LR-SPANTLVVSLAVSDFIMMACMFPPLVLNCYWG-TWIFGPLFCEVYAFIGNTVGCASIGNMIFITFDRYNVIVKGI--SG-TPLSQKNTTLQVLFVWICSIMWC-VFPF-FGWN-RYVPRGDMTACGTDYLT--EDEFSRSYLYVYSVWVYIGPLALIIYCYFHIVSAVATHEKQMRDQAKK-----MGVKSLRTEEAKKTSA-ECRLAKVALTTVSLWFMAWTPYLIINWAGMFYP-SVV--SPLFSIWGSVFAKANAVYNPIVYAISHPKYRAALYKKLPCLACSTESA 
CRUS_LWS_n  WYGILAFVVTVVGLCSICGNFVVIWVIMNTKA-LR-SPANTLVVSLAVSDYIMMTCMFPPLVLNCYWG-TWIFGPLFCEVYAFIGNTVGCASIGNMIFITFDRYNVIVKGV--SG-KPLSQKNATLQVLFVWICSIMWC-VFPF-FGWN-RYVPEGNMTACGTDYLT--EDEFSRSYLYIYSVWVYIGPLALIIYCYFHIVSAVATHEKQMRDQAKK-----MGVKSLRTEEAKKTSA-GCRLAKVALTTVSLWFMAWTPYLIINWAGMFYP-SVV--SPLFSIWGSVFAKSNAVYNPIVYAISHPKYRAALYKKLPCLACSTESA 
INSE_MWS_d  WAKILTAYMIIIGMISWCGNGVVIYIFATTKS-LR-TPANLLVINLAISDFGIMITNTPMMGINLYFE-TWVLGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQVIVKGM--AG-RPMTIPLALGKIAYIWFMSTIWCCLAPV-FGWS-RYVPEGNLTSCGIDYLE--RDWNPRSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKAMREQAKK-----MNVKSLRSSEDADKSA-EGKLAKVALVTISLWFMAWTPYLVINCMGLFKF-EGL--TPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVFGKVDD 
INSE_MWS_c  WAKFLAAYMVLIATISWCGNGVVIYIFSTTKS-LR-TPANLLVINLAISDFGIMITNTPMMGINLFYE-TWVLGPLMCDIYGGLGSAFGCSSILSMCMISLDRYNVIVKGM--AG-QPMTIKLAIMKIALIWFMASIWT-LAPV-FGWS-RYVPEGNLTSCGIDYLE--RDWNPRSYLIFYSIFVYYLPLFLICYSYWFIIAAVSAHEKAMREQAKK-----MNVKSLRSSEDADKSA-EGKLAKVALVTISLWFMAWTPYTIINTLGLFKY-EGL--TPLNTIWGACFAKSAACYNPIVYGISHPKYGIALKEKCPCCVFGKVDD 
CRUS_LWS_c  MYPLLLVFMLITGILCLAGNFVTIWVFMNTKS-LR-TPANLLVVNLAMSDFLMMFTMFPPMMITCYYH-TWTLGATFCEVYAFLGNLCGCASIWTMVFITFDRYNVIVKGV--AG-EPLSTKKASLWILTVWVLSFTWC-VAPF-FGWN-RYVPEGNLTGCGTDYLS--EDILSRSYLYIYSTWVYFLPLAITIYCYVFIIKAVAAHEKGMRDQAKK-----MGIKSLRNEEAQKTSA-ECRLAKIAMTTVALWFIAWTPYLLINWVGMFAR-SYL--SPVYTIWGYVFAKANAVYNPIVYAIS                       
CRUS_LWS_p  MYPLLLIFMLFTGILCLAGNFVTIWVFMNTKS-LR-TPANLLVVNLAMSDFLMMFTMFPPMMVTCYYH-TWTLGPTFCQVYGFLGNLCGCASIWTMVFITFDRYNVIVKGV--AG-EPLSTKKASLWILIVWVLSLAWC-MAPF-FGWN-RYVPEGNLTGCGTDYLS--EDILSRSYLYIYSTWVYFLPLTITIYCYVFIIKAVAAHEKGMRDQAKK-----MGIKSLRNEEAQKTSA-ECRLAKIAMTTVALWFIAWTPYLLINWVGMFAR-SYL--SPVYTIWGYVFAKANAVYNPIVYAIS                       
INSE_LWS_h  WHGLLGFVIGVLGFISVTGNGMVVYIFTTTKS-LK-TPSNILVVNLAFSDFLMMFMMAPPMVINCYNE-TWVFGPLACQLYACAGSLYGCVSIWTMTMIAFDRYNVIVKGI--AA-KPMTINGALLRVFGIWAFSLAWT-IAPL-FGWG-RYVPEGNMTACGTDYFD--QSFSNRSYILLYSIACYYAPLFLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSDAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYAGIFKTMT                                                     
CRUS_LWS_e  WYGLLGFVIFCLGCLSVFGNSVVIWVFTSTKT-LR-SPANMLVVNLALSDFLMMANMSPPTVHSCYHG-TWMLGPTYCEYYALVGSLSGCISIWTMVWITLDRYNVIVKGV--AA-TPLTNKGAFARNIFSWLSALIWC-VSPL-YGWN-RYVPEGNMTACGTDYLT--DDWLSHSYLYAYTFWVYLFPFFIIVYCYTYIVSAVFAHEKGMRDQAKK-----MGVKSLRNEEAQKTSA-ECRLAKVALVTVSLWFIAWTPYCVINVTGMWDK-TKI--TPLFTIWGSL                                       
CRUS_LWS_a  WYGLLGFVIFCLGILSVCGNAVVIWVFMNTKS-LR-SPANLLVVNLAFSDFLMMLNMFPPMVHSCYHG-TWMLGAFFCEFYGFTGSLFGCISIWTMVFITMDRYNVIVKGV--AA-EPLTSKGASIRILFVWTVAFAWT-ILPF-FGWN-RYVPEGNLTACGTDYLT--EDSTSHLYLYMYASWAYYTPLLYIIYAYTFIVQAVSAHEKGMREQAKK-----MGVKSLRNEEAQKTSA-ECRLAKVALMTVSLWFMAWTPYMIINFTGMNDR-TKL--TPLCTIWGSL                                       
CRUS_LWS_h  WYGLLGFWMTVMGTLSVAGNFVVIWVFMNTKS-LR-TPANLLVVNLAISDFFMMLTMTPPLLANAYWG-TWILGAFFCEVYAFLGSFFGCVSIWSMVFITADRYNVIVKGV--SA-EPLTSGGAMMRIAGTWAFTLAWC-LPPF-FGWN-RYVPEGNMLACGTDYLT--ETELSRSYLYVYSVWVYLFPLAYIIYSYTFIVKAVAAHEKGMREQAKK-----MGVKSLRSEEAQKTSA-ECRLCKVALMTVTLWFMAWTPYFIINWGGMFNK-PMV--TPLFS                                           
CRUS_MWS_h  WHYLLGVVYLFLGVISIAGNGLVIYLYMKSQA-LK-TPANMLIVNLALSDLIMLTTNFPPFCYNCFSGGRWMFSGTYCEIYAALGAITGVCSIWTLCMISFDRYNIICNGF--NG-PKLTQGKATFMCGLAWVISVGWS-LPPF-FGWG-SYTLEGILDSCSYDYFT--RDMNTITYNICIFIFDFFLPASVIVFSYVFIVKAIFAHEAAMRAQAKK-----MNVTNLRSNEAETQRA-EIRIAKTALVNVSLWFICWTPYAAITIQGLLGNAEGI--TPLLTTLPALLAKSCSCYNPFVYAISHPKFRLAITQHLPWFCVHEKDP 
INSE_UVV_c  LHYLLAIVYILFTFVALFGNGLVIWIFCSAKS-LR-TPSNLFVVNLAFCDFMMMLKA-PIFIYNSFHT-GFATGHLGCQIFACMGSLSGIGAGMTNAAIAYDRYSTIARPL--DG-K-LSRGQVLLLIMLIWTYTIPWALMPLM-QVWG-RFVPEGFLTSCSFDYLT--DSQEIRYFVPTIFTFSYCVPMLLIIYYYSQIVGHVVSHEKALREQAKK-----MNVESLRSNVNTNAQSAEIRIAKAAITICFLFVLSWTPYGALAMIGAFGNRALL--TPGITMIPACACKFVACLDPYVYAISHPRYRLELQKRLPWLELQEKP 
INSE_UVV_a  LHYLLALLYILFTFLALLGNGLVIWIFCAAKS-LR-TPSNMFVVNLAICDFFMMIKT-PIFIYNSFNT-GFALGNLGCQIFAVIGSLTGIGAAITNAAIAYDRYSTIARPL--DG-K-LSRGQVILFIVLIWTYTIPWALMPVM-GVWG-RFVPEGFLTSCSFDYLT--DTNEIRIFVATIFTFSYCIPMILIIYYYSQIVSHVVNHEKALREQAKK-----MNVDSLRSNANTSSQSAEIRIAKAAITICFLYVLSWTPYGVMSMIGAFGNKALL--TPGVTMIPACTCKAVACLDPYVYAISHPKYRLELQKRLPWLELQEKP 
INSE_UVV_m  AHTALALLYIFFTFAALVGNGMVIFIFSTTKS-LR-TSSNFLVLNLAILDFIMMAKA-PIFIYNSAMR-GFAVGTVGCQIFALMGAYSGIGAGMTNACIAYDRHSTITRPL--DG-R-LSEGKVLLMVAFVWIYSTPWALLPLL-KIWG-RYVPEGYLTSCSFDYLT--NTFDTKLFVACIFTCSYVFPMSLIIYFYSGIVKQVFAHEAALREQAKK-----MNVESLRANQGGSSESAEIRIAKAALTVCFLFVASWTPYGVMALIGAFGNQQLL--TPGVTMIPAVACKAVACISPWVYAIRHPMYRQELQRRMPWLQIDEPD 
INSE_UVV_p  AHTMLALVYVFFTAAALIGNGLVIFIFSASKS-LR-TPSNLLVVQLAVLDFLMMLKA-PIFIYNSIKR-GFASGVIGCQIFAFMGSVSGTAAGLTNACIAYDRHSTITRPL--DG-R-LSRGKVLLMMVCVWLYTAPWAILPQL-QIWG-RYVPEGFLTSCTFDYLT--TTFDNKLFVASMFVCVYIFPMIAILYFYSGIVKQVFAHEAALREQAKK-----MNVDSLRSNQNAAAESAEIRIAKAALTVCFLYVASWTPYGVMSLIGAFGDQNLL--TPGVTMIPALACKGVACIDPWVYAISHPKYRQELQKRMPWLQIDEPD 
INSE_UVV_d  MHYMLGVFYIFLFCASTVGNGMVIWIFSTSKS-LR-TPSNMFVLNLAVFDLIMCLKA-PIFIYNSFHR-GFALGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNVITKPM--NR-N-MTFTKAVIMNIIIWLYCTPWVVLPLT-QFWD-RFVPEGYLTSCSFDYLS--DNFDTRLFVGTIFFFSFVCPTLMILYYYSQIVGHVFSHEKALREQAKK-----MNVESLRSNVDKSKETAEIRIAKAAITICFLFFVSWTPYGVMSLIGAFGDKSLL--TPGATMIPACTCKLVACIDPFVYAISHPRYRLELQKRCPWLGVNEKS 
INSE_BLU_m  WHYVLALIYTMLMVTSLTGNGIVIWIFSTSKS-LR-SASNMFVINLAVFDLMMMLEM-PLLIMNSFYQ-RLVGYQLGCDVYAVLGSLSGIGGAITNAVIAFDRYKTISSPL--DG-R-INTVQAGLLIAFTWFWALPFTILPAF-RIWG-RFVPEGFLTTCSFDYFT--EDQDTEVFVACIFVWSYCIPMALICYFYSQLFGAVRLHERMLQEQAKK-----MNVKSLASNKEDNSRSVEIRIAKVAFTIFFLFICAWTPYAFVTMTGAFGDRTLL--TPIATMIPAVCCKVVSCIDPWVYAINHPRYRAELQKRLPWMGVREQDP 
INSE_BLU_a  FHIGLAIIYSMLLIMSLVGNCCVIWIFSTSKS-LR-TPSNMFIVSLAIFDIIMAFEM-PMLVISSFME-RMIGWEIGCDVYSVFGSISGMGQAMTNAAIAFDRYRTISCPI--DG-R-LNSKQAAVIIAFTWFWVTPFTVLPLL-KVWG-RYTTEGFLTTCSFDFLT--DDEDTKVFVTCIFIWAYVIPLIFIILFYSRLLSSIRNHEKMLREQAKK-----MNVKSLVSN-QDKERSAEVRIAKVAFTIFFLFLLAWTPYATVALIGVYGNRELL--TPVSTMLPAVFAKTVSCIDPWIYAINHPRYRQELQKRCKWMGIHEP   
INSE_BLU_d  YHAGFYIAFIVLMLSSIFGNGLVIWIFSTSKS-LR-TPSNLLILNLAIFDLFMCTNM-PHYLINATVG-YIVGGDLGCDIYALNGGISGMGASITNAFIAFDRYKTISNPI--DG-R-LSYGQIVLLILFTWLWATPFSVLPLF-QIWG-RYQPEGFLTTCSFDYLT--NTDENRLFVRTIFVWSYVIPMTMILVSYYKLFTHVRVHEKMLAEQAKK-----MNVKSLSANANADNMSVELRIAKAALIIYMLFILAWTPYSVVALIGCFGEQQLI--TPFVSMLPCLACKSVSCLDPWVYATSHPKYRLELERRLPWLGIREKHA 
MEL1b_braF  MQLVFGSMMLVFGLIGVVGNAVALYAFCRSRS-LR-RPKNYLIANLCLTDMVVCLVYSPIIVTRSLSH--GLPSKESCIVEGFVVGLGSIVSICSLAGIAVERYVTITQPI--KSLSILTHRALLGAVSAVWVYAFLLAFPPLV--GWG-RYVSEESKISCTFDYLS--TDDATRAHVIVLVIGAFGLPFSVITYCYVRSFATVRKCTKERKQM---------------SPLAKSDSRSEVKAAVNSFVITTSFCLCWCPYAVVATMGVSGFTV----HSHAVFIAALLAKLSVLFNPVAYVLSIPN                   
MEL1b_braB  MQLIFGSMMLVFGLIGVVGNVVALYAFCRTRS-LR-RPKNYVVANLCLTDMFVCLVYCPIVVSRSFSH--GFPSKESCIVEGFMVGVGSIASICSLAAIAVERYLSVTQPL--KSLTILTQRKLLVAVLTVWVYSLLLAFPPLV--GWG-RYVREETYISCTFDYLS--TDDATRAYVITLVMGAFGFPLLTIAYCYIRVFTTARKHAEERKFM---------------SPLKRPESRTEIKTAVTACVITTSFCLCWCPYAVVATLGISGVSV----QQQTVFSAALLAKLTVIINPIVYVLSIPNFRKALFAQEREKYASEDVV 
MEL_nemVec  WVIAQVVLWGCIFVISSLGNSLVLLCIVKSNR--LHSSIYAFYGSLAASDCIAGMLCCPLLLVTALHQLWIMGK-VMCHVYSTLLSTSLNASIATLCLISMDRLNAVRKPFEYRGHNTFTQRWCKWLLVLSWVHSIFWAAAPLG--GWG-EIITDSATYTCKPNWSA--ASIVNRSYSLCLALFPFAFPVFLMVAIYCVIYRHTKKCSNLMS----------GLEDGRNLVAEQERQMRERRLFRTVLIIIGAFAACWAIYTLATTCKLFIGQTP---PTWLVQLGLICAIAGSCVNPVIYTIRDATFARELGRLHPCLAWLLKQS 
LWS_nemVec  TSFTAIALLVIMLLTIIGNLMVCYVVLSNKR--LWTEMNMFLVNLAFGDLAVGLICMVFPLITAIKREWIFGRGILCQLNACCNSVLFCSTIFTHTVISIDRYIVIVHPMK----KIMTRKKAALMIVGVWVFSVFIVLGPVF--GWG-RMEYNASTLQCGFGFPR--DKMASM-YIVIVAIIAFIIPLLIMTYTYIRIYISVLEHTRRMS-------------ETATAMQQQAVFSAQKRIVFTFFIALLAFFACWAPFFSFIAFAVVVKNPHD-IPHGLGLASYVCGFINSACNPFIIGLRSKQFKSGFSRILCCCRGRDP   
Consensus  ............g......N..V.......k. LR .P.N...vNLA..Dl.....................g....C..yg%.....G..s...$..ia.#RY.v!..P.  ..........a.......W.....w...Pl.  GW. .Y.pEg..tsC..#w..........s%....f...%..Pl.!I.%.Y..i............... .    .................E.....m...m!..F...W.PYa..............  .p.....P..fAK.s..%NP!IY......%R...................
 
Structural and functional markers along the opsin molecule:
 
>RHO1_homSap rhodopsin                <----------TM1--------->    c1    <----------TM2------->    x1      <c--i------TM3--------->          c2      <----------TM4-------->              x2        <----------TM5------c->    c3
MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLGGEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSRYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKE
AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA* 0
          c3      <----------TM6------->    x3      <--------b-TM7---gprot>  helix8    palm cyto tail
 
</pre>
 
'''See also:''' [[Opsin_evolution|Curated Sequences]] | [[Opsin_evolution:_ancestral_introns|Ancestral Introns]] | [[Opsin_evolution:_informative_indels|Informative Indels]] | [[Opsin_evolution:_ancestral_sequences|Ancestral Sequences]] | [[Opsin_evolution:_Cytoplasmic_face|Cytoplasmic face]] | [[Opsin_evolution:_update_blog|Update Blog]]
 
[[Category:Comparative Genomics]]

Latest revision as of 15:54, 23 March 2010

See also: Curated Sequences | Ancestral Introns | Informative Indels | Ancestral Sequences | Cytoplasmic face | Update Blog

This section provides an alignment of 230 opsins, mostly ciliary and rhabdomeric types. It could be updated to the full 420 curated reference sequences available using MultAlin or similar tools that allow precise control of formatting and color but too many sequence becomes unwieldy.

N- and C-terminals have been trimmed away because they are generally unalignable and uninformative outside a narrow gene class. Fragmentary sequences are mostly not shown: these score low and so fall to the bottom. (That can still be useful as two important clades, jawless fish and chondrichthyes, are largely represented by fragments.) Notice the numerous invariant (red) and nearly invariant sequences (blue) -- these anchor the alignment with near-certainty. Some of these are not specific to opsins but are rather properties of GPCR signaling proteins generally.

Opsins have seven alpha helical sections traversing the cell membrane with the intervening sequence alternating as cytoplasmic and extra-cellular. Certain key residues such as the lysine where the retinal is covalently bound, counterions, and recognition sites diagnostic for binding of other proteins require markups that will be added shortly.

Among other things, the alignment by MultAlign shows that the Opsin Classifier has properly named the opsins -- each classifies just as expected from its name. Deletions and insertions show up clearly on the alignment, readily resolved as to type using the known phylogenetic topology to establish ancestral condition. The alignment also exhibits some anomalies where the sequence in question needs re-evaluation at the primary data source (cDNA and/or genome).

MultAlign is apparently the only alignment software that allows line width to be specified. That's important here because it enables the entire alignment to be seen in a single window. The numbering schemes allows specific residues and regions to be discussed. Colored text output was also an option (it allows copy and paste of specific residues) but the file again is huge with color markups and awkward to display tightly within genomeWiki.


Opsin align.png


Here are those sequences aligned (after some trimming of unalignable regions, and anomalous and fragmentary sequences). These are in text form which can be searched by motif using web browser text search, unlike the graphic above (which is more conveniently colored however).

 Consensus  ............g......N..V.......k. LR .P.N...vNLA..Dl.....................g....C..yg%.....G..s...$..ia.#RY.v!..P.  ..........a.......W.....w...Pl.  GW. .Y.pEg..tsC..#w..........s%....f...%..Pl.!I.%.Y..i............... .     .................E.....m...m!..F...W.PYa..............  .p.....P..fAK.s..%NP!IY......%R...................
RHO1_homSa  QFSMLAAYMFLLIVLGFPINFLTLYVTVQHKK-LR-TPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTG-CNLEGFFATLGGEIALWSLVVLAIERYVVVCKPM---SNFRFGENHAIMGVAFTWVMALACAAPPLA--GWS-RYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNF--GPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTI--CCGKNPLGDD
RHO1_monDo  QFSCLAAYMFMLIVLGFPINFLTLYVTIQHKK-LR-TPLNYILLNLAIADLFMVFGGFTMTLYTSLHGYFVFGPTG-CNLEGFFATLGGEIALWSLVVLAIERYVVVCKPM---SNFRFGENHAIIGVAFTWVMALACAFPPLI--GWS-RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGQLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSNF--GPIFMTIPAFFAKSSSVYNPVIYIMMNKQFRTCMITTL--CCGKNPLGDD
RHO1_bosTa  QFSMLAAYMFLLIMLGFPINFLTLYVTVQHKK-LR-TPLNYILLNLAVADLFMVFGGFTTTLYTSLHGYFVFGPTG-CNLEGFFATLGGEIALWSLVVLAIERYVVVCKPM---SNFRFGENHAIMGVAFTWVMALACAAPPLV--GWS-RYIPEGMQCSCGIDYYTPHEETNNESFVIYMFVVHFIIPLIVIFFCYGQLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSDF--GPIFMTIPAFFAKTSAVYNPVIYIMMNKQFRNCMVTTL--CCGKNPLGDD
RHO1_ornAn  QYSVLAAYMFMLIMLGFPINFLTLYVTIQHKK-LR-TPLNYILLNLAFANHFMVLGGFTTTLYTSLHGYFVFGPTG-CNIEGFFATLGGEIALWSLVVLAIERYIVVCKPM---SNFRFGENHAIMGVAFTWIMALACALPPLV--GWS-RYIPEGMQCSCGIDYYTLRPEVNNESFVIYMFVVHFTIPMTIIFFCYGRLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNF--GPIFMTVPAFFAKSSAIYNPVIYIMMNKQFRNCMLTTI--CCGKNPLGDD
RHO1_galGa  KFSALAAYMFMLILLGFPVNFLTLYVTIQHKK-LR-TPLNYILLNLVVADLFMVFGGFTTTMYTSMNGYFVFGVTG-CYIEGFFATLGGEIALWSLVVLAVERYVVVCKPM---SNFRFGENHAIMGVAFSWIMAMACAAPPLF--GWS-RYIPEGMQCSCGIDYYTLKPEINNESFVIYMFVVHFMIPLAVIFFCYGNLVCTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTNQGSDF--GPIFMTIPAFFAKSSAIYNPVIYIVMNKQFRNCMITTL--CCGKNPLGDE
RHO1_xenTr  KYSALAAYMFLLILLGFPINFMTLYVTIQHKK-LR-TPLNYILLNLVFANHFMVLCGFTVTMYTSMHGYFIFGQTG-CYIEGFFATLGGEMALWSLVVLAIERYVVVCKPM---ANFRFGENHAIMGVVFTWIMALSCAAPPLF--GWS-RYIPEGMQCSCGVDYYTLKPEVNNESFVVYMFIVHFTIPLCVIFFCYGRLLCTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVVMMVIFFLICWVPYAYVAFYIFTHQGSDF--GPVFMTVPAFFAKSSAIYNPVIYIVLNKQFRNCLITTL--CCGKNPFGDE
RHO1_neoFo  KYSALAAYMFFLILTGFPINFLTLYVTVQHKK-LR-TPLNYILLNLAVADLFMVFGGFTTTMYTAMNGYFVFGVVG-CNLEGFFATFGGIIALWCLVVLAIERYIVVCKPI---SNFRFGENHAIMGVVFTWIMALACAGPPLF--GWS-RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFIVHFTIPLIIIFFCYGRLMCTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVYIMVISYLVCWLPYASVSFYIFTHQGSDF--GPVFMTVPAFFAKTASVYNPVIYILMNKQFRNCMITTL--CCGKNPFGDE
RHO1_latCh  KYSALAAYMFFLILVGFPINFLTLFVTIQHKK-LR-TPLNYILLDLAVADLCMVFGGFFVTMYSSMNGYFVLGPTG-CNIEGFFATLGGQVALWALVVLAIERYVVVCKPM---SNFRFGENHAIMGVIFTWIMALSCAVPPLF--GWS-RYIPEGMQSSCGVDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKDA--AAQ------------Q-----QESATTQKAEKEVTRMVIVMVISFLVCWVPYASVAAYIFFNQGSEF--GPVFMTAPSFFAKSASFYNPVIYILLNKQFRNCMITTL--CCGKNPFGDE
RHO1_anoCa  QFSALAAYMFLLILLGFPINFLTLFVTIQHKK-LR-TPLNYILLNLAVANLFMVLMGFTTTMYTSMNGYFIFGTVG-CNIEGFFATLGGEMGLWSLVVLAVERYVVICKPM---SNFRFGETHALIGVSCTWIMALACAGPPLL--GWS-RYIPEGMQCSCGVDYYTPTPEVHNESFVIYMFLVHFVTPLTIIFFCYGRLVCTVKAA--AAQ------------Q-----QESATTQKAEREVTRMVVIMVISFLVCWVPYASVAFYIFTHQGSDF--GPVFMTIPAFFAKSSAIYNPVIYILMNKQFRNCMIMTL--CCGKNPLGDE
RHO1_petMa  KYSVLAAYMFFLILVGFPVNFLTLFVTVQHKK-LR-TPLNYILLNLAVANLFMVLFGFTLTMYSSMNGYFVFGPTM-CNFEGFFATLGGEMSLWSLVVLAIERYIVICKPM---GNFRFGSTHAYMGVAFTWFMALSCAAPPLV--GWS-RYLPEGMQCSCGPDYYTLNPNFNNESFVIYMFLVHFIIPFIVIFFCYGRLLCTVKEA--AAA------------Q-----QESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDF--GATFMTVPAFFAKTSALYNPIIYILMNKQFRNCMITTL--CCGKNPLGDE
RHO1_letJa  KYSALAAYMFFLILVGFPVNFLTLFVTVQHKK-LR-TPLNYILLNLAMANLFMVLFGFTVTMYTSMNGYFVFGPTM-CSIEGFFATLGGEVALWSLVVLAIERYIVICKPM---GNFRFGNTHAIMGVAFTWIMALACAAPPLV--GWS-RYIPEGMQCSCGPDYYTLNPNFNNESYVVYMFVVHFLVPFVIIFFCYGRLLCTVKEA--AAA------------Q-----QESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDF--GATFMTLPAFFAKSSALYNPVIYILMNKQFRNCMITTL--CCGKNPLGDD
RHO1_geoAu  KFSALAAYMFFLILVGFPVNFLTLFVTVQHKK-LR-TPLNYILLNLAVSNLFMILFGFTTTMYTSMNGYFVFGPTM-CSIEGFFATLGGEVSLWSLVVLAIERYIVICKPM---GNFRFGNTHAIMGVALTWVMALSCAAPPLL--GWS-RYLPEGMQCSCGPDYYTMNPTYNNESFVIYMFIVHFTIPFVIIFFSYGRLLCTVKEA--AAA------------Q-----QESASTQKAEKEVTRMVVLMVVGFLVCWVPYASVAFYIFTNQGSDF--GATFMTLPAFFAKSSALYNPVIYILMNKQFRNCMITTL--CCGKNPLGDD
RHO1_leuEr  MFSALAAYMFFLILTGLPVNFLTLFVTIQHKK-LR-QPLNYILLNLAVSDLFMVFGGFTTTIITSMNGYFIFGPAG-CNFEGFFATLGGEVGLWCLVVLAIERYMVVCKPM---ANFRFGSQHAIIGVVFTWIMALSCAGPPLV--GWS-RYIPEGLQCSCGVDYYTMKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKEA--AAQ------------Q-----QESESTQRAEREVTRMVIIMVVAFLICWVPYASVAFYIFINQGCDF--TPFFMTVPAFFAKSSAVYNPLIYILMNKQFRNCMITTI--CLGKNPFEEE
RHO1_calMi  QFSILAAYMFFLIITCFPVNFLTLYVTFEHKK-LR-QPLNFILLNLAVADLFMVFGGFFITVYTSLHGYFVFGVTG-CNFEGFFATLGGEIGLWSLVVLAIERYVVVCKPM---SNFRFGTNHAIMGVAFTWVMALACAVPPLM--GWS-RYIPEGLQCSCGVDYYTLKPEINNESFVIYMFVVHFLIPLIIIFFCYGRLVCTVKEA--AAQ------------Q-----QESESTQRAEREVTRMVIIMVIFFLICWVPYASVAFFIFTNQGSEF--GPIFMAVPAFFAKSSALYNPLIYILLNKQFRNCMITTL--CCGKNPFEED
RHO1_takRu  KYSLVAAYMLFLIITAFPVNFLTLFVTVKHKK-LR-TPLNYVLLNLAVADLFMVIGGFTVTLYTALHAYFVLGVTG-CNIEGFFATLGGEIALWSLVVLAVERYIVVCKPM---TNFRFGEKHAIAGLVFTWIMALTCATPPLL--GWS-RYIPEGMQCSCGIDYYTPKPEINNTSFVIYMFILHFSIPLAIIFFCYSRLLCTVRAA--AAL------------Q-----QESETTQRAEKEVTRMVIVMVISFLVCWVPYASVAWYIFANQGTEF--GPVFMTAPAFFAKSAALYNPVIYILLNRQFRNCMITTV--CCGKNPFGDD
RHO2_galGa  KYRLVCCYIFFLISTGLPINLLTLLVTFKHKK-LR-QPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFVFGPVG-CAVEGFFATLGGQVALWSLVVLAIERYIVVCKPM---GNFRFSATHAMMGIAFTWVMAFSCAAPPLF--GWS-RYMPEGMQCSCGPDYYTHNPDYHNESYVLYMFVIHFIIPVVVIFFSYGRLICKVREA--AAQ------------Q-----QESATTQKAEKEVTRMVILMVLGFMLAWTPYAVVAFWIFTNKGADF--TATLMAVPAFFSKSSSLYNPIIYVLMNKQFRNCMITTI--CCGKNPFGDE
RHO2_anoCa  KYKVVCCYIFFLIFTGLPINILTLLVTFKHKK-LR-QPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFIFGPIG-CAIEGFFATLGGQVALWSLVVLAIERYIVVCKPM---GNFRFSATHALMGISFTWFMSFSCAAPPLL--GWS-RYIPEGMQCSCGPDYYTLNPDYHNESYVLYMFGVHFVIPVVVIFFSYGRLICKVREA--AAQ------------Q-----QESASTQKAEREVTRMVILMVLGFLLAWTPYAMVAFWIFTNKGVDF--SATLMSVPAFFSKSSSLYNPIIYVLMNKQFRNCMITTI--CCGKNPFGDE
RHO2_neoFo  KYSIVCAYMFFLIITGLPINLLTLVVTFKHKK-LR-QPLNYILVNLAVADLFMVCFGFTVTFSTAINGYFIFGPRG-CAIEGFMATLGGEVALWSLVVLAIERYIVVCKPM---GNFRFSNNHSIIGIVFTWLAALSCAAPPLF--GWS-RYLPEGMQCSCGPDYYTMNPDYHNESFVIYMFVVHFFIPVIVIFVSYGRLICKVKEA--AAQ------------Q-----QESASTQKAEREVTRMVILMVIGFMTAWTPYATVAFWIFMNKGAEF--GATFMAAPAFFSKSSALYNPIIYVLMNKQFRNCMVTTL--CCGKNPFGDD
RHO2_latCh  KFSVLCAYMFLLIILGFPINFLTLLVTFKHKK-LR-QPLNYILVNLAVASLFMVVFGFTVTFYSSLNGYFVLGPMG-CAMEGFFATLGGQVALWSLVVLAIERYIVVCKPM---GNFRFASSHAIMGIAFTWIMALACAAPPLV--GWS-RYIPEGLQCSCGPDYYTLNPDFHNESYVMYLFLVHFLLPIIIIFFTYGRLICKVKEA--AAQ------------Q-----QESASTQKAEKEVTRMVILMVIGFLTAWVPYASAAFWIFCNRGAEF--TATLMTVPAFFSKSSCLFNPIIYVLLNKQFRNCMITTL--CCGKNPLGDD
RHO2_gekGe  KFKVLSFYMFFLIAAGMPLNGLTLFVTFQHKK-LR-QPLNYILVNLAAANLVTVCCGFTVTFYASWYAYFVFGPIG-CAIEGFFATIGGQVALWSLVVLAIERYIVICKPM---GNFRFSATHAIMGIAFTWFMALACAGPPLF--GWS-RFIPEGMQCSCGPDYYTLNPDFHNESYVIYMFIVHFTVPMVVIFFSYGRLVCKVREA--AAQ------------Q-----QESATTQKAEKEVTRMVILMVLGFLLAWTPYAATAIWIFTNRGAAF--SVTFMTIPAFFSKSSSIYNPIIYVLLNKQFRNCMVTTI--CCGKNPFGDE
RHO2_geoAu  MYSAISAYVFTLILIGFPVNFMTLFVTFKLKK-LR-QPLNFILVNLCVADLLMIMFGFTTTFYTAMNGYFVFGPTG-CNIEGFFATLGGEVSLWSLVMLAIERYIVVCKPM---GNFRFATTHAALGVVFTWVMASACAVPPLV--GWS-RYIPEGMQCSCGPDYYTLNPKYYNESYVIYLFLVHFLLPVTIIFFTYGRLICTVKEA--AAQ------------Q-----QESASTQKAEREVTRMVIIMVVGFLVCWVPYASFAFYLFMNKGILF--SATAMTVPAFFSKSSVLYNPIIYVLLNKQFRTCMVTTL--FCGKNPFGED
SWS2_ornAn  IFMSLAAFMFLLITLGFPINLLTVICTIKYKK-LR-SHLNYILVNLAVSNMLVVCVGSATAFYSFAHMYFVLGPTA-CKIEGFAATLGGMVSLWSLAVIAFERFLVICKPL---GNLSFRGTHAIFGCAATWVFGLAASLPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKWNNESYVIFLFSFCFGVPLSIIIFSYGRLLLTLRAV--AKQ------------Q-----EQSATTQKAEREVTKMVIVMVLGFLVCWLPYASFSLWVVTNRGQVF--DLRMASIPSVFSKASTIYNPIIYVFMNKQFRSCMLKLV--FCGKSPFGDE
SWS2_utaSt  LFMGMAAFMFLLIILGVPINVLTIFCTFKYKK-LR-SHLNYILVNLAVSNLLVVCIGSTTAFYSFAQMYFSLGPTA-CKIEGFAATLGGMVSLWSLAVVAFERFLVICKPL---GNFSFRGTHAIIGCIITWVFGLVASLPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKWNNESYVLFLFSFCFGVPLSVIIFSYGRLLLTLRAV--AKQ------------Q-----EQSATTQKAEREVTKMVVVMVMGFLVCWLPYASFALWVVTHRGEPF--DVRLATIPSVFSKASSVYNPVIYVFMNKQFRSCMLKLV--FCGKSPFGDE
SWS2_taeGu  IFKAMAAFMFLLVLLGVPINALTVLCTAKYKK-LR-SHLNYILVNLAVANLLVVCVGSTTAFYSFSQMYFALGPLA-CKIEGFTATLGGMVSLWSLAVVAFERFLVICKPL---GNFTFRGSHAVLGCAITWIFGLIASLPPLF--GWS-RYIPEGLQCSCGPDWYTTDNKWNNESYVIFLFCFCFGFPLTVIVFSYGRLLLTLRAV--AKQ------------Q-----EQSASTQKAEREVTKMVVVMVLGFLVCWLPYCSFALWVVTHRGHPF--DLGLASIPSVFSKASTVYNPIIYVFMNKQFRSCMLKLV--FCGRSPFGDE
SWS2_neoFo  VFMVLSVFMFFLLITGIPINVLTIICTFKYKK-LR-SHLNYILVNLAVANLIVVGFGSTTAFYSFSQMYFAWGPLA-CKIEGFAATLGGMVSLWSLAVVAFERFLVICKPL---GNFTFRSTHAIIGCVATWVFGLISSAPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKWNNESYVIFLFCFCFGFPLSVIIFSYGRLLMTLRAV--AKQ------------Q-----EQSASTQKAEREVTKMVVVMVLGFLVCWLPYTVFSLWVVTHRGESF--ELALGSIPAVFSKSSTVYNPLIYVFMNKQFRSCMMKLI--FCGKSPFGDE
SWS2_galGa  LFRAMAAFMFLLIALGVPINTLTIFCTARFRK-LR-SHLNYILVNLALANLLVILVGSTTACYSFSQMYFALGPTA-CKIEGFAATLGGMVSLWSLAVVAFERFLVICKPL---GNFTFRGSHAVLGCVATWVLGFVASAPPLF--GWS-RYIPEGLQCSCGPDWYTTDNKWHNESYVLFLFTFCFGVPLAIIVFSYGRLLITLRAV--ARQ------------Q-----EQSATTQKADREVTKMVVVMVLGFLVCWAPYTAFALWVVTHRGRSF--EVGLASIPSVFSKSSTVYNPVIYVLMNKQFRSCMLKLL--FCGRSPFGDD
SWS2_xenTr  IFMSISAFMLFTIIFGFPLNLLTIICTVKYKK-LR-SHLNYILVNLAVANLIVICFGSTTAFYSFSQMYFSLGTLA-CKIEGFTATLGGIIGLWSLAVVAFERFLVICKPM---GNFTFRESHAVLGCILTWVIGLVAAIPPLL--GWS-RYIPEGLQCSCGPDWYTVNNKWNNESYVLFLFCFCFGFPLAIIVFSYGRLLLALHAV--AKQ------------Q-----EQSATTQKAEREVTRMVIVMVVGFLVCWLPYASFALWAVTHRGELF--DLRMSSVPSVFSKASTVYNPFIYIFMNRQFRSCMMKMI--FCGKNPLGDD
SWS2_geoAu  IFYGMSAFMLFLVLAGFPLNFLTVFVTIKYKK-LR-SHLNYILVNLAIANLIVVCCGSTLAFYSFMHKYFILGPLF-CKMEGFTATLGGMLSLWSLAVLAFERCLVICKPF---GNIAFRGTHALIRCGFAWAAAIAASTPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKYNNESYVMFLFIFCFGTPFTIIIVSYSKLILTLRAA--AAQ------------Q-----QESASTQKAEKEVSRMVVIMVGGFLVCWLPYASLALWIVFNRGSPF--DLRLATIPSVFSKASTVYNPVIYIFLNKQFRSCMMKTI--FCGKNPLGDD
SWS2_takRu  VFYGMSAFMFFLFVAGTGINVLTIACTIQYKK-LR-SHLNYILVNLAFSNLLVTTVGSFTCFCCFFVRYMIVGPLG-CKIEGFAATLGGMVSLWSLAVVAFERWLVVCKPL---GNFIFKPDHAIVCCIFTWFFALIISAPPLF--GWS-RYIPEGFQCSCGPDWYTTGNKYNNESYVWFIFGFGFAVPLFVIVFCYSQLLVMLK-S--AKA------------Q-----AESASTQKAEREVTRMVVVMILGFLVCWLPYASFALWVVNNRGTPF--DLRLATIPACFSKASTVYNPIIYVVLNKQFRSCMKKML--GMSGGD    
SWS2_gasAc  TFYSLAFYMFFILIVGTFINALTVACTVQNKK-LR-SHLNYILVNLAVSNLLVSGVGAFTAFLSFAARYFVLGTLA-CKVEGFLATLGGMVSLWSLAVIAFERWLVICKPL---GNFIFKPDHALVCCAFTWVFALAASAPPLV--GWS-RYIPEGLQCSCGPDWYTTNNKYNNESYVLFLFGFCFAVPFCTICFCYSQLLFTMKMA--AKA------------Q-----AESASTQKAEREVTRMVVLMVMGFLVCWMPYASFALWVVNNRGQTF--DLRFASIPSVFSKSSAVYNPVIYVLLNKQFRSCMMKML--GMGGGD    
SWS1_homSa  AFYLQAAFMGTVFLIGFPLNAMVLVATLRYKK-LR-QPLNYILVNVSFGGFLLCIFSVFPVFVASCNGYFVFGRHV-CALEGFLGTVAGLVTGWSLAFLAFERYIVICKPF---GNFRFSSKHALTVVLATWTIGIGVSIPPFF--GWS-RFIPEGLQCSCGPDWYTVGTKYRSESYTWFLFIFCFIVPLSLICFSYTQLLRALKAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCVCYVPYAAFAMYMVNNRNHGL--DLRLVTIPSFFSKSACIYNPIIYCFMNKQFQACIMKMV---CGKAMTDES
SWS1_monDo  AFHFQTVFMGFVFCAGTPLNAVVLVATLRYKK-LR-QPLNYILVNVSLCGFIFCIFAVFTVFISSSQGYFIFGRHV-CAMEAFLGSVAGLVTGWSLAFLAFERFIVICKPF---GNFRFNSKHAMMVVLATWVIGIGVSIPPFF--GWS-RFIPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIMPLFLICFSYSQLLRALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVMMVGSFCLCYVPYAALAMYMVNNQNHGL--DLRLVTIPAFFSKSACVYNPIIYCFMNKQFHACIMEMV---CRKPMTDDS
SWS1_anoCa  AFYFQTAFMGFVFFAGTPLNAIILIVTVKYKK-LR-QPLNYILVNISFAGFLFCTFSVFTVFMASSQGYFFFGRHV-CAMEAFLGSVAGLVTGWSLAFLAFERYIVICKPF---GNFRFNSRHALLVVAATWIIGVGVAIPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCLCYVPYASLAMYMVNNRDHGL--DLRLVTIPAFFSKSSCVYNPIIYCFMNKQFRACILETV---CGKPMSDES
SWS1_utaSt  AFYFQTAFMGFVFFAGTPLNAIILIVTVKYKK-LR-QPLNYILVNISFAGFLFCVFSVFTVFLASSQGYFFFGRHI-CALEAFLGSVAGLVTGWSLAFLAFERYIVICKPF---GNFRFNSKHALLVVAATWFIGIGVSIPPFF--GWS-RFIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGI--DLRLVTIPAFFSKSACVYNPIIYCFMNKQFRACIMETV---CGKPMTDES
SWS1_taeGu  AFYLQTIFMGLVFVAGTPLNAIVLIVTIKYKK-LR-QPLNYILVNISVSGLMCCVFCIFTVFIASSQGYFVFGKHM-CAFEGFAGATGGLVTGWSLAFLAFERYIVICKPF---GNFRFNSRHALLVVAATWIIGVGVAIPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCMCYVPYAALAMYMVNNREHGI--DLRLVTIPAFFSKSSCVYNPIIYCFMNKQFRACIMETV---CGRPMTDDS
SWS1_galGa  AFYLQTAFMGIVFAVGTPLNAVVLWVTVRYKR-LR-QPLNYILVNISASGFVSCVLSVFVVFVASARGYFVFGKRV-CELEAFVGTHGGLVTGWSLAFLAFERYIVICKPF---GNFRFSSRHALLVVVATWLIGVGVGLPPFF--GWS-RYMPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGL--DLRLVTIPAFFSKSACVYNPIIYCFMNKQFRACIMETV---CGKPLTDDS
SWS1_neoFo  AFFLQAAFMGFVLFVGTPLNAIVLFVTIKYKK-LQ-QPLNYILVNISLAGFIFCFFGVFAVFIASCQGYFIFGKTV-CALEGFTGSVAGLVTGWSLAILAFERYLVICKPI---GNFRFGSKHSMIAVVAAWVIGVGVSIPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIIPLFIICFSYSQLLGALRAV--AAQ------------Q-----QESATTQKAEREVSRMIIVMVGSFCVCYVPYAALAMYMVNNRDHGI--DLRLVTIPAFFSKSSFVYNPIIYCFMNKQFRACIMQTV---FGKPMTDDS
SWS1_xenLa  AFTLQAIFMGMVFLIGTPLNFIVLLVTIKYKK-LR-QPLNYILVNITVGGFLMCIFSIFPVFVSSSQGYFFFGRIA-CSIDAFVGTLTGLVTGWSLAFLAFERYIVICKPM---GNFNFSSSHALAVVICTWIIGIVVSVPPFL--GWS-RYMPEGLQCSCGPDWYTVGTKYRSEYYTWFIFIFCFVIPLSLICFSYGRLLGALRAV--AAQ------------Q-----QESASTQKAEREVSRMVIFMVGSFCLCYVPYAAMAMYMVTNRNHGL--DLRLVTIPAFFSKSSCVYNPIIYSFMNKQFRGCIMETV---CGRPMSDDS
SWS1_geoAu  AFYLQAAFMGFVFICGTPLNAIVLVVTIKYKK-LR-QPLNYILVNISAAGLVFCLFSISTVFVASMQGYFFLGPTI-CALEAFFGSLAGLVTGWSLAFLAAERYIVICKPF---GNFRFGSKHALVAVGLTWMLGLSVALPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTYFLFVFCFVVPLSIIIFSYGSLLGTLRAV--AAQ------------Q-----QESASTQKAEREVSRMVIMMVASFCTCYVPYAALAVYMVTNRDHNI--DLRFVTVPAFFSKASCVYNPLIYSFMNKQFRACILETV---CGKPITDES
SWS1_danRe  AFYLQAAFMGFVFIVGTPMNGIVLFVTMKYKK-LR-QPLNYILVNISLAGFIFDTFSVSQVSVCAARGYYSLGYTL-CSMEAAMGSIAGLVTGWSLAVLAFERYVVICKPF---GSFKFGQGQAVGAVVFTWIIGTACATPPFF--GWS-RYIPEGLGTACGPDWYTKSEEYNSESYTYFLLITCFMMPMTIIIFSYSQLLGALRAV--AAQ------------Q-----AESESTQKAEREVSRMVVVMVGSFVLCYAPYAVTAMYFANSDEPNK--DYRLVAIPAFFSKSSSVYNPLIYAFMNKQFNACIMETV---FGKKIDESS
SWS1_oryLa  AFYLQAAFMGFVFFVGTPLNFVVLLATAKYKK-LR-VPLNYILVNITFAGFIFVTFSVSQVFLASVRGYYFFGQTL-CALEAAVGAVAGLVTSWSLAVLSFERYLVICKPF---GAFKFGSNHALAAVIFTWFMGVGCACPPFF--GWS-RYIPEGLGCSCGPDWYTNCEEFSCASYSKFLLVTCFICPITIIIFSYSQLLGALRAV--AAQ------------Q-----AESASTQKAEKEVSRMIIVMVASFVTCYGPYALTAQYYAYSQDENK--DYRLVTIPAFFSKSSCVYNPLIYAFMNKQFNGCIMEMV---FGKKMEEAS
LWS_homSap  VYHLTSVWMIFVVIASVFTNGLVLAATMKFKK-LR-HPLNWILVNLAVADLAETVIASTISVVNQVYGYFVLGHPM-CVLEGYTVSLCGITGLWSLAIISWERWMVVCKPF---GNVRFDAKLAIVGIAFSWIWAAVWTAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSYPGVQSYMIVLMVTCCITPLSIIVLCYLQVWLAIRAV--AKQ------------Q-----KESESTQKAEKEVTRMVVVMVLAFCFCWGPYAFFACFAAANPGYPF--HPLMAALPAFFAKSATIYNPVIYVFMNRQFRNCILQLF--GKKVDDGS  
LWS_monDom  VYNLTSLWMVFVVIASIFTNGLVLVATMKFKK-LR-HPLNWILVNLAVADLGETVIASTISVINQIYGYFILGHPL-CVLEGYTVSLCGITGLWSLAIISWERWVVVCKPF---GNVKFDAKLAMVGIIFSWVWAAVWTAPPLF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMATCCIFPLSIILLCYVQVWLAIRAV--AKQ------------Q-----KESESTQKAEKEVSRMVVVMILAYCFCWGPYTLFACFAAANPGYSF--HPLTASLPAYFAKSATIYNPIIYVFMNRQFRTCILQLF--GKKVDDGS  
LWS_ornAna  AYNVTSLWMIFVVIASVFTNGLVLVATMKFKK-LR-HPLNWILVNLAVADLGETLIASTISVINQIFGYFILGHPM-CVLEGYTVSLCGITGLWSLSIISWERWIVVCKPF---GNVKFDAKLAMVGIVFSWVWAAVWTAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMSTCCILPLSIIVLCYLQVWLAIRAV--AKQ------------Q-----KESESTQKAEKEVSRMVVVMILAYCFCWGPYTIFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLF--GKKVDDGS  
LWS_galGal  VYNLTSLWMIFVVAASVFTNGLVLVATWKFKK-LR-HPLNWILVNLAVADLGETVIASTISVINQISGYFILGHPM-CVVEGYTVSACGITALWSLAIISWERWFVVCKPF---GNIKFDGKLAVAGILFSWLWSCAWTAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFFPLAIIILCYLQVSLAIRAV--AAQ------------Q-----KESESTQKAEKEVSRMVVVMIVAYCFCWGPYTFFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCILQLF--GKKVDDGS  
LWS_anoCar  VYNITSVWMIFVVIASIFTNGLVLVATAKFKK-LR-HPLNWILVNLAIADLGETVIASTISVINQISGYFILGHPM-CVLEGYTVSTCGISALWSLAVISWERWVVVCKPF---GNVKFDAKLAVAGIVFSWVWSAVWTAPPVF--GWS-RYWPHGLKTSCGPDVFSGSDDPGVLSYMIVLMITCCFIPLAVILLCYLQVWLAIRAV--AAQ------------Q-----KESESTQKAEKEVSRMVVVMIIAYCFCWGPYTVFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLF--GKKVDDGS  
LWS_xenTro  VYNISSLWMIFVVLASVFTNGLVLVATLKFKK-LR-HPLNWILVNMAIADLGETVIASTISVCNQIFGYFVLGHPM-CILEGYTVSVCGIAALWSLTVIAWERWFVVCKPF---GNIKFDGKLAATGIIFSWVWAAGWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMLVLMITCCIIPLAIIVLCYMHVWLTIRQV--AQQ------------Q-----KESESTQKAEREVSRMVVVMIIAYIFCWGPYTFFACFAAFNPGYNF--HPLAAAMPAYFAKSATIYNPIIYVFMNRQFRNCIYQLF--GKKVDDGS  
LWS_takRub  VYNVATVWMFIVVVLSVFTNGLVLVATAKFKK-LR-HPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPM-CVFEGYTVSTCGIAALWSLTIISWERWVVVCKPF---GNVKFDAKWATGGIVFSWVWAAVWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCIIPLAIIILCYLAVWLAIRSV--AMQ------------Q-----KESESTQKAEKEVSRMVVVMIVAYCVCWGPYTFFACFAAANPGYAF--HPLAAAMPAYFAKSATIYNPVIYVFMNRQFRVCIMKLF--GKEVDDGS  
LWS_gasAcu  VYNLSTLWMFIVVALSVFTNGLVLVATAKFKK-LQ-HPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPM-CVFEGYVVSVCGITALWSLTIISWERWIVVCKPF---GNVKFDAKWATAGIVFSWIWSAVWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCLIPLAIIILCYLAVWLAIRAV--AMQ------------Q-----KESESTQKAERDVSRMVVVMIVAYIVCWGPYTTFACFAAANPGYAF--HPLAAAMPAYFAKSATIYNPVIYVFMNRQFRSCIMQLF--GKEVDDGS  
LWS_petMar  VFNLTSVWMIIVVVLSLFSNGLVLVATVKFKK-LR-HPLNWIIVNLAIADILETIFASTISVCNQVYGYFILGHPM-CVFEGYVVSTCGIAGLWSLAIISWERWMVVCKPF---GNIKFDGKIATILIVFSWVWPASWCSLPIF--GWS-RYWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFLPLSIIILCYLQVWLAIHSV--AQQ------------Q-----KESETTQKAERDVSRMVVVMILAYVFCWGPYTFFACFAAANPGYSF--HPIAAALPAYFAKGATIYNPIIYVFMNRQFRNCILQLF--GKKVDDGS  
LWS_letJap  MFNLTSVWMIIVVVLSLFTNGLVLVATMKFKK-LR-HPLNWILVNLAIADILETIFASTISVCNQVFGYFILGHPM-CVFEGYVVSTCGIAGLWSLAIISWERWMVVCKPF---GNIKFDGKIAIILIVFSWVWPACWCSLPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFLPLSVIILCYLQVWLAIHSV--AQQ------------Q-----KESETTQKAERDVSRMVVVMILAYIFCWGPYTFFACYAAANPGYAF--HPLTAALPAYFAKSATIYNPVIYVFMNRQFRNCIMQLF--GKKVDDGS  
LWS_geoAus  MYNLTSFWMIIVVILSLFTNGLVLVATLKFKK-LR-HPLNWILVNLAIADIGETIFASTVSVVNQIFGYFILGHPL-CVFEGFTVSVCGITALWSLAIISFERWMVVCKPF---GNLKFDGKVAIVLIIFSWAWSAGWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFIPLALIIICYLQVWLAIHTV--AQQ------------Q-----KESETTQKAERDVSRMVVVMIFAYIFCWGPYTFFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLF--GKKVDDGS  
LWS_neoFor  VYNLTSLWMIFVVFASCFTNGLVLMATYKFKK-LR-HPLNWILVNLAIADLGETLIASTISVTNQIFGYFILGHPM-CMLEGFTVATCGITGLWSLTIIAWERWVVVCKPF---GNIKFDGKWAAGGIIFSWVWSAFWCAMPLF--GWS-RFWPHGLKTSCGPDVFSGEDKYGTRSFMIALMITCCIIPLGVIILCYIQVWWAIRTV--AKQ------------Q-----KESESTQKAEKEVSRMVVVMIFAYCFCWGPYTFMACFGAAYPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIYQLL--GKKVDDGS  
PIN_galGal  TYVGVAVLMGTVVACASVVNGLVIVVSICYKK-LR-SPLNYILVNLAVADLLVTLCGSSVSLSNNINGFFVFGRRM-CELEGFMVSLTGIVGLWSLAILALERYVVVCRPL---GDFQFQRRHAVSGCAFTWGWALLWSTPPLL--GWS-SYVPEGLRTSCGPNWYTG--GSNNNSYILSLFVTCFVLPLSLILFSYTNLLLTLRAA--AAQ------------Q-----KEADTTQRAEREVTRMVIVMVMAFLLCWLPYSTFALVVATHKGIII--QPVLASLPSYFSKTATVYNPIIYVFMNKQFQSCLLEML--CCGYQPQRTG
PIN_utaSta  IYTSVAVLMGLVVVSAAFVNGLVIVVSIQYKK-LR-SPLNYILVNLAIADLLVTSFGSTLSFANNIYGFFVLGQTA-CEFEGFMVSLTGIVGLWSLAILAFERYLVICKPV---GDFRFQQRHAVFGCVFTWMWSLVWTLPPLF--GWS-SYVPEGLRTSCGPNWYTG--GSGNNSYIMALFVTCFALPLGMIIFSYASLLLTLRAV--ATQ------------Q-----KEVETTQQAEKEVTRRVIAMVMAFLVCWLPYASFAMVVATNKDLVI--QPALASLPSYFSKTATVYNPIIYVFMNKQFRSCLLSTM--SCGHRPRGAQ
PIN_podSic  TYISVAVLMGLVVISATLVNGLVIVVSVQFKK-LR-SPLNYVLVNLAVADLLVTFFGSTISFVNNAQGFFIFGQAT-CEFEGFMVSLTGIVGLWSLAILAFERYLVICKPV---GDFRFPARHAVLGCAFTWGWSFVWTVPPLL--GWS-SYVPEGLRTSCGPNWYSG--GSSNNSYIMTLFVTCFAMPLSTILFSYANLLMTLRTV--AAQ------------Q-----KEQETTQRAEREVTRMVVAMVAAFLVCWLPYASFAMVVATHKDLAI--RPALASLPSYFSKTATVYNPIIYVFMNKQFRSCLLYKM--SCGHRALSSQ
PIN_pheMad  VYTSLAALMGVVVLSASLANGLVIAVSVRFKR-LR-SPLNYILVNLATADLLVTFFGSIISFVNNAVGFFVFGKTA-CRFEGFMVSLTGIVGLWSLAILAFERYLVICKPV---GDFQFQRRHAVIGCLYTWGWSLIWTVPPLF--GWS-SYVPEGLGTSCGPNWYMG--GTNNNSYIVALFVTCFALPLSMILFSYANLLLTLRAV--AAQ------------Q-----KEQETTQRAEKEVTRMVITMVMAFLVCWLPYATFAMVVATTKDLSI--QPGLASLPSYFSKTATVYNPIIYVFMNKQFRSCLLNTV--SCGRIPQTMP
PIN_xenTro  TFLTVAAVMCMVVILAFFVNGLVIVVTLKYKK-LR-SPLNYILVNLAIANLLVTIFGSSVSFSNNVVGYFFMGKTM-CEFEGFMVSLTGIVGLWSLAILAFERYLVICKPM---GDFRFQQKHAILGCSFTWVWSFIWTSPPLF--GWC-SYVPEGLRTSCGPNWYTG--GTNNNSYIMALFLTCFIMPLSTIIFSYSNLLMALRAV--AAQ------------Q-----KDSETTQRAEKEVTRMVIAMVLAFLICWLPYASFAVVVAVNKDVVI--EPTVASLPSYFSKTATVYNPIIYVFMNKQFRNCLMTLL--CCGRS-FGDD
PIN_bufJap  TYLTVAVLMGMVVFLAFFVNGMVIVVSLKYKK-LR-SPLNYILVNLAVADILVTMFGSTVSFHNNIFGFFTLGKLV-CELEGFVVSLTGIVGLWSLAILAFERYIVICKPM---GDFRFQQRHAVMGCAFTWIWAFLWTSPPLI--GWC-SYVPEGLGTSCGPNWYTG--GTNNNSYILALFTTCFMMPLTTIIFSYSNLLLALRAV--AAQ------------Q-----KESETTQRAEREVTRMVIAMVLAFLICWLPYAVFAIVMASNKNVVI--DPTLASMPSYFSKTATVYNPVIYVFMNKQFRDCLTKLL--CCGRNPFGED
VAOP_galGa  HFRLVAAVMFVVTSLSLAENLAVILVTFKFKQ-LR-QPVNYVIVNLSVADFLVSLTGGTISFLANLKGYFYMGHWA-CVLEGFAVTFFGIVALWSLALLAFERYIVICRPV---GNMRLRGKHAAQGIAFVWTFSFIWTIPPTM--GWS-SYTTSKIGTTCEPNWYSG--AYNDRSYIIAFFTTCFIVPLLVILVSYGKLLQKLRKV--SNT------------Q-----GRLRTARKPERQVTRMVVVMIIAFLICWMPYAVFSILATAYPSIEL--DPHLAAIPAFFSKTATVYNPIIYVFMNKQFRMCLIQMF--KCSAIETAES
VAOP_anoCa  NFHLISALMFVVTLFSLSENFTVILVTIKFKQ-LR-QPLNYVIVNLSVADFLVSLIGGTISFSTNLKGYFYMGHWA-CVLEGFAVTFFGIVALWSLALLAFERYVVICRPL---GNMRLNGKHAALGVAFVWIFSFIWTVPPTM--GWS-SYTTSKIGTTCEPNWYSG--DYNDHTFIITFFTTCFILPLLVILVSYGKLMRKLRKV--SDT------------Q-----GRLGTTRKPERQVTGMVVIMILAFLICWSPYAAFSILVTACPSIEL--DPRLAAIPAFFSKTATVYNPVIYVFMNNQFRKCLVQLF--QCSSQETMDA
VAOP_xenTr  NFHLLAALMFVVTSLSIAENFIVILVTAKFKQ-LR-QPLNYIIVNLSVADFLVSVIGGTISIATNSRGYFYLGSWA-CVLEGFAVTFFGIVALWSLSVLAFERYIVICRPL---GNLRLQGKHSALAIIFVWVFSFVWTIPPTM--GWS-SYTTSKIGTTCEPNWYSG--EMRDHTYIITFLTTCFVFPLLVIFMSYGKLMRKLRKV--SDT------------Q-----GRLGSTRKPEKEVTRMVVIMILAFLICWTPYAAFSILITAHPTIDL--DPRLAAIPAFFAKTASMYNPIIYVYMNKQFRRCLYQMF--NINDPEAKES
VAOP_danRe  NYSVLAALMFVVTALSLSENFTVMLVTFRFQQ-LR-QPLNYIIVNLSLADFLVSLTGGSISFLTNYHGYFFLGKWA-CVLEGFAVTFFGIVALWSLAVLAFERFFVICRPL---GNIRLRGKHAALGLVFVWSFSFIWTVPPVL--GWS-SYTVSRIGTTCEPNWYSG--NFHDHTFIITLFSTCFIFPLGVIIVCYCKLIRKLRKV--SNT------------H-----GRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIIITAHPSMHV--DPRLAAIPAFVAKTAAVYNPIIYVFMNKQFRKCLVQLL--SCSKVTVVEG
VAOP_rutRu  NYKVLATLMFVVTAASLSENFAVMLVTFRFTQ-LR-KPLNYIIVNLSLADFLVSLTGGTISFLTNYHGYFFLGKWA-CVLEGFAVTYFGIVALWSLAVLAFERFFVICRPL---GNIRLRGKHAALGLLFVWTFSFIWTIPPVL--GWS-SYTVSKIGTTCEPNWYSG--NFHDHTFIIAFFITCFILPLGVIVVCYCKLIKKLRKV--SNT------------H-----GRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIVVTAHPSIHL--DPRLAAAPAFFSKTAAVYNPVIYVFMNKQFRKCLVQLL--RCRDVTIIEG
VAOP_takRu  NFTILAVLMFVVTSLSLCENFLVMFITFKFKQ-LR-QPLNYIIVNLAIADFLVSLTGGLISFLTNARGYFFLGRWA-CVLEGFAVTYFGIVAMWSLAVLSFERFFVICRPL---GNMRLQAKHAAIGLLFVWTFSFVWTFPPVL--GWN-RYTVSKIGTTCEPDWYSN--NMTSHSYIITFFSTCFILPLGIIFFCYGKLLRKLRKV--S--------------H-----GRLATARKPERQVTRMVVVMIVAFMVAWTPYATFAILVTIHPTIEL--DPRLASIPAFFSKTAAVYNPIIYVFMNKQFRKCLIQHF--IGMGVMAES 
VAOP_petMa  NFTMLAALMGTITALSLGENFAVIVVTARFRQ-LR-QPLNYVLVNLAAADLLVSAIGGSVSFFTNIKGYFFLGVHA-CVLEGFAVTYFGVVALWSLALLAFERYFVICRPL---GNFRLQSKHAVLGLAVVWVFSLACTLPPVL--GWS-SYRPSMIGTTCEPNWYSG--ELHDHTFILMFFSTCFIFPLAVIFFSYGKLIQKLKKA--SET------------Q-----RGLESTRRAEQQVTRMVVVMILAFLVCWMPYATFSIVVTACPTIHL--DPLLAAVPAFFSKTATVYNPVIYIFMNKQFRDCFVQVL--PCKGLKKVSA
PPIN_anoCa  GYTIIAIIMATSCTLSVILNTAVIAITIKYRQ-LR-QPINYSLVNLAIADLGAALLGGSLNVETNAVGYYNLGRVG-CVTEGFAMAFFGIVALCTIAVIAVDRAIVIAKPM---GTITFTTRKAMIGVAVSWIWSLVWNTPPLF--GWG-GYQMEGVMTSCAPDWANS--DPINVSYIICYFLFCFTIPFITILASYGYLIWTLRQV--AKV------------G----LAQRGSTTKAEAQVSRMVIVMVMAFLICWLPYATFALVVVGNPQIYI--NPIIATIPMYMAKSSTFYNPIIYIFMNKQFRDCLVRCL--LCGRNPCASE
PPIN_xenTr  GYTILALIMAVFCAAALFLNVTVIVVTFKYRQ-LR-HPINYSLVNLAIADLGVTVLGGALTVETNAVGYFNLGRVG-CVIEGFAVAFFGIAALCTIAVIALDRVFVVCKPM---GTLTFTPKQALAGIAASWIWSLIWNTPPLF--GWG-SYELEGVMTSCAPNWYSA--DPVNMSYIVCYFSFCFAIPFLIIVGSYGYLMWTLRQV--AKL------------G----VAEGGTTSKAEVQVSRMVIVMILAFLVCWLPYAAFAMTVVANPGMHI--DPIIATVPMYLTKTSTVYNPIIYIFMNKQFQECVIPFL--FCGRNPWAAE
PPIN_petMa  GFTILAVIMAVFTLASLVLNSTVIIVTLRHRQ-LR-HPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVG-CVIEGFAVAFFGIAALCTIAVIAVDRFVVVCKPL---GTLMFTRRHALLGITWAWLWSFVWNTPPLF--GWG-SYKLEGVRTSCAPDWYSR--DPANVSYIVSFFSFCFAIPFLVIVVAYGRLLWTLHQV--AKL------------G----MGESGSTAKAEAQVSRMVVVMVVAFLVCWLPYALFAMIVVAKPGVYI--DPVIATLPMYLTKTSTVYNPIIYIFMNRQFRDCAVPFL--LCGRNPWAEP
PPIN_letJa  GFTILAVIMAVFTIASLVLNSTVVIVTLRHRQ-LR-HPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVG-CVIEGFAVAFFGIAALCTIAVIAVDRFVVVCKPL---GTLMFTRRHALLGIAWAWLWSFVWNTPPLF--GWG-SYELEGVRTSCAPDWYSR--DPANVSYITSYFAFCFAIPFLVIVVAYGRLMWTLHQV--AKL------------G----MGESGSTAKAEAQVSRMVVVMVVAFLVCWLPYALFAMIVVTKPDVYI--DPVIATLPMYLTKTSTVYNPIIYIFMNRQFRDCAVPFL--LCGRNPWAEP
PPIN_ictPu  GYTILSIIMALSSTFGIILNMVVIIVTVRYKQ-LR-QPLNYALVNLAVADLGCPVFGGLLTAVTNAMGYFSLGRVG-CVLEGFAVAFFGIAGLCSVAVIAVDRYMVVCRPL---GAVMFQTKHALAGVVFSWVWSFIWNTPPLF--GWG-SYQLEGVMTSCAPNWYRR--DPVNVSYILCYFMLCFALPFATIIFSYMHLLHTLWQV--AKL------------Q----VADSGSTAKVEVQVARMVVIMVMAFLLTWLPYAAFALTVIIDSNIYI--NPVIGTIPAYLAKSSTVFNPIIYIFMNRQFRDYALPCL--LCGKNPWAAK
PPIN_oncMy  GFTILAVIIGVFSVSGVCMNVLVIMVTMRHRK-LR-QPLNYALVNLAVADLGCALFGGLPTMVTNAMGYFSMGRLG-CVLEGFAVAFFGIAGLCSVAVIAVDRYVVVCRPM---GAVMFQTRHAVGGVVLSWVWSFLWNTPPLF--GWG-SFELEGVRTSCSPNWYSR--EPGNMSYIILYFLLCFAIPFSIIMVSYARILFTLHQV--SKL------------K----VLEGNSTTRVEIQVVRMVVVMVMAFLLSWLPYAAFALSVILDPSLHI--NPLIATVPMYLAKSSTVYNPIIYVFMNRQFRDCAVPFL--LCGLNPWAS 
PPIN_danRe  GYTILAVIIGVFSVCGVILNVTVITVTLKYKQ-LR-QPLNFALVNLAVADLGCAVFGGLPTVVTNAMGYFSLGRVG-CVLEGFAVAFFGIAALCSVAVIALERCMVVCRPV---GSISFQTRHAVFGVAVSWLWSFIWNTPPLF--GWG-RLQLEGVRTSCAPDWYSR--DLANVSFIVCYFLLCFALPFSVIVYSYTRLLWTLRQV--SRL------------Q----VCEGGSAARAEAQVSCMVVVMILAFLLTWLPYASFALCVILIPELYI--DPVIATVPMYLTKSSTVFNPIIYIFMNRQFRDRALPFL--LCGRNPWAA 
PPINa_cioI  TYSFLCVYMTFVFLLSCSLNILVIVATLKNKV-LR-QPLNYIIVNLAVVDLLSGFVGGFISIAANGAGYFFWGKTM-CQIEGYFVSNFGVTGLLSIAVMAFERYFVICKPF---GPVRFEEKHSIFGIVITWVWSMFWNTPPLI--FWD-GYDTEGLGTSCAPNWFVK--EKRERLFIILYFVFCFVIPLAVIMICYGKLILTLRQI--AKE------------------SSLSGGTSPEGEVTKMVVVMVTAFVFCWLPYAAFAMYNVVNPEAQI--DYALGAAPAFFAKTATIYNPLIYIGLNRQFRDCVVRMI--FNGRNPWVDE
PPINa_cioS  VYSFLAVYMTFICLISCSLNILVITATLKNKV-LR-QPLNYIIVNLAVVDLLSGLVGGVISIFANGAGYFFWGKFM-CQVEGYTVSNFGVTGLLSIAVMAFERYFVICKPF---GPVRFEEKHAVIGIAVTWIWAMFWNTPPLI--FWD-GYDTEGLGTSCAPNWFVK--GNTERLFIILYFVFCFLIPLAIIVLCYGKLILQLRQI--AKE------------------SSLSGGTSPEGEVTKMVVVMVTAFVICWLPYAAFAMYNVVNPEAQI--DYALGAAPAFFAKTATIYNPLIYIGLNRQFRDCVVRMI--FNGRNPWVDE
PPINb_cioI  IYTILAVYMTFIFLLAVSLNGFVIIATMKNKK-LR-QPLNYIIINLSIADFLSGLVGGFIGMISNSAGYFYFGKTV-CILEGYIVSVAGVCGLMSISVMAFERYFVVCKPY---GPFTLTNTHAALGIGFTWTWSVLWSTPGLI--WLD-GYVPEGLGTSCAPNWFSK--NKSERIFIFVYFVFCFFIPLLVIIICYGKIVLFLKQA--TRQ------------------SSASSNRQADNKVTKMVLVMISAFLICWTPYGVLSLYNAINPDKQL--DYGLGAVPVFFAKTANIYNPLIYIGLNKQFRDGVIKMV--FRGRNPWAEE
PPINb_cioS  TYSGLCVFMSFVFVLAVPLNLLVIVATYKNKD-LR-RPINYIIVNLAVADLTCSVVGGLLGVLNNGAGYYFLGKSV-CIFEGYVMSVTGVCGILSITVMAFERYFVVCKPF---GQTNLKWSHAITGIVFTWTWSVIWHTPGLF--FWN-GYEPEGFGTSCAPNWFSQ--QKSERIFIFAYFAFCFLTPLTIIFACYLKLILFIRKV--SKK------------------SMVNEADRRDFEVTRMVFVMIAAFLICWLPYGCLSMYNAIHPDNLL--SYGIGSVPAFFAKTATIYNPIIYMGLNKKFRDGVIRML--FKGRNPWLDG
PARIE_utaS  GYGVLAFLMFLNALFSIFNNSLVIAVTLKNPQ-LR-NPINIFILNLSFSDLMMSLCGTTIVIATNYYGYFYLGRKF-CIFQGFAVNYFGIVSLWSLTILAYERYNVVCQPL---GTLQMSTKRGYQLLGFIWVFCLFWAVVPLF--GWS-SYGPEGVQTSCSIGWEER--SWSNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHGL--NKK-----------VE----QLGGKSSPEEEFRAVIMVLVMVVAFLICWLPYTVFALIVVFNPALNI--SPLAATIPTYLSKTSPVYNPIIYIFLNKQFRDCAVEFI--TCGQVVLTSP
PARIE_anoC  GYGVLAFLMFINALFSLFNNFLVIAVTLKNPQ-LR-NPINIFILNLSFSDLMMSICGTTIVIATNYHGYFYLGRRF-CIFQGFAVNYFGIVSLWSLTILAYERYNVVCQPL---GTLQMSTQRAYQLLGFIWVFCLFWAVVPLF--GWS-SYGPEGVQTSCSIGWEER--SWNNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHGL--NKK-----------VE----QLGGKSNPEEEFRAVIMVLVMVVAFLICWLPYTLFALTVVFNPALNI--SPLAATIPTYLSKTSPVYNPIIYIFLNKEFRECAVEFI--TCGKVVLTSP
PARIE_xenT  GYSILSFLMFLNAVFSICNNAIVILVTLKHPQ-LR-NPINIFILNLSFSDLMMALCGTTIVVSTNYHGYFYLGKQF-CIFQGFAVNYFGIVSLWSLTLLAYERYNVVCEPI---GALKLSTKRGYQGLVFIWLFCLFWAIAPLF--GWS-SYGPEGVQTSCSIGWEER--SWSNYSYIISYFLTCFIIPVGIIGFSYGSILRSLHQL--NRK-----------IE----QQGGKTNPREEKRVVIMVLFMVLAFLICWLPYTVFALIVVINPQLYI--SPLAATLPTYFAKTSPVYNPIIYIFLNKQFRTYAVQCL--TCGHINLDSL
PARIE_takR  GYSILSFLMFINTVLSVFNNSLAIAVMLKNPS-LL-QPINIFILSLAVSDLMIGLCGSLVVTITNYHGSFFIGHTA-CVFQGFAVNYFGLVSLCTLTLLAYERYNVVCKPR---AGLKLTMRRSIIGLLFVWTFCLFWAVTPLL--GWS-SYGPEGVQTSCSLAWEER--SWNNYSYLILYTLLCFIFPVGVIIYCYCKVLTSMNKL--NKS-----------VE----LQGGLSCRRENKHAINMVLAMIIAFFVCWLPYTALSVVVVVDPELHI--PPLVATMPMYFAKTSPVYNPIIYFLSNKQFRDATLEVL--SCSRYIPHAS
PARIE_gasA  GYSILSFLMFINTVLTVFNNVLVITVLVRNPS-LL-QPMNVFILSLAVSDLMIGLCGSLVVTITNYHGSFFIGHTA-CIFQGFAVNYFGLVSLCTLTLLSYERYNVVCRPR---NALKLSMRRSIHGLLIVWTFCLFWAVAPLF--GWS-GYGPEGVQTSCSLAWEER--SWSNYSYLVLYTLLCFIVPVAVIIYCYAKVLTSMNTL--NRS-----------VE----VQGGRSSQKENDHAVSMVLAMIIAFFSCWLPYTALSVVVVVDPTLYI--PPLVATMPMYFAKTSPVYNPIIYFLSNKQFRDAALEML--SCGRYIAHMP
PARIE_danR  GYSILSYLMFINTTLSVFNNVLVIAVMVKNLH-FL-NAMTVIIFSLAVSDLLIATCGSAIVTVTNYEGSFFLGDAF-CVFQGFAVNYFGLVSLCTLTLLAYERYNVVCKPM---AGFKLNVGRSCQGLLLVWLYCLFWAVAPLL--GWS-SYGPEGVQTSCSLGWEER--SWRNYSYLILYTLMCFILPTVIITYCYSNVLLTMRKI--NKS-----------IE----CQGGKNCAEDNEHAVRMVLAMIIAFFICWLPYTAISVLVVVNPEISI--PPLIATMPMYFAKTSPVYNPIIYFLTNKRFRESSLEVL--SCGRYISRET
CILI2_plaD  SYVITAIYLCIVGVIGTLSNGVIMYLYFKDKS-LR-SPMNLLFVNLAMSDFTVAFFGAMFQFGLTCTRKYMSPGMALCDFYGFITFLGGLASEMNLFIISVERYLAVVRPF---DVGNLTNRRVIAGGVFVWLYSLVFAGGPLV--GWS-SYRPEGLGTWCSISWQ--DRSMNTMSYVTAVFLGCYFFPVSIIIFCYFNVWRKVKE----AA-----------DA----QGGAGTAGKAEKSIFRMSVIMVTCYLTAWTPYAIVCLIASYGPPNGL--PIYAEVLPSLFAKSSQVYNPIIYVLMNKPYRSALVSLV--CRGRNPFDEA
CILI1_plaD  DYNICAAYLFFIACLGVSLNVLVLVLFIKDRK-LR-SPNNFLYVSLALGDLLVAVFGTAFKFIITARKTLLREEDGFCKWYGFITYLGGLAALMTLSVIAFVRCLAVLRLG---SFTGLTTRMGVAAMAFIWIYSLAFTLAPLL--GWN-HYIPEGLATWCSIDWL--SDETSDKSYVFAIFIFCFLVPVLIIVVSYGLIYDKVRK----VA-----------KT-------GGSVAKAEREVLRMTLLMVSLFMLAWSPYAVICMLASFGPKDLL--HPVATVIPAMFAKSSTMYNPLIYVFMNKQFRRSLKVLL--GMGVEDLNSE
ENCEPH_hom  TYERLALLLGSIGLLGVGNNLLVLVLYYKFQR-LR-TPTHLLLVNISLSDLLVSLFGVTFTFVSCLRNGWVWDTVG-CVWDGFSGSLFGIVSIATLTVLAYERYIRVVH------ARVINFSWAWRAITYIWLYSLAWAGAPLL--GWN-RYILDVHGLGCTVDWK--SKDANDSSFVLFLFLGCLVVPLGVIAHCYGHILYSIRMLRCVED-----------LQ----TIQVIKILKYEKKLAKMCFLMIFTFLVCWMPYIVICFLVVNGHGHLV--TPTISIVSYLFAKSNTVYNPVIYVFMIRKFRRSLLQLLCL          
ENCEPH_mon  TYELLALLIATIGLLGLCNNLLVLVLYYKFQR-LR-TPTHLFLVNISFNDLLVSLFGVTFTFVSCLRSGWVWDSVG-CAWDGFSNTLFGIVSIMTLTVLAYERYNRIVH------AKVINFSWAWRAITYIWLYSLVWTGAPLL--GWN-RYTLEIHGLGCSVDWK--SKDPNDSSFVIFLFFGCLMLPVGVMAYCYGHILYAIRMLRCVEE-----------LQ----TIQVIKILRYEKKVAKMCFLMIAIFLFCWMPYAVICLLVANGYGSLV--TPTVAIIASLFAKSSTAYNPIIYIFMSRKFRRCLLQLLCF          
ENCEPH_gal  TYELLALLIATIGTLGVCNNLLVLVLYYKFKR-LR-TPTNLFLVNISLSDLLVSVCGVSLTFMSCLRSRWVWDAAG-CVWDGFSNSLFGIVSIMTLTVLAYERYIRVVH------AKVIDFSWSWRAITYIWLYSLAWTGAPLL--GWN-RYTLEIHGLGCSMDWK--SKDPNDTSFVLLFFLGCLVAPVVIMAYCYGHILYAVRMLRCVED-----------FQ----TSQVIKLLKYEKKVAKMCFLMISTFLICWMPYAVVSLLVTYGYSNLV--TPTVAIIPSFFAKSSTAYNPVIYIFMSRKFRQCLLQLLCF          
ENCEPH_ano  TYELLALLVAAIGLLGLCNNLLVLVLYAKFKR-LR-TPTHLFLVNISLSDLLVSLFGVSFTFGSCLRHRWVWDAAG-CVWDGFSNSLFGIVSIMTLTVLAYERYIRVVH------ARVIDFSWSWRAITYIWLYSLAWTGAPLL--GWN-HYTLEIHGLGCSVDWQ--SKEPSDSSFVLFFFLGCLAAPVGIMAYCYGHILHAIRMLRCVED-----------LQ----SIQVIKILRYEKKVAKMCFLMVTTFLICWMPYAVVSLLIAYGYGHLI--TPTVAIIPSFFAKSSTAYNPVIYIFMSRKFRRCLVQLFCV          
ENCEPH_gas  TYKLLAFAIGTIGVFGFCNNVVVIVLYCKFKR-LR-TPTNLLVVNISLSDLLVSVIGINFTFVSCIRGGWTWSRAT-CIWDGFSNSLFGIVSIMTLASLAYERYIRVVH------AQVVDFPWAWRAIGHIWLYSLVWTGAPLL--GWN-RYTLEIHRLGCSLDWA--SKDPNDASFILLFLLACFFVPVGIMIYCYGNILYAVQMLRSIQD-----------LQ----TVQIIKILRYEKKVAVMFLLMISCFLLCWTPYAVVSMMEAFGRKNMV--SPTVAIIPSFFAKSSTAYNPLICVFMSRKFRRCLMQLLCS          
ENCEPH_xen  TYHFLALIVATVGFLGLVNNLLVLILYCKFKR-LQ-TPTNLLFFNTSLCHFVFSLLAITFTFMSCVRGSWAFSVEM-CVFHGFSKNLLGIVSFGTLTVVAYERYARVVY------GKYVNSSWSKRSITFVWVYSLAWTGFPLI--GWN-LYTFETHKLDCSFEWT--ATDPKDTAFVLLFFLACITLPLSIMAYCYGYILYEIQKLRSVKN-----------IQ----NFQEITILDYEIKMAKMCLLMMLTFLIGWMPYTILSLLVTSGYSKFI--TPTITVMPSLLAIASAAYNPVIHIFTIKKFRQCLVQLLPPINFHPPIN  
ENCEPH4a_t  GNLVVSVFLGFIGTFGLVNNLLVLVLFCRYKM-LR-SPINLLLMNISISDLLVCVLGTPFSFAASTQGRWLIGEAG-CVWYGFANSLFGVVSLISLAVLSFERYSTMMTPT---EADPSNYCKVCLGITLSWVYSLVWTVPPLF--GWS-SYGPEGPGTTCSVNWT--AKTTNSISYIICLFVFCLIVPFLVIVFCYGKLLCAIRQ---VSG------------------INASTSRKREQRVLCMVVIMVICYLLCWLPYGVVALLATFGPPDLV--TPEASIIPSVLAKSSTVINPIIYVFMNKQFYRCFLALL--CCQDPRSGSS
ENCEPH4b_t  GHLVVAVCLGFIGTVGFLSNFLVLALFCRYRA-LR-TPMNLMLVSISASDLLVSVLGTPFSFAASTQGRWLIGRAG-CVWYGFVNACLGIVSLISLAVLSYERYCTMVSST---IASNRDYRPVLGGICFSWFYSLAWTVPPLL--GWS-RYGPEGPGTTCSVDWR--TQTPNNISYIVCLFTFCLLLPFFVILYSYGKLLHTIRQ---VRR------------------VSSTVTRRREHRVLVMVVAMVVCYLICWLPYGVTALLATFGPPNLL--TPEATITPSLLAKFSTVINPFIYIFMNKQFYRCFRAFL--NCSTPKRDST
ENCEPH4_br  GYTAIATCLALIGFVGFTNNFVVILLIGCHRQ-LR-TPFNLLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPG-CVWYGFANSLFGIVSLVTLSALAFERYCVVVR-----SSDMLTYKSSLVVITFIWLYSLLWTSLPLL--GWS-SYQFEGHNVGCSVNWV--QHNPDNVSYIVTLMVTCFFVPMVVVCWSYAWIWRTVRM---SSE------------A----KPECGNSQNAGRLVTTMVVVMIICFLVCWTPYAVMALIVTFGADHLV--TPTASVIPSLVAKSSTAYNPIIYVLMNNQFREFLLARLQRVCCRQ     
ENCEPH5_br  GFDTVAVVIAAIGIAGFLSNGAVVLLFLKFRQ-LR-TPFNMLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPG-CVWYGFANHLFGLVSLISLAVISYERYRMVVKPK-GPGSSYLTYNKVGLAIIFIYLYCLLWTTLPIV--GWS-SYQLEGPKISCSVAWE--EHSLSNTSYIVAIFIMCLLLPLLIIIYSYCRLWYKVKK---GSQ-----------NL----PPAIRKSSQKEQKIARMVVVMITCFLVCWLPYGAMALVVSFGGESLI--SPTAAVVPSLLAKSSTCYNPLVYFAMNNQFRRYFQDLL--CCGRRLFDAS
PIN_stoPur  TYNYLTVYTGFLTIFGILNNGIVMILFARFPS-LR-HPINSFLFNVSLSDLIISCLASPFTFASNFAGRWLFGDLG-CTLYAFLVFVAGTEQIVILAALSIQRCMLVVRPF---TAQKMTHRWALFFISLTWIYSLIICVPPLF--GWN-RYTYEGPGTACSVAWN--SPSPGDTSYIIFIFVLVLVIPFGIIIFCYGLLVYAVKK---ISR------------T----QAALSSEAKADRKVSKMIFIMILFFLIAWTPYTGFSLYVTFGKNVVI--TPLAGTFPPFFAKLCTIHNPIIYFLLNKQFKDALIQLF--CCGENPFDRD
ENCEPH_api  MYIGAAIALGFIGFFGFTANLLVAIVIVKDAQILW-TPVNVILFNLVFGDFLVSIFGNPVAMVSAATGGWYWGYKM-CLWYAWFMSTLGFASIGNLTVMAVERWLLVARPM-----QALSIRHAVILASFVWIYALSLSLPPLF--GWG-SYGPEAGNVSCSVSWEVHDPVTNSDTYIGFLFVLGLIVPVFTIVSSYAAIVLTLKK-----------------VR----K-RAGASGRREAKITKMVALMITAFLLAWSPYAALAIAAQYFNAK-P--SATVAVLPALLAKSSICYNPIIYAGLNNQFSRFLKKIFDARGSRT     
ENCEPH1_an  AYNGAAVTLFFIGFFGFFLNIFVIALMYKDVQ-LW-TPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWLYGKSI-CVAYGFFMSLLGIASITTLTVLSYERFCLISRPF---AAQNRSKQGACLAVLFIWSYSFALTSPPLF--GWG-AYVNEAANISCSVNWE--SQTANATSYIIFLFIFGLILPLAVIIYSYINIVLEMRK-----------------NS----A-RVGRVNRAERRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELI--GPGLAVLPALVAKSSICYNPIIYVGMNTQFRAAFWRIRRSNGVAGQPD  
ENCEPH2_an  AYNASAVTLFFIGFFGFFLNLFVIALMCKDMQ-LW-TPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWIFGRTL-CVAYGFFMSLLGITSITTLTVLSYERYCLISRPF---SSRNLTRRGAFLAIFFIWGYSFALTSPPLF--GWG-AYVQEAANISCSVNWE--SQTKNATTYIIFLFVFGLVVPLIVIVYSYTNIIVNMRE-----------------NS----A-RVGRINRAEQRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELI--GPGLAVLPALVAKSSICYNPIIYVGMNTQFRAAFSRVRNKGQQA      
ENCEPH_aed  AYVASAVTLFFIGFFGFFLNLFVIALMCKDVQ-LW-TPINIILFNLVCSDFSVSIIGNPFTLTSAISRHWIFGRTV-CIAYGFFMSLLGITSITTLTVLSYERFCLISHPF---SSRSLSRRGAVFAILFIWSYSFALTSPPLF--GWG-AYVNEAANISCSVNWE--SQTLNATSYIIFLFVFGLVVPLVVIVYSYTNIVVNMKR-----------------NA----A-RVGRINRAEKRVTRMVFVMVLAFMIAWTPYAVFALIEQFGPTDII--SPALGVLPALIAKSSICYNPIIYVGMNTQFRAAFNRVRNNE         
ENCEPH_cul  AYVATAVVLFFIGFFGFFLNLFVIALMCKEVQVLW-TPMNIILLNLVCSDFSVSIVGNPFTLSSAISHRWLFGRKL-CVAYGFFMSLLGITSITTLTVLSYERFYLISRPF---SSRSLSRRGALGAVLLIWCYSFALTSPPLF--GWG-AYVNEAANISCSVNWE--TQTLNATTYIIYLFVFGLVVPLTVIVYSYTNIIVNMKK-----------------NA----A-RVGRINRAEKRVTTMVAVMVIAFMVAWTPYSVFALMEQFGPPDVI--GPGLAVLPALIAKSSICYNPIIYVGMNTQFRAAFNRVRHDP         
ENCEPH_tri  GYIAAAVVLFCIGFFGFSLNLTVIIFMLKERQ-LW-SPLNIILFNLVVSDFLVSVLGNPWTFFSAINYGWIFGETG-CTIYGFIMSLLSITSITTLTVLAFERYLLIARPF---RNNALNFHSAALSVFSIWLYSLSLTIPPLI--GWG-EYVHEAANLSCSVNWE--EKSPNSTSYILYLFAFGLFLPLVIITFSYVNIILTMRR-----------------NA----AFRVGQVSKAENKVAYMIFIMIIAFLTAWSPYAIMALIVQFGDAALV--TPGMAVIPALLAKSSICYNPVIYIGLNAQVKGAKWVSGLIYLFQFQQ   
ENCEPHa_ne  EANIVLGYYIAIFVIGFVTNTIVVIIFISSQR-LH-TTPNLILFSMSVCDWLMATMAKSVGIYGNARYWPTVGKVT-CDYYAFATSAIGYASILHLAALAVEKRMAVVSPM----TNSFNGRRMLVIIATLWGFAILWAVFPLI--GWS-SYGPEPGYVSCSITWYT--TDHNNVSYIICVSVLFFLIPIVTMTFCFASIYHTIRNLSHEAT-----------ARWGSDARATQETIRAKAKTAKMAFLMVMCFLFAWTPYAVVSLWTTFGDTHRI--PALLGVLPSLFAKLSSCYNPIIYFFMYTKFRCAGKALLYQEHH       
ENCEPHb_ne  EANIVLGYYIAIFVIGFVTNTIVVITFIFSKR-LH-TTPNLILFSMSVCDWLMAAMAKSVGIYGNARYWPTVGKVT-CDYYAFATSAIGYASILHLAALAVEKRMAVASPM----TNSLNERRMLVIIATLWGFAILWAVFPLI--GWS-SYGPEPGYVSCSITWYT--TDHNNVSYIICISVLFFFVPIVTMTFSFASIYKAIRNISHEAI-----------ARWGSHARATQETIKAKAKTAKMAFLMVMCFLFAWTPYAVVSLWTTFGGTHRN--PALLGVLPSLFAKLSSCYNPIIYFFMYTKFRRAAKLLFIKKVIRPTEA  
ENCEPHc_ne    HAITVMYSLLAAGAFVLNGIVLIIFLATRS-LR-TIPNMILLSMAWADWLMACLADAVGAYANANNWPSMVGGL-CVYYGFITTALGLTSMIHLTALSVERFVTVTIPM----TRPITETQMLLVVTFLWAFSFLWAIFPLV--GWS-SYGPEPGYAACSIAWYR--QDLNNMSYILCLFMFFFFLPIVIMIACFSSIYFTVRKLTRDSM-----------RRWGASSDSTQQTLAAERKTAWMSFIMVLAFLFAWVPYAVVSLYASFGGVTTI--PKLMSTLPAMLAKTSACYNPIIYFFMYSKFRKAFQRFFFKNVITPSQT  
MOLL_PERc_  EFRIIGIFISICCIIGVLGNLLIIIVF-AKRRSVR-RPINFFVLNLAVSDLIVALLGYPMTAASAFSNRWIFDNIG-CKIYAFLCFNSGVISIMTHAALSFCRYIIICQYG--YR-KKITQTTVLRTLFSIWSFAMFWTLSPLF--GWS-SYVIEVVPVSCSVNW--YGHGLGDVSYTISVIVAVYVFPLSIIVFSYGMILQEKVCKDSRKN-------GIRAQQRY----TPRFIQDIEQRVTFISFLMMAAFMVAWTPYAIMSALAI--GSFNV--ENSFAALPTLFAKASCAYNPFIYAFTNANFRDTVVEIMAPWTTRRVGV  
PER2_strPu  GYLLTAIYLTIVGSIATVGNITVICVL-CRYRTFRKRSINLLLINMAASDLGVSVAGYPLTTVSGYWGRWLFGDVG-CQFYAFCVYTLSCSTISTHAAIAVYRYIYIVKTD--LR-PKLTANFTSGVIVVIWVYAFFWTVTPFV--GWS-SYIYEPFGTSCSVNW--VGRTISDISYMVACTIGVYLLQIFIMLYCYIRVAKKIRGVDPGRT-------EEKDAGVVVFGRLRKREAKIDTHVTKMCFMMMLTFIVVWAPYAVECLRAA--HVHRI--SALSSVLPTMFAKSSCMVNPIIFLTSSSKFRQDLGKLWSRPSSQDSL   
PER1_strPu  GYLLTALYLTLVGIVSTIGNITVLCVL-CRYGTFRKRSVNILLMNMAVSDLGVSVAGYPLTAISGYRGRWVFADIG-CQFSGFCVYALSCSTISTHAVVAIYRYIYIVKPY--HR-PRLSSSTSCLAILCIWTFTLFWTITPFF--GWS-SYTYEPFGTSCSINW--YGKSLGDLTYIICCVVFVFILPIIIMLYCYIGVAKKIKGIDPLRT-------EERDIAVV-FGRLRKHETKIDTRVTKICFMMMASFIVVWTPYAVGSIWAS--KIGKI--SASASVLPTMFAKSSCMINPIIFLTSSSKFRADLGKLWNRPSSLEHTI  
PER_homSap  EHNIVATYLIMAGMISIISNIIVLGIF-IKYKELR-TPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAG-CQVYAGLNIFFGMASIGLLTVVAVDRYLTICLPD--VG-RRMTTNTYIGLILGAWINGLFWALMPII--GWA-SYAPDPTGATCTINW--RKNDRSFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDC-----------TESL------NRDWSDQIDVTKMSVIMICMFLVAWSPYSIVCLWASFGDPKKI--PPPMAIIAPLFAKSSTFYNPCIYVVANKKFRRAMLAMFKCQTHQTMPV  
PER_monDom  EHKIVAAYLITAGVISIVSNVIVLGIF-VKYKALR-TATNTIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYDG-CQIYAGLNIFFGMASIGLLTAVAIDRYLTICQPD--LG-R-MTSYNYTLMILTAWVNGFFWALMPIV--GWA-GYAPDPTGATCTINW--RKNDVSFVSYTMTVITINFAMPLGVMFYCYYNVSQKMKQYSPSNC-----------PDHI------NRDWSNQVAVTKMSVVMILMFLLAWSPYSIVCLWASFGDPKEI--PPAMAIVAPLFAKSSTFYNPCIYVAANKKFRRAISAMIRCQTHQSMPI  
PER_galGal  EHNIVAAYLITAGVISIFSNIVVLGIF-VKYKEFR-TATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYTG-CQIYAALNIFFGMASIGLLTVVAVDRYLTICRPD--IG-RRMTTRNYAALILAAWINAVFWASMPTV--GWA-GYASDPTGATCTANW--RKNDVPFVSYTMSVIAVNFVVPLTVMFYCYYNVSRTMKQYTSSNC-----------LESI------NMDWSDQVDVTKMSVVMIVMFLVAWSPYSIVCLWSSFGDPKKI--SPAMAIIAPLFAKSSTFYNPCIYVIANKKFRRAILAMVRCQTRQEITI  
PER_xenTro  EHNIVAAYLITAGVISILSNIIVLGIF-VKYKELR-TATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYVG-CQIYAGLNIFFGMASIGLLTVVAIDRYLTICRPD--IGGRRISGRHYTAMILAAWINAVFWSVMPVV--GWS-SYAPDPTGATCTINW--RKNDVSFVSYTMSVVAVNFVVPLMVMFYCYYNVSRTMKGYGSRSS-----------LGGI------NADWSDQTDVTKMSMVMIVMFLVAWSPYSIVCLWSSFGDPRKI--PPAMAIIAPLFAKSSTFYNPCIYVIANKKFRRAILSMVQCKSRQEVTL  
PER_gasAcu  EHNIVAGYLITAGVISLFSNIVVLLMF-WKFKELR-TATNFIIINLAFTDIGVAGIGYPMSAASDIHGSWKFGYAG-CQIYAALNIFFGMASIGLLTVVAIDRYLTICRPD--IGGQKMTMQSYNLLILAAWLNAVFWSSMPVV--GWA-SYAPDPTGATCTINW--RQNDVSFISYTMAVIAVNFVLPLSAMFYCYYNVSATVKRYKASNC-----------LDSA------NIDWSDQMDVTKMSIVMIIMFLVAWSPYSIVCLWASFGDPKTI--PAPMAIIAPLFAKSSTFYNPCIYVIANKKFRRAIIGMVRCQTRQRITI  
PERa_braFl  DHLIVGLYLFVIGIIGTVENGITLATF-TKFRSLR-SPTTMLLVHLAIADLGICIFGYPFSGASSLRSHWLFGGVG-CQWYGFNGMFFGMANIGLLTCVAVDRYLVICRQD--LV-DKVNYNTYGVMAALGWLFAAFWAALPLV--GWA-EYSLEPSGTACTINW--QKNDSLYISYVTSCFILGFALPLAVMMFCYWQASCFVNKVLKGDI-----------SGDLTFPVAVNVDWEYQNHFSKMCLAMVAAFVVAWTPYSVLFLFAAFGNPADI--PAWITLLPPLIAKSSALYNPIIYIIANRRFRSAIFSMVKGQNPDVET   
PERa_braBe  DHLIVGLYLFVIGIIGTIENGITLATF-SKFRSLR-SPTTMLLVHLAIADLGICIFGYPFSGASSLRSHWLFGGVG-CQWYGFNGMFFGMANIGLLTCVAVDRYLVICRHD--LV-DKVNYNTYGVMAALGWLFAAFWAALPLV--GWA-EYALEPSGTACTINF--QKNDSLYISYVTSCFVLGFVVPLAVMAFCYWQASCFVSKVLKGDI-----------AGDLTFPVAANVDWEYQNHFSKMCLAMVAAFVVAWTPYSVLFLFAAFWNPADI--PAWLTLLPPLIAKSSALYNPIIYIIANRRFRNAICSMMKGQDPDVEDD  
PERc_braFl  GYLASAVYLTITGLIAFVGNIFAIIVFLTE-KEFRKKEHNSFALNLAIADLSVCVFAYPSSTISGYAGEWMLGDVG-CTIYGFLCFTFSLTSMVTLCAISVYRYIVICKPQ--YA-HLLTHRRTNYVILGIWLYALVFSVPPLF--GVN-RYTYEPI-ITCSLDW--NVQHVGETIYTAAVIIIVYVLNVSIMCFCYFNIIFKSANLKFAAL-------ASEKTR--------TAAKKDIWKTSMMCLAMVVSFLIAWTPYAVSSTWDIL-TEEDL--PIIATILPTMFAKSSCMMNPIIYSCCNGKFRQAALKTFSK          
PERc_braBe  GYLASAIYITLTGLIAFFGNVITITVFLTE-KEFRKKQQNGFVLNLAIADLSVCVFAYPSSAIAGYAGRWVLGDVG-CTIYGFLCFTFALVSMVTLCVISIYRYILICKPQ--YA-HLLTHRRTVYVIIGTWLYALVFTVPPLV--GVK-RYTYEPMQITCSLDW--NVQHPGEKAYIAAVLVIVYVLQVLIMCFCYFNIIFKSANLKFAAL-------ASEKTK--------MAAKKDTWKTSVMCLTMVVSFLIAWTPYAVSSTWDIL-SAEDL--PIIATILPSLFAKSSCMMNPIIYACCNTKFRQAAVKSFRKLCGMCKQK  
PERb_braFl  SATIMGVYLTIVGLVSTVGNATVVLMFMLKWRQLCRKAPNLLIINLAAVDLCISVFGYPFSASSGFANQWLFSDAI-CTLYGFSCFLLSMVSMHTLCLISAHRYITICRPE--HA-SKLTMTRTILAVVGAWVYGISVAVPPLF--GIA-GYTYESFGLSCTIDF--HGTTVADMVYLSILIILCYVINVAVMGTCYFKIIRKFSKHRFREV-------RDVRTS--------HQHSFERGVT-LRCILMTLFYLISWTPYTAVAVWTMV-GPPP---PVQLGMVAALTAKTHCAFNPILYMLMSEVYRKLVLRTMCPCCFNKISN  
PERb_braBe  SATIMGVYLTIVGLVATVGNATVVLMFIMKWRQLCRKAPNLLVINLAAANLCITIFGYPFSASSGYAHQWLFPDAI-CTLYGFSCFLLSMVSMHTLCLISAHRYITICRPE--HA-SKLTMNRTVLAVIGTWLYAIAVAVPPLF--NIA-RYTYEPSGLSCTIDF--RVTTVADLVYLGSLIVLCYVIHVAVMATCYFKIIRKFSRHRFRQV-------RDIRTS--------HQRSFEMGVT-MRCILMTLFYLLSWTPYTAVCIWTMV-GPPP---PVVVSMAAALIAKTHCAFNPILYAFMSEVYRKLVFRTMCPCCFNRISC  
NEUR_homSa  ADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLR--PAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLS--YG-VWLKRKHAYICLAAIWAYASFWTTMPLV--GLG-DYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKS-SSKEV-------AHFDSRIHSSHVLEMKLTKV-------AMLICAGFLIAWIPYAVVSVWSAFGRPDSI--PIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEG  
NEUR_monDo  ADLVAGFYLTIIGVLSTLGNGYVIYMSSKRKKKLR--PAEIMTVNLAVCDLGISVVGKPFTIISCFSHRWVFGWVG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLS--YG-TWLKRHHAYICLVIIWAYATFWATMPLA--GLG-NYAPEPFGTSCTLDWWLAQASVTGQTFILNILFFCLLLPTAVIVFSYVKIIAKVKS-STKEV-------AHFDSRIQSSHVLEMKLTK--------AMLICAGFLIAWIPYAVVSVWSAFGQPDSI--PVQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCQSGGQKAAKKESLRT  
NEUR_ornAn  ADLVAGVYLVIIGVLSTLGNGYVIYMSSRRKKKLR--PAEIMTVNLAVCDLGISVVGKPFTIVSCFCHRWVFGWMG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLS--YG-TWLKRHHAYICLAIIWAYASFWATMPLV--GLG-NYAPEPFGTSCTLDWWLAQASVAGQAFILNILFFCLLLPTAVIVFSYVKIIAKVKS-STKEV-------AHFDSRIQNSHVLEMKLTK--------AMLICAGFLIAWIPYAVVSVWSAFGQPDSI--PIQFSVVPTLLAKSAAMYNPIIYQVIDCRISCCRLGGPKTGKKESLKN  
NEUR_galGa  ADIIAGFYLTVIGILSTLGNGYVIFMSSKRKKKLR--PAEIMTVNLAVCDLGISV-GKPFSIISFFSHRWIFGWMG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLA--YG-TWLKRHHAFICLALIWAYATFWATVPFA--GVG-SYAPEPFGTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKS-STKEV-------AHYDTRIQNSHILEMKLTKV-------AMLICAGFLIAWIPYAVVSVWSAFGQPDSV--PIQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCRSGGPKTLQKKSSLK  
NEUR_anoCa  ADIVAGVYLLVIGILSTLGNGYVIYMSTQRKKKLK--PAEIMTVNLAVCDLGISV-GKPFSIIAFFSHRWIFGWSG-CRWYGWAGFFFGIGSLITMTAVSLDRYFKICHLS--YG-TWLKRHHVFICLGIIWSYAAFWATIPFA--GFG-NYAPEPFGTSCTLDWWLAQGSVAGQAFILNILFFCLVLPTAVIMFCYVKIIAKVQS-STKEV-------AHYDTRIQNQHVLEMKLTKV-------AMLICAGFMFAWIPYAVVSVWSAFGRPDSV--PIKVSVIPTLLAKSAAMYNPVIYQVIDCKSACCRPGNLQPLQKKNSRY  
NEUR_xenTr  ADIFAGVYLMAIGILSTLGNGYVIYMACSRKKKLR--PAEIMTINLAVCDLGISVTGKPFAIVSCFSHRWVFGWNA-CRWYGWAGFFFGCGSLITLTVVSLDRYLKICHLR--YG-TWLKRRHAFIALAVIWAYATLWATLPLV--GVG-NYAPEPFGTTCTLDWWLAQASVKGQIFVLSMLFFCLLFPTMVIVFSYAKIIAKVKS-SAKEV-------AHFDTRNQNNHTLEIKLTK--------AMLICAGFLIAWFPYAVVSVWSAFGQPDSI--PIELSVVPTMMAKSASMYNPIIYQVIDCKPACCKKD--KSLQNTTSRY  
NEUR_gasAc  ADIIAAFYICIIGIMSATGNGYVIYMTIKRKSKLK--PPELMTVNLAVFDFGISVTGKPFFVVSSFAHRWLFGWEG-CRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLR--YG-TWLKRQHAFLCLVFVWMYAAFWATMPLV--GWG-NYAPEPFGTSCTLDWWLAQASVSGQSFVVAILFFCLVLPAGIIVFSYVMIIFKVKS-SAKEI-------SNFDARIKNSHNLEIKLTKTRNCATEDAMLICAGFLIAWIPYAVVSVVSAFGEPDSV--PISVSVIPTLLAKSSAMYNPIIYQVLDLKNSCMKSSCFKGLKKPRHFR  
NEUR_calMi              GLLSTLGNGYVIYLSITQKRKLK--PPEILITNLAISDFGMSVGGQPFLIISCFSHRWIFGWVG-CRWHGWAGFFFGCGSLITMTVVSLDRYLKICHLQ--YG-SWLQRRHVFMSLAFIWFYAAFWATMPLV--GWG-NYAPEPFGTSCTLDWWLARVSVSGLIFVLTILFFCLLLPIIIIVFSYIKIIAKVKS-SAKEV-------AHFDSRIQNHHSLEMNLTK                                                                                          
MEL1_homSa  AHYTLGTVILLVGLTGMLGNLTVIYTFCRSRS-LR-TPANMFIINLAVSDFLMSFTQA-PVFFTSSLYKQWLFGETGCEFYAFCGALFGISSMITLTAIALDRYLVITRPL--ATFGVASKRRAAFVLLGVWLYALAWSLPPFF--GWS-AYVPEGLLTSCSWDYMS--FTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGRALQTFGAC------KGNGESLWQRQ-RLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAGYAHVL--TPYMSSVPAVIAKASAIHNPIIYAITHPKYRVAIAQHLPCLGVLLGVS  
MEL1_monDo  AHYTIGATILAVGFTGVLGNLLVIYTFCR----LR-TPANMFIINLAISDFFMSFTQA-PVFFASSMYKRWIFGEKACEFYAFCGALFGITSMITLMAIALDRYFVITRPL--ASIGVISKKKTGFILLGVWLYSLAWSLPPFF--GWS-AYVPEGLLTSCSWDYTT--FTPSVRAYTMLLFCFVFFIPLIVIIYCYIFIFRAIQDTNKAVHSIGSG------ESTA-SPRHCQ-RMKNEWKMAKIALVVILLYVLSWAPYSTVALVAFAGYSHIL--TPYMNSVPAIIAKASAIHNPIIYAISHPKYRMAIAQNFPCLRALLCVR  
MEL1_xenTr  VHYVVGAVILAVGITGMLGNFLVIYAFCRSRS-LR-SPANMFIINLAITDFLMSVTQA-PVFFATSLHKRWIFGEKGCELYAFCGALFGITSMITLMVIAVDRYFVITRPL--TSIGVMSKKRAVLILSGVWLYSLAWSLPPFF--GWS-AYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFCFVFFIPLFIIIYCYIFIFKAIKNTNRAVQKIGTD------N-NKESHKQYQ-KMKNEWKMAKIALIVILLYVVSWSPYSTVALLAFAGYASIL--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRMAIAKYIPCLGSLLRVK  
MEL1_galGa  AHYTIGTVILIVGITGTLGNFLVIYAFCRSRT-LQ-KPANIFIINLAVSDFLMSITQS-PVFFTNSLHKRWIFGEKGCELYAFCGALFGITSMITLMVIALDRYFVITKPL--ASVRVMSKKKALIILVGVWLYSLAWSLPPFF--GWS-AYVPEGLLTSCSWDYMT--FTPSVRAYTMLLFCFVFFIPLIAIIYSYVFIFEAIKKANKSVQTFGCK------HGNRELQKQYH-RMKNEWKLAKIALIVILLYVISWSPYSVVALVAFAGYSHVL--TPFMNSVPAVIAKASAIHNPIIYAITHPKYRTAIATYVPCLGFLLRVS  
MEL1_calMi  AHYIIGATILAVGVTGMVGNFLVIYAFLRSRS-LR-TPANTFIINLAATDFLMSVTQS-PIFFITSIHKRWIFGEKGCELYAFCGALFGITSMITLMVIALDRYFVITRPL--ASIGVLSHRRAGLIILSLWLYSLAWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFCFVFFIPLGVIIYCYIFIFRAIKSTNKKVG----G------STNRESQKQHQ-RMKNEWKMAKIALIVILLFVISWSPYSTVALTAFAGYADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRMAIAKYVPLLGLLLRVS  
MEL1_danRe  AHYTIGAVILTVGITGMLGNFLVIYAFSRSRT-LR-TPANLFIINLAITDFLMCATQA-PIFFTTSMHKRWIFGEKGCELYAFCGALFGICSMITLMVIAVDRYFVITRPL--ASIGVLSQKRALLILLVAWVYSLGWSLPPFF--GWS-AYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFIFVFFIPLIVIIYCYFFIFRSIRTTNEAVGKINGD-------NKRDSMKRFQ-RLKNEWKMAKIALIVILMYVISWSPYSTVALTAFAGYSDFL--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRLAIAKYIPCLRLLLCVP  
MEL1_takRu  AHYTIGSVILVIGITGMIGNFLVIYAFCRSRS-LR-TPANMFIINLAVTDLLMCVTQT-PIFFTTSMYKRWIFGEKGCELYAFCGALFGICSMITLTVIAIDRYFVITRPL--TSIGVLSRKRAFVILMTVWIYSLGWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FSPSVRAYTMLLFIFVFFLPLFIIIYCYFFIFRAIRATNKAVGKVNGS--VHSHSRRRESVKNFQ-RLQNEWKMAKIALMVILLYVISWSPYSCVALTAFAGYADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRLALAKYIPCLGFLLCIS  
MEL1_gasAc  AHYTIGSVILAIGITGIIGNVLVIYAFSKSRS-LR-TPANMFIINLAITDLLMCVTQA-PIFFTTSMHKRWIFGEKGCELYAFCGALFGICSMITLTVIALDRYFVITRPL--TSIGMMSRRRALLILMGAWTYSLGWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFIFVFFLPLFIIIYCYFFIFRAIRVTNRAVGKMNGS--IHSHGSGRDSTKNFH-RLQNEWKMAKIALIVILLYVVSWSPYSAVALTAFAGYADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRIALAKYIPFLGVLLCVP  
MEL1_oryLa  AHYTIGSVILAIGITGIIGNFLVIYAFSRSRS-LR-TPANMFIINLAITDLLMCVTQS-PIFFTTSMHKRWIFGEKGCELYAFCGALFGICSMITLTVIAIDRYFVITRPL--TSIGVLSRKRALLILSAAWAYSLGWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFIFVFFLPLFIIIYCYVFIFRAIRSTNRAVGKINGN--T------RDAVKSFN-RLQNEWKMAKIALIVILLYVISWSPYSTVALTAFAGYADML--TPYMNSIPAVIAKASAIHNPIIYAITHPKYRMALAKYIPGLGVLLCIH  
MEL1D_danR  AHYTIGSVILAVGITGMVGNLLVMYAFCKSRS-LR-TPANMFIINLAVTDFLMCVTQT-PIFFTTSLHKRWIFGEKGCELYAFCGALFGICSMITLMIIAVDRYFVITRPL--ASIGVMSRKRALLILSAAWAYSMGWSLPPFF--GWSGAYVPEGLLTSCSWDYMT--FSPSVRAYTMLLFTFVFFIPLFVIIYCYFFIFKAIRETNRAVGKINGE------GGPRDSIKKIH-RMKNEWKMAKIALIVILLYVISWSPYSCVALTAF--YADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRSAIAKYIPCLGVLLCVP  
MEL2_galGa  VLYTVGTCVLVIGSIGIIGNLLVLYAFYSNKK-LR-TPQNFFIMNLAVSDFLMSASQA-PICFVNSLHREWILGDIGCDLYAFCGALFGITSMMTLLAISVDRYLVITKPL--RSIQWTSKKRTIQIIAAVWLYSLGWSVAPLL--GWS-SYVPEGLMISCTWDYVT--YSPANRSYTMILCCCVFFIPLIIILHCYLFMFLAIRSTGRDVQKLGSC---------SRKSFLSQ-SMKNEWKLAKIAFVVIIVYVLSWSPYACVTLIAWAGRGNTL--TPYSKSVPAVIAKASAIYNPIIYAIIHPRYRKTIHNAVPCLRFLIRIS  
MEL2_xenLa  VLYTIGSFILIIGSVGIIGNMLVLYAFYRNKK-LR-TAPNYFIINLAISDFLMSATQA-PVCFLSSLHREWILGDIGCNVYAFCGALFGITSMMTLLAISINRYIVITKPL--QSIQWSSKKRTSQIIVLVWMYSLMWSLAPLL--GWS-SYVPEGLRISCTWDYVT--STMSNRSYTMMLCCCVFFIPLIVISHCYLFMFLAIRSTGRNVQKLGSY---------GRQSFLSQ-SMKNEWKMAKIAFVIIIVFVLSWSPYACVTLIAWAGHGKSL--TPYSKTVPAVIAKASAIYNPIIYGIIHPKYRETIHKTVPCLRFLIREP  
MEL2_anoCa  VLYTVGSCVLVIGCIGITGNLLVLYAFYSNKR-LR-TPPNYFIMNLAVSDFLMSATQA-PICFLNSMHKEWVLGDIGCNLYAFCGALFGITSMITLLAISVDRYCVITKPL--QSIKRTSKKRTCIIIVFVWLYSLGWSVCPLF--GWS-SYIPEGLMISCTWDYVT--YSPANRSYTMMLCCCVFFIPLVIIFHCYIFMFLAIRSTGR------------------RKSSISH-SIKSEWKLAKIAFVAIVVFVLSWSPYACVTLISWAG---TL--TPYSKSVPAVIAKASAIYNPIIYAIIHPRYRKTIRSAVPCLRFLIPIS  
MEL2_tetNi  VHYIIAFFVFVIGILGITGNVLVIFAFYSNKK-LR-SLPNYFIVNLAVSDLLMASTQS-PIFFIN-LYKEWMFGETACKMYAFCGALFGITSMINLLAISVDRYVVITKPL--QTIRRSSKRRTALAILMVWLYSLAWSLAPLV--GWG-SYIPEGLMTSCTWDYVT--YTLANRSYTMMLCCFVFFIPLAIILCCYLLMFLAIRKTSRR----------------KSTLIQQK-SIRSEWKLAKIAFVVIVVYVLSWSPYACVTLISWAG---TL--TPYSKSVPAVIAKASAIYNPIIYAIIHPRYRKTIRSAVPCLRFLIPIS  
MEL2_gasAc  AHYIVAVFVVVIGTLGITGNALVMLAVYSNKK-LR-NLPNYFIMNLAVSDFLMAFTQS-PIFFINCLYKEWAFGETGCKIYAFCGALFGIASMINLLAISIDRYLVITKPL--QAIHWGSKRRTTLAILLVWLYSLAWSLAPLV--GWG-SYIPEGLMTSCTWDYVT--YTLANRSYTMMLCCFVFFIPLGIILYCYLFMFLAIRKTSRR----------------KSTLIKQK-SMKSEWKLAKIAFVVIVVYVLSWSPYACVTLISWAG---IL--SPYSKAVPAIIAKASAIYNPFIYAIIHNKYRMTLAAKFPCLRFLSPTP  
MEL2_danRe  VHYIIAFLILIIGTLGVSGNALVMFAFYRNKK-LR-SLPNYFIMNLAVSDFLMAITQS-PIFFINCLYKEWMFGELGCKIYAFCGALFGITSMINLLAISIDRYLVITKPL--QTIQWNSKRRTGLAILCIWLYSLAWSLAPLI--GWG-SYIPEGLMTSCTWDYVS--PSPANKSYTMMLCCFVFFIPLSIILYCYLFMFLSVRQASRQ----------------KSSFVKQQ-SMRSEWKLAKIAAVVIVVYVLSWAPYACVTLVAWAG----L--TPYSKTLPAVLAKSSAIYNPFIYAIIHNKYRATLAEKVPGLSCLSRSQ  
MEL1a_braF  AHYIVGTAVFCVGCCGMFGNAVVVYSFIKSKG-LR-TPANFFIINLALSDFLMNLTNM-PIFAVNSAFQRWLLSDFACELYGFAGGLFGCLSINTLMAISMDRYLVITKPF--LVMRIVTKQRVMFAILLLWIWSLVWALPPLF--GWS-AYVPEGFGTSCTFDYMT--PKLSYHIFTYIIFFTMYFIPMGVIIYCYYNIFATVKSGDKQFGKAVKE--M-AHEDVKNKAQQER-QRKNEIKTAKIAFIVITLFLSAWTPYAVVSALGTLGYQDLV--TPYLQSIPAVFAKSSAVYNPIVYAITHPKFRAAVKKHIPCLSGCLPAD  
MEL1a_braB  AHYIVGTAVFCIGCCGMFGNAVVVYSFIKSKG-LR-TPANFFIINLALSDFLMNLTNM-PIFAVNSAFQRWLLSDFACELYGFAGGLFGCLSINTLMAISMDRYLVITKPF--LVMRIVTKQRVMFAILLLWIWSLVWALPPLF--GWS-AYVSEGFGTSCTFDYMT--PKLSYHIFTYIIFFTMYFIPGGVMIYCYYNIFATVKSGDKQFGKAVKE--M-AHEDVKNKAQQER-QRKNEIKTAKIAFIVISLFMSAWTPYAVVSALGTLGYQDLV--TPYLQSIPAMFAKSSAVYSPIVYAITYPKFREAVKKHIPCLSGCLPAS  
MOLL_RHO_l  VYYSLGIFIAICGIIGCAGNGIVIYLFTKTKS-LQ-TPANMFIINLAFSDFTFSLVNGFPMMTISCFLKHWVFGQAACKVYGLIGGIFGLTSIMTMTMISIDRYNVIRRPM--SASKKMSHRKAFIMIVFVWIWSTIWAIGPIF--GWG-AYQLEGVLCNCSFDYIT--RDASTRSNIVCMYIFAFMFPIVVIFFCYFNIVMSVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GASAEMKLAKISIVIVTQSLLSWSPYAIVALLAQFGPIEWV--TPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAIASNFPWILTCCQ    
MOLL_RHO_s  VYYSLGIFIGICGIIGCTGNGIVIYLFTKTKS-LQ-TPANMFIINLAFSDFTFSLVNGFPLMTISCFIKKWVFGMAACKVYGFIGGIFGLMSIMTMSMISIDRYNVIGRPM--AASKKMSHRRAFLMIIFVWMWSTLWSIGPIF--GWG-AYVLEGVLCNCSFDYIT--RDSATRSNIVCMYIFAFCFPILIIFFCYFNIVMAVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GASAEMKLAKISIVIVTQFLLSWSPYAVVALLAQFGPIEWV--TPYAAQLPVMFAKASAIHNPLIYSVSHPKFREAIAENFPWIITCCQ    
MOLL_RHO_t  VYYSLGIFIGICGIIGCGGNGIVIYLFTKTKS-LQ-TPANMFIINLAFSDFTFSLVNGFPLMTISCFLKKWIFGFAACKVYGFIGGIFGFMSIMTMAMISIDRYNVIGRPM--AASKKMSHRRAFIMIIFVWLWSVLWAIGPIF--GWG-AYTLEGVLCNCSFDYIS--RDSTTRSNILCMFILGFFGPILIIFFCYFNIVMSVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GANAEMRLAKISIVIVSQFLLSWSPYAVVALLAQFGPLEWV--TPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAISQTFPWVLTCCQ    
MOLL_RHO_e  VYYSVGIFIGVVGIIGILGNGVVIYLFSKTKS-LQ-TPANMFIINLAMSDLSFSAINGFPLKTISAFMKKWIFGKVACQLYGLLGGIFGFMSINTMAMISIDRYNVIGRPM--AASKKMSHRRAFLMIIFVWMWSIVWSVGPVF--NWG-AYVPEGILTSCSFDYLS--TDPSTRSFILCMYFCGFMLPIIIIAFCYFNIVMSVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GASAEMKLAKISMVIITQFMLSWSPYAIIALLAQFGPAEWV--TPYAAELPVLFAKASAIHNPIVYSVSHPKFREAIQTTFPWLLTCCQ    
MOLL_MEL_p  WHYIIGVYITIVGLLGIMGNTTVVYIFSNTKS-LR-SPSNLFVVNLAVSDLIFSAVNGFPLLTVSSFHQKWIFGSLFCQLYGFVGGVFGLMSINTLTAISIDRYVVITKPL--QASQTMTRRKVHLMIVIVWVLSILLSIPPFF--GWG-AYIPEGFQTSCTFDYLT--KTARTRTYIVVLYLFGFLIPLIIIGVCYVLIIRGVRRHDQKMLTIT-R--S-MKTED--ARANNK-RARSELRISKIAMTVTCLFIISWSPYAIIALIAQFGPAHWI--TPLVSELPMMLAKSSSMHNPVVYALSHPKFRKALYQRVPWLFCCCK    
LOPH_RHO_p  WHYAVAAWMTFFGILGVSGNLLVVWTFLKTKS-LR-TAPNMLLVNLAIGDMAFSAINGFPLLTISSINKRWVWGKLWRELYAFVGGIFGLMSINTLAWIAIDRFYVITNPL--GAAQTMTKKRAFIILTIIWANASLWALAPFF--GWG-AYIPEGFQTSCTYDYLT--QDMNNYTYVLGMYLFGFIFPVAIIFFCYLGIVRAIFAHHAEMMATA-K--R-MGAN---TGKADA-DKKSEIQIAKVAAMTIGTFMLSWTPYAVVGVFGMIKPHSEMFIHPLLAEIPVMMAKASARYNPIIYALSHPKFRAEIDKHFPWLLCCCKPK  
RHAB_schMe  YHYLVGVYISIVGISGVLGNLLVLYIFARAKS-LR-TPPNMFIMSLAIGDLTFSAVNGFPLLTISSFNTRWAWGKLTCEIYGFIGGLFGFISINTMALISLDRYFVIAQPF--QTMKSLTIKRAIIMLVFVWLYSLIWSTPPFF--GYG-NYVPEGFQTSCTFDYLT--QSKGNIIFNIGMYIGNFIIPVGIIIFCYYQIVKAVRVHELEMLKMA-Q--K-MNASHPTSMKTGA--KKADVQAAKISVIIVFLYMLSWTPYAIIALMALTGRRDHL--NPYTAELPVLFAKTSAMYNPFIYAINHPKFRIQLEKKFPCLICCCPPK  
MEL1_schMa  YYYLVGIYIGIVGILAVMGNSLVITLFLLCKQ-LR-TPPNMLIVSLAISDFSFALINGFPLKTIAAFNHRWGWGKLACELYGFAGSIFGFISLTTMAFIALDRYLVIVQPF--ETFSRITYGKVIVMIFITWIWSALWSIPPFF--GYG-SYIPEGFHTSCTFDYLS--TDLPNLIFNAGLYILGFLCPVFIIIFSYYQIVKTVRLNELELMKMA-Q--S-LDLQNPSAMKTGG-DKKADIEAAKTSIILVLLYLMSWSPYAIVCLMTLIGSRDSL--TPFHSELPVLFAKTSAVYNPIVYAVKHPKFRMEIEKRFPFLICCCPPK  
MEL1_capCa  IYYGLGLYMAVVGIVGTLGNLVVITLFI--KS-LR-TPPNMFIINLALSDMGFCATNGFPLMTVASFQKLWRWGPVACELYALAGSITGFNSIATLALISMDRYMVIAKPF--YAMKHVSHKRSLIQIILAWTWAFIWSAPPLLRMGYG-RYIPEGFQVSCTFDYLS--RDLKNLIFVWCLFVFGFFIPVLAIACSYVGIIRAVGAQSKEMRKTA-E--K-MGAK---TGKSDK-EKKQDIAMAKVAAGTIGLFLMSWTPYAAVSMIGIAGNRSWI--TPYVSQIPVMFAKASAMWNPILYALSHPKFRAALEDHMPWLLVC      
MEL2_schMa  YQYAIGLFIAVVGITGMCLNLLVIVFFTMFKS-LR-TPSNILVVNLAISDFGFSAVIGFPLKTMAAFNNFWPWGKLACDLYGLAGGLFGFVSLSTIAAVALDRYLVIATPF--ESVFQTTPRRTLLLMLFLWMWSLMWTIPPLFGFG-K-RYVTEGYQTSCTMDYIS--TDLNNRLFNIGLFGFGFLCPLFLSLFCYARIILIVRSRGKDFIEMAAS--S-KGTNQKEKSANVS-SSKSDTFVSKSSAILLGVYLICWTPYSFVCLMALIGYADYI--TPLMVEIPCLCAKTA---NPCIYAFRYPKFRSLLQQRFGFLRLTKNRV  
MEL1_helRo  FYYFLGTFFAVVGFLGVFGNIIVVWVFSRTPS-LR-TPSNVLVINLAICDILFSALIGFPMSALSCFQRHWIWGNF-CQFYSFVAGITGLASINCLAVIAVDRYLVVGQPL--AMLNQSHFRRSFYHVLIIWTWACVWSAMPLI--GWG-EYILEGFGVSCTFDYLT--RTTWNISFNVCLFTFCFGMPVSVIILSYIGIIRSIAKNRKEFSSL--------------TAENSS-RARQEIKIAKVFAVCMTAFILCWVPYATVAQLGIYGYDQMV--SPYTAELPVMLAKTSALWNPIIYAFSHPKYRKCLKELPIF          
MEL1_strPu          MNAVTTALPHGLNKPTIEARWTKS-LR-TPPNMLIVNLAISDFGMVITN-FPLMFASTIYNRWLFGDAGCQFYAFCGALFGIMSIANMTAIALDRYYVICWSL--EAVRSVTHRRSMIIIIIVWCYAIFWSIPPFFGVG---SYVLEGYGLGCTFDFMT--KDLNHYLHVSFLFASSFVVPVTIIIVCFTRIAITVRAHRHELNKMRTKLTEDKDKKHKSSIRRAN-KAKTEFQIAKVGFQVTIFYVLSWMPYSIVAVIGQYFDSDLL--TPLGTVVPVIFAKCSAIWNPIIYCLSHEKFNAALKEKLMGMCGIEIPS  
MEL2_strPu  AFPLIGGYLLVVVLLGTAGNSLVIYTFLRFKK-LH-SPINLLIVNLSASDLLVATT-GTPLSMVSSFYGRWLFGTNACAFYGFVNYYCGCISLNSLAAISVFRYIIVVRGQ--AQNNKLSLRSSIYAILVIHLYTLIFSTPPLY--GWN-RFVLAGYHTSCDIDFHT--KTPLFVSYICYMFFFLFFLPLGLISWSYFKIYQRVSK--HSNSMRTSFTGVTKEINSDEKHANHR-------RTASTLFVTIVVFLFAWFPYCIVSLWVLIGDANSI--SKLSTTIPSLFAKSSVIYNPLIYVVLNSKFRKALIQTLSFLKCLSKHE  
MOLL_MEL_a  VHLSVGVFITLVGVLAVCGNSLVIITCIRFKD-LR-TRSNILIINLAVGDLLMCLI-DFPLLAAASFYGEWPYGRQVCQMYAFLTAIAGLVTINTLAVIAADRYWAVVRRP--TPGQKLPKCVTSIAVASVWAYSISWALCPIL--GWG-AYVLDGIRTTCTFDFLT--RTWENRSFVIGMMIGNFVLPFALMVFSYFRIWVAVRKVKSGNVFCAIRHNYNLALGSTLFVKQHRYRLHCEQKTVKIIMFLLIAFTVSWSPYLAVSIIGLFGDRSQL--TYQNTLTASLIAKTSMVFNPILYSISHPKVRKRIANLACCYSVRRHQQ  
MOLL_MEL_l  CQYTIGIFISTVAVIAVIGNSIVIWAHVRIKS-LS-TTSNMLILNLCVGCLIMCIV-DFPLYATSSFLQKWIFGHKVCEIYATITGTAGLLIMNSYSAIAFDRFITVTRYN--NPNYPRSKSATMCISGFVWIYSLSWSMAPVV--GWS-RYQLDGSGTTCTFDYLS--TTWTNRSFILSIAFFNFVLPLCFILFAYSRILHLISS--HSREMKSYRSAVIISKGKASIPKRFR----SERKTAITLLITVVVFCLSWVPYVIIALIGQFGNQSFI--TPQISVIPQLVAKLSTVTNPILYSLSHPVVRNKLFLRLRHELYRRPSD  
CHEL_LWS_l  WYSILGVAMIILGIICVLGNGMVIYLMMTTKS-LR-TPTNLLVVNLAFSDFCMMAFMMPTMTSNCFAE-TWILGPFMCEVYGMAGSLFGCASIWSMVMITLDRYNVIVRGM--AA-APLTHKKATLLLLFVWIWSGGWT-ILPF-FGWS-RYVPEGNLTSCTVDYLT--KDWSSASYVVIYGLAVYFLPLITMIYCYFFIVHAVAEHEKQLREQAKK-----MNVASLRANADQQKQSAECRLAKVAMMTVGLWFMAWTPYLIISWAGVFSSGTRL--TPLATIWGSVFAKANSCYNPIVYGISHPRYKAALYQRFPSLACGSGE   
CHEL_LWS_i  WHSLLGFAMVILGVISVVGNSMVIYIMTTSKS-LR-SPTNMLVVNLAFSDWCMMAFMMPTMAANCFAE-TWILGPFMCEVYGMVGSLFGCGSIWSMVMITLDRYNVIVRGV--AA-APLTHKRAALMIFFVWFWALTWT-LLPF-FGWS-RYVPEGNMTSCTIDYLT--KALWSASYVVAYAGGVYWTPLFINIYCYSKIVRAVAQHEKQLRLQARK-----MNVASLRANAEQTKTSAEARLAKIALMTVGLWFMAWTPYLTIAWAGIFSDGSKL--TPLATIWGSVFAKANACYNPIVYGISHPKYRAALARRFPSLVCMPPGG  
INSE_LWS1_  WHKILGLVMIILGIMGWCGNGVVVYVFIMTPS-LR-TPSNLLVVNLAFSDFIMMGFMCPPMVICCFYE-TWVLGSLMCDIYAMVGSLCGCASIWTMTAIALDRYNVIVKGM--SG-TPLTIKRAMLQILGIWLFGLIWT-ILPL-VGWN-RYVPEGNMTACGTDYLS--QDWTFKSYILVYSFFVYYTPLFTIIYSYYFIVSAVAAHEKAMKEQAKK-----MNVTSLRSGDNQNTSA-EAKLAKVALTTISLWFMAWTPYLVINYIGIFNR-SLI--TPLFTIWGSLFAKANAIYNPIVYGISHPKYRAALKEKLPFLVCGSTED  
INSE_LWS2_  WHGILGFVIGMLGFVSAMGNGMVVYIFLSTKS-LR-TPSNLFVINLAISNFLMMFCMSPPMVINCYYE-TWVLGPLFCQIYAMLGSLFGCGSIWTMTMIAFDRYNVIVKGL--SG-KPLSINGALIRIIAIWLFSLGWT-IAPM-FGWN-RYVPEGNMTACGTDYFN--RGLLSASYLVCYGIWVYFVPLFLIIYSYWFIIQAVAAHEKNMREQAKK-----MNVASLRSSENQNTSA-ECKLAKVALMTISLWFMAWTPYLVINFSGIFNL-VKI--SPLFTIWGSLFAKANAVYNPIVYGISHPKYRAALFAKFPSLACAAEPS  
INSE_LWS_c  WHAILGFVIGILGMISVIGNGMVIYIFTTTKS-LR-TPSNLLVINLAISDFLMMLSMSPAMVINCYYE-TWVLGPLVCELYGLTGSLFGCGSIWTMTMIAFDRYNVIVKGL--SA-KPMTINGALLRILGIWFFSLGWT-IAPM-FGWN-RYVPEGNMTACGTDYLT--KDLLSRSYILVYSFFCYFLPLFLIIYSYFFIIQAVAAHEKNMREQAKK-----MNVASLRSAENQSTSA-ECKLAKVALMTISLWFMAWTPYLVINYAGIFET-VKI--NPLFTIWGSLFAKANAVYNPIVYGISHPKYRAALFQRFPSLACSSGPA  
INSE_LWS_p  WHGLLGFTIGVLGFISITGNGMVVYIFTSTKS-LK-TPSNLLVVNLAFSDFLMMLCMAPPMLINCYYE-TWVFGPLACELYACAGSLFGSISIWTMTMIAFDRYNVIVKGI--AA-KPMTINGALLRILGIWLFSLAWT-IAPM-LGWN-RYVPEGNMTACGTDYLS--KSWLSRSYILVYSIFVYYTPLLLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSEAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYTGVFET-API--SPLATIWGSVFAKANAVYNPIVYGISHPKYRAALYQKFPSLACQPSA   
INSE_LWS_m  WHALLGFTIGVLGFVSISGNGMVIYIFMSTKS-LK-TPSNLLVVNLAFSDFLMMCAMSPAMVVNCYYE-TWVWGPFACELYACAGSLFGCASIWTMTMIAFDRYNVIVKGI--AA-KPMTSNGALLRILGIWVFSLAWT-LLPF-FGWN-RYVPEGNMTACGTDYLS--KSWVSRSYILIYSVFVYFLPLLLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSEAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYTGVFES-API--SPLATIWGSLFAKANAVYNPIVYGISHPKYQAALYAKFPSLQCQSAP   
INSE_LWS_v  WHGLLGFVIGILGFISITGNGMVIYIFTTTKS-LK-TPSNILVVNLAFSDFLMMCVMSPPMVVNCYTE-TWVFGPLACQLYACAGSLFGCASIWTMTMIAFDRYNVIVKGI--AA-KPLTINGAMLRVLGIWVFSLAWT-VAPL-FGWG-RYVPEGNMTACGTDYLD--KSWFNRSYILIYSIFCYFSPLFLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSDAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYAGIFET-ATI--TPLATIWGSVFAKANAVYNPIVYGISHPKYRAALYARFPALACQPSP   
INSE_LWS_t  WHGILGFVIGVLGFVSIVGNGMVIYIFSSTKA-LR-TPSNLLVVNLAFSDFLMMXCMSPAMVINCYNE-TWVLGPLVCELYGMSGSLFGCASIWTMTFIALDRYNVIVKGL--SA-QPLTKKGAMLRILIIWVFSTLWT-IAPF-FGWN-RYVPEGNMTACGTDYLT--KDWVSRSYILVYAVWVYFVPLFTIIYSYWFIVQAVAAHEKSMREQAKK-----MNVASLRSSEAAQTSA-ECKLAKIALMTITLWFFAWTPYLVTNFTGIFEG-AKI--SPLATIWCSLFAKANAVYNPIVYGISHPKYRQALQKKFPSLVCAGEP   
INSE_LWS_s  WHGLLGFVIGVLGVISVIGNGMVIYIFSTTKS-LR-TPSNLLVVNLAFSDFLMMFTMSAPMGINCYYE-TWVLGPFMCELYALFGSLFGCGSIWTMTMIALDRYNVIVKGL--SA-KPMTNKTAMLRILFIWAFSVAWT-IMPL-FGWN-RYVPEGNMTACGTDYLT--KDWVSRSYILVYSFFVYLLPLGTIIYSYFFILQAVSAHEKQMREQRKK-----MNVASLRSAEASQTSA-ECKLAKVALMTISLWFFGWTPYLIINFTGIFET-MKI--SPLLTIWGSLFAKANAVFNPIVYGISHPKYRAALEKKFPSLACASSS   
INSE_LWS_b  WHGILGFVIGLLGFISVSGNGMVVYIFLSTKS-LR-TPSNMFVINLAISDFLMMFCMSPPMVINCYYE-TWVLGPLFCQVYAMLGSLFGCGSIWTMTMIAFDRYNVIVKGL--SG-KPLTINGALLRILGIWLFSLIWT-IAPM-FGWN-RYVPEGNMTACGTDYFS--KDIVSVSYILLYSIWVYFFPLFLIIWSYWFIXQAVAAHEKNMREQAKK-----MNVASLRSSENQNTSA-ECKLAKVALMTISLWFMAWTPYLVINWSGIFSL-VKI--SPLYTIWGSLFAKANAV                                 
INSE_LWS_d  WFGIIGFVIAILGTMSLAGNFIVMYIFTSSKG-LR-TPSNMFVVNLAFSDFMMMFTMFPPVVLNGFYG-TWIMGPFLCELYGMFGSLFGCVSIWSMTLIAYDRYCVIVKGM--AR-KPLTATAAVLRLMVVWTICGAWA-LMPL-FGWN-RYVPEGNMTACGTDYFA--KDWWNRSYIIVYSLWVYLTPLLTIIFSYWHIMKAVAAHEKAMREQAKK-----MNVASLRNSEADKSKAIEIKLAKVALTTISLWFFAWTPYTIINYAGIFES-MHL--SPLSTICGSVFAKANAVCNPIVYGLSHPKYKQVLREKMPCLACGKDDL  
CRUS_LWS_m  WYGILAFVVTVVGLCSICGNFVVIWVFMNTKA-LR-SPANTLVVSLAVSDFIMMACMFPPLVLNCYWG-TWIFGPLFCEVYAFIGNTVGCASIGNMIFITFDRYNVIVKGI--SG-TPLSQKNTTLQVLFVWICSIMWC-VFPF-FGWN-RYVPRGDMTACGTDYLT--EDEFSRSYLYVYSVWVYIGPLALIIYCYFHIVSAVATHEKQMRDQAKK-----MGVKSLRTEEAKKTSA-ECRLAKVALTTVSLWFMAWTPYLIINWAGMFYP-SVV--SPLFSIWGSVFAKANAVYNPIVYAISHPKYRAALYKKLPCLACSTESA  
CRUS_LWS_n  WYGILAFVVTVVGLCSICGNFVVIWVIMNTKA-LR-SPANTLVVSLAVSDYIMMTCMFPPLVLNCYWG-TWIFGPLFCEVYAFIGNTVGCASIGNMIFITFDRYNVIVKGV--SG-KPLSQKNATLQVLFVWICSIMWC-VFPF-FGWN-RYVPEGNMTACGTDYLT--EDEFSRSYLYIYSVWVYIGPLALIIYCYFHIVSAVATHEKQMRDQAKK-----MGVKSLRTEEAKKTSA-GCRLAKVALTTVSLWFMAWTPYLIINWAGMFYP-SVV--SPLFSIWGSVFAKSNAVYNPIVYAISHPKYRAALYKKLPCLACSTESA  
INSE_MWS_d  WAKILTAYMIIIGMISWCGNGVVIYIFATTKS-LR-TPANLLVINLAISDFGIMITNTPMMGINLYFE-TWVLGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQVIVKGM--AG-RPMTIPLALGKIAYIWFMSTIWCCLAPV-FGWS-RYVPEGNLTSCGIDYLE--RDWNPRSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKAMREQAKK-----MNVKSLRSSEDADKSA-EGKLAKVALVTISLWFMAWTPYLVINCMGLFKF-EGL--TPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVFGKVDD  
INSE_MWS_c  WAKFLAAYMVLIATISWCGNGVVIYIFSTTKS-LR-TPANLLVINLAISDFGIMITNTPMMGINLFYE-TWVLGPLMCDIYGGLGSAFGCSSILSMCMISLDRYNVIVKGM--AG-QPMTIKLAIMKIALIWFMASIWT-LAPV-FGWS-RYVPEGNLTSCGIDYLE--RDWNPRSYLIFYSIFVYYLPLFLICYSYWFIIAAVSAHEKAMREQAKK-----MNVKSLRSSEDADKSA-EGKLAKVALVTISLWFMAWTPYTIINTLGLFKY-EGL--TPLNTIWGACFAKSAACYNPIVYGISHPKYGIALKEKCPCCVFGKVDD  
CRUS_LWS_c  MYPLLLVFMLITGILCLAGNFVTIWVFMNTKS-LR-TPANLLVVNLAMSDFLMMFTMFPPMMITCYYH-TWTLGATFCEVYAFLGNLCGCASIWTMVFITFDRYNVIVKGV--AG-EPLSTKKASLWILTVWVLSFTWC-VAPF-FGWN-RYVPEGNLTGCGTDYLS--EDILSRSYLYIYSTWVYFLPLAITIYCYVFIIKAVAAHEKGMRDQAKK-----MGIKSLRNEEAQKTSA-ECRLAKIAMTTVALWFIAWTPYLLINWVGMFAR-SYL--SPVYTIWGYVFAKANAVYNPIVYAIS                        
CRUS_LWS_p  MYPLLLIFMLFTGILCLAGNFVTIWVFMNTKS-LR-TPANLLVVNLAMSDFLMMFTMFPPMMVTCYYH-TWTLGPTFCQVYGFLGNLCGCASIWTMVFITFDRYNVIVKGV--AG-EPLSTKKASLWILIVWVLSLAWC-MAPF-FGWN-RYVPEGNLTGCGTDYLS--EDILSRSYLYIYSTWVYFLPLTITIYCYVFIIKAVAAHEKGMRDQAKK-----MGIKSLRNEEAQKTSA-ECRLAKIAMTTVALWFIAWTPYLLINWVGMFAR-SYL--SPVYTIWGYVFAKANAVYNPIVYAIS                        
INSE_LWS_h  WHGLLGFVIGVLGFISVTGNGMVVYIFTTTKS-LK-TPSNILVVNLAFSDFLMMFMMAPPMVINCYNE-TWVFGPLACQLYACAGSLYGCVSIWTMTMIAFDRYNVIVKGI--AA-KPMTINGALLRVFGIWAFSLAWT-IAPL-FGWG-RYVPEGNMTACGTDYFD--QSFSNRSYILLYSIACYYAPLFLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSDAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYAGIFKTMT                                                      
CRUS_LWS_e  WYGLLGFVIFCLGCLSVFGNSVVIWVFTSTKT-LR-SPANMLVVNLALSDFLMMANMSPPTVHSCYHG-TWMLGPTYCEYYALVGSLSGCISIWTMVWITLDRYNVIVKGV--AA-TPLTNKGAFARNIFSWLSALIWC-VSPL-YGWN-RYVPEGNMTACGTDYLT--DDWLSHSYLYAYTFWVYLFPFFIIVYCYTYIVSAVFAHEKGMRDQAKK-----MGVKSLRNEEAQKTSA-ECRLAKVALVTVSLWFIAWTPYCVINVTGMWDK-TKI--TPLFTIWGSL                                        
CRUS_LWS_a  WYGLLGFVIFCLGILSVCGNAVVIWVFMNTKS-LR-SPANLLVVNLAFSDFLMMLNMFPPMVHSCYHG-TWMLGAFFCEFYGFTGSLFGCISIWTMVFITMDRYNVIVKGV--AA-EPLTSKGASIRILFVWTVAFAWT-ILPF-FGWN-RYVPEGNLTACGTDYLT--EDSTSHLYLYMYASWAYYTPLLYIIYAYTFIVQAVSAHEKGMREQAKK-----MGVKSLRNEEAQKTSA-ECRLAKVALMTVSLWFMAWTPYMIINFTGMNDR-TKL--TPLCTIWGSL                                        
CRUS_LWS_h  WYGLLGFWMTVMGTLSVAGNFVVIWVFMNTKS-LR-TPANLLVVNLAISDFFMMLTMTPPLLANAYWG-TWILGAFFCEVYAFLGSFFGCVSIWSMVFITADRYNVIVKGV--SA-EPLTSGGAMMRIAGTWAFTLAWC-LPPF-FGWN-RYVPEGNMLACGTDYLT--ETELSRSYLYVYSVWVYLFPLAYIIYSYTFIVKAVAAHEKGMREQAKK-----MGVKSLRSEEAQKTSA-ECRLCKVALMTVTLWFMAWTPYFIINWGGMFNK-PMV--TPLFS                                             
CRUS_MWS_h  WHYLLGVVYLFLGVISIAGNGLVIYLYMKSQA-LK-TPANMLIVNLALSDLIMLTTNFPPFCYNCFSGGRWMFSGTYCEIYAALGAITGVCSIWTLCMISFDRYNIICNGF--NG-PKLTQGKATFMCGLAWVISVGWS-LPPF-FGWG-SYTLEGILDSCSYDYFT--RDMNTITYNICIFIFDFFLPASVIVFSYVFIVKAIFAHEAAMRAQAKK-----MNVTNLRSNEAETQRA-EIRIAKTALVNVSLWFICWTPYAAITIQGLLGNAEGI--TPLLTTLPALLAKSCSCYNPFVYAISHPKFRLAITQHLPWFCVHEKDP  
INSE_UVV_c  LHYLLAIVYILFTFVALFGNGLVIWIFCSAKS-LR-TPSNLFVVNLAFCDFMMMLKA-PIFIYNSFHT-GFATGHLGCQIFACMGSLSGIGAGMTNAAIAYDRYSTIARPL--DG-K-LSRGQVLLLIMLIWTYTIPWALMPLM-QVWG-RFVPEGFLTSCSFDYLT--DSQEIRYFVPTIFTFSYCVPMLLIIYYYSQIVGHVVSHEKALREQAKK-----MNVESLRSNVNTNAQSAEIRIAKAAITICFLFVLSWTPYGALAMIGAFGNRALL--TPGITMIPACACKFVACLDPYVYAISHPRYRLELQKRLPWLELQEKP   
INSE_UVV_a  LHYLLALLYILFTFLALLGNGLVIWIFCAAKS-LR-TPSNMFVVNLAICDFFMMIKT-PIFIYNSFNT-GFALGNLGCQIFAVIGSLTGIGAAITNAAIAYDRYSTIARPL--DG-K-LSRGQVILFIVLIWTYTIPWALMPVM-GVWG-RFVPEGFLTSCSFDYLT--DTNEIRIFVATIFTFSYCIPMILIIYYYSQIVSHVVNHEKALREQAKK-----MNVDSLRSNANTSSQSAEIRIAKAAITICFLYVLSWTPYGVMSMIGAFGNKALL--TPGVTMIPACTCKAVACLDPYVYAISHPKYRLELQKRLPWLELQEKP   
INSE_UVV_m  AHTALALLYIFFTFAALVGNGMVIFIFSTTKS-LR-TSSNFLVLNLAILDFIMMAKA-PIFIYNSAMR-GFAVGTVGCQIFALMGAYSGIGAGMTNACIAYDRHSTITRPL--DG-R-LSEGKVLLMVAFVWIYSTPWALLPLL-KIWG-RYVPEGYLTSCSFDYLT--NTFDTKLFVACIFTCSYVFPMSLIIYFYSGIVKQVFAHEAALREQAKK-----MNVESLRANQGGSSESAEIRIAKAALTVCFLFVASWTPYGVMALIGAFGNQQLL--TPGVTMIPAVACKAVACISPWVYAIRHPMYRQELQRRMPWLQIDEPD   
INSE_UVV_p  AHTMLALVYVFFTAAALIGNGLVIFIFSASKS-LR-TPSNLLVVQLAVLDFLMMLKA-PIFIYNSIKR-GFASGVIGCQIFAFMGSVSGTAAGLTNACIAYDRHSTITRPL--DG-R-LSRGKVLLMMVCVWLYTAPWAILPQL-QIWG-RYVPEGFLTSCTFDYLT--TTFDNKLFVASMFVCVYIFPMIAILYFYSGIVKQVFAHEAALREQAKK-----MNVDSLRSNQNAAAESAEIRIAKAALTVCFLYVASWTPYGVMSLIGAFGDQNLL--TPGVTMIPALACKGVACIDPWVYAISHPKYRQELQKRMPWLQIDEPD   
INSE_UVV_d  MHYMLGVFYIFLFCASTVGNGMVIWIFSTSKS-LR-TPSNMFVLNLAVFDLIMCLKA-PIFIYNSFHR-GFALGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNVITKPM--NR-N-MTFTKAVIMNIIIWLYCTPWVVLPLT-QFWD-RFVPEGYLTSCSFDYLS--DNFDTRLFVGTIFFFSFVCPTLMILYYYSQIVGHVFSHEKALREQAKK-----MNVESLRSNVDKSKETAEIRIAKAAITICFLFFVSWTPYGVMSLIGAFGDKSLL--TPGATMIPACTCKLVACIDPFVYAISHPRYRLELQKRCPWLGVNEKS   
INSE_BLU_m  WHYVLALIYTMLMVTSLTGNGIVIWIFSTSKS-LR-SASNMFVINLAVFDLMMMLEM-PLLIMNSFYQ-RLVGYQLGCDVYAVLGSLSGIGGAITNAVIAFDRYKTISSPL--DG-R-INTVQAGLLIAFTWFWALPFTILPAF-RIWG-RFVPEGFLTTCSFDYFT--EDQDTEVFVACIFVWSYCIPMALICYFYSQLFGAVRLHERMLQEQAKK-----MNVKSLASNKEDNSRSVEIRIAKVAFTIFFLFICAWTPYAFVTMTGAFGDRTLL--TPIATMIPAVCCKVVSCIDPWVYAINHPRYRAELQKRLPWMGVREQDP  
INSE_BLU_a  FHIGLAIIYSMLLIMSLVGNCCVIWIFSTSKS-LR-TPSNMFIVSLAIFDIIMAFEM-PMLVISSFME-RMIGWEIGCDVYSVFGSISGMGQAMTNAAIAFDRYRTISCPI--DG-R-LNSKQAAVIIAFTWFWVTPFTVLPLL-KVWG-RYTTEGFLTTCSFDFLT--DDEDTKVFVTCIFIWAYVIPLIFIILFYSRLLSSIRNHEKMLREQAKK-----MNVKSLVSN-QDKERSAEVRIAKVAFTIFFLFLLAWTPYATVALIGVYGNRELL--TPVSTMLPAVFAKTVSCIDPWIYAINHPRYRQELQKRCKWMGIHEP    
INSE_BLU_d  YHAGFYIAFIVLMLSSIFGNGLVIWIFSTSKS-LR-TPSNLLILNLAIFDLFMCTNM-PHYLINATVG-YIVGGDLGCDIYALNGGISGMGASITNAFIAFDRYKTISNPI--DG-R-LSYGQIVLLILFTWLWATPFSVLPLF-QIWG-RYQPEGFLTTCSFDYLT--NTDENRLFVRTIFVWSYVIPMTMILVSYYKLFTHVRVHEKMLAEQAKK-----MNVKSLSANANADNMSVELRIAKAALIIYMLFILAWTPYSVVALIGCFGEQQLI--TPFVSMLPCLACKSVSCLDPWVYATSHPKYRLELERRLPWLGIREKHA  
MEL1b_braF  MQLVFGSMMLVFGLIGVVGNAVALYAFCRSRS-LR-RPKNYLIANLCLTDMVVCLVYSPIIVTRSLSH--GLPSKESCIVEGFVVGLGSIVSICSLAGIAVERYVTITQPI--KSLSILTHRALLGAVSAVWVYAFLLAFPPLV--GWG-RYVSEESKISCTFDYLS--TDDATRAHVIVLVIGAFGLPFSVITYCYVRSFATVRKCTKERKQM---------------SPLAKSDSRSEVKAAVNSFVITTSFCLCWCPYAVVATMGVSGFTV----HSHAVFIAALLAKLSVLFNPVAYVLSIPN                     
MEL1b_braB  MQLIFGSMMLVFGLIGVVGNVVALYAFCRTRS-LR-RPKNYVVANLCLTDMFVCLVYCPIVVSRSFSH--GFPSKESCIVEGFMVGVGSIASICSLAAIAVERYLSVTQPL--KSLTILTQRKLLVAVLTVWVYSLLLAFPPLV--GWG-RYVREETYISCTFDYLS--TDDATRAYVITLVMGAFGFPLLTIAYCYIRVFTTARKHAEERKFM---------------SPLKRPESRTEIKTAVTACVITTSFCLCWCPYAVVATLGISGVSV----QQQTVFSAALLAKLTVIINPIVYVLSIPNFRKALFAQEREKYASEDVV  
MEL_nemVec  WVIAQVVLWGCIFVISSLGNSLVLLCIVKSNR--LHSSIYAFYGSLAASDCIAGMLCCPLLLVTALHQLWIMGK-VMCHVYSTLLSTSLNASIATLCLISMDRLNAVRKPFEYRGHNTFTQRWCKWLLVLSWVHSIFWAAAPLG--GWG-EIITDSATYTCKPNWSA--ASIVNRSYSLCLALFPFAFPVFLMVAIYCVIYRHTKKCSNLMS----------GLEDGRNLVAEQERQMRERRLFRTVLIIIGAFAACWAIYTLATTCKLFIGQTP---PTWLVQLGLICAIAGSCVNPVIYTIRDATFARELGRLHPCLAWLLKQS  
LWS_nemVec   TSFTAIALLVIMLLTIIGNLMVCYVVLSNKR--LWTEMNMFLVNLAFGDLAVGLICMVFPLITAIKREWIFGRGILCQLNACCNSVLFCSTIFTHTVISIDRYIVIVHPMK----KIMTRKKAALMIVGVWVFSVFIVLGPVF--GWG-RMEYNASTLQCGFGFPR--DKMASM-YIVIVAIIAFIIPLLIMTYTYIRIYISVLEHTRRMS-------------ETATAMQQQAVFSAQKRIVFTFFIALLAFFACWAPFFSFIAFAVVVKNPHD-IPHGLGLASYVCGFINSACNPFIIGLRSKQFKSGFSRILCCCRGRDP    
 Consensus  ............g......N..V.......k. LR .P.N...vNLA..Dl.....................g....C..yg%.....G..s...$..ia.#RY.v!..P.  ..........a.......W.....w...Pl.  GW. .Y.pEg..tsC..#w..........s%....f...%..Pl.!I.%.Y..i............... .     .................E.....m...m!..F...W.PYa..............  .p.....P..fAK.s..%NP!IY......%R...................

Structural and functional markers along the opsin molecule:

>RHO1_homSap rhodopsin                 <----------TM1--------->    c1     <----------TM2------->    x1      <c--i------TM3--------->           c2       <----------TM4-------->               x2         <----------TM5------c->     c3
MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLGGEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSRYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKE 
AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA* 0
           c3       <----------TM6------->    x3      <--------b-TM7---gprot>  helix8    palm cyto tail

See also: Curated Sequences | Ancestral Introns | Informative Indels | Ancestral Sequences | Cytoplasmic face | Update Blog