Opsin evolution: alignment

From genomewiki
Revision as of 15:54, 23 March 2010 by Tomemerald (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

See also: Curated Sequences | Ancestral Introns | Informative Indels | Ancestral Sequences | Cytoplasmic face | Update Blog

This section provides an alignment of 230 opsins, mostly ciliary and rhabdomeric types. It could be updated to the full 420 curated reference sequences available using MultAlin or similar tools that allow precise control of formatting and color but too many sequence becomes unwieldy.

N- and C-terminals have been trimmed away because they are generally unalignable and uninformative outside a narrow gene class. Fragmentary sequences are mostly not shown: these score low and so fall to the bottom. (That can still be useful as two important clades, jawless fish and chondrichthyes, are largely represented by fragments.) Notice the numerous invariant (red) and nearly invariant sequences (blue) -- these anchor the alignment with near-certainty. Some of these are not specific to opsins but are rather properties of GPCR signaling proteins generally.

Opsins have seven alpha helical sections traversing the cell membrane with the intervening sequence alternating as cytoplasmic and extra-cellular. Certain key residues such as the lysine where the retinal is covalently bound, counterions, and recognition sites diagnostic for binding of other proteins require markups that will be added shortly.

Among other things, the alignment by MultAlign shows that the Opsin Classifier has properly named the opsins -- each classifies just as expected from its name. Deletions and insertions show up clearly on the alignment, readily resolved as to type using the known phylogenetic topology to establish ancestral condition. The alignment also exhibits some anomalies where the sequence in question needs re-evaluation at the primary data source (cDNA and/or genome).

MultAlign is apparently the only alignment software that allows line width to be specified. That's important here because it enables the entire alignment to be seen in a single window. The numbering schemes allows specific residues and regions to be discussed. Colored text output was also an option (it allows copy and paste of specific residues) but the file again is huge with color markups and awkward to display tightly within genomeWiki.


Opsin align.png


Here are those sequences aligned (after some trimming of unalignable regions, and anomalous and fragmentary sequences). These are in text form which can be searched by motif using web browser text search, unlike the graphic above (which is more conveniently colored however).

 Consensus  ............g......N..V.......k. LR .P.N...vNLA..Dl.....................g....C..yg%.....G..s...$..ia.#RY.v!..P.  ..........a.......W.....w...Pl.  GW. .Y.pEg..tsC..#w..........s%....f...%..Pl.!I.%.Y..i............... .     .................E.....m...m!..F...W.PYa..............  .p.....P..fAK.s..%NP!IY......%R...................
RHO1_homSa  QFSMLAAYMFLLIVLGFPINFLTLYVTVQHKK-LR-TPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTG-CNLEGFFATLGGEIALWSLVVLAIERYVVVCKPM---SNFRFGENHAIMGVAFTWVMALACAAPPLA--GWS-RYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNF--GPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTI--CCGKNPLGDD
RHO1_monDo  QFSCLAAYMFMLIVLGFPINFLTLYVTIQHKK-LR-TPLNYILLNLAIADLFMVFGGFTMTLYTSLHGYFVFGPTG-CNLEGFFATLGGEIALWSLVVLAIERYVVVCKPM---SNFRFGENHAIIGVAFTWVMALACAFPPLI--GWS-RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGQLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSNF--GPIFMTIPAFFAKSSSVYNPVIYIMMNKQFRTCMITTL--CCGKNPLGDD
RHO1_bosTa  QFSMLAAYMFLLIMLGFPINFLTLYVTVQHKK-LR-TPLNYILLNLAVADLFMVFGGFTTTLYTSLHGYFVFGPTG-CNLEGFFATLGGEIALWSLVVLAIERYVVVCKPM---SNFRFGENHAIMGVAFTWVMALACAAPPLV--GWS-RYIPEGMQCSCGIDYYTPHEETNNESFVIYMFVVHFIIPLIVIFFCYGQLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSDF--GPIFMTIPAFFAKTSAVYNPVIYIMMNKQFRNCMVTTL--CCGKNPLGDD
RHO1_ornAn  QYSVLAAYMFMLIMLGFPINFLTLYVTIQHKK-LR-TPLNYILLNLAFANHFMVLGGFTTTLYTSLHGYFVFGPTG-CNIEGFFATLGGEIALWSLVVLAIERYIVVCKPM---SNFRFGENHAIMGVAFTWIMALACALPPLV--GWS-RYIPEGMQCSCGIDYYTLRPEVNNESFVIYMFVVHFTIPMTIIFFCYGRLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNF--GPIFMTVPAFFAKSSAIYNPVIYIMMNKQFRNCMLTTI--CCGKNPLGDD
RHO1_galGa  KFSALAAYMFMLILLGFPVNFLTLYVTIQHKK-LR-TPLNYILLNLVVADLFMVFGGFTTTMYTSMNGYFVFGVTG-CYIEGFFATLGGEIALWSLVVLAVERYVVVCKPM---SNFRFGENHAIMGVAFSWIMAMACAAPPLF--GWS-RYIPEGMQCSCGIDYYTLKPEINNESFVIYMFVVHFMIPLAVIFFCYGNLVCTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTNQGSDF--GPIFMTIPAFFAKSSAIYNPVIYIVMNKQFRNCMITTL--CCGKNPLGDE
RHO1_xenTr  KYSALAAYMFLLILLGFPINFMTLYVTIQHKK-LR-TPLNYILLNLVFANHFMVLCGFTVTMYTSMHGYFIFGQTG-CYIEGFFATLGGEMALWSLVVLAIERYVVVCKPM---ANFRFGENHAIMGVVFTWIMALSCAAPPLF--GWS-RYIPEGMQCSCGVDYYTLKPEVNNESFVVYMFIVHFTIPLCVIFFCYGRLLCTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVVMMVIFFLICWVPYAYVAFYIFTHQGSDF--GPVFMTVPAFFAKSSAIYNPVIYIVLNKQFRNCLITTL--CCGKNPFGDE
RHO1_neoFo  KYSALAAYMFFLILTGFPINFLTLYVTVQHKK-LR-TPLNYILLNLAVADLFMVFGGFTTTMYTAMNGYFVFGVVG-CNLEGFFATFGGIIALWCLVVLAIERYIVVCKPI---SNFRFGENHAIMGVVFTWIMALACAGPPLF--GWS-RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFIVHFTIPLIIIFFCYGRLMCTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVYIMVISYLVCWLPYASVSFYIFTHQGSDF--GPVFMTVPAFFAKTASVYNPVIYILMNKQFRNCMITTL--CCGKNPFGDE
RHO1_latCh  KYSALAAYMFFLILVGFPINFLTLFVTIQHKK-LR-TPLNYILLDLAVADLCMVFGGFFVTMYSSMNGYFVLGPTG-CNIEGFFATLGGQVALWALVVLAIERYVVVCKPM---SNFRFGENHAIMGVIFTWIMALSCAVPPLF--GWS-RYIPEGMQSSCGVDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKDA--AAQ------------Q-----QESATTQKAEKEVTRMVIVMVISFLVCWVPYASVAAYIFFNQGSEF--GPVFMTAPSFFAKSASFYNPVIYILLNKQFRNCMITTL--CCGKNPFGDE
RHO1_anoCa  QFSALAAYMFLLILLGFPINFLTLFVTIQHKK-LR-TPLNYILLNLAVANLFMVLMGFTTTMYTSMNGYFIFGTVG-CNIEGFFATLGGEMGLWSLVVLAVERYVVICKPM---SNFRFGETHALIGVSCTWIMALACAGPPLL--GWS-RYIPEGMQCSCGVDYYTPTPEVHNESFVIYMFLVHFVTPLTIIFFCYGRLVCTVKAA--AAQ------------Q-----QESATTQKAEREVTRMVVIMVISFLVCWVPYASVAFYIFTHQGSDF--GPVFMTIPAFFAKSSAIYNPVIYILMNKQFRNCMIMTL--CCGKNPLGDE
RHO1_petMa  KYSVLAAYMFFLILVGFPVNFLTLFVTVQHKK-LR-TPLNYILLNLAVANLFMVLFGFTLTMYSSMNGYFVFGPTM-CNFEGFFATLGGEMSLWSLVVLAIERYIVICKPM---GNFRFGSTHAYMGVAFTWFMALSCAAPPLV--GWS-RYLPEGMQCSCGPDYYTLNPNFNNESFVIYMFLVHFIIPFIVIFFCYGRLLCTVKEA--AAA------------Q-----QESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDF--GATFMTVPAFFAKTSALYNPIIYILMNKQFRNCMITTL--CCGKNPLGDE
RHO1_letJa  KYSALAAYMFFLILVGFPVNFLTLFVTVQHKK-LR-TPLNYILLNLAMANLFMVLFGFTVTMYTSMNGYFVFGPTM-CSIEGFFATLGGEVALWSLVVLAIERYIVICKPM---GNFRFGNTHAIMGVAFTWIMALACAAPPLV--GWS-RYIPEGMQCSCGPDYYTLNPNFNNESYVVYMFVVHFLVPFVIIFFCYGRLLCTVKEA--AAA------------Q-----QESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDF--GATFMTLPAFFAKSSALYNPVIYILMNKQFRNCMITTL--CCGKNPLGDD
RHO1_geoAu  KFSALAAYMFFLILVGFPVNFLTLFVTVQHKK-LR-TPLNYILLNLAVSNLFMILFGFTTTMYTSMNGYFVFGPTM-CSIEGFFATLGGEVSLWSLVVLAIERYIVICKPM---GNFRFGNTHAIMGVALTWVMALSCAAPPLL--GWS-RYLPEGMQCSCGPDYYTMNPTYNNESFVIYMFIVHFTIPFVIIFFSYGRLLCTVKEA--AAA------------Q-----QESASTQKAEKEVTRMVVLMVVGFLVCWVPYASVAFYIFTNQGSDF--GATFMTLPAFFAKSSALYNPVIYILMNKQFRNCMITTL--CCGKNPLGDD
RHO1_leuEr  MFSALAAYMFFLILTGLPVNFLTLFVTIQHKK-LR-QPLNYILLNLAVSDLFMVFGGFTTTIITSMNGYFIFGPAG-CNFEGFFATLGGEVGLWCLVVLAIERYMVVCKPM---ANFRFGSQHAIIGVVFTWIMALSCAGPPLV--GWS-RYIPEGLQCSCGVDYYTMKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKEA--AAQ------------Q-----QESESTQRAEREVTRMVIIMVVAFLICWVPYASVAFYIFINQGCDF--TPFFMTVPAFFAKSSAVYNPLIYILMNKQFRNCMITTI--CLGKNPFEEE
RHO1_calMi  QFSILAAYMFFLIITCFPVNFLTLYVTFEHKK-LR-QPLNFILLNLAVADLFMVFGGFFITVYTSLHGYFVFGVTG-CNFEGFFATLGGEIGLWSLVVLAIERYVVVCKPM---SNFRFGTNHAIMGVAFTWVMALACAVPPLM--GWS-RYIPEGLQCSCGVDYYTLKPEINNESFVIYMFVVHFLIPLIIIFFCYGRLVCTVKEA--AAQ------------Q-----QESESTQRAEREVTRMVIIMVIFFLICWVPYASVAFFIFTNQGSEF--GPIFMAVPAFFAKSSALYNPLIYILLNKQFRNCMITTL--CCGKNPFEED
RHO1_takRu  KYSLVAAYMLFLIITAFPVNFLTLFVTVKHKK-LR-TPLNYVLLNLAVADLFMVIGGFTVTLYTALHAYFVLGVTG-CNIEGFFATLGGEIALWSLVVLAVERYIVVCKPM---TNFRFGEKHAIAGLVFTWIMALTCATPPLL--GWS-RYIPEGMQCSCGIDYYTPKPEINNTSFVIYMFILHFSIPLAIIFFCYSRLLCTVRAA--AAL------------Q-----QESETTQRAEKEVTRMVIVMVISFLVCWVPYASVAWYIFANQGTEF--GPVFMTAPAFFAKSAALYNPVIYILLNRQFRNCMITTV--CCGKNPFGDD
RHO2_galGa  KYRLVCCYIFFLISTGLPINLLTLLVTFKHKK-LR-QPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFVFGPVG-CAVEGFFATLGGQVALWSLVVLAIERYIVVCKPM---GNFRFSATHAMMGIAFTWVMAFSCAAPPLF--GWS-RYMPEGMQCSCGPDYYTHNPDYHNESYVLYMFVIHFIIPVVVIFFSYGRLICKVREA--AAQ------------Q-----QESATTQKAEKEVTRMVILMVLGFMLAWTPYAVVAFWIFTNKGADF--TATLMAVPAFFSKSSSLYNPIIYVLMNKQFRNCMITTI--CCGKNPFGDE
RHO2_anoCa  KYKVVCCYIFFLIFTGLPINILTLLVTFKHKK-LR-QPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFIFGPIG-CAIEGFFATLGGQVALWSLVVLAIERYIVVCKPM---GNFRFSATHALMGISFTWFMSFSCAAPPLL--GWS-RYIPEGMQCSCGPDYYTLNPDYHNESYVLYMFGVHFVIPVVVIFFSYGRLICKVREA--AAQ------------Q-----QESASTQKAEREVTRMVILMVLGFLLAWTPYAMVAFWIFTNKGVDF--SATLMSVPAFFSKSSSLYNPIIYVLMNKQFRNCMITTI--CCGKNPFGDE
RHO2_neoFo  KYSIVCAYMFFLIITGLPINLLTLVVTFKHKK-LR-QPLNYILVNLAVADLFMVCFGFTVTFSTAINGYFIFGPRG-CAIEGFMATLGGEVALWSLVVLAIERYIVVCKPM---GNFRFSNNHSIIGIVFTWLAALSCAAPPLF--GWS-RYLPEGMQCSCGPDYYTMNPDYHNESFVIYMFVVHFFIPVIVIFVSYGRLICKVKEA--AAQ------------Q-----QESASTQKAEREVTRMVILMVIGFMTAWTPYATVAFWIFMNKGAEF--GATFMAAPAFFSKSSALYNPIIYVLMNKQFRNCMVTTL--CCGKNPFGDD
RHO2_latCh  KFSVLCAYMFLLIILGFPINFLTLLVTFKHKK-LR-QPLNYILVNLAVASLFMVVFGFTVTFYSSLNGYFVLGPMG-CAMEGFFATLGGQVALWSLVVLAIERYIVVCKPM---GNFRFASSHAIMGIAFTWIMALACAAPPLV--GWS-RYIPEGLQCSCGPDYYTLNPDFHNESYVMYLFLVHFLLPIIIIFFTYGRLICKVKEA--AAQ------------Q-----QESASTQKAEKEVTRMVILMVIGFLTAWVPYASAAFWIFCNRGAEF--TATLMTVPAFFSKSSCLFNPIIYVLLNKQFRNCMITTL--CCGKNPLGDD
RHO2_gekGe  KFKVLSFYMFFLIAAGMPLNGLTLFVTFQHKK-LR-QPLNYILVNLAAANLVTVCCGFTVTFYASWYAYFVFGPIG-CAIEGFFATIGGQVALWSLVVLAIERYIVICKPM---GNFRFSATHAIMGIAFTWFMALACAGPPLF--GWS-RFIPEGMQCSCGPDYYTLNPDFHNESYVIYMFIVHFTVPMVVIFFSYGRLVCKVREA--AAQ------------Q-----QESATTQKAEKEVTRMVILMVLGFLLAWTPYAATAIWIFTNRGAAF--SVTFMTIPAFFSKSSSIYNPIIYVLLNKQFRNCMVTTI--CCGKNPFGDE
RHO2_geoAu  MYSAISAYVFTLILIGFPVNFMTLFVTFKLKK-LR-QPLNFILVNLCVADLLMIMFGFTTTFYTAMNGYFVFGPTG-CNIEGFFATLGGEVSLWSLVMLAIERYIVVCKPM---GNFRFATTHAALGVVFTWVMASACAVPPLV--GWS-RYIPEGMQCSCGPDYYTLNPKYYNESYVIYLFLVHFLLPVTIIFFTYGRLICTVKEA--AAQ------------Q-----QESASTQKAEREVTRMVIIMVVGFLVCWVPYASFAFYLFMNKGILF--SATAMTVPAFFSKSSVLYNPIIYVLLNKQFRTCMVTTL--FCGKNPFGED
SWS2_ornAn  IFMSLAAFMFLLITLGFPINLLTVICTIKYKK-LR-SHLNYILVNLAVSNMLVVCVGSATAFYSFAHMYFVLGPTA-CKIEGFAATLGGMVSLWSLAVIAFERFLVICKPL---GNLSFRGTHAIFGCAATWVFGLAASLPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKWNNESYVIFLFSFCFGVPLSIIIFSYGRLLLTLRAV--AKQ------------Q-----EQSATTQKAEREVTKMVIVMVLGFLVCWLPYASFSLWVVTNRGQVF--DLRMASIPSVFSKASTIYNPIIYVFMNKQFRSCMLKLV--FCGKSPFGDE
SWS2_utaSt  LFMGMAAFMFLLIILGVPINVLTIFCTFKYKK-LR-SHLNYILVNLAVSNLLVVCIGSTTAFYSFAQMYFSLGPTA-CKIEGFAATLGGMVSLWSLAVVAFERFLVICKPL---GNFSFRGTHAIIGCIITWVFGLVASLPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKWNNESYVLFLFSFCFGVPLSVIIFSYGRLLLTLRAV--AKQ------------Q-----EQSATTQKAEREVTKMVVVMVMGFLVCWLPYASFALWVVTHRGEPF--DVRLATIPSVFSKASSVYNPVIYVFMNKQFRSCMLKLV--FCGKSPFGDE
SWS2_taeGu  IFKAMAAFMFLLVLLGVPINALTVLCTAKYKK-LR-SHLNYILVNLAVANLLVVCVGSTTAFYSFSQMYFALGPLA-CKIEGFTATLGGMVSLWSLAVVAFERFLVICKPL---GNFTFRGSHAVLGCAITWIFGLIASLPPLF--GWS-RYIPEGLQCSCGPDWYTTDNKWNNESYVIFLFCFCFGFPLTVIVFSYGRLLLTLRAV--AKQ------------Q-----EQSASTQKAEREVTKMVVVMVLGFLVCWLPYCSFALWVVTHRGHPF--DLGLASIPSVFSKASTVYNPIIYVFMNKQFRSCMLKLV--FCGRSPFGDE
SWS2_neoFo  VFMVLSVFMFFLLITGIPINVLTIICTFKYKK-LR-SHLNYILVNLAVANLIVVGFGSTTAFYSFSQMYFAWGPLA-CKIEGFAATLGGMVSLWSLAVVAFERFLVICKPL---GNFTFRSTHAIIGCVATWVFGLISSAPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKWNNESYVIFLFCFCFGFPLSVIIFSYGRLLMTLRAV--AKQ------------Q-----EQSASTQKAEREVTKMVVVMVLGFLVCWLPYTVFSLWVVTHRGESF--ELALGSIPAVFSKSSTVYNPLIYVFMNKQFRSCMMKLI--FCGKSPFGDE
SWS2_galGa  LFRAMAAFMFLLIALGVPINTLTIFCTARFRK-LR-SHLNYILVNLALANLLVILVGSTTACYSFSQMYFALGPTA-CKIEGFAATLGGMVSLWSLAVVAFERFLVICKPL---GNFTFRGSHAVLGCVATWVLGFVASAPPLF--GWS-RYIPEGLQCSCGPDWYTTDNKWHNESYVLFLFTFCFGVPLAIIVFSYGRLLITLRAV--ARQ------------Q-----EQSATTQKADREVTKMVVVMVLGFLVCWAPYTAFALWVVTHRGRSF--EVGLASIPSVFSKSSTVYNPVIYVLMNKQFRSCMLKLL--FCGRSPFGDD
SWS2_xenTr  IFMSISAFMLFTIIFGFPLNLLTIICTVKYKK-LR-SHLNYILVNLAVANLIVICFGSTTAFYSFSQMYFSLGTLA-CKIEGFTATLGGIIGLWSLAVVAFERFLVICKPM---GNFTFRESHAVLGCILTWVIGLVAAIPPLL--GWS-RYIPEGLQCSCGPDWYTVNNKWNNESYVLFLFCFCFGFPLAIIVFSYGRLLLALHAV--AKQ------------Q-----EQSATTQKAEREVTRMVIVMVVGFLVCWLPYASFALWAVTHRGELF--DLRMSSVPSVFSKASTVYNPFIYIFMNRQFRSCMMKMI--FCGKNPLGDD
SWS2_geoAu  IFYGMSAFMLFLVLAGFPLNFLTVFVTIKYKK-LR-SHLNYILVNLAIANLIVVCCGSTLAFYSFMHKYFILGPLF-CKMEGFTATLGGMLSLWSLAVLAFERCLVICKPF---GNIAFRGTHALIRCGFAWAAAIAASTPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKYNNESYVMFLFIFCFGTPFTIIIVSYSKLILTLRAA--AAQ------------Q-----QESASTQKAEKEVSRMVVIMVGGFLVCWLPYASLALWIVFNRGSPF--DLRLATIPSVFSKASTVYNPVIYIFLNKQFRSCMMKTI--FCGKNPLGDD
SWS2_takRu  VFYGMSAFMFFLFVAGTGINVLTIACTIQYKK-LR-SHLNYILVNLAFSNLLVTTVGSFTCFCCFFVRYMIVGPLG-CKIEGFAATLGGMVSLWSLAVVAFERWLVVCKPL---GNFIFKPDHAIVCCIFTWFFALIISAPPLF--GWS-RYIPEGFQCSCGPDWYTTGNKYNNESYVWFIFGFGFAVPLFVIVFCYSQLLVMLK-S--AKA------------Q-----AESASTQKAEREVTRMVVVMILGFLVCWLPYASFALWVVNNRGTPF--DLRLATIPACFSKASTVYNPIIYVVLNKQFRSCMKKML--GMSGGD    
SWS2_gasAc  TFYSLAFYMFFILIVGTFINALTVACTVQNKK-LR-SHLNYILVNLAVSNLLVSGVGAFTAFLSFAARYFVLGTLA-CKVEGFLATLGGMVSLWSLAVIAFERWLVICKPL---GNFIFKPDHALVCCAFTWVFALAASAPPLV--GWS-RYIPEGLQCSCGPDWYTTNNKYNNESYVLFLFGFCFAVPFCTICFCYSQLLFTMKMA--AKA------------Q-----AESASTQKAEREVTRMVVLMVMGFLVCWMPYASFALWVVNNRGQTF--DLRFASIPSVFSKSSAVYNPVIYVLLNKQFRSCMMKML--GMGGGD    
SWS1_homSa  AFYLQAAFMGTVFLIGFPLNAMVLVATLRYKK-LR-QPLNYILVNVSFGGFLLCIFSVFPVFVASCNGYFVFGRHV-CALEGFLGTVAGLVTGWSLAFLAFERYIVICKPF---GNFRFSSKHALTVVLATWTIGIGVSIPPFF--GWS-RFIPEGLQCSCGPDWYTVGTKYRSESYTWFLFIFCFIVPLSLICFSYTQLLRALKAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCVCYVPYAAFAMYMVNNRNHGL--DLRLVTIPSFFSKSACIYNPIIYCFMNKQFQACIMKMV---CGKAMTDES
SWS1_monDo  AFHFQTVFMGFVFCAGTPLNAVVLVATLRYKK-LR-QPLNYILVNVSLCGFIFCIFAVFTVFISSSQGYFIFGRHV-CAMEAFLGSVAGLVTGWSLAFLAFERFIVICKPF---GNFRFNSKHAMMVVLATWVIGIGVSIPPFF--GWS-RFIPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIMPLFLICFSYSQLLRALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVMMVGSFCLCYVPYAALAMYMVNNQNHGL--DLRLVTIPAFFSKSACVYNPIIYCFMNKQFHACIMEMV---CRKPMTDDS
SWS1_anoCa  AFYFQTAFMGFVFFAGTPLNAIILIVTVKYKK-LR-QPLNYILVNISFAGFLFCTFSVFTVFMASSQGYFFFGRHV-CAMEAFLGSVAGLVTGWSLAFLAFERYIVICKPF---GNFRFNSRHALLVVAATWIIGVGVAIPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCLCYVPYASLAMYMVNNRDHGL--DLRLVTIPAFFSKSSCVYNPIIYCFMNKQFRACILETV---CGKPMSDES
SWS1_utaSt  AFYFQTAFMGFVFFAGTPLNAIILIVTVKYKK-LR-QPLNYILVNISFAGFLFCVFSVFTVFLASSQGYFFFGRHI-CALEAFLGSVAGLVTGWSLAFLAFERYIVICKPF---GNFRFNSKHALLVVAATWFIGIGVSIPPFF--GWS-RFIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGI--DLRLVTIPAFFSKSACVYNPIIYCFMNKQFRACIMETV---CGKPMTDES
SWS1_taeGu  AFYLQTIFMGLVFVAGTPLNAIVLIVTIKYKK-LR-QPLNYILVNISVSGLMCCVFCIFTVFIASSQGYFVFGKHM-CAFEGFAGATGGLVTGWSLAFLAFERYIVICKPF---GNFRFNSRHALLVVAATWIIGVGVAIPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCMCYVPYAALAMYMVNNREHGI--DLRLVTIPAFFSKSSCVYNPIIYCFMNKQFRACIMETV---CGRPMTDDS
SWS1_galGa  AFYLQTAFMGIVFAVGTPLNAVVLWVTVRYKR-LR-QPLNYILVNISASGFVSCVLSVFVVFVASARGYFVFGKRV-CELEAFVGTHGGLVTGWSLAFLAFERYIVICKPF---GNFRFSSRHALLVVVATWLIGVGVGLPPFF--GWS-RYMPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGL--DLRLVTIPAFFSKSACVYNPIIYCFMNKQFRACIMETV---CGKPLTDDS
SWS1_neoFo  AFFLQAAFMGFVLFVGTPLNAIVLFVTIKYKK-LQ-QPLNYILVNISLAGFIFCFFGVFAVFIASCQGYFIFGKTV-CALEGFTGSVAGLVTGWSLAILAFERYLVICKPI---GNFRFGSKHSMIAVVAAWVIGVGVSIPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIIPLFIICFSYSQLLGALRAV--AAQ------------Q-----QESATTQKAEREVSRMIIVMVGSFCVCYVPYAALAMYMVNNRDHGI--DLRLVTIPAFFSKSSFVYNPIIYCFMNKQFRACIMQTV---FGKPMTDDS
SWS1_xenLa  AFTLQAIFMGMVFLIGTPLNFIVLLVTIKYKK-LR-QPLNYILVNITVGGFLMCIFSIFPVFVSSSQGYFFFGRIA-CSIDAFVGTLTGLVTGWSLAFLAFERYIVICKPM---GNFNFSSSHALAVVICTWIIGIVVSVPPFL--GWS-RYMPEGLQCSCGPDWYTVGTKYRSEYYTWFIFIFCFVIPLSLICFSYGRLLGALRAV--AAQ------------Q-----QESASTQKAEREVSRMVIFMVGSFCLCYVPYAAMAMYMVTNRNHGL--DLRLVTIPAFFSKSSCVYNPIIYSFMNKQFRGCIMETV---CGRPMSDDS
SWS1_geoAu  AFYLQAAFMGFVFICGTPLNAIVLVVTIKYKK-LR-QPLNYILVNISAAGLVFCLFSISTVFVASMQGYFFLGPTI-CALEAFFGSLAGLVTGWSLAFLAAERYIVICKPF---GNFRFGSKHALVAVGLTWMLGLSVALPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTYFLFVFCFVVPLSIIIFSYGSLLGTLRAV--AAQ------------Q-----QESASTQKAEREVSRMVIMMVASFCTCYVPYAALAVYMVTNRDHNI--DLRFVTVPAFFSKASCVYNPLIYSFMNKQFRACILETV---CGKPITDES
SWS1_danRe  AFYLQAAFMGFVFIVGTPMNGIVLFVTMKYKK-LR-QPLNYILVNISLAGFIFDTFSVSQVSVCAARGYYSLGYTL-CSMEAAMGSIAGLVTGWSLAVLAFERYVVICKPF---GSFKFGQGQAVGAVVFTWIIGTACATPPFF--GWS-RYIPEGLGTACGPDWYTKSEEYNSESYTYFLLITCFMMPMTIIIFSYSQLLGALRAV--AAQ------------Q-----AESESTQKAEREVSRMVVVMVGSFVLCYAPYAVTAMYFANSDEPNK--DYRLVAIPAFFSKSSSVYNPLIYAFMNKQFNACIMETV---FGKKIDESS
SWS1_oryLa  AFYLQAAFMGFVFFVGTPLNFVVLLATAKYKK-LR-VPLNYILVNITFAGFIFVTFSVSQVFLASVRGYYFFGQTL-CALEAAVGAVAGLVTSWSLAVLSFERYLVICKPF---GAFKFGSNHALAAVIFTWFMGVGCACPPFF--GWS-RYIPEGLGCSCGPDWYTNCEEFSCASYSKFLLVTCFICPITIIIFSYSQLLGALRAV--AAQ------------Q-----AESASTQKAEKEVSRMIIVMVASFVTCYGPYALTAQYYAYSQDENK--DYRLVTIPAFFSKSSCVYNPLIYAFMNKQFNGCIMEMV---FGKKMEEAS
LWS_homSap  VYHLTSVWMIFVVIASVFTNGLVLAATMKFKK-LR-HPLNWILVNLAVADLAETVIASTISVVNQVYGYFVLGHPM-CVLEGYTVSLCGITGLWSLAIISWERWMVVCKPF---GNVRFDAKLAIVGIAFSWIWAAVWTAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSYPGVQSYMIVLMVTCCITPLSIIVLCYLQVWLAIRAV--AKQ------------Q-----KESESTQKAEKEVTRMVVVMVLAFCFCWGPYAFFACFAAANPGYPF--HPLMAALPAFFAKSATIYNPVIYVFMNRQFRNCILQLF--GKKVDDGS  
LWS_monDom  VYNLTSLWMVFVVIASIFTNGLVLVATMKFKK-LR-HPLNWILVNLAVADLGETVIASTISVINQIYGYFILGHPL-CVLEGYTVSLCGITGLWSLAIISWERWVVVCKPF---GNVKFDAKLAMVGIIFSWVWAAVWTAPPLF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMATCCIFPLSIILLCYVQVWLAIRAV--AKQ------------Q-----KESESTQKAEKEVSRMVVVMILAYCFCWGPYTLFACFAAANPGYSF--HPLTASLPAYFAKSATIYNPIIYVFMNRQFRTCILQLF--GKKVDDGS  
LWS_ornAna  AYNVTSLWMIFVVIASVFTNGLVLVATMKFKK-LR-HPLNWILVNLAVADLGETLIASTISVINQIFGYFILGHPM-CVLEGYTVSLCGITGLWSLSIISWERWIVVCKPF---GNVKFDAKLAMVGIVFSWVWAAVWTAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMSTCCILPLSIIVLCYLQVWLAIRAV--AKQ------------Q-----KESESTQKAEKEVSRMVVVMILAYCFCWGPYTIFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLF--GKKVDDGS  
LWS_galGal  VYNLTSLWMIFVVAASVFTNGLVLVATWKFKK-LR-HPLNWILVNLAVADLGETVIASTISVINQISGYFILGHPM-CVVEGYTVSACGITALWSLAIISWERWFVVCKPF---GNIKFDGKLAVAGILFSWLWSCAWTAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFFPLAIIILCYLQVSLAIRAV--AAQ------------Q-----KESESTQKAEKEVSRMVVVMIVAYCFCWGPYTFFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCILQLF--GKKVDDGS  
LWS_anoCar  VYNITSVWMIFVVIASIFTNGLVLVATAKFKK-LR-HPLNWILVNLAIADLGETVIASTISVINQISGYFILGHPM-CVLEGYTVSTCGISALWSLAVISWERWVVVCKPF---GNVKFDAKLAVAGIVFSWVWSAVWTAPPVF--GWS-RYWPHGLKTSCGPDVFSGSDDPGVLSYMIVLMITCCFIPLAVILLCYLQVWLAIRAV--AAQ------------Q-----KESESTQKAEKEVSRMVVVMIIAYCFCWGPYTVFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLF--GKKVDDGS  
LWS_xenTro  VYNISSLWMIFVVLASVFTNGLVLVATLKFKK-LR-HPLNWILVNMAIADLGETVIASTISVCNQIFGYFVLGHPM-CILEGYTVSVCGIAALWSLTVIAWERWFVVCKPF---GNIKFDGKLAATGIIFSWVWAAGWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMLVLMITCCIIPLAIIVLCYMHVWLTIRQV--AQQ------------Q-----KESESTQKAEREVSRMVVVMIIAYIFCWGPYTFFACFAAFNPGYNF--HPLAAAMPAYFAKSATIYNPIIYVFMNRQFRNCIYQLF--GKKVDDGS  
LWS_takRub  VYNVATVWMFIVVVLSVFTNGLVLVATAKFKK-LR-HPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPM-CVFEGYTVSTCGIAALWSLTIISWERWVVVCKPF---GNVKFDAKWATGGIVFSWVWAAVWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCIIPLAIIILCYLAVWLAIRSV--AMQ------------Q-----KESESTQKAEKEVSRMVVVMIVAYCVCWGPYTFFACFAAANPGYAF--HPLAAAMPAYFAKSATIYNPVIYVFMNRQFRVCIMKLF--GKEVDDGS  
LWS_gasAcu  VYNLSTLWMFIVVALSVFTNGLVLVATAKFKK-LQ-HPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPM-CVFEGYVVSVCGITALWSLTIISWERWIVVCKPF---GNVKFDAKWATAGIVFSWIWSAVWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCLIPLAIIILCYLAVWLAIRAV--AMQ------------Q-----KESESTQKAERDVSRMVVVMIVAYIVCWGPYTTFACFAAANPGYAF--HPLAAAMPAYFAKSATIYNPVIYVFMNRQFRSCIMQLF--GKEVDDGS  
LWS_petMar  VFNLTSVWMIIVVVLSLFSNGLVLVATVKFKK-LR-HPLNWIIVNLAIADILETIFASTISVCNQVYGYFILGHPM-CVFEGYVVSTCGIAGLWSLAIISWERWMVVCKPF---GNIKFDGKIATILIVFSWVWPASWCSLPIF--GWS-RYWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFLPLSIIILCYLQVWLAIHSV--AQQ------------Q-----KESETTQKAERDVSRMVVVMILAYVFCWGPYTFFACFAAANPGYSF--HPIAAALPAYFAKGATIYNPIIYVFMNRQFRNCILQLF--GKKVDDGS  
LWS_letJap  MFNLTSVWMIIVVVLSLFTNGLVLVATMKFKK-LR-HPLNWILVNLAIADILETIFASTISVCNQVFGYFILGHPM-CVFEGYVVSTCGIAGLWSLAIISWERWMVVCKPF---GNIKFDGKIAIILIVFSWVWPACWCSLPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFLPLSVIILCYLQVWLAIHSV--AQQ------------Q-----KESETTQKAERDVSRMVVVMILAYIFCWGPYTFFACYAAANPGYAF--HPLTAALPAYFAKSATIYNPVIYVFMNRQFRNCIMQLF--GKKVDDGS  
LWS_geoAus  MYNLTSFWMIIVVILSLFTNGLVLVATLKFKK-LR-HPLNWILVNLAIADIGETIFASTVSVVNQIFGYFILGHPL-CVFEGFTVSVCGITALWSLAIISFERWMVVCKPF---GNLKFDGKVAIVLIIFSWAWSAGWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFIPLALIIICYLQVWLAIHTV--AQQ------------Q-----KESETTQKAERDVSRMVVVMIFAYIFCWGPYTFFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLF--GKKVDDGS  
LWS_neoFor  VYNLTSLWMIFVVFASCFTNGLVLMATYKFKK-LR-HPLNWILVNLAIADLGETLIASTISVTNQIFGYFILGHPM-CMLEGFTVATCGITGLWSLTIIAWERWVVVCKPF---GNIKFDGKWAAGGIIFSWVWSAFWCAMPLF--GWS-RFWPHGLKTSCGPDVFSGEDKYGTRSFMIALMITCCIIPLGVIILCYIQVWWAIRTV--AKQ------------Q-----KESESTQKAEKEVSRMVVVMIFAYCFCWGPYTFMACFGAAYPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIYQLL--GKKVDDGS  
PIN_galGal  TYVGVAVLMGTVVACASVVNGLVIVVSICYKK-LR-SPLNYILVNLAVADLLVTLCGSSVSLSNNINGFFVFGRRM-CELEGFMVSLTGIVGLWSLAILALERYVVVCRPL---GDFQFQRRHAVSGCAFTWGWALLWSTPPLL--GWS-SYVPEGLRTSCGPNWYTG--GSNNNSYILSLFVTCFVLPLSLILFSYTNLLLTLRAA--AAQ------------Q-----KEADTTQRAEREVTRMVIVMVMAFLLCWLPYSTFALVVATHKGIII--QPVLASLPSYFSKTATVYNPIIYVFMNKQFQSCLLEML--CCGYQPQRTG
PIN_utaSta  IYTSVAVLMGLVVVSAAFVNGLVIVVSIQYKK-LR-SPLNYILVNLAIADLLVTSFGSTLSFANNIYGFFVLGQTA-CEFEGFMVSLTGIVGLWSLAILAFERYLVICKPV---GDFRFQQRHAVFGCVFTWMWSLVWTLPPLF--GWS-SYVPEGLRTSCGPNWYTG--GSGNNSYIMALFVTCFALPLGMIIFSYASLLLTLRAV--ATQ------------Q-----KEVETTQQAEKEVTRRVIAMVMAFLVCWLPYASFAMVVATNKDLVI--QPALASLPSYFSKTATVYNPIIYVFMNKQFRSCLLSTM--SCGHRPRGAQ
PIN_podSic  TYISVAVLMGLVVISATLVNGLVIVVSVQFKK-LR-SPLNYVLVNLAVADLLVTFFGSTISFVNNAQGFFIFGQAT-CEFEGFMVSLTGIVGLWSLAILAFERYLVICKPV---GDFRFPARHAVLGCAFTWGWSFVWTVPPLL--GWS-SYVPEGLRTSCGPNWYSG--GSSNNSYIMTLFVTCFAMPLSTILFSYANLLMTLRTV--AAQ------------Q-----KEQETTQRAEREVTRMVVAMVAAFLVCWLPYASFAMVVATHKDLAI--RPALASLPSYFSKTATVYNPIIYVFMNKQFRSCLLYKM--SCGHRALSSQ
PIN_pheMad  VYTSLAALMGVVVLSASLANGLVIAVSVRFKR-LR-SPLNYILVNLATADLLVTFFGSIISFVNNAVGFFVFGKTA-CRFEGFMVSLTGIVGLWSLAILAFERYLVICKPV---GDFQFQRRHAVIGCLYTWGWSLIWTVPPLF--GWS-SYVPEGLGTSCGPNWYMG--GTNNNSYIVALFVTCFALPLSMILFSYANLLLTLRAV--AAQ------------Q-----KEQETTQRAEKEVTRMVITMVMAFLVCWLPYATFAMVVATTKDLSI--QPGLASLPSYFSKTATVYNPIIYVFMNKQFRSCLLNTV--SCGRIPQTMP
PIN_xenTro  TFLTVAAVMCMVVILAFFVNGLVIVVTLKYKK-LR-SPLNYILVNLAIANLLVTIFGSSVSFSNNVVGYFFMGKTM-CEFEGFMVSLTGIVGLWSLAILAFERYLVICKPM---GDFRFQQKHAILGCSFTWVWSFIWTSPPLF--GWC-SYVPEGLRTSCGPNWYTG--GTNNNSYIMALFLTCFIMPLSTIIFSYSNLLMALRAV--AAQ------------Q-----KDSETTQRAEKEVTRMVIAMVLAFLICWLPYASFAVVVAVNKDVVI--EPTVASLPSYFSKTATVYNPIIYVFMNKQFRNCLMTLL--CCGRS-FGDD
PIN_bufJap  TYLTVAVLMGMVVFLAFFVNGMVIVVSLKYKK-LR-SPLNYILVNLAVADILVTMFGSTVSFHNNIFGFFTLGKLV-CELEGFVVSLTGIVGLWSLAILAFERYIVICKPM---GDFRFQQRHAVMGCAFTWIWAFLWTSPPLI--GWC-SYVPEGLGTSCGPNWYTG--GTNNNSYILALFTTCFMMPLTTIIFSYSNLLLALRAV--AAQ------------Q-----KESETTQRAEREVTRMVIAMVLAFLICWLPYAVFAIVMASNKNVVI--DPTLASMPSYFSKTATVYNPVIYVFMNKQFRDCLTKLL--CCGRNPFGED
VAOP_galGa  HFRLVAAVMFVVTSLSLAENLAVILVTFKFKQ-LR-QPVNYVIVNLSVADFLVSLTGGTISFLANLKGYFYMGHWA-CVLEGFAVTFFGIVALWSLALLAFERYIVICRPV---GNMRLRGKHAAQGIAFVWTFSFIWTIPPTM--GWS-SYTTSKIGTTCEPNWYSG--AYNDRSYIIAFFTTCFIVPLLVILVSYGKLLQKLRKV--SNT------------Q-----GRLRTARKPERQVTRMVVVMIIAFLICWMPYAVFSILATAYPSIEL--DPHLAAIPAFFSKTATVYNPIIYVFMNKQFRMCLIQMF--KCSAIETAES
VAOP_anoCa  NFHLISALMFVVTLFSLSENFTVILVTIKFKQ-LR-QPLNYVIVNLSVADFLVSLIGGTISFSTNLKGYFYMGHWA-CVLEGFAVTFFGIVALWSLALLAFERYVVICRPL---GNMRLNGKHAALGVAFVWIFSFIWTVPPTM--GWS-SYTTSKIGTTCEPNWYSG--DYNDHTFIITFFTTCFILPLLVILVSYGKLMRKLRKV--SDT------------Q-----GRLGTTRKPERQVTGMVVIMILAFLICWSPYAAFSILVTACPSIEL--DPRLAAIPAFFSKTATVYNPVIYVFMNNQFRKCLVQLF--QCSSQETMDA
VAOP_xenTr  NFHLLAALMFVVTSLSIAENFIVILVTAKFKQ-LR-QPLNYIIVNLSVADFLVSVIGGTISIATNSRGYFYLGSWA-CVLEGFAVTFFGIVALWSLSVLAFERYIVICRPL---GNLRLQGKHSALAIIFVWVFSFVWTIPPTM--GWS-SYTTSKIGTTCEPNWYSG--EMRDHTYIITFLTTCFVFPLLVIFMSYGKLMRKLRKV--SDT------------Q-----GRLGSTRKPEKEVTRMVVIMILAFLICWTPYAAFSILITAHPTIDL--DPRLAAIPAFFAKTASMYNPIIYVYMNKQFRRCLYQMF--NINDPEAKES
VAOP_danRe  NYSVLAALMFVVTALSLSENFTVMLVTFRFQQ-LR-QPLNYIIVNLSLADFLVSLTGGSISFLTNYHGYFFLGKWA-CVLEGFAVTFFGIVALWSLAVLAFERFFVICRPL---GNIRLRGKHAALGLVFVWSFSFIWTVPPVL--GWS-SYTVSRIGTTCEPNWYSG--NFHDHTFIITLFSTCFIFPLGVIIVCYCKLIRKLRKV--SNT------------H-----GRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIIITAHPSMHV--DPRLAAIPAFVAKTAAVYNPIIYVFMNKQFRKCLVQLL--SCSKVTVVEG
VAOP_rutRu  NYKVLATLMFVVTAASLSENFAVMLVTFRFTQ-LR-KPLNYIIVNLSLADFLVSLTGGTISFLTNYHGYFFLGKWA-CVLEGFAVTYFGIVALWSLAVLAFERFFVICRPL---GNIRLRGKHAALGLLFVWTFSFIWTIPPVL--GWS-SYTVSKIGTTCEPNWYSG--NFHDHTFIIAFFITCFILPLGVIVVCYCKLIKKLRKV--SNT------------H-----GRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIVVTAHPSIHL--DPRLAAAPAFFSKTAAVYNPVIYVFMNKQFRKCLVQLL--RCRDVTIIEG
VAOP_takRu  NFTILAVLMFVVTSLSLCENFLVMFITFKFKQ-LR-QPLNYIIVNLAIADFLVSLTGGLISFLTNARGYFFLGRWA-CVLEGFAVTYFGIVAMWSLAVLSFERFFVICRPL---GNMRLQAKHAAIGLLFVWTFSFVWTFPPVL--GWN-RYTVSKIGTTCEPDWYSN--NMTSHSYIITFFSTCFILPLGIIFFCYGKLLRKLRKV--S--------------H-----GRLATARKPERQVTRMVVVMIVAFMVAWTPYATFAILVTIHPTIEL--DPRLASIPAFFSKTAAVYNPIIYVFMNKQFRKCLIQHF--IGMGVMAES 
VAOP_petMa  NFTMLAALMGTITALSLGENFAVIVVTARFRQ-LR-QPLNYVLVNLAAADLLVSAIGGSVSFFTNIKGYFFLGVHA-CVLEGFAVTYFGVVALWSLALLAFERYFVICRPL---GNFRLQSKHAVLGLAVVWVFSLACTLPPVL--GWS-SYRPSMIGTTCEPNWYSG--ELHDHTFILMFFSTCFIFPLAVIFFSYGKLIQKLKKA--SET------------Q-----RGLESTRRAEQQVTRMVVVMILAFLVCWMPYATFSIVVTACPTIHL--DPLLAAVPAFFSKTATVYNPVIYIFMNKQFRDCFVQVL--PCKGLKKVSA
PPIN_anoCa  GYTIIAIIMATSCTLSVILNTAVIAITIKYRQ-LR-QPINYSLVNLAIADLGAALLGGSLNVETNAVGYYNLGRVG-CVTEGFAMAFFGIVALCTIAVIAVDRAIVIAKPM---GTITFTTRKAMIGVAVSWIWSLVWNTPPLF--GWG-GYQMEGVMTSCAPDWANS--DPINVSYIICYFLFCFTIPFITILASYGYLIWTLRQV--AKV------------G----LAQRGSTTKAEAQVSRMVIVMVMAFLICWLPYATFALVVVGNPQIYI--NPIIATIPMYMAKSSTFYNPIIYIFMNKQFRDCLVRCL--LCGRNPCASE
PPIN_xenTr  GYTILALIMAVFCAAALFLNVTVIVVTFKYRQ-LR-HPINYSLVNLAIADLGVTVLGGALTVETNAVGYFNLGRVG-CVIEGFAVAFFGIAALCTIAVIALDRVFVVCKPM---GTLTFTPKQALAGIAASWIWSLIWNTPPLF--GWG-SYELEGVMTSCAPNWYSA--DPVNMSYIVCYFSFCFAIPFLIIVGSYGYLMWTLRQV--AKL------------G----VAEGGTTSKAEVQVSRMVIVMILAFLVCWLPYAAFAMTVVANPGMHI--DPIIATVPMYLTKTSTVYNPIIYIFMNKQFQECVIPFL--FCGRNPWAAE
PPIN_petMa  GFTILAVIMAVFTLASLVLNSTVIIVTLRHRQ-LR-HPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVG-CVIEGFAVAFFGIAALCTIAVIAVDRFVVVCKPL---GTLMFTRRHALLGITWAWLWSFVWNTPPLF--GWG-SYKLEGVRTSCAPDWYSR--DPANVSYIVSFFSFCFAIPFLVIVVAYGRLLWTLHQV--AKL------------G----MGESGSTAKAEAQVSRMVVVMVVAFLVCWLPYALFAMIVVAKPGVYI--DPVIATLPMYLTKTSTVYNPIIYIFMNRQFRDCAVPFL--LCGRNPWAEP
PPIN_letJa  GFTILAVIMAVFTIASLVLNSTVVIVTLRHRQ-LR-HPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVG-CVIEGFAVAFFGIAALCTIAVIAVDRFVVVCKPL---GTLMFTRRHALLGIAWAWLWSFVWNTPPLF--GWG-SYELEGVRTSCAPDWYSR--DPANVSYITSYFAFCFAIPFLVIVVAYGRLMWTLHQV--AKL------------G----MGESGSTAKAEAQVSRMVVVMVVAFLVCWLPYALFAMIVVTKPDVYI--DPVIATLPMYLTKTSTVYNPIIYIFMNRQFRDCAVPFL--LCGRNPWAEP
PPIN_ictPu  GYTILSIIMALSSTFGIILNMVVIIVTVRYKQ-LR-QPLNYALVNLAVADLGCPVFGGLLTAVTNAMGYFSLGRVG-CVLEGFAVAFFGIAGLCSVAVIAVDRYMVVCRPL---GAVMFQTKHALAGVVFSWVWSFIWNTPPLF--GWG-SYQLEGVMTSCAPNWYRR--DPVNVSYILCYFMLCFALPFATIIFSYMHLLHTLWQV--AKL------------Q----VADSGSTAKVEVQVARMVVIMVMAFLLTWLPYAAFALTVIIDSNIYI--NPVIGTIPAYLAKSSTVFNPIIYIFMNRQFRDYALPCL--LCGKNPWAAK
PPIN_oncMy  GFTILAVIIGVFSVSGVCMNVLVIMVTMRHRK-LR-QPLNYALVNLAVADLGCALFGGLPTMVTNAMGYFSMGRLG-CVLEGFAVAFFGIAGLCSVAVIAVDRYVVVCRPM---GAVMFQTRHAVGGVVLSWVWSFLWNTPPLF--GWG-SFELEGVRTSCSPNWYSR--EPGNMSYIILYFLLCFAIPFSIIMVSYARILFTLHQV--SKL------------K----VLEGNSTTRVEIQVVRMVVVMVMAFLLSWLPYAAFALSVILDPSLHI--NPLIATVPMYLAKSSTVYNPIIYVFMNRQFRDCAVPFL--LCGLNPWAS 
PPIN_danRe  GYTILAVIIGVFSVCGVILNVTVITVTLKYKQ-LR-QPLNFALVNLAVADLGCAVFGGLPTVVTNAMGYFSLGRVG-CVLEGFAVAFFGIAALCSVAVIALERCMVVCRPV---GSISFQTRHAVFGVAVSWLWSFIWNTPPLF--GWG-RLQLEGVRTSCAPDWYSR--DLANVSFIVCYFLLCFALPFSVIVYSYTRLLWTLRQV--SRL------------Q----VCEGGSAARAEAQVSCMVVVMILAFLLTWLPYASFALCVILIPELYI--DPVIATVPMYLTKSSTVFNPIIYIFMNRQFRDRALPFL--LCGRNPWAA 
PPINa_cioI  TYSFLCVYMTFVFLLSCSLNILVIVATLKNKV-LR-QPLNYIIVNLAVVDLLSGFVGGFISIAANGAGYFFWGKTM-CQIEGYFVSNFGVTGLLSIAVMAFERYFVICKPF---GPVRFEEKHSIFGIVITWVWSMFWNTPPLI--FWD-GYDTEGLGTSCAPNWFVK--EKRERLFIILYFVFCFVIPLAVIMICYGKLILTLRQI--AKE------------------SSLSGGTSPEGEVTKMVVVMVTAFVFCWLPYAAFAMYNVVNPEAQI--DYALGAAPAFFAKTATIYNPLIYIGLNRQFRDCVVRMI--FNGRNPWVDE
PPINa_cioS  VYSFLAVYMTFICLISCSLNILVITATLKNKV-LR-QPLNYIIVNLAVVDLLSGLVGGVISIFANGAGYFFWGKFM-CQVEGYTVSNFGVTGLLSIAVMAFERYFVICKPF---GPVRFEEKHAVIGIAVTWIWAMFWNTPPLI--FWD-GYDTEGLGTSCAPNWFVK--GNTERLFIILYFVFCFLIPLAIIVLCYGKLILQLRQI--AKE------------------SSLSGGTSPEGEVTKMVVVMVTAFVICWLPYAAFAMYNVVNPEAQI--DYALGAAPAFFAKTATIYNPLIYIGLNRQFRDCVVRMI--FNGRNPWVDE
PPINb_cioI  IYTILAVYMTFIFLLAVSLNGFVIIATMKNKK-LR-QPLNYIIINLSIADFLSGLVGGFIGMISNSAGYFYFGKTV-CILEGYIVSVAGVCGLMSISVMAFERYFVVCKPY---GPFTLTNTHAALGIGFTWTWSVLWSTPGLI--WLD-GYVPEGLGTSCAPNWFSK--NKSERIFIFVYFVFCFFIPLLVIIICYGKIVLFLKQA--TRQ------------------SSASSNRQADNKVTKMVLVMISAFLICWTPYGVLSLYNAINPDKQL--DYGLGAVPVFFAKTANIYNPLIYIGLNKQFRDGVIKMV--FRGRNPWAEE
PPINb_cioS  TYSGLCVFMSFVFVLAVPLNLLVIVATYKNKD-LR-RPINYIIVNLAVADLTCSVVGGLLGVLNNGAGYYFLGKSV-CIFEGYVMSVTGVCGILSITVMAFERYFVVCKPF---GQTNLKWSHAITGIVFTWTWSVIWHTPGLF--FWN-GYEPEGFGTSCAPNWFSQ--QKSERIFIFAYFAFCFLTPLTIIFACYLKLILFIRKV--SKK------------------SMVNEADRRDFEVTRMVFVMIAAFLICWLPYGCLSMYNAIHPDNLL--SYGIGSVPAFFAKTATIYNPIIYMGLNKKFRDGVIRML--FKGRNPWLDG
PARIE_utaS  GYGVLAFLMFLNALFSIFNNSLVIAVTLKNPQ-LR-NPINIFILNLSFSDLMMSLCGTTIVIATNYYGYFYLGRKF-CIFQGFAVNYFGIVSLWSLTILAYERYNVVCQPL---GTLQMSTKRGYQLLGFIWVFCLFWAVVPLF--GWS-SYGPEGVQTSCSIGWEER--SWSNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHGL--NKK-----------VE----QLGGKSSPEEEFRAVIMVLVMVVAFLICWLPYTVFALIVVFNPALNI--SPLAATIPTYLSKTSPVYNPIIYIFLNKQFRDCAVEFI--TCGQVVLTSP
PARIE_anoC  GYGVLAFLMFINALFSLFNNFLVIAVTLKNPQ-LR-NPINIFILNLSFSDLMMSICGTTIVIATNYHGYFYLGRRF-CIFQGFAVNYFGIVSLWSLTILAYERYNVVCQPL---GTLQMSTQRAYQLLGFIWVFCLFWAVVPLF--GWS-SYGPEGVQTSCSIGWEER--SWNNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHGL--NKK-----------VE----QLGGKSNPEEEFRAVIMVLVMVVAFLICWLPYTLFALTVVFNPALNI--SPLAATIPTYLSKTSPVYNPIIYIFLNKEFRECAVEFI--TCGKVVLTSP
PARIE_xenT  GYSILSFLMFLNAVFSICNNAIVILVTLKHPQ-LR-NPINIFILNLSFSDLMMALCGTTIVVSTNYHGYFYLGKQF-CIFQGFAVNYFGIVSLWSLTLLAYERYNVVCEPI---GALKLSTKRGYQGLVFIWLFCLFWAIAPLF--GWS-SYGPEGVQTSCSIGWEER--SWSNYSYIISYFLTCFIIPVGIIGFSYGSILRSLHQL--NRK-----------IE----QQGGKTNPREEKRVVIMVLFMVLAFLICWLPYTVFALIVVINPQLYI--SPLAATLPTYFAKTSPVYNPIIYIFLNKQFRTYAVQCL--TCGHINLDSL
PARIE_takR  GYSILSFLMFINTVLSVFNNSLAIAVMLKNPS-LL-QPINIFILSLAVSDLMIGLCGSLVVTITNYHGSFFIGHTA-CVFQGFAVNYFGLVSLCTLTLLAYERYNVVCKPR---AGLKLTMRRSIIGLLFVWTFCLFWAVTPLL--GWS-SYGPEGVQTSCSLAWEER--SWNNYSYLILYTLLCFIFPVGVIIYCYCKVLTSMNKL--NKS-----------VE----LQGGLSCRRENKHAINMVLAMIIAFFVCWLPYTALSVVVVVDPELHI--PPLVATMPMYFAKTSPVYNPIIYFLSNKQFRDATLEVL--SCSRYIPHAS
PARIE_gasA  GYSILSFLMFINTVLTVFNNVLVITVLVRNPS-LL-QPMNVFILSLAVSDLMIGLCGSLVVTITNYHGSFFIGHTA-CIFQGFAVNYFGLVSLCTLTLLSYERYNVVCRPR---NALKLSMRRSIHGLLIVWTFCLFWAVAPLF--GWS-GYGPEGVQTSCSLAWEER--SWSNYSYLVLYTLLCFIVPVAVIIYCYAKVLTSMNTL--NRS-----------VE----VQGGRSSQKENDHAVSMVLAMIIAFFSCWLPYTALSVVVVVDPTLYI--PPLVATMPMYFAKTSPVYNPIIYFLSNKQFRDAALEML--SCGRYIAHMP
PARIE_danR  GYSILSYLMFINTTLSVFNNVLVIAVMVKNLH-FL-NAMTVIIFSLAVSDLLIATCGSAIVTVTNYEGSFFLGDAF-CVFQGFAVNYFGLVSLCTLTLLAYERYNVVCKPM---AGFKLNVGRSCQGLLLVWLYCLFWAVAPLL--GWS-SYGPEGVQTSCSLGWEER--SWRNYSYLILYTLMCFILPTVIITYCYSNVLLTMRKI--NKS-----------IE----CQGGKNCAEDNEHAVRMVLAMIIAFFICWLPYTAISVLVVVNPEISI--PPLIATMPMYFAKTSPVYNPIIYFLTNKRFRESSLEVL--SCGRYISRET
CILI2_plaD  SYVITAIYLCIVGVIGTLSNGVIMYLYFKDKS-LR-SPMNLLFVNLAMSDFTVAFFGAMFQFGLTCTRKYMSPGMALCDFYGFITFLGGLASEMNLFIISVERYLAVVRPF---DVGNLTNRRVIAGGVFVWLYSLVFAGGPLV--GWS-SYRPEGLGTWCSISWQ--DRSMNTMSYVTAVFLGCYFFPVSIIIFCYFNVWRKVKE----AA-----------DA----QGGAGTAGKAEKSIFRMSVIMVTCYLTAWTPYAIVCLIASYGPPNGL--PIYAEVLPSLFAKSSQVYNPIIYVLMNKPYRSALVSLV--CRGRNPFDEA
CILI1_plaD  DYNICAAYLFFIACLGVSLNVLVLVLFIKDRK-LR-SPNNFLYVSLALGDLLVAVFGTAFKFIITARKTLLREEDGFCKWYGFITYLGGLAALMTLSVIAFVRCLAVLRLG---SFTGLTTRMGVAAMAFIWIYSLAFTLAPLL--GWN-HYIPEGLATWCSIDWL--SDETSDKSYVFAIFIFCFLVPVLIIVVSYGLIYDKVRK----VA-----------KT-------GGSVAKAEREVLRMTLLMVSLFMLAWSPYAVICMLASFGPKDLL--HPVATVIPAMFAKSSTMYNPLIYVFMNKQFRRSLKVLL--GMGVEDLNSE
ENCEPH_hom  TYERLALLLGSIGLLGVGNNLLVLVLYYKFQR-LR-TPTHLLLVNISLSDLLVSLFGVTFTFVSCLRNGWVWDTVG-CVWDGFSGSLFGIVSIATLTVLAYERYIRVVH------ARVINFSWAWRAITYIWLYSLAWAGAPLL--GWN-RYILDVHGLGCTVDWK--SKDANDSSFVLFLFLGCLVVPLGVIAHCYGHILYSIRMLRCVED-----------LQ----TIQVIKILKYEKKLAKMCFLMIFTFLVCWMPYIVICFLVVNGHGHLV--TPTISIVSYLFAKSNTVYNPVIYVFMIRKFRRSLLQLLCL          
ENCEPH_mon  TYELLALLIATIGLLGLCNNLLVLVLYYKFQR-LR-TPTHLFLVNISFNDLLVSLFGVTFTFVSCLRSGWVWDSVG-CAWDGFSNTLFGIVSIMTLTVLAYERYNRIVH------AKVINFSWAWRAITYIWLYSLVWTGAPLL--GWN-RYTLEIHGLGCSVDWK--SKDPNDSSFVIFLFFGCLMLPVGVMAYCYGHILYAIRMLRCVEE-----------LQ----TIQVIKILRYEKKVAKMCFLMIAIFLFCWMPYAVICLLVANGYGSLV--TPTVAIIASLFAKSSTAYNPIIYIFMSRKFRRCLLQLLCF          
ENCEPH_gal  TYELLALLIATIGTLGVCNNLLVLVLYYKFKR-LR-TPTNLFLVNISLSDLLVSVCGVSLTFMSCLRSRWVWDAAG-CVWDGFSNSLFGIVSIMTLTVLAYERYIRVVH------AKVIDFSWSWRAITYIWLYSLAWTGAPLL--GWN-RYTLEIHGLGCSMDWK--SKDPNDTSFVLLFFLGCLVAPVVIMAYCYGHILYAVRMLRCVED-----------FQ----TSQVIKLLKYEKKVAKMCFLMISTFLICWMPYAVVSLLVTYGYSNLV--TPTVAIIPSFFAKSSTAYNPVIYIFMSRKFRQCLLQLLCF          
ENCEPH_ano  TYELLALLVAAIGLLGLCNNLLVLVLYAKFKR-LR-TPTHLFLVNISLSDLLVSLFGVSFTFGSCLRHRWVWDAAG-CVWDGFSNSLFGIVSIMTLTVLAYERYIRVVH------ARVIDFSWSWRAITYIWLYSLAWTGAPLL--GWN-HYTLEIHGLGCSVDWQ--SKEPSDSSFVLFFFLGCLAAPVGIMAYCYGHILHAIRMLRCVED-----------LQ----SIQVIKILRYEKKVAKMCFLMVTTFLICWMPYAVVSLLIAYGYGHLI--TPTVAIIPSFFAKSSTAYNPVIYIFMSRKFRRCLVQLFCV          
ENCEPH_gas  TYKLLAFAIGTIGVFGFCNNVVVIVLYCKFKR-LR-TPTNLLVVNISLSDLLVSVIGINFTFVSCIRGGWTWSRAT-CIWDGFSNSLFGIVSIMTLASLAYERYIRVVH------AQVVDFPWAWRAIGHIWLYSLVWTGAPLL--GWN-RYTLEIHRLGCSLDWA--SKDPNDASFILLFLLACFFVPVGIMIYCYGNILYAVQMLRSIQD-----------LQ----TVQIIKILRYEKKVAVMFLLMISCFLLCWTPYAVVSMMEAFGRKNMV--SPTVAIIPSFFAKSSTAYNPLICVFMSRKFRRCLMQLLCS          
ENCEPH_xen  TYHFLALIVATVGFLGLVNNLLVLILYCKFKR-LQ-TPTNLLFFNTSLCHFVFSLLAITFTFMSCVRGSWAFSVEM-CVFHGFSKNLLGIVSFGTLTVVAYERYARVVY------GKYVNSSWSKRSITFVWVYSLAWTGFPLI--GWN-LYTFETHKLDCSFEWT--ATDPKDTAFVLLFFLACITLPLSIMAYCYGYILYEIQKLRSVKN-----------IQ----NFQEITILDYEIKMAKMCLLMMLTFLIGWMPYTILSLLVTSGYSKFI--TPTITVMPSLLAIASAAYNPVIHIFTIKKFRQCLVQLLPPINFHPPIN  
ENCEPH4a_t  GNLVVSVFLGFIGTFGLVNNLLVLVLFCRYKM-LR-SPINLLLMNISISDLLVCVLGTPFSFAASTQGRWLIGEAG-CVWYGFANSLFGVVSLISLAVLSFERYSTMMTPT---EADPSNYCKVCLGITLSWVYSLVWTVPPLF--GWS-SYGPEGPGTTCSVNWT--AKTTNSISYIICLFVFCLIVPFLVIVFCYGKLLCAIRQ---VSG------------------INASTSRKREQRVLCMVVIMVICYLLCWLPYGVVALLATFGPPDLV--TPEASIIPSVLAKSSTVINPIIYVFMNKQFYRCFLALL--CCQDPRSGSS
ENCEPH4b_t  GHLVVAVCLGFIGTVGFLSNFLVLALFCRYRA-LR-TPMNLMLVSISASDLLVSVLGTPFSFAASTQGRWLIGRAG-CVWYGFVNACLGIVSLISLAVLSYERYCTMVSST---IASNRDYRPVLGGICFSWFYSLAWTVPPLL--GWS-RYGPEGPGTTCSVDWR--TQTPNNISYIVCLFTFCLLLPFFVILYSYGKLLHTIRQ---VRR------------------VSSTVTRRREHRVLVMVVAMVVCYLICWLPYGVTALLATFGPPNLL--TPEATITPSLLAKFSTVINPFIYIFMNKQFYRCFRAFL--NCSTPKRDST
ENCEPH4_br  GYTAIATCLALIGFVGFTNNFVVILLIGCHRQ-LR-TPFNLLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPG-CVWYGFANSLFGIVSLVTLSALAFERYCVVVR-----SSDMLTYKSSLVVITFIWLYSLLWTSLPLL--GWS-SYQFEGHNVGCSVNWV--QHNPDNVSYIVTLMVTCFFVPMVVVCWSYAWIWRTVRM---SSE------------A----KPECGNSQNAGRLVTTMVVVMIICFLVCWTPYAVMALIVTFGADHLV--TPTASVIPSLVAKSSTAYNPIIYVLMNNQFREFLLARLQRVCCRQ     
ENCEPH5_br  GFDTVAVVIAAIGIAGFLSNGAVVLLFLKFRQ-LR-TPFNMLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPG-CVWYGFANHLFGLVSLISLAVISYERYRMVVKPK-GPGSSYLTYNKVGLAIIFIYLYCLLWTTLPIV--GWS-SYQLEGPKISCSVAWE--EHSLSNTSYIVAIFIMCLLLPLLIIIYSYCRLWYKVKK---GSQ-----------NL----PPAIRKSSQKEQKIARMVVVMITCFLVCWLPYGAMALVVSFGGESLI--SPTAAVVPSLLAKSSTCYNPLVYFAMNNQFRRYFQDLL--CCGRRLFDAS
PIN_stoPur  TYNYLTVYTGFLTIFGILNNGIVMILFARFPS-LR-HPINSFLFNVSLSDLIISCLASPFTFASNFAGRWLFGDLG-CTLYAFLVFVAGTEQIVILAALSIQRCMLVVRPF---TAQKMTHRWALFFISLTWIYSLIICVPPLF--GWN-RYTYEGPGTACSVAWN--SPSPGDTSYIIFIFVLVLVIPFGIIIFCYGLLVYAVKK---ISR------------T----QAALSSEAKADRKVSKMIFIMILFFLIAWTPYTGFSLYVTFGKNVVI--TPLAGTFPPFFAKLCTIHNPIIYFLLNKQFKDALIQLF--CCGENPFDRD
ENCEPH_api  MYIGAAIALGFIGFFGFTANLLVAIVIVKDAQILW-TPVNVILFNLVFGDFLVSIFGNPVAMVSAATGGWYWGYKM-CLWYAWFMSTLGFASIGNLTVMAVERWLLVARPM-----QALSIRHAVILASFVWIYALSLSLPPLF--GWG-SYGPEAGNVSCSVSWEVHDPVTNSDTYIGFLFVLGLIVPVFTIVSSYAAIVLTLKK-----------------VR----K-RAGASGRREAKITKMVALMITAFLLAWSPYAALAIAAQYFNAK-P--SATVAVLPALLAKSSICYNPIIYAGLNNQFSRFLKKIFDARGSRT     
ENCEPH1_an  AYNGAAVTLFFIGFFGFFLNIFVIALMYKDVQ-LW-TPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWLYGKSI-CVAYGFFMSLLGIASITTLTVLSYERFCLISRPF---AAQNRSKQGACLAVLFIWSYSFALTSPPLF--GWG-AYVNEAANISCSVNWE--SQTANATSYIIFLFIFGLILPLAVIIYSYINIVLEMRK-----------------NS----A-RVGRVNRAERRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELI--GPGLAVLPALVAKSSICYNPIIYVGMNTQFRAAFWRIRRSNGVAGQPD  
ENCEPH2_an  AYNASAVTLFFIGFFGFFLNLFVIALMCKDMQ-LW-TPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWIFGRTL-CVAYGFFMSLLGITSITTLTVLSYERYCLISRPF---SSRNLTRRGAFLAIFFIWGYSFALTSPPLF--GWG-AYVQEAANISCSVNWE--SQTKNATTYIIFLFVFGLVVPLIVIVYSYTNIIVNMRE-----------------NS----A-RVGRINRAEQRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELI--GPGLAVLPALVAKSSICYNPIIYVGMNTQFRAAFSRVRNKGQQA      
ENCEPH_aed  AYVASAVTLFFIGFFGFFLNLFVIALMCKDVQ-LW-TPINIILFNLVCSDFSVSIIGNPFTLTSAISRHWIFGRTV-CIAYGFFMSLLGITSITTLTVLSYERFCLISHPF---SSRSLSRRGAVFAILFIWSYSFALTSPPLF--GWG-AYVNEAANISCSVNWE--SQTLNATSYIIFLFVFGLVVPLVVIVYSYTNIVVNMKR-----------------NA----A-RVGRINRAEKRVTRMVFVMVLAFMIAWTPYAVFALIEQFGPTDII--SPALGVLPALIAKSSICYNPIIYVGMNTQFRAAFNRVRNNE         
ENCEPH_cul  AYVATAVVLFFIGFFGFFLNLFVIALMCKEVQVLW-TPMNIILLNLVCSDFSVSIVGNPFTLSSAISHRWLFGRKL-CVAYGFFMSLLGITSITTLTVLSYERFYLISRPF---SSRSLSRRGALGAVLLIWCYSFALTSPPLF--GWG-AYVNEAANISCSVNWE--TQTLNATTYIIYLFVFGLVVPLTVIVYSYTNIIVNMKK-----------------NA----A-RVGRINRAEKRVTTMVAVMVIAFMVAWTPYSVFALMEQFGPPDVI--GPGLAVLPALIAKSSICYNPIIYVGMNTQFRAAFNRVRHDP         
ENCEPH_tri  GYIAAAVVLFCIGFFGFSLNLTVIIFMLKERQ-LW-SPLNIILFNLVVSDFLVSVLGNPWTFFSAINYGWIFGETG-CTIYGFIMSLLSITSITTLTVLAFERYLLIARPF---RNNALNFHSAALSVFSIWLYSLSLTIPPLI--GWG-EYVHEAANLSCSVNWE--EKSPNSTSYILYLFAFGLFLPLVIITFSYVNIILTMRR-----------------NA----AFRVGQVSKAENKVAYMIFIMIIAFLTAWSPYAIMALIVQFGDAALV--TPGMAVIPALLAKSSICYNPVIYIGLNAQVKGAKWVSGLIYLFQFQQ   
ENCEPHa_ne  EANIVLGYYIAIFVIGFVTNTIVVIIFISSQR-LH-TTPNLILFSMSVCDWLMATMAKSVGIYGNARYWPTVGKVT-CDYYAFATSAIGYASILHLAALAVEKRMAVVSPM----TNSFNGRRMLVIIATLWGFAILWAVFPLI--GWS-SYGPEPGYVSCSITWYT--TDHNNVSYIICVSVLFFLIPIVTMTFCFASIYHTIRNLSHEAT-----------ARWGSDARATQETIRAKAKTAKMAFLMVMCFLFAWTPYAVVSLWTTFGDTHRI--PALLGVLPSLFAKLSSCYNPIIYFFMYTKFRCAGKALLYQEHH       
ENCEPHb_ne  EANIVLGYYIAIFVIGFVTNTIVVITFIFSKR-LH-TTPNLILFSMSVCDWLMAAMAKSVGIYGNARYWPTVGKVT-CDYYAFATSAIGYASILHLAALAVEKRMAVASPM----TNSLNERRMLVIIATLWGFAILWAVFPLI--GWS-SYGPEPGYVSCSITWYT--TDHNNVSYIICISVLFFFVPIVTMTFSFASIYKAIRNISHEAI-----------ARWGSHARATQETIKAKAKTAKMAFLMVMCFLFAWTPYAVVSLWTTFGGTHRN--PALLGVLPSLFAKLSSCYNPIIYFFMYTKFRRAAKLLFIKKVIRPTEA  
ENCEPHc_ne    HAITVMYSLLAAGAFVLNGIVLIIFLATRS-LR-TIPNMILLSMAWADWLMACLADAVGAYANANNWPSMVGGL-CVYYGFITTALGLTSMIHLTALSVERFVTVTIPM----TRPITETQMLLVVTFLWAFSFLWAIFPLV--GWS-SYGPEPGYAACSIAWYR--QDLNNMSYILCLFMFFFFLPIVIMIACFSSIYFTVRKLTRDSM-----------RRWGASSDSTQQTLAAERKTAWMSFIMVLAFLFAWVPYAVVSLYASFGGVTTI--PKLMSTLPAMLAKTSACYNPIIYFFMYSKFRKAFQRFFFKNVITPSQT  
MOLL_PERc_  EFRIIGIFISICCIIGVLGNLLIIIVF-AKRRSVR-RPINFFVLNLAVSDLIVALLGYPMTAASAFSNRWIFDNIG-CKIYAFLCFNSGVISIMTHAALSFCRYIIICQYG--YR-KKITQTTVLRTLFSIWSFAMFWTLSPLF--GWS-SYVIEVVPVSCSVNW--YGHGLGDVSYTISVIVAVYVFPLSIIVFSYGMILQEKVCKDSRKN-------GIRAQQRY----TPRFIQDIEQRVTFISFLMMAAFMVAWTPYAIMSALAI--GSFNV--ENSFAALPTLFAKASCAYNPFIYAFTNANFRDTVVEIMAPWTTRRVGV  
PER2_strPu  GYLLTAIYLTIVGSIATVGNITVICVL-CRYRTFRKRSINLLLINMAASDLGVSVAGYPLTTVSGYWGRWLFGDVG-CQFYAFCVYTLSCSTISTHAAIAVYRYIYIVKTD--LR-PKLTANFTSGVIVVIWVYAFFWTVTPFV--GWS-SYIYEPFGTSCSVNW--VGRTISDISYMVACTIGVYLLQIFIMLYCYIRVAKKIRGVDPGRT-------EEKDAGVVVFGRLRKREAKIDTHVTKMCFMMMLTFIVVWAPYAVECLRAA--HVHRI--SALSSVLPTMFAKSSCMVNPIIFLTSSSKFRQDLGKLWSRPSSQDSL   
PER1_strPu  GYLLTALYLTLVGIVSTIGNITVLCVL-CRYGTFRKRSVNILLMNMAVSDLGVSVAGYPLTAISGYRGRWVFADIG-CQFSGFCVYALSCSTISTHAVVAIYRYIYIVKPY--HR-PRLSSSTSCLAILCIWTFTLFWTITPFF--GWS-SYTYEPFGTSCSINW--YGKSLGDLTYIICCVVFVFILPIIIMLYCYIGVAKKIKGIDPLRT-------EERDIAVV-FGRLRKHETKIDTRVTKICFMMMASFIVVWTPYAVGSIWAS--KIGKI--SASASVLPTMFAKSSCMINPIIFLTSSSKFRADLGKLWNRPSSLEHTI  
PER_homSap  EHNIVATYLIMAGMISIISNIIVLGIF-IKYKELR-TPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAG-CQVYAGLNIFFGMASIGLLTVVAVDRYLTICLPD--VG-RRMTTNTYIGLILGAWINGLFWALMPII--GWA-SYAPDPTGATCTINW--RKNDRSFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDC-----------TESL------NRDWSDQIDVTKMSVIMICMFLVAWSPYSIVCLWASFGDPKKI--PPPMAIIAPLFAKSSTFYNPCIYVVANKKFRRAMLAMFKCQTHQTMPV  
PER_monDom  EHKIVAAYLITAGVISIVSNVIVLGIF-VKYKALR-TATNTIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYDG-CQIYAGLNIFFGMASIGLLTAVAIDRYLTICQPD--LG-R-MTSYNYTLMILTAWVNGFFWALMPIV--GWA-GYAPDPTGATCTINW--RKNDVSFVSYTMTVITINFAMPLGVMFYCYYNVSQKMKQYSPSNC-----------PDHI------NRDWSNQVAVTKMSVVMILMFLLAWSPYSIVCLWASFGDPKEI--PPAMAIVAPLFAKSSTFYNPCIYVAANKKFRRAISAMIRCQTHQSMPI  
PER_galGal  EHNIVAAYLITAGVISIFSNIVVLGIF-VKYKEFR-TATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYTG-CQIYAALNIFFGMASIGLLTVVAVDRYLTICRPD--IG-RRMTTRNYAALILAAWINAVFWASMPTV--GWA-GYASDPTGATCTANW--RKNDVPFVSYTMSVIAVNFVVPLTVMFYCYYNVSRTMKQYTSSNC-----------LESI------NMDWSDQVDVTKMSVVMIVMFLVAWSPYSIVCLWSSFGDPKKI--SPAMAIIAPLFAKSSTFYNPCIYVIANKKFRRAILAMVRCQTRQEITI  
PER_xenTro  EHNIVAAYLITAGVISILSNIIVLGIF-VKYKELR-TATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYVG-CQIYAGLNIFFGMASIGLLTVVAIDRYLTICRPD--IGGRRISGRHYTAMILAAWINAVFWSVMPVV--GWS-SYAPDPTGATCTINW--RKNDVSFVSYTMSVVAVNFVVPLMVMFYCYYNVSRTMKGYGSRSS-----------LGGI------NADWSDQTDVTKMSMVMIVMFLVAWSPYSIVCLWSSFGDPRKI--PPAMAIIAPLFAKSSTFYNPCIYVIANKKFRRAILSMVQCKSRQEVTL  
PER_gasAcu  EHNIVAGYLITAGVISLFSNIVVLLMF-WKFKELR-TATNFIIINLAFTDIGVAGIGYPMSAASDIHGSWKFGYAG-CQIYAALNIFFGMASIGLLTVVAIDRYLTICRPD--IGGQKMTMQSYNLLILAAWLNAVFWSSMPVV--GWA-SYAPDPTGATCTINW--RQNDVSFISYTMAVIAVNFVLPLSAMFYCYYNVSATVKRYKASNC-----------LDSA------NIDWSDQMDVTKMSIVMIIMFLVAWSPYSIVCLWASFGDPKTI--PAPMAIIAPLFAKSSTFYNPCIYVIANKKFRRAIIGMVRCQTRQRITI  
PERa_braFl  DHLIVGLYLFVIGIIGTVENGITLATF-TKFRSLR-SPTTMLLVHLAIADLGICIFGYPFSGASSLRSHWLFGGVG-CQWYGFNGMFFGMANIGLLTCVAVDRYLVICRQD--LV-DKVNYNTYGVMAALGWLFAAFWAALPLV--GWA-EYSLEPSGTACTINW--QKNDSLYISYVTSCFILGFALPLAVMMFCYWQASCFVNKVLKGDI-----------SGDLTFPVAVNVDWEYQNHFSKMCLAMVAAFVVAWTPYSVLFLFAAFGNPADI--PAWITLLPPLIAKSSALYNPIIYIIANRRFRSAIFSMVKGQNPDVET   
PERa_braBe  DHLIVGLYLFVIGIIGTIENGITLATF-SKFRSLR-SPTTMLLVHLAIADLGICIFGYPFSGASSLRSHWLFGGVG-CQWYGFNGMFFGMANIGLLTCVAVDRYLVICRHD--LV-DKVNYNTYGVMAALGWLFAAFWAALPLV--GWA-EYALEPSGTACTINF--QKNDSLYISYVTSCFVLGFVVPLAVMAFCYWQASCFVSKVLKGDI-----------AGDLTFPVAANVDWEYQNHFSKMCLAMVAAFVVAWTPYSVLFLFAAFWNPADI--PAWLTLLPPLIAKSSALYNPIIYIIANRRFRNAICSMMKGQDPDVEDD  
PERc_braFl  GYLASAVYLTITGLIAFVGNIFAIIVFLTE-KEFRKKEHNSFALNLAIADLSVCVFAYPSSTISGYAGEWMLGDVG-CTIYGFLCFTFSLTSMVTLCAISVYRYIVICKPQ--YA-HLLTHRRTNYVILGIWLYALVFSVPPLF--GVN-RYTYEPI-ITCSLDW--NVQHVGETIYTAAVIIIVYVLNVSIMCFCYFNIIFKSANLKFAAL-------ASEKTR--------TAAKKDIWKTSMMCLAMVVSFLIAWTPYAVSSTWDIL-TEEDL--PIIATILPTMFAKSSCMMNPIIYSCCNGKFRQAALKTFSK          
PERc_braBe  GYLASAIYITLTGLIAFFGNVITITVFLTE-KEFRKKQQNGFVLNLAIADLSVCVFAYPSSAIAGYAGRWVLGDVG-CTIYGFLCFTFALVSMVTLCVISIYRYILICKPQ--YA-HLLTHRRTVYVIIGTWLYALVFTVPPLV--GVK-RYTYEPMQITCSLDW--NVQHPGEKAYIAAVLVIVYVLQVLIMCFCYFNIIFKSANLKFAAL-------ASEKTK--------MAAKKDTWKTSVMCLTMVVSFLIAWTPYAVSSTWDIL-SAEDL--PIIATILPSLFAKSSCMMNPIIYACCNTKFRQAAVKSFRKLCGMCKQK  
PERb_braFl  SATIMGVYLTIVGLVSTVGNATVVLMFMLKWRQLCRKAPNLLIINLAAVDLCISVFGYPFSASSGFANQWLFSDAI-CTLYGFSCFLLSMVSMHTLCLISAHRYITICRPE--HA-SKLTMTRTILAVVGAWVYGISVAVPPLF--GIA-GYTYESFGLSCTIDF--HGTTVADMVYLSILIILCYVINVAVMGTCYFKIIRKFSKHRFREV-------RDVRTS--------HQHSFERGVT-LRCILMTLFYLISWTPYTAVAVWTMV-GPPP---PVQLGMVAALTAKTHCAFNPILYMLMSEVYRKLVLRTMCPCCFNKISN  
PERb_braBe  SATIMGVYLTIVGLVATVGNATVVLMFIMKWRQLCRKAPNLLVINLAAANLCITIFGYPFSASSGYAHQWLFPDAI-CTLYGFSCFLLSMVSMHTLCLISAHRYITICRPE--HA-SKLTMNRTVLAVIGTWLYAIAVAVPPLF--NIA-RYTYEPSGLSCTIDF--RVTTVADLVYLGSLIVLCYVIHVAVMATCYFKIIRKFSRHRFRQV-------RDIRTS--------HQRSFEMGVT-MRCILMTLFYLLSWTPYTAVCIWTMV-GPPP---PVVVSMAAALIAKTHCAFNPILYAFMSEVYRKLVFRTMCPCCFNRISC  
NEUR_homSa  ADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLR--PAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLS--YG-VWLKRKHAYICLAAIWAYASFWTTMPLV--GLG-DYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKS-SSKEV-------AHFDSRIHSSHVLEMKLTKV-------AMLICAGFLIAWIPYAVVSVWSAFGRPDSI--PIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEG  
NEUR_monDo  ADLVAGFYLTIIGVLSTLGNGYVIYMSSKRKKKLR--PAEIMTVNLAVCDLGISVVGKPFTIISCFSHRWVFGWVG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLS--YG-TWLKRHHAYICLVIIWAYATFWATMPLA--GLG-NYAPEPFGTSCTLDWWLAQASVTGQTFILNILFFCLLLPTAVIVFSYVKIIAKVKS-STKEV-------AHFDSRIQSSHVLEMKLTK--------AMLICAGFLIAWIPYAVVSVWSAFGQPDSI--PVQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCQSGGQKAAKKESLRT  
NEUR_ornAn  ADLVAGVYLVIIGVLSTLGNGYVIYMSSRRKKKLR--PAEIMTVNLAVCDLGISVVGKPFTIVSCFCHRWVFGWMG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLS--YG-TWLKRHHAYICLAIIWAYASFWATMPLV--GLG-NYAPEPFGTSCTLDWWLAQASVAGQAFILNILFFCLLLPTAVIVFSYVKIIAKVKS-STKEV-------AHFDSRIQNSHVLEMKLTK--------AMLICAGFLIAWIPYAVVSVWSAFGQPDSI--PIQFSVVPTLLAKSAAMYNPIIYQVIDCRISCCRLGGPKTGKKESLKN  
NEUR_galGa  ADIIAGFYLTVIGILSTLGNGYVIFMSSKRKKKLR--PAEIMTVNLAVCDLGISV-GKPFSIISFFSHRWIFGWMG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLA--YG-TWLKRHHAFICLALIWAYATFWATVPFA--GVG-SYAPEPFGTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKS-STKEV-------AHYDTRIQNSHILEMKLTKV-------AMLICAGFLIAWIPYAVVSVWSAFGQPDSV--PIQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCRSGGPKTLQKKSSLK  
NEUR_anoCa  ADIVAGVYLLVIGILSTLGNGYVIYMSTQRKKKLK--PAEIMTVNLAVCDLGISV-GKPFSIIAFFSHRWIFGWSG-CRWYGWAGFFFGIGSLITMTAVSLDRYFKICHLS--YG-TWLKRHHVFICLGIIWSYAAFWATIPFA--GFG-NYAPEPFGTSCTLDWWLAQGSVAGQAFILNILFFCLVLPTAVIMFCYVKIIAKVQS-STKEV-------AHYDTRIQNQHVLEMKLTKV-------AMLICAGFMFAWIPYAVVSVWSAFGRPDSV--PIKVSVIPTLLAKSAAMYNPVIYQVIDCKSACCRPGNLQPLQKKNSRY  
NEUR_xenTr  ADIFAGVYLMAIGILSTLGNGYVIYMACSRKKKLR--PAEIMTINLAVCDLGISVTGKPFAIVSCFSHRWVFGWNA-CRWYGWAGFFFGCGSLITLTVVSLDRYLKICHLR--YG-TWLKRRHAFIALAVIWAYATLWATLPLV--GVG-NYAPEPFGTTCTLDWWLAQASVKGQIFVLSMLFFCLLFPTMVIVFSYAKIIAKVKS-SAKEV-------AHFDTRNQNNHTLEIKLTK--------AMLICAGFLIAWFPYAVVSVWSAFGQPDSI--PIELSVVPTMMAKSASMYNPIIYQVIDCKPACCKKD--KSLQNTTSRY  
NEUR_gasAc  ADIIAAFYICIIGIMSATGNGYVIYMTIKRKSKLK--PPELMTVNLAVFDFGISVTGKPFFVVSSFAHRWLFGWEG-CRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLR--YG-TWLKRQHAFLCLVFVWMYAAFWATMPLV--GWG-NYAPEPFGTSCTLDWWLAQASVSGQSFVVAILFFCLVLPAGIIVFSYVMIIFKVKS-SAKEI-------SNFDARIKNSHNLEIKLTKTRNCATEDAMLICAGFLIAWIPYAVVSVVSAFGEPDSV--PISVSVIPTLLAKSSAMYNPIIYQVLDLKNSCMKSSCFKGLKKPRHFR  
NEUR_calMi              GLLSTLGNGYVIYLSITQKRKLK--PPEILITNLAISDFGMSVGGQPFLIISCFSHRWIFGWVG-CRWHGWAGFFFGCGSLITMTVVSLDRYLKICHLQ--YG-SWLQRRHVFMSLAFIWFYAAFWATMPLV--GWG-NYAPEPFGTSCTLDWWLARVSVSGLIFVLTILFFCLLLPIIIIVFSYIKIIAKVKS-SAKEV-------AHFDSRIQNHHSLEMNLTK                                                                                          
MEL1_homSa  AHYTLGTVILLVGLTGMLGNLTVIYTFCRSRS-LR-TPANMFIINLAVSDFLMSFTQA-PVFFTSSLYKQWLFGETGCEFYAFCGALFGISSMITLTAIALDRYLVITRPL--ATFGVASKRRAAFVLLGVWLYALAWSLPPFF--GWS-AYVPEGLLTSCSWDYMS--FTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGRALQTFGAC------KGNGESLWQRQ-RLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAGYAHVL--TPYMSSVPAVIAKASAIHNPIIYAITHPKYRVAIAQHLPCLGVLLGVS  
MEL1_monDo  AHYTIGATILAVGFTGVLGNLLVIYTFCR----LR-TPANMFIINLAISDFFMSFTQA-PVFFASSMYKRWIFGEKACEFYAFCGALFGITSMITLMAIALDRYFVITRPL--ASIGVISKKKTGFILLGVWLYSLAWSLPPFF--GWS-AYVPEGLLTSCSWDYTT--FTPSVRAYTMLLFCFVFFIPLIVIIYCYIFIFRAIQDTNKAVHSIGSG------ESTA-SPRHCQ-RMKNEWKMAKIALVVILLYVLSWAPYSTVALVAFAGYSHIL--TPYMNSVPAIIAKASAIHNPIIYAISHPKYRMAIAQNFPCLRALLCVR  
MEL1_xenTr  VHYVVGAVILAVGITGMLGNFLVIYAFCRSRS-LR-SPANMFIINLAITDFLMSVTQA-PVFFATSLHKRWIFGEKGCELYAFCGALFGITSMITLMVIAVDRYFVITRPL--TSIGVMSKKRAVLILSGVWLYSLAWSLPPFF--GWS-AYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFCFVFFIPLFIIIYCYIFIFKAIKNTNRAVQKIGTD------N-NKESHKQYQ-KMKNEWKMAKIALIVILLYVVSWSPYSTVALLAFAGYASIL--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRMAIAKYIPCLGSLLRVK  
MEL1_galGa  AHYTIGTVILIVGITGTLGNFLVIYAFCRSRT-LQ-KPANIFIINLAVSDFLMSITQS-PVFFTNSLHKRWIFGEKGCELYAFCGALFGITSMITLMVIALDRYFVITKPL--ASVRVMSKKKALIILVGVWLYSLAWSLPPFF--GWS-AYVPEGLLTSCSWDYMT--FTPSVRAYTMLLFCFVFFIPLIAIIYSYVFIFEAIKKANKSVQTFGCK------HGNRELQKQYH-RMKNEWKLAKIALIVILLYVISWSPYSVVALVAFAGYSHVL--TPFMNSVPAVIAKASAIHNPIIYAITHPKYRTAIATYVPCLGFLLRVS  
MEL1_calMi  AHYIIGATILAVGVTGMVGNFLVIYAFLRSRS-LR-TPANTFIINLAATDFLMSVTQS-PIFFITSIHKRWIFGEKGCELYAFCGALFGITSMITLMVIALDRYFVITRPL--ASIGVLSHRRAGLIILSLWLYSLAWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFCFVFFIPLGVIIYCYIFIFRAIKSTNKKVG----G------STNRESQKQHQ-RMKNEWKMAKIALIVILLFVISWSPYSTVALTAFAGYADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRMAIAKYVPLLGLLLRVS  
MEL1_danRe  AHYTIGAVILTVGITGMLGNFLVIYAFSRSRT-LR-TPANLFIINLAITDFLMCATQA-PIFFTTSMHKRWIFGEKGCELYAFCGALFGICSMITLMVIAVDRYFVITRPL--ASIGVLSQKRALLILLVAWVYSLGWSLPPFF--GWS-AYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFIFVFFIPLIVIIYCYFFIFRSIRTTNEAVGKINGD-------NKRDSMKRFQ-RLKNEWKMAKIALIVILMYVISWSPYSTVALTAFAGYSDFL--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRLAIAKYIPCLRLLLCVP  
MEL1_takRu  AHYTIGSVILVIGITGMIGNFLVIYAFCRSRS-LR-TPANMFIINLAVTDLLMCVTQT-PIFFTTSMYKRWIFGEKGCELYAFCGALFGICSMITLTVIAIDRYFVITRPL--TSIGVLSRKRAFVILMTVWIYSLGWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FSPSVRAYTMLLFIFVFFLPLFIIIYCYFFIFRAIRATNKAVGKVNGS--VHSHSRRRESVKNFQ-RLQNEWKMAKIALMVILLYVISWSPYSCVALTAFAGYADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRLALAKYIPCLGFLLCIS  
MEL1_gasAc  AHYTIGSVILAIGITGIIGNVLVIYAFSKSRS-LR-TPANMFIINLAITDLLMCVTQA-PIFFTTSMHKRWIFGEKGCELYAFCGALFGICSMITLTVIALDRYFVITRPL--TSIGMMSRRRALLILMGAWTYSLGWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFIFVFFLPLFIIIYCYFFIFRAIRVTNRAVGKMNGS--IHSHGSGRDSTKNFH-RLQNEWKMAKIALIVILLYVVSWSPYSAVALTAFAGYADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRIALAKYIPFLGVLLCVP  
MEL1_oryLa  AHYTIGSVILAIGITGIIGNFLVIYAFSRSRS-LR-TPANMFIINLAITDLLMCVTQS-PIFFTTSMHKRWIFGEKGCELYAFCGALFGICSMITLTVIAIDRYFVITRPL--TSIGVLSRKRALLILSAAWAYSLGWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFIFVFFLPLFIIIYCYVFIFRAIRSTNRAVGKINGN--T------RDAVKSFN-RLQNEWKMAKIALIVILLYVISWSPYSTVALTAFAGYADML--TPYMNSIPAVIAKASAIHNPIIYAITHPKYRMALAKYIPGLGVLLCIH  
MEL1D_danR  AHYTIGSVILAVGITGMVGNLLVMYAFCKSRS-LR-TPANMFIINLAVTDFLMCVTQT-PIFFTTSLHKRWIFGEKGCELYAFCGALFGICSMITLMIIAVDRYFVITRPL--ASIGVMSRKRALLILSAAWAYSMGWSLPPFF--GWSGAYVPEGLLTSCSWDYMT--FSPSVRAYTMLLFTFVFFIPLFVIIYCYFFIFKAIRETNRAVGKINGE------GGPRDSIKKIH-RMKNEWKMAKIALIVILLYVISWSPYSCVALTAF--YADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRSAIAKYIPCLGVLLCVP  
MEL2_galGa  VLYTVGTCVLVIGSIGIIGNLLVLYAFYSNKK-LR-TPQNFFIMNLAVSDFLMSASQA-PICFVNSLHREWILGDIGCDLYAFCGALFGITSMMTLLAISVDRYLVITKPL--RSIQWTSKKRTIQIIAAVWLYSLGWSVAPLL--GWS-SYVPEGLMISCTWDYVT--YSPANRSYTMILCCCVFFIPLIIILHCYLFMFLAIRSTGRDVQKLGSC---------SRKSFLSQ-SMKNEWKLAKIAFVVIIVYVLSWSPYACVTLIAWAGRGNTL--TPYSKSVPAVIAKASAIYNPIIYAIIHPRYRKTIHNAVPCLRFLIRIS  
MEL2_xenLa  VLYTIGSFILIIGSVGIIGNMLVLYAFYRNKK-LR-TAPNYFIINLAISDFLMSATQA-PVCFLSSLHREWILGDIGCNVYAFCGALFGITSMMTLLAISINRYIVITKPL--QSIQWSSKKRTSQIIVLVWMYSLMWSLAPLL--GWS-SYVPEGLRISCTWDYVT--STMSNRSYTMMLCCCVFFIPLIVISHCYLFMFLAIRSTGRNVQKLGSY---------GRQSFLSQ-SMKNEWKMAKIAFVIIIVFVLSWSPYACVTLIAWAGHGKSL--TPYSKTVPAVIAKASAIYNPIIYGIIHPKYRETIHKTVPCLRFLIREP  
MEL2_anoCa  VLYTVGSCVLVIGCIGITGNLLVLYAFYSNKR-LR-TPPNYFIMNLAVSDFLMSATQA-PICFLNSMHKEWVLGDIGCNLYAFCGALFGITSMITLLAISVDRYCVITKPL--QSIKRTSKKRTCIIIVFVWLYSLGWSVCPLF--GWS-SYIPEGLMISCTWDYVT--YSPANRSYTMMLCCCVFFIPLVIIFHCYIFMFLAIRSTGR------------------RKSSISH-SIKSEWKLAKIAFVAIVVFVLSWSPYACVTLISWAG---TL--TPYSKSVPAVIAKASAIYNPIIYAIIHPRYRKTIRSAVPCLRFLIPIS  
MEL2_tetNi  VHYIIAFFVFVIGILGITGNVLVIFAFYSNKK-LR-SLPNYFIVNLAVSDLLMASTQS-PIFFIN-LYKEWMFGETACKMYAFCGALFGITSMINLLAISVDRYVVITKPL--QTIRRSSKRRTALAILMVWLYSLAWSLAPLV--GWG-SYIPEGLMTSCTWDYVT--YTLANRSYTMMLCCFVFFIPLAIILCCYLLMFLAIRKTSRR----------------KSTLIQQK-SIRSEWKLAKIAFVVIVVYVLSWSPYACVTLISWAG---TL--TPYSKSVPAVIAKASAIYNPIIYAIIHPRYRKTIRSAVPCLRFLIPIS  
MEL2_gasAc  AHYIVAVFVVVIGTLGITGNALVMLAVYSNKK-LR-NLPNYFIMNLAVSDFLMAFTQS-PIFFINCLYKEWAFGETGCKIYAFCGALFGIASMINLLAISIDRYLVITKPL--QAIHWGSKRRTTLAILLVWLYSLAWSLAPLV--GWG-SYIPEGLMTSCTWDYVT--YTLANRSYTMMLCCFVFFIPLGIILYCYLFMFLAIRKTSRR----------------KSTLIKQK-SMKSEWKLAKIAFVVIVVYVLSWSPYACVTLISWAG---IL--SPYSKAVPAIIAKASAIYNPFIYAIIHNKYRMTLAAKFPCLRFLSPTP  
MEL2_danRe  VHYIIAFLILIIGTLGVSGNALVMFAFYRNKK-LR-SLPNYFIMNLAVSDFLMAITQS-PIFFINCLYKEWMFGELGCKIYAFCGALFGITSMINLLAISIDRYLVITKPL--QTIQWNSKRRTGLAILCIWLYSLAWSLAPLI--GWG-SYIPEGLMTSCTWDYVS--PSPANKSYTMMLCCFVFFIPLSIILYCYLFMFLSVRQASRQ----------------KSSFVKQQ-SMRSEWKLAKIAAVVIVVYVLSWAPYACVTLVAWAG----L--TPYSKTLPAVLAKSSAIYNPFIYAIIHNKYRATLAEKVPGLSCLSRSQ  
MEL1a_braF  AHYIVGTAVFCVGCCGMFGNAVVVYSFIKSKG-LR-TPANFFIINLALSDFLMNLTNM-PIFAVNSAFQRWLLSDFACELYGFAGGLFGCLSINTLMAISMDRYLVITKPF--LVMRIVTKQRVMFAILLLWIWSLVWALPPLF--GWS-AYVPEGFGTSCTFDYMT--PKLSYHIFTYIIFFTMYFIPMGVIIYCYYNIFATVKSGDKQFGKAVKE--M-AHEDVKNKAQQER-QRKNEIKTAKIAFIVITLFLSAWTPYAVVSALGTLGYQDLV--TPYLQSIPAVFAKSSAVYNPIVYAITHPKFRAAVKKHIPCLSGCLPAD  
MEL1a_braB  AHYIVGTAVFCIGCCGMFGNAVVVYSFIKSKG-LR-TPANFFIINLALSDFLMNLTNM-PIFAVNSAFQRWLLSDFACELYGFAGGLFGCLSINTLMAISMDRYLVITKPF--LVMRIVTKQRVMFAILLLWIWSLVWALPPLF--GWS-AYVSEGFGTSCTFDYMT--PKLSYHIFTYIIFFTMYFIPGGVMIYCYYNIFATVKSGDKQFGKAVKE--M-AHEDVKNKAQQER-QRKNEIKTAKIAFIVISLFMSAWTPYAVVSALGTLGYQDLV--TPYLQSIPAMFAKSSAVYSPIVYAITYPKFREAVKKHIPCLSGCLPAS  
MOLL_RHO_l  VYYSLGIFIAICGIIGCAGNGIVIYLFTKTKS-LQ-TPANMFIINLAFSDFTFSLVNGFPMMTISCFLKHWVFGQAACKVYGLIGGIFGLTSIMTMTMISIDRYNVIRRPM--SASKKMSHRKAFIMIVFVWIWSTIWAIGPIF--GWG-AYQLEGVLCNCSFDYIT--RDASTRSNIVCMYIFAFMFPIVVIFFCYFNIVMSVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GASAEMKLAKISIVIVTQSLLSWSPYAIVALLAQFGPIEWV--TPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAIASNFPWILTCCQ    
MOLL_RHO_s  VYYSLGIFIGICGIIGCTGNGIVIYLFTKTKS-LQ-TPANMFIINLAFSDFTFSLVNGFPLMTISCFIKKWVFGMAACKVYGFIGGIFGLMSIMTMSMISIDRYNVIGRPM--AASKKMSHRRAFLMIIFVWMWSTLWSIGPIF--GWG-AYVLEGVLCNCSFDYIT--RDSATRSNIVCMYIFAFCFPILIIFFCYFNIVMAVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GASAEMKLAKISIVIVTQFLLSWSPYAVVALLAQFGPIEWV--TPYAAQLPVMFAKASAIHNPLIYSVSHPKFREAIAENFPWIITCCQ    
MOLL_RHO_t  VYYSLGIFIGICGIIGCGGNGIVIYLFTKTKS-LQ-TPANMFIINLAFSDFTFSLVNGFPLMTISCFLKKWIFGFAACKVYGFIGGIFGFMSIMTMAMISIDRYNVIGRPM--AASKKMSHRRAFIMIIFVWLWSVLWAIGPIF--GWG-AYTLEGVLCNCSFDYIS--RDSTTRSNILCMFILGFFGPILIIFFCYFNIVMSVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GANAEMRLAKISIVIVSQFLLSWSPYAVVALLAQFGPLEWV--TPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAISQTFPWVLTCCQ    
MOLL_RHO_e  VYYSVGIFIGVVGIIGILGNGVVIYLFSKTKS-LQ-TPANMFIINLAMSDLSFSAINGFPLKTISAFMKKWIFGKVACQLYGLLGGIFGFMSINTMAMISIDRYNVIGRPM--AASKKMSHRRAFLMIIFVWMWSIVWSVGPVF--NWG-AYVPEGILTSCSFDYLS--TDPSTRSFILCMYFCGFMLPIIIIAFCYFNIVMSVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GASAEMKLAKISMVIITQFMLSWSPYAIIALLAQFGPAEWV--TPYAAELPVLFAKASAIHNPIVYSVSHPKFREAIQTTFPWLLTCCQ    
MOLL_MEL_p  WHYIIGVYITIVGLLGIMGNTTVVYIFSNTKS-LR-SPSNLFVVNLAVSDLIFSAVNGFPLLTVSSFHQKWIFGSLFCQLYGFVGGVFGLMSINTLTAISIDRYVVITKPL--QASQTMTRRKVHLMIVIVWVLSILLSIPPFF--GWG-AYIPEGFQTSCTFDYLT--KTARTRTYIVVLYLFGFLIPLIIIGVCYVLIIRGVRRHDQKMLTIT-R--S-MKTED--ARANNK-RARSELRISKIAMTVTCLFIISWSPYAIIALIAQFGPAHWI--TPLVSELPMMLAKSSSMHNPVVYALSHPKFRKALYQRVPWLFCCCK    
LOPH_RHO_p  WHYAVAAWMTFFGILGVSGNLLVVWTFLKTKS-LR-TAPNMLLVNLAIGDMAFSAINGFPLLTISSINKRWVWGKLWRELYAFVGGIFGLMSINTLAWIAIDRFYVITNPL--GAAQTMTKKRAFIILTIIWANASLWALAPFF--GWG-AYIPEGFQTSCTYDYLT--QDMNNYTYVLGMYLFGFIFPVAIIFFCYLGIVRAIFAHHAEMMATA-K--R-MGAN---TGKADA-DKKSEIQIAKVAAMTIGTFMLSWTPYAVVGVFGMIKPHSEMFIHPLLAEIPVMMAKASARYNPIIYALSHPKFRAEIDKHFPWLLCCCKPK  
RHAB_schMe  YHYLVGVYISIVGISGVLGNLLVLYIFARAKS-LR-TPPNMFIMSLAIGDLTFSAVNGFPLLTISSFNTRWAWGKLTCEIYGFIGGLFGFISINTMALISLDRYFVIAQPF--QTMKSLTIKRAIIMLVFVWLYSLIWSTPPFF--GYG-NYVPEGFQTSCTFDYLT--QSKGNIIFNIGMYIGNFIIPVGIIIFCYYQIVKAVRVHELEMLKMA-Q--K-MNASHPTSMKTGA--KKADVQAAKISVIIVFLYMLSWTPYAIIALMALTGRRDHL--NPYTAELPVLFAKTSAMYNPFIYAINHPKFRIQLEKKFPCLICCCPPK  
MEL1_schMa  YYYLVGIYIGIVGILAVMGNSLVITLFLLCKQ-LR-TPPNMLIVSLAISDFSFALINGFPLKTIAAFNHRWGWGKLACELYGFAGSIFGFISLTTMAFIALDRYLVIVQPF--ETFSRITYGKVIVMIFITWIWSALWSIPPFF--GYG-SYIPEGFHTSCTFDYLS--TDLPNLIFNAGLYILGFLCPVFIIIFSYYQIVKTVRLNELELMKMA-Q--S-LDLQNPSAMKTGG-DKKADIEAAKTSIILVLLYLMSWSPYAIVCLMTLIGSRDSL--TPFHSELPVLFAKTSAVYNPIVYAVKHPKFRMEIEKRFPFLICCCPPK  
MEL1_capCa  IYYGLGLYMAVVGIVGTLGNLVVITLFI--KS-LR-TPPNMFIINLALSDMGFCATNGFPLMTVASFQKLWRWGPVACELYALAGSITGFNSIATLALISMDRYMVIAKPF--YAMKHVSHKRSLIQIILAWTWAFIWSAPPLLRMGYG-RYIPEGFQVSCTFDYLS--RDLKNLIFVWCLFVFGFFIPVLAIACSYVGIIRAVGAQSKEMRKTA-E--K-MGAK---TGKSDK-EKKQDIAMAKVAAGTIGLFLMSWTPYAAVSMIGIAGNRSWI--TPYVSQIPVMFAKASAMWNPILYALSHPKFRAALEDHMPWLLVC      
MEL2_schMa  YQYAIGLFIAVVGITGMCLNLLVIVFFTMFKS-LR-TPSNILVVNLAISDFGFSAVIGFPLKTMAAFNNFWPWGKLACDLYGLAGGLFGFVSLSTIAAVALDRYLVIATPF--ESVFQTTPRRTLLLMLFLWMWSLMWTIPPLFGFG-K-RYVTEGYQTSCTMDYIS--TDLNNRLFNIGLFGFGFLCPLFLSLFCYARIILIVRSRGKDFIEMAAS--S-KGTNQKEKSANVS-SSKSDTFVSKSSAILLGVYLICWTPYSFVCLMALIGYADYI--TPLMVEIPCLCAKTA---NPCIYAFRYPKFRSLLQQRFGFLRLTKNRV  
MEL1_helRo  FYYFLGTFFAVVGFLGVFGNIIVVWVFSRTPS-LR-TPSNVLVINLAICDILFSALIGFPMSALSCFQRHWIWGNF-CQFYSFVAGITGLASINCLAVIAVDRYLVVGQPL--AMLNQSHFRRSFYHVLIIWTWACVWSAMPLI--GWG-EYILEGFGVSCTFDYLT--RTTWNISFNVCLFTFCFGMPVSVIILSYIGIIRSIAKNRKEFSSL--------------TAENSS-RARQEIKIAKVFAVCMTAFILCWVPYATVAQLGIYGYDQMV--SPYTAELPVMLAKTSALWNPIIYAFSHPKYRKCLKELPIF          
MEL1_strPu          MNAVTTALPHGLNKPTIEARWTKS-LR-TPPNMLIVNLAISDFGMVITN-FPLMFASTIYNRWLFGDAGCQFYAFCGALFGIMSIANMTAIALDRYYVICWSL--EAVRSVTHRRSMIIIIIVWCYAIFWSIPPFFGVG---SYVLEGYGLGCTFDFMT--KDLNHYLHVSFLFASSFVVPVTIIIVCFTRIAITVRAHRHELNKMRTKLTEDKDKKHKSSIRRAN-KAKTEFQIAKVGFQVTIFYVLSWMPYSIVAVIGQYFDSDLL--TPLGTVVPVIFAKCSAIWNPIIYCLSHEKFNAALKEKLMGMCGIEIPS  
MEL2_strPu  AFPLIGGYLLVVVLLGTAGNSLVIYTFLRFKK-LH-SPINLLIVNLSASDLLVATT-GTPLSMVSSFYGRWLFGTNACAFYGFVNYYCGCISLNSLAAISVFRYIIVVRGQ--AQNNKLSLRSSIYAILVIHLYTLIFSTPPLY--GWN-RFVLAGYHTSCDIDFHT--KTPLFVSYICYMFFFLFFLPLGLISWSYFKIYQRVSK--HSNSMRTSFTGVTKEINSDEKHANHR-------RTASTLFVTIVVFLFAWFPYCIVSLWVLIGDANSI--SKLSTTIPSLFAKSSVIYNPLIYVVLNSKFRKALIQTLSFLKCLSKHE  
MOLL_MEL_a  VHLSVGVFITLVGVLAVCGNSLVIITCIRFKD-LR-TRSNILIINLAVGDLLMCLI-DFPLLAAASFYGEWPYGRQVCQMYAFLTAIAGLVTINTLAVIAADRYWAVVRRP--TPGQKLPKCVTSIAVASVWAYSISWALCPIL--GWG-AYVLDGIRTTCTFDFLT--RTWENRSFVIGMMIGNFVLPFALMVFSYFRIWVAVRKVKSGNVFCAIRHNYNLALGSTLFVKQHRYRLHCEQKTVKIIMFLLIAFTVSWSPYLAVSIIGLFGDRSQL--TYQNTLTASLIAKTSMVFNPILYSISHPKVRKRIANLACCYSVRRHQQ  
MOLL_MEL_l  CQYTIGIFISTVAVIAVIGNSIVIWAHVRIKS-LS-TTSNMLILNLCVGCLIMCIV-DFPLYATSSFLQKWIFGHKVCEIYATITGTAGLLIMNSYSAIAFDRFITVTRYN--NPNYPRSKSATMCISGFVWIYSLSWSMAPVV--GWS-RYQLDGSGTTCTFDYLS--TTWTNRSFILSIAFFNFVLPLCFILFAYSRILHLISS--HSREMKSYRSAVIISKGKASIPKRFR----SERKTAITLLITVVVFCLSWVPYVIIALIGQFGNQSFI--TPQISVIPQLVAKLSTVTNPILYSLSHPVVRNKLFLRLRHELYRRPSD  
CHEL_LWS_l  WYSILGVAMIILGIICVLGNGMVIYLMMTTKS-LR-TPTNLLVVNLAFSDFCMMAFMMPTMTSNCFAE-TWILGPFMCEVYGMAGSLFGCASIWSMVMITLDRYNVIVRGM--AA-APLTHKKATLLLLFVWIWSGGWT-ILPF-FGWS-RYVPEGNLTSCTVDYLT--KDWSSASYVVIYGLAVYFLPLITMIYCYFFIVHAVAEHEKQLREQAKK-----MNVASLRANADQQKQSAECRLAKVAMMTVGLWFMAWTPYLIISWAGVFSSGTRL--TPLATIWGSVFAKANSCYNPIVYGISHPRYKAALYQRFPSLACGSGE   
CHEL_LWS_i  WHSLLGFAMVILGVISVVGNSMVIYIMTTSKS-LR-SPTNMLVVNLAFSDWCMMAFMMPTMAANCFAE-TWILGPFMCEVYGMVGSLFGCGSIWSMVMITLDRYNVIVRGV--AA-APLTHKRAALMIFFVWFWALTWT-LLPF-FGWS-RYVPEGNMTSCTIDYLT--KALWSASYVVAYAGGVYWTPLFINIYCYSKIVRAVAQHEKQLRLQARK-----MNVASLRANAEQTKTSAEARLAKIALMTVGLWFMAWTPYLTIAWAGIFSDGSKL--TPLATIWGSVFAKANACYNPIVYGISHPKYRAALARRFPSLVCMPPGG  
INSE_LWS1_  WHKILGLVMIILGIMGWCGNGVVVYVFIMTPS-LR-TPSNLLVVNLAFSDFIMMGFMCPPMVICCFYE-TWVLGSLMCDIYAMVGSLCGCASIWTMTAIALDRYNVIVKGM--SG-TPLTIKRAMLQILGIWLFGLIWT-ILPL-VGWN-RYVPEGNMTACGTDYLS--QDWTFKSYILVYSFFVYYTPLFTIIYSYYFIVSAVAAHEKAMKEQAKK-----MNVTSLRSGDNQNTSA-EAKLAKVALTTISLWFMAWTPYLVINYIGIFNR-SLI--TPLFTIWGSLFAKANAIYNPIVYGISHPKYRAALKEKLPFLVCGSTED  
INSE_LWS2_  WHGILGFVIGMLGFVSAMGNGMVVYIFLSTKS-LR-TPSNLFVINLAISNFLMMFCMSPPMVINCYYE-TWVLGPLFCQIYAMLGSLFGCGSIWTMTMIAFDRYNVIVKGL--SG-KPLSINGALIRIIAIWLFSLGWT-IAPM-FGWN-RYVPEGNMTACGTDYFN--RGLLSASYLVCYGIWVYFVPLFLIIYSYWFIIQAVAAHEKNMREQAKK-----MNVASLRSSENQNTSA-ECKLAKVALMTISLWFMAWTPYLVINFSGIFNL-VKI--SPLFTIWGSLFAKANAVYNPIVYGISHPKYRAALFAKFPSLACAAEPS  
INSE_LWS_c  WHAILGFVIGILGMISVIGNGMVIYIFTTTKS-LR-TPSNLLVINLAISDFLMMLSMSPAMVINCYYE-TWVLGPLVCELYGLTGSLFGCGSIWTMTMIAFDRYNVIVKGL--SA-KPMTINGALLRILGIWFFSLGWT-IAPM-FGWN-RYVPEGNMTACGTDYLT--KDLLSRSYILVYSFFCYFLPLFLIIYSYFFIIQAVAAHEKNMREQAKK-----MNVASLRSAENQSTSA-ECKLAKVALMTISLWFMAWTPYLVINYAGIFET-VKI--NPLFTIWGSLFAKANAVYNPIVYGISHPKYRAALFQRFPSLACSSGPA  
INSE_LWS_p  WHGLLGFTIGVLGFISITGNGMVVYIFTSTKS-LK-TPSNLLVVNLAFSDFLMMLCMAPPMLINCYYE-TWVFGPLACELYACAGSLFGSISIWTMTMIAFDRYNVIVKGI--AA-KPMTINGALLRILGIWLFSLAWT-IAPM-LGWN-RYVPEGNMTACGTDYLS--KSWLSRSYILVYSIFVYYTPLLLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSEAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYTGVFET-API--SPLATIWGSVFAKANAVYNPIVYGISHPKYRAALYQKFPSLACQPSA   
INSE_LWS_m  WHALLGFTIGVLGFVSISGNGMVIYIFMSTKS-LK-TPSNLLVVNLAFSDFLMMCAMSPAMVVNCYYE-TWVWGPFACELYACAGSLFGCASIWTMTMIAFDRYNVIVKGI--AA-KPMTSNGALLRILGIWVFSLAWT-LLPF-FGWN-RYVPEGNMTACGTDYLS--KSWVSRSYILIYSVFVYFLPLLLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSEAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYTGVFES-API--SPLATIWGSLFAKANAVYNPIVYGISHPKYQAALYAKFPSLQCQSAP   
INSE_LWS_v  WHGLLGFVIGILGFISITGNGMVIYIFTTTKS-LK-TPSNILVVNLAFSDFLMMCVMSPPMVVNCYTE-TWVFGPLACQLYACAGSLFGCASIWTMTMIAFDRYNVIVKGI--AA-KPLTINGAMLRVLGIWVFSLAWT-VAPL-FGWG-RYVPEGNMTACGTDYLD--KSWFNRSYILIYSIFCYFSPLFLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSDAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYAGIFET-ATI--TPLATIWGSVFAKANAVYNPIVYGISHPKYRAALYARFPALACQPSP   
INSE_LWS_t  WHGILGFVIGVLGFVSIVGNGMVIYIFSSTKA-LR-TPSNLLVVNLAFSDFLMMXCMSPAMVINCYNE-TWVLGPLVCELYGMSGSLFGCASIWTMTFIALDRYNVIVKGL--SA-QPLTKKGAMLRILIIWVFSTLWT-IAPF-FGWN-RYVPEGNMTACGTDYLT--KDWVSRSYILVYAVWVYFVPLFTIIYSYWFIVQAVAAHEKSMREQAKK-----MNVASLRSSEAAQTSA-ECKLAKIALMTITLWFFAWTPYLVTNFTGIFEG-AKI--SPLATIWCSLFAKANAVYNPIVYGISHPKYRQALQKKFPSLVCAGEP   
INSE_LWS_s  WHGLLGFVIGVLGVISVIGNGMVIYIFSTTKS-LR-TPSNLLVVNLAFSDFLMMFTMSAPMGINCYYE-TWVLGPFMCELYALFGSLFGCGSIWTMTMIALDRYNVIVKGL--SA-KPMTNKTAMLRILFIWAFSVAWT-IMPL-FGWN-RYVPEGNMTACGTDYLT--KDWVSRSYILVYSFFVYLLPLGTIIYSYFFILQAVSAHEKQMREQRKK-----MNVASLRSAEASQTSA-ECKLAKVALMTISLWFFGWTPYLIINFTGIFET-MKI--SPLLTIWGSLFAKANAVFNPIVYGISHPKYRAALEKKFPSLACASSS   
INSE_LWS_b  WHGILGFVIGLLGFISVSGNGMVVYIFLSTKS-LR-TPSNMFVINLAISDFLMMFCMSPPMVINCYYE-TWVLGPLFCQVYAMLGSLFGCGSIWTMTMIAFDRYNVIVKGL--SG-KPLTINGALLRILGIWLFSLIWT-IAPM-FGWN-RYVPEGNMTACGTDYFS--KDIVSVSYILLYSIWVYFFPLFLIIWSYWFIXQAVAAHEKNMREQAKK-----MNVASLRSSENQNTSA-ECKLAKVALMTISLWFMAWTPYLVINWSGIFSL-VKI--SPLYTIWGSLFAKANAV                                 
INSE_LWS_d  WFGIIGFVIAILGTMSLAGNFIVMYIFTSSKG-LR-TPSNMFVVNLAFSDFMMMFTMFPPVVLNGFYG-TWIMGPFLCELYGMFGSLFGCVSIWSMTLIAYDRYCVIVKGM--AR-KPLTATAAVLRLMVVWTICGAWA-LMPL-FGWN-RYVPEGNMTACGTDYFA--KDWWNRSYIIVYSLWVYLTPLLTIIFSYWHIMKAVAAHEKAMREQAKK-----MNVASLRNSEADKSKAIEIKLAKVALTTISLWFFAWTPYTIINYAGIFES-MHL--SPLSTICGSVFAKANAVCNPIVYGLSHPKYKQVLREKMPCLACGKDDL  
CRUS_LWS_m  WYGILAFVVTVVGLCSICGNFVVIWVFMNTKA-LR-SPANTLVVSLAVSDFIMMACMFPPLVLNCYWG-TWIFGPLFCEVYAFIGNTVGCASIGNMIFITFDRYNVIVKGI--SG-TPLSQKNTTLQVLFVWICSIMWC-VFPF-FGWN-RYVPRGDMTACGTDYLT--EDEFSRSYLYVYSVWVYIGPLALIIYCYFHIVSAVATHEKQMRDQAKK-----MGVKSLRTEEAKKTSA-ECRLAKVALTTVSLWFMAWTPYLIINWAGMFYP-SVV--SPLFSIWGSVFAKANAVYNPIVYAISHPKYRAALYKKLPCLACSTESA  
CRUS_LWS_n  WYGILAFVVTVVGLCSICGNFVVIWVIMNTKA-LR-SPANTLVVSLAVSDYIMMTCMFPPLVLNCYWG-TWIFGPLFCEVYAFIGNTVGCASIGNMIFITFDRYNVIVKGV--SG-KPLSQKNATLQVLFVWICSIMWC-VFPF-FGWN-RYVPEGNMTACGTDYLT--EDEFSRSYLYIYSVWVYIGPLALIIYCYFHIVSAVATHEKQMRDQAKK-----MGVKSLRTEEAKKTSA-GCRLAKVALTTVSLWFMAWTPYLIINWAGMFYP-SVV--SPLFSIWGSVFAKSNAVYNPIVYAISHPKYRAALYKKLPCLACSTESA  
INSE_MWS_d  WAKILTAYMIIIGMISWCGNGVVIYIFATTKS-LR-TPANLLVINLAISDFGIMITNTPMMGINLYFE-TWVLGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQVIVKGM--AG-RPMTIPLALGKIAYIWFMSTIWCCLAPV-FGWS-RYVPEGNLTSCGIDYLE--RDWNPRSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKAMREQAKK-----MNVKSLRSSEDADKSA-EGKLAKVALVTISLWFMAWTPYLVINCMGLFKF-EGL--TPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVFGKVDD  
INSE_MWS_c  WAKFLAAYMVLIATISWCGNGVVIYIFSTTKS-LR-TPANLLVINLAISDFGIMITNTPMMGINLFYE-TWVLGPLMCDIYGGLGSAFGCSSILSMCMISLDRYNVIVKGM--AG-QPMTIKLAIMKIALIWFMASIWT-LAPV-FGWS-RYVPEGNLTSCGIDYLE--RDWNPRSYLIFYSIFVYYLPLFLICYSYWFIIAAVSAHEKAMREQAKK-----MNVKSLRSSEDADKSA-EGKLAKVALVTISLWFMAWTPYTIINTLGLFKY-EGL--TPLNTIWGACFAKSAACYNPIVYGISHPKYGIALKEKCPCCVFGKVDD  
CRUS_LWS_c  MYPLLLVFMLITGILCLAGNFVTIWVFMNTKS-LR-TPANLLVVNLAMSDFLMMFTMFPPMMITCYYH-TWTLGATFCEVYAFLGNLCGCASIWTMVFITFDRYNVIVKGV--AG-EPLSTKKASLWILTVWVLSFTWC-VAPF-FGWN-RYVPEGNLTGCGTDYLS--EDILSRSYLYIYSTWVYFLPLAITIYCYVFIIKAVAAHEKGMRDQAKK-----MGIKSLRNEEAQKTSA-ECRLAKIAMTTVALWFIAWTPYLLINWVGMFAR-SYL--SPVYTIWGYVFAKANAVYNPIVYAIS                        
CRUS_LWS_p  MYPLLLIFMLFTGILCLAGNFVTIWVFMNTKS-LR-TPANLLVVNLAMSDFLMMFTMFPPMMVTCYYH-TWTLGPTFCQVYGFLGNLCGCASIWTMVFITFDRYNVIVKGV--AG-EPLSTKKASLWILIVWVLSLAWC-MAPF-FGWN-RYVPEGNLTGCGTDYLS--EDILSRSYLYIYSTWVYFLPLTITIYCYVFIIKAVAAHEKGMRDQAKK-----MGIKSLRNEEAQKTSA-ECRLAKIAMTTVALWFIAWTPYLLINWVGMFAR-SYL--SPVYTIWGYVFAKANAVYNPIVYAIS                        
INSE_LWS_h  WHGLLGFVIGVLGFISVTGNGMVVYIFTTTKS-LK-TPSNILVVNLAFSDFLMMFMMAPPMVINCYNE-TWVFGPLACQLYACAGSLYGCVSIWTMTMIAFDRYNVIVKGI--AA-KPMTINGALLRVFGIWAFSLAWT-IAPL-FGWG-RYVPEGNMTACGTDYFD--QSFSNRSYILLYSIACYYAPLFLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSDAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYAGIFKTMT                                                      
CRUS_LWS_e  WYGLLGFVIFCLGCLSVFGNSVVIWVFTSTKT-LR-SPANMLVVNLALSDFLMMANMSPPTVHSCYHG-TWMLGPTYCEYYALVGSLSGCISIWTMVWITLDRYNVIVKGV--AA-TPLTNKGAFARNIFSWLSALIWC-VSPL-YGWN-RYVPEGNMTACGTDYLT--DDWLSHSYLYAYTFWVYLFPFFIIVYCYTYIVSAVFAHEKGMRDQAKK-----MGVKSLRNEEAQKTSA-ECRLAKVALVTVSLWFIAWTPYCVINVTGMWDK-TKI--TPLFTIWGSL                                        
CRUS_LWS_a  WYGLLGFVIFCLGILSVCGNAVVIWVFMNTKS-LR-SPANLLVVNLAFSDFLMMLNMFPPMVHSCYHG-TWMLGAFFCEFYGFTGSLFGCISIWTMVFITMDRYNVIVKGV--AA-EPLTSKGASIRILFVWTVAFAWT-ILPF-FGWN-RYVPEGNLTACGTDYLT--EDSTSHLYLYMYASWAYYTPLLYIIYAYTFIVQAVSAHEKGMREQAKK-----MGVKSLRNEEAQKTSA-ECRLAKVALMTVSLWFMAWTPYMIINFTGMNDR-TKL--TPLCTIWGSL                                        
CRUS_LWS_h  WYGLLGFWMTVMGTLSVAGNFVVIWVFMNTKS-LR-TPANLLVVNLAISDFFMMLTMTPPLLANAYWG-TWILGAFFCEVYAFLGSFFGCVSIWSMVFITADRYNVIVKGV--SA-EPLTSGGAMMRIAGTWAFTLAWC-LPPF-FGWN-RYVPEGNMLACGTDYLT--ETELSRSYLYVYSVWVYLFPLAYIIYSYTFIVKAVAAHEKGMREQAKK-----MGVKSLRSEEAQKTSA-ECRLCKVALMTVTLWFMAWTPYFIINWGGMFNK-PMV--TPLFS                                             
CRUS_MWS_h  WHYLLGVVYLFLGVISIAGNGLVIYLYMKSQA-LK-TPANMLIVNLALSDLIMLTTNFPPFCYNCFSGGRWMFSGTYCEIYAALGAITGVCSIWTLCMISFDRYNIICNGF--NG-PKLTQGKATFMCGLAWVISVGWS-LPPF-FGWG-SYTLEGILDSCSYDYFT--RDMNTITYNICIFIFDFFLPASVIVFSYVFIVKAIFAHEAAMRAQAKK-----MNVTNLRSNEAETQRA-EIRIAKTALVNVSLWFICWTPYAAITIQGLLGNAEGI--TPLLTTLPALLAKSCSCYNPFVYAISHPKFRLAITQHLPWFCVHEKDP  
INSE_UVV_c  LHYLLAIVYILFTFVALFGNGLVIWIFCSAKS-LR-TPSNLFVVNLAFCDFMMMLKA-PIFIYNSFHT-GFATGHLGCQIFACMGSLSGIGAGMTNAAIAYDRYSTIARPL--DG-K-LSRGQVLLLIMLIWTYTIPWALMPLM-QVWG-RFVPEGFLTSCSFDYLT--DSQEIRYFVPTIFTFSYCVPMLLIIYYYSQIVGHVVSHEKALREQAKK-----MNVESLRSNVNTNAQSAEIRIAKAAITICFLFVLSWTPYGALAMIGAFGNRALL--TPGITMIPACACKFVACLDPYVYAISHPRYRLELQKRLPWLELQEKP   
INSE_UVV_a  LHYLLALLYILFTFLALLGNGLVIWIFCAAKS-LR-TPSNMFVVNLAICDFFMMIKT-PIFIYNSFNT-GFALGNLGCQIFAVIGSLTGIGAAITNAAIAYDRYSTIARPL--DG-K-LSRGQVILFIVLIWTYTIPWALMPVM-GVWG-RFVPEGFLTSCSFDYLT--DTNEIRIFVATIFTFSYCIPMILIIYYYSQIVSHVVNHEKALREQAKK-----MNVDSLRSNANTSSQSAEIRIAKAAITICFLYVLSWTPYGVMSMIGAFGNKALL--TPGVTMIPACTCKAVACLDPYVYAISHPKYRLELQKRLPWLELQEKP   
INSE_UVV_m  AHTALALLYIFFTFAALVGNGMVIFIFSTTKS-LR-TSSNFLVLNLAILDFIMMAKA-PIFIYNSAMR-GFAVGTVGCQIFALMGAYSGIGAGMTNACIAYDRHSTITRPL--DG-R-LSEGKVLLMVAFVWIYSTPWALLPLL-KIWG-RYVPEGYLTSCSFDYLT--NTFDTKLFVACIFTCSYVFPMSLIIYFYSGIVKQVFAHEAALREQAKK-----MNVESLRANQGGSSESAEIRIAKAALTVCFLFVASWTPYGVMALIGAFGNQQLL--TPGVTMIPAVACKAVACISPWVYAIRHPMYRQELQRRMPWLQIDEPD   
INSE_UVV_p  AHTMLALVYVFFTAAALIGNGLVIFIFSASKS-LR-TPSNLLVVQLAVLDFLMMLKA-PIFIYNSIKR-GFASGVIGCQIFAFMGSVSGTAAGLTNACIAYDRHSTITRPL--DG-R-LSRGKVLLMMVCVWLYTAPWAILPQL-QIWG-RYVPEGFLTSCTFDYLT--TTFDNKLFVASMFVCVYIFPMIAILYFYSGIVKQVFAHEAALREQAKK-----MNVDSLRSNQNAAAESAEIRIAKAALTVCFLYVASWTPYGVMSLIGAFGDQNLL--TPGVTMIPALACKGVACIDPWVYAISHPKYRQELQKRMPWLQIDEPD   
INSE_UVV_d  MHYMLGVFYIFLFCASTVGNGMVIWIFSTSKS-LR-TPSNMFVLNLAVFDLIMCLKA-PIFIYNSFHR-GFALGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNVITKPM--NR-N-MTFTKAVIMNIIIWLYCTPWVVLPLT-QFWD-RFVPEGYLTSCSFDYLS--DNFDTRLFVGTIFFFSFVCPTLMILYYYSQIVGHVFSHEKALREQAKK-----MNVESLRSNVDKSKETAEIRIAKAAITICFLFFVSWTPYGVMSLIGAFGDKSLL--TPGATMIPACTCKLVACIDPFVYAISHPRYRLELQKRCPWLGVNEKS   
INSE_BLU_m  WHYVLALIYTMLMVTSLTGNGIVIWIFSTSKS-LR-SASNMFVINLAVFDLMMMLEM-PLLIMNSFYQ-RLVGYQLGCDVYAVLGSLSGIGGAITNAVIAFDRYKTISSPL--DG-R-INTVQAGLLIAFTWFWALPFTILPAF-RIWG-RFVPEGFLTTCSFDYFT--EDQDTEVFVACIFVWSYCIPMALICYFYSQLFGAVRLHERMLQEQAKK-----MNVKSLASNKEDNSRSVEIRIAKVAFTIFFLFICAWTPYAFVTMTGAFGDRTLL--TPIATMIPAVCCKVVSCIDPWVYAINHPRYRAELQKRLPWMGVREQDP  
INSE_BLU_a  FHIGLAIIYSMLLIMSLVGNCCVIWIFSTSKS-LR-TPSNMFIVSLAIFDIIMAFEM-PMLVISSFME-RMIGWEIGCDVYSVFGSISGMGQAMTNAAIAFDRYRTISCPI--DG-R-LNSKQAAVIIAFTWFWVTPFTVLPLL-KVWG-RYTTEGFLTTCSFDFLT--DDEDTKVFVTCIFIWAYVIPLIFIILFYSRLLSSIRNHEKMLREQAKK-----MNVKSLVSN-QDKERSAEVRIAKVAFTIFFLFLLAWTPYATVALIGVYGNRELL--TPVSTMLPAVFAKTVSCIDPWIYAINHPRYRQELQKRCKWMGIHEP    
INSE_BLU_d  YHAGFYIAFIVLMLSSIFGNGLVIWIFSTSKS-LR-TPSNLLILNLAIFDLFMCTNM-PHYLINATVG-YIVGGDLGCDIYALNGGISGMGASITNAFIAFDRYKTISNPI--DG-R-LSYGQIVLLILFTWLWATPFSVLPLF-QIWG-RYQPEGFLTTCSFDYLT--NTDENRLFVRTIFVWSYVIPMTMILVSYYKLFTHVRVHEKMLAEQAKK-----MNVKSLSANANADNMSVELRIAKAALIIYMLFILAWTPYSVVALIGCFGEQQLI--TPFVSMLPCLACKSVSCLDPWVYATSHPKYRLELERRLPWLGIREKHA  
MEL1b_braF  MQLVFGSMMLVFGLIGVVGNAVALYAFCRSRS-LR-RPKNYLIANLCLTDMVVCLVYSPIIVTRSLSH--GLPSKESCIVEGFVVGLGSIVSICSLAGIAVERYVTITQPI--KSLSILTHRALLGAVSAVWVYAFLLAFPPLV--GWG-RYVSEESKISCTFDYLS--TDDATRAHVIVLVIGAFGLPFSVITYCYVRSFATVRKCTKERKQM---------------SPLAKSDSRSEVKAAVNSFVITTSFCLCWCPYAVVATMGVSGFTV----HSHAVFIAALLAKLSVLFNPVAYVLSIPN                     
MEL1b_braB  MQLIFGSMMLVFGLIGVVGNVVALYAFCRTRS-LR-RPKNYVVANLCLTDMFVCLVYCPIVVSRSFSH--GFPSKESCIVEGFMVGVGSIASICSLAAIAVERYLSVTQPL--KSLTILTQRKLLVAVLTVWVYSLLLAFPPLV--GWG-RYVREETYISCTFDYLS--TDDATRAYVITLVMGAFGFPLLTIAYCYIRVFTTARKHAEERKFM---------------SPLKRPESRTEIKTAVTACVITTSFCLCWCPYAVVATLGISGVSV----QQQTVFSAALLAKLTVIINPIVYVLSIPNFRKALFAQEREKYASEDVV  
MEL_nemVec  WVIAQVVLWGCIFVISSLGNSLVLLCIVKSNR--LHSSIYAFYGSLAASDCIAGMLCCPLLLVTALHQLWIMGK-VMCHVYSTLLSTSLNASIATLCLISMDRLNAVRKPFEYRGHNTFTQRWCKWLLVLSWVHSIFWAAAPLG--GWG-EIITDSATYTCKPNWSA--ASIVNRSYSLCLALFPFAFPVFLMVAIYCVIYRHTKKCSNLMS----------GLEDGRNLVAEQERQMRERRLFRTVLIIIGAFAACWAIYTLATTCKLFIGQTP---PTWLVQLGLICAIAGSCVNPVIYTIRDATFARELGRLHPCLAWLLKQS  
LWS_nemVec   TSFTAIALLVIMLLTIIGNLMVCYVVLSNKR--LWTEMNMFLVNLAFGDLAVGLICMVFPLITAIKREWIFGRGILCQLNACCNSVLFCSTIFTHTVISIDRYIVIVHPMK----KIMTRKKAALMIVGVWVFSVFIVLGPVF--GWG-RMEYNASTLQCGFGFPR--DKMASM-YIVIVAIIAFIIPLLIMTYTYIRIYISVLEHTRRMS-------------ETATAMQQQAVFSAQKRIVFTFFIALLAFFACWAPFFSFIAFAVVVKNPHD-IPHGLGLASYVCGFINSACNPFIIGLRSKQFKSGFSRILCCCRGRDP    
 Consensus  ............g......N..V.......k. LR .P.N...vNLA..Dl.....................g....C..yg%.....G..s...$..ia.#RY.v!..P.  ..........a.......W.....w...Pl.  GW. .Y.pEg..tsC..#w..........s%....f...%..Pl.!I.%.Y..i............... .     .................E.....m...m!..F...W.PYa..............  .p.....P..fAK.s..%NP!IY......%R...................

Structural and functional markers along the opsin molecule:

>RHO1_homSap rhodopsin                 <----------TM1--------->    c1     <----------TM2------->    x1      <c--i------TM3--------->           c2       <----------TM4-------->               x2         <----------TM5------c->     c3
MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLGGEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSRYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKE 
AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA* 0
           c3       <----------TM6------->    x3      <--------b-TM7---gprot>  helix8    palm cyto tail

See also: Curated Sequences | Ancestral Introns | Informative Indels | Ancestral Sequences | Cytoplasmic face | Update Blog