Opsin evolution: alignment
See also: Curated Sequences | Ancestral Introns | Informative Indels | Ancestral Sequences | Cytoplasmic face | Update Blog
This section provides an alignment of 230 opsins, mostly ciliary and rhabdomeric types. It could be updated to the full 420 curated reference sequences available using MultAlin or similar tools that allow precise control of formatting and color but too many sequence becomes unwieldy.
N- and C-terminals have been trimmed away because they are generally unalignable and uninformative outside a narrow gene class. Fragmentary sequences are mostly not shown: these score low and so fall to the bottom. (That can still be useful as two important clades, jawless fish and chondrichthyes, are largely represented by fragments.) Notice the numerous invariant (red) and nearly invariant sequences (blue) -- these anchor the alignment with near-certainty. Some of these are not specific to opsins but are rather properties of GPCR signaling proteins generally.
Opsins have seven alpha helical sections traversing the cell membrane with the intervening sequence alternating as cytoplasmic and extra-cellular. Certain key residues such as the lysine where the retinal is covalently bound, counterions, and recognition sites diagnostic for binding of other proteins require markups that will be added shortly.
Among other things, the alignment by MultAlign shows that the Opsin Classifier has properly named the opsins -- each classifies just as expected from its name. Deletions and insertions show up clearly on the alignment, readily resolved as to type using the known phylogenetic topology to establish ancestral condition. The alignment also exhibits some anomalies where the sequence in question needs re-evaluation at the primary data source (cDNA and/or genome).
MultAlign is apparently the only alignment software that allows line width to be specified. That's important here because it enables the entire alignment to be seen in a single window. The numbering schemes allows specific residues and regions to be discussed. Colored text output was also an option (it allows copy and paste of specific residues) but the file again is huge with color markups and awkward to display tightly within genomeWiki.
Here are those sequences aligned (after some trimming of unalignable regions, and anomalous and fragmentary sequences). These are in text form which can be searched by motif using web browser text search, unlike the graphic above (which is more conveniently colored however).
Consensus ............g......N..V.......k. LR .P.N...vNLA..Dl.....................g....C..yg%.....G..s...$..ia.#RY.v!..P. ..........a.......W.....w...Pl. GW. .Y.pEg..tsC..#w..........s%....f...%..Pl.!I.%.Y..i............... . .................E.....m...m!..F...W.PYa.............. .p.....P..fAK.s..%NP!IY......%R...................
RHO1_homSa QFSMLAAYMFLLIVLGFPINFLTLYVTVQHKK-LR-TPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTG-CNLEGFFATLGGEIALWSLVVLAIERYVVVCKPM---SNFRFGENHAIMGVAFTWVMALACAAPPLA--GWS-RYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNF--GPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTI--CCGKNPLGDD
RHO1_monDo QFSCLAAYMFMLIVLGFPINFLTLYVTIQHKK-LR-TPLNYILLNLAIADLFMVFGGFTMTLYTSLHGYFVFGPTG-CNLEGFFATLGGEIALWSLVVLAIERYVVVCKPM---SNFRFGENHAIIGVAFTWVMALACAFPPLI--GWS-RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGQLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSNF--GPIFMTIPAFFAKSSSVYNPVIYIMMNKQFRTCMITTL--CCGKNPLGDD
RHO1_bosTa QFSMLAAYMFLLIMLGFPINFLTLYVTVQHKK-LR-TPLNYILLNLAVADLFMVFGGFTTTLYTSLHGYFVFGPTG-CNLEGFFATLGGEIALWSLVVLAIERYVVVCKPM---SNFRFGENHAIMGVAFTWVMALACAAPPLV--GWS-RYIPEGMQCSCGIDYYTPHEETNNESFVIYMFVVHFIIPLIVIFFCYGQLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSDF--GPIFMTIPAFFAKTSAVYNPVIYIMMNKQFRNCMVTTL--CCGKNPLGDD
RHO1_ornAn QYSVLAAYMFMLIMLGFPINFLTLYVTIQHKK-LR-TPLNYILLNLAFANHFMVLGGFTTTLYTSLHGYFVFGPTG-CNIEGFFATLGGEIALWSLVVLAIERYIVVCKPM---SNFRFGENHAIMGVAFTWIMALACALPPLV--GWS-RYIPEGMQCSCGIDYYTLRPEVNNESFVIYMFVVHFTIPMTIIFFCYGRLVFTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNF--GPIFMTVPAFFAKSSAIYNPVIYIMMNKQFRNCMLTTI--CCGKNPLGDD
RHO1_galGa KFSALAAYMFMLILLGFPVNFLTLYVTIQHKK-LR-TPLNYILLNLVVADLFMVFGGFTTTMYTSMNGYFVFGVTG-CYIEGFFATLGGEIALWSLVVLAVERYVVVCKPM---SNFRFGENHAIMGVAFSWIMAMACAAPPLF--GWS-RYIPEGMQCSCGIDYYTLKPEINNESFVIYMFVVHFMIPLAVIFFCYGNLVCTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTNQGSDF--GPIFMTIPAFFAKSSAIYNPVIYIVMNKQFRNCMITTL--CCGKNPLGDE
RHO1_xenTr KYSALAAYMFLLILLGFPINFMTLYVTIQHKK-LR-TPLNYILLNLVFANHFMVLCGFTVTMYTSMHGYFIFGQTG-CYIEGFFATLGGEMALWSLVVLAIERYVVVCKPM---ANFRFGENHAIMGVVFTWIMALSCAAPPLF--GWS-RYIPEGMQCSCGVDYYTLKPEVNNESFVVYMFIVHFTIPLCVIFFCYGRLLCTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVVMMVIFFLICWVPYAYVAFYIFTHQGSDF--GPVFMTVPAFFAKSSAIYNPVIYIVLNKQFRNCLITTL--CCGKNPFGDE
RHO1_neoFo KYSALAAYMFFLILTGFPINFLTLYVTVQHKK-LR-TPLNYILLNLAVADLFMVFGGFTTTMYTAMNGYFVFGVVG-CNLEGFFATFGGIIALWCLVVLAIERYIVVCKPI---SNFRFGENHAIMGVVFTWIMALACAGPPLF--GWS-RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFIVHFTIPLIIIFFCYGRLMCTVKEA--AAQ------------Q-----QESATTQKAEKEVTRMVYIMVISYLVCWLPYASVSFYIFTHQGSDF--GPVFMTVPAFFAKTASVYNPVIYILMNKQFRNCMITTL--CCGKNPFGDE
RHO1_latCh KYSALAAYMFFLILVGFPINFLTLFVTIQHKK-LR-TPLNYILLDLAVADLCMVFGGFFVTMYSSMNGYFVLGPTG-CNIEGFFATLGGQVALWALVVLAIERYVVVCKPM---SNFRFGENHAIMGVIFTWIMALSCAVPPLF--GWS-RYIPEGMQSSCGVDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKDA--AAQ------------Q-----QESATTQKAEKEVTRMVIVMVISFLVCWVPYASVAAYIFFNQGSEF--GPVFMTAPSFFAKSASFYNPVIYILLNKQFRNCMITTL--CCGKNPFGDE
RHO1_anoCa QFSALAAYMFLLILLGFPINFLTLFVTIQHKK-LR-TPLNYILLNLAVANLFMVLMGFTTTMYTSMNGYFIFGTVG-CNIEGFFATLGGEMGLWSLVVLAVERYVVICKPM---SNFRFGETHALIGVSCTWIMALACAGPPLL--GWS-RYIPEGMQCSCGVDYYTPTPEVHNESFVIYMFLVHFVTPLTIIFFCYGRLVCTVKAA--AAQ------------Q-----QESATTQKAEREVTRMVVIMVISFLVCWVPYASVAFYIFTHQGSDF--GPVFMTIPAFFAKSSAIYNPVIYILMNKQFRNCMIMTL--CCGKNPLGDE
RHO1_petMa KYSVLAAYMFFLILVGFPVNFLTLFVTVQHKK-LR-TPLNYILLNLAVANLFMVLFGFTLTMYSSMNGYFVFGPTM-CNFEGFFATLGGEMSLWSLVVLAIERYIVICKPM---GNFRFGSTHAYMGVAFTWFMALSCAAPPLV--GWS-RYLPEGMQCSCGPDYYTLNPNFNNESFVIYMFLVHFIIPFIVIFFCYGRLLCTVKEA--AAA------------Q-----QESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDF--GATFMTVPAFFAKTSALYNPIIYILMNKQFRNCMITTL--CCGKNPLGDE
RHO1_letJa KYSALAAYMFFLILVGFPVNFLTLFVTVQHKK-LR-TPLNYILLNLAMANLFMVLFGFTVTMYTSMNGYFVFGPTM-CSIEGFFATLGGEVALWSLVVLAIERYIVICKPM---GNFRFGNTHAIMGVAFTWIMALACAAPPLV--GWS-RYIPEGMQCSCGPDYYTLNPNFNNESYVVYMFVVHFLVPFVIIFFCYGRLLCTVKEA--AAA------------Q-----QESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDF--GATFMTLPAFFAKSSALYNPVIYILMNKQFRNCMITTL--CCGKNPLGDD
RHO1_geoAu KFSALAAYMFFLILVGFPVNFLTLFVTVQHKK-LR-TPLNYILLNLAVSNLFMILFGFTTTMYTSMNGYFVFGPTM-CSIEGFFATLGGEVSLWSLVVLAIERYIVICKPM---GNFRFGNTHAIMGVALTWVMALSCAAPPLL--GWS-RYLPEGMQCSCGPDYYTMNPTYNNESFVIYMFIVHFTIPFVIIFFSYGRLLCTVKEA--AAA------------Q-----QESASTQKAEKEVTRMVVLMVVGFLVCWVPYASVAFYIFTNQGSDF--GATFMTLPAFFAKSSALYNPVIYILMNKQFRNCMITTL--CCGKNPLGDD
RHO1_leuEr MFSALAAYMFFLILTGLPVNFLTLFVTIQHKK-LR-QPLNYILLNLAVSDLFMVFGGFTTTIITSMNGYFIFGPAG-CNFEGFFATLGGEVGLWCLVVLAIERYMVVCKPM---ANFRFGSQHAIIGVVFTWIMALSCAGPPLV--GWS-RYIPEGLQCSCGVDYYTMKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKEA--AAQ------------Q-----QESESTQRAEREVTRMVIIMVVAFLICWVPYASVAFYIFINQGCDF--TPFFMTVPAFFAKSSAVYNPLIYILMNKQFRNCMITTI--CLGKNPFEEE
RHO1_calMi QFSILAAYMFFLIITCFPVNFLTLYVTFEHKK-LR-QPLNFILLNLAVADLFMVFGGFFITVYTSLHGYFVFGVTG-CNFEGFFATLGGEIGLWSLVVLAIERYVVVCKPM---SNFRFGTNHAIMGVAFTWVMALACAVPPLM--GWS-RYIPEGLQCSCGVDYYTLKPEINNESFVIYMFVVHFLIPLIIIFFCYGRLVCTVKEA--AAQ------------Q-----QESESTQRAEREVTRMVIIMVIFFLICWVPYASVAFFIFTNQGSEF--GPIFMAVPAFFAKSSALYNPLIYILLNKQFRNCMITTL--CCGKNPFEED
RHO1_takRu KYSLVAAYMLFLIITAFPVNFLTLFVTVKHKK-LR-TPLNYVLLNLAVADLFMVIGGFTVTLYTALHAYFVLGVTG-CNIEGFFATLGGEIALWSLVVLAVERYIVVCKPM---TNFRFGEKHAIAGLVFTWIMALTCATPPLL--GWS-RYIPEGMQCSCGIDYYTPKPEINNTSFVIYMFILHFSIPLAIIFFCYSRLLCTVRAA--AAL------------Q-----QESETTQRAEKEVTRMVIVMVISFLVCWVPYASVAWYIFANQGTEF--GPVFMTAPAFFAKSAALYNPVIYILLNRQFRNCMITTV--CCGKNPFGDD
RHO2_galGa KYRLVCCYIFFLISTGLPINLLTLLVTFKHKK-LR-QPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFVFGPVG-CAVEGFFATLGGQVALWSLVVLAIERYIVVCKPM---GNFRFSATHAMMGIAFTWVMAFSCAAPPLF--GWS-RYMPEGMQCSCGPDYYTHNPDYHNESYVLYMFVIHFIIPVVVIFFSYGRLICKVREA--AAQ------------Q-----QESATTQKAEKEVTRMVILMVLGFMLAWTPYAVVAFWIFTNKGADF--TATLMAVPAFFSKSSSLYNPIIYVLMNKQFRNCMITTI--CCGKNPFGDE
RHO2_anoCa KYKVVCCYIFFLIFTGLPINILTLLVTFKHKK-LR-QPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFIFGPIG-CAIEGFFATLGGQVALWSLVVLAIERYIVVCKPM---GNFRFSATHALMGISFTWFMSFSCAAPPLL--GWS-RYIPEGMQCSCGPDYYTLNPDYHNESYVLYMFGVHFVIPVVVIFFSYGRLICKVREA--AAQ------------Q-----QESASTQKAEREVTRMVILMVLGFLLAWTPYAMVAFWIFTNKGVDF--SATLMSVPAFFSKSSSLYNPIIYVLMNKQFRNCMITTI--CCGKNPFGDE
RHO2_neoFo KYSIVCAYMFFLIITGLPINLLTLVVTFKHKK-LR-QPLNYILVNLAVADLFMVCFGFTVTFSTAINGYFIFGPRG-CAIEGFMATLGGEVALWSLVVLAIERYIVVCKPM---GNFRFSNNHSIIGIVFTWLAALSCAAPPLF--GWS-RYLPEGMQCSCGPDYYTMNPDYHNESFVIYMFVVHFFIPVIVIFVSYGRLICKVKEA--AAQ------------Q-----QESASTQKAEREVTRMVILMVIGFMTAWTPYATVAFWIFMNKGAEF--GATFMAAPAFFSKSSALYNPIIYVLMNKQFRNCMVTTL--CCGKNPFGDD
RHO2_latCh KFSVLCAYMFLLIILGFPINFLTLLVTFKHKK-LR-QPLNYILVNLAVASLFMVVFGFTVTFYSSLNGYFVLGPMG-CAMEGFFATLGGQVALWSLVVLAIERYIVVCKPM---GNFRFASSHAIMGIAFTWIMALACAAPPLV--GWS-RYIPEGLQCSCGPDYYTLNPDFHNESYVMYLFLVHFLLPIIIIFFTYGRLICKVKEA--AAQ------------Q-----QESASTQKAEKEVTRMVILMVIGFLTAWVPYASAAFWIFCNRGAEF--TATLMTVPAFFSKSSCLFNPIIYVLLNKQFRNCMITTL--CCGKNPLGDD
RHO2_gekGe KFKVLSFYMFFLIAAGMPLNGLTLFVTFQHKK-LR-QPLNYILVNLAAANLVTVCCGFTVTFYASWYAYFVFGPIG-CAIEGFFATIGGQVALWSLVVLAIERYIVICKPM---GNFRFSATHAIMGIAFTWFMALACAGPPLF--GWS-RFIPEGMQCSCGPDYYTLNPDFHNESYVIYMFIVHFTVPMVVIFFSYGRLVCKVREA--AAQ------------Q-----QESATTQKAEKEVTRMVILMVLGFLLAWTPYAATAIWIFTNRGAAF--SVTFMTIPAFFSKSSSIYNPIIYVLLNKQFRNCMVTTI--CCGKNPFGDE
RHO2_geoAu MYSAISAYVFTLILIGFPVNFMTLFVTFKLKK-LR-QPLNFILVNLCVADLLMIMFGFTTTFYTAMNGYFVFGPTG-CNIEGFFATLGGEVSLWSLVMLAIERYIVVCKPM---GNFRFATTHAALGVVFTWVMASACAVPPLV--GWS-RYIPEGMQCSCGPDYYTLNPKYYNESYVIYLFLVHFLLPVTIIFFTYGRLICTVKEA--AAQ------------Q-----QESASTQKAEREVTRMVIIMVVGFLVCWVPYASFAFYLFMNKGILF--SATAMTVPAFFSKSSVLYNPIIYVLLNKQFRTCMVTTL--FCGKNPFGED
SWS2_ornAn IFMSLAAFMFLLITLGFPINLLTVICTIKYKK-LR-SHLNYILVNLAVSNMLVVCVGSATAFYSFAHMYFVLGPTA-CKIEGFAATLGGMVSLWSLAVIAFERFLVICKPL---GNLSFRGTHAIFGCAATWVFGLAASLPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKWNNESYVIFLFSFCFGVPLSIIIFSYGRLLLTLRAV--AKQ------------Q-----EQSATTQKAEREVTKMVIVMVLGFLVCWLPYASFSLWVVTNRGQVF--DLRMASIPSVFSKASTIYNPIIYVFMNKQFRSCMLKLV--FCGKSPFGDE
SWS2_utaSt LFMGMAAFMFLLIILGVPINVLTIFCTFKYKK-LR-SHLNYILVNLAVSNLLVVCIGSTTAFYSFAQMYFSLGPTA-CKIEGFAATLGGMVSLWSLAVVAFERFLVICKPL---GNFSFRGTHAIIGCIITWVFGLVASLPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKWNNESYVLFLFSFCFGVPLSVIIFSYGRLLLTLRAV--AKQ------------Q-----EQSATTQKAEREVTKMVVVMVMGFLVCWLPYASFALWVVTHRGEPF--DVRLATIPSVFSKASSVYNPVIYVFMNKQFRSCMLKLV--FCGKSPFGDE
SWS2_taeGu IFKAMAAFMFLLVLLGVPINALTVLCTAKYKK-LR-SHLNYILVNLAVANLLVVCVGSTTAFYSFSQMYFALGPLA-CKIEGFTATLGGMVSLWSLAVVAFERFLVICKPL---GNFTFRGSHAVLGCAITWIFGLIASLPPLF--GWS-RYIPEGLQCSCGPDWYTTDNKWNNESYVIFLFCFCFGFPLTVIVFSYGRLLLTLRAV--AKQ------------Q-----EQSASTQKAEREVTKMVVVMVLGFLVCWLPYCSFALWVVTHRGHPF--DLGLASIPSVFSKASTVYNPIIYVFMNKQFRSCMLKLV--FCGRSPFGDE
SWS2_neoFo VFMVLSVFMFFLLITGIPINVLTIICTFKYKK-LR-SHLNYILVNLAVANLIVVGFGSTTAFYSFSQMYFAWGPLA-CKIEGFAATLGGMVSLWSLAVVAFERFLVICKPL---GNFTFRSTHAIIGCVATWVFGLISSAPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKWNNESYVIFLFCFCFGFPLSVIIFSYGRLLMTLRAV--AKQ------------Q-----EQSASTQKAEREVTKMVVVMVLGFLVCWLPYTVFSLWVVTHRGESF--ELALGSIPAVFSKSSTVYNPLIYVFMNKQFRSCMMKLI--FCGKSPFGDE
SWS2_galGa LFRAMAAFMFLLIALGVPINTLTIFCTARFRK-LR-SHLNYILVNLALANLLVILVGSTTACYSFSQMYFALGPTA-CKIEGFAATLGGMVSLWSLAVVAFERFLVICKPL---GNFTFRGSHAVLGCVATWVLGFVASAPPLF--GWS-RYIPEGLQCSCGPDWYTTDNKWHNESYVLFLFTFCFGVPLAIIVFSYGRLLITLRAV--ARQ------------Q-----EQSATTQKADREVTKMVVVMVLGFLVCWAPYTAFALWVVTHRGRSF--EVGLASIPSVFSKSSTVYNPVIYVLMNKQFRSCMLKLL--FCGRSPFGDD
SWS2_xenTr IFMSISAFMLFTIIFGFPLNLLTIICTVKYKK-LR-SHLNYILVNLAVANLIVICFGSTTAFYSFSQMYFSLGTLA-CKIEGFTATLGGIIGLWSLAVVAFERFLVICKPM---GNFTFRESHAVLGCILTWVIGLVAAIPPLL--GWS-RYIPEGLQCSCGPDWYTVNNKWNNESYVLFLFCFCFGFPLAIIVFSYGRLLLALHAV--AKQ------------Q-----EQSATTQKAEREVTRMVIVMVVGFLVCWLPYASFALWAVTHRGELF--DLRMSSVPSVFSKASTVYNPFIYIFMNRQFRSCMMKMI--FCGKNPLGDD
SWS2_geoAu IFYGMSAFMLFLVLAGFPLNFLTVFVTIKYKK-LR-SHLNYILVNLAIANLIVVCCGSTLAFYSFMHKYFILGPLF-CKMEGFTATLGGMLSLWSLAVLAFERCLVICKPF---GNIAFRGTHALIRCGFAWAAAIAASTPPLF--GWS-RYIPEGLQCSCGPDWYTTNNKYNNESYVMFLFIFCFGTPFTIIIVSYSKLILTLRAA--AAQ------------Q-----QESASTQKAEKEVSRMVVIMVGGFLVCWLPYASLALWIVFNRGSPF--DLRLATIPSVFSKASTVYNPVIYIFLNKQFRSCMMKTI--FCGKNPLGDD
SWS2_takRu VFYGMSAFMFFLFVAGTGINVLTIACTIQYKK-LR-SHLNYILVNLAFSNLLVTTVGSFTCFCCFFVRYMIVGPLG-CKIEGFAATLGGMVSLWSLAVVAFERWLVVCKPL---GNFIFKPDHAIVCCIFTWFFALIISAPPLF--GWS-RYIPEGFQCSCGPDWYTTGNKYNNESYVWFIFGFGFAVPLFVIVFCYSQLLVMLK-S--AKA------------Q-----AESASTQKAEREVTRMVVVMILGFLVCWLPYASFALWVVNNRGTPF--DLRLATIPACFSKASTVYNPIIYVVLNKQFRSCMKKML--GMSGGD
SWS2_gasAc TFYSLAFYMFFILIVGTFINALTVACTVQNKK-LR-SHLNYILVNLAVSNLLVSGVGAFTAFLSFAARYFVLGTLA-CKVEGFLATLGGMVSLWSLAVIAFERWLVICKPL---GNFIFKPDHALVCCAFTWVFALAASAPPLV--GWS-RYIPEGLQCSCGPDWYTTNNKYNNESYVLFLFGFCFAVPFCTICFCYSQLLFTMKMA--AKA------------Q-----AESASTQKAEREVTRMVVLMVMGFLVCWMPYASFALWVVNNRGQTF--DLRFASIPSVFSKSSAVYNPVIYVLLNKQFRSCMMKML--GMGGGD
SWS1_homSa AFYLQAAFMGTVFLIGFPLNAMVLVATLRYKK-LR-QPLNYILVNVSFGGFLLCIFSVFPVFVASCNGYFVFGRHV-CALEGFLGTVAGLVTGWSLAFLAFERYIVICKPF---GNFRFSSKHALTVVLATWTIGIGVSIPPFF--GWS-RFIPEGLQCSCGPDWYTVGTKYRSESYTWFLFIFCFIVPLSLICFSYTQLLRALKAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCVCYVPYAAFAMYMVNNRNHGL--DLRLVTIPSFFSKSACIYNPIIYCFMNKQFQACIMKMV---CGKAMTDES
SWS1_monDo AFHFQTVFMGFVFCAGTPLNAVVLVATLRYKK-LR-QPLNYILVNVSLCGFIFCIFAVFTVFISSSQGYFIFGRHV-CAMEAFLGSVAGLVTGWSLAFLAFERFIVICKPF---GNFRFNSKHAMMVVLATWVIGIGVSIPPFF--GWS-RFIPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIMPLFLICFSYSQLLRALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVMMVGSFCLCYVPYAALAMYMVNNQNHGL--DLRLVTIPAFFSKSACVYNPIIYCFMNKQFHACIMEMV---CRKPMTDDS
SWS1_anoCa AFYFQTAFMGFVFFAGTPLNAIILIVTVKYKK-LR-QPLNYILVNISFAGFLFCTFSVFTVFMASSQGYFFFGRHV-CAMEAFLGSVAGLVTGWSLAFLAFERYIVICKPF---GNFRFNSRHALLVVAATWIIGVGVAIPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCLCYVPYASLAMYMVNNRDHGL--DLRLVTIPAFFSKSSCVYNPIIYCFMNKQFRACILETV---CGKPMSDES
SWS1_utaSt AFYFQTAFMGFVFFAGTPLNAIILIVTVKYKK-LR-QPLNYILVNISFAGFLFCVFSVFTVFLASSQGYFFFGRHI-CALEAFLGSVAGLVTGWSLAFLAFERYIVICKPF---GNFRFNSKHALLVVAATWFIGIGVSIPPFF--GWS-RFIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGI--DLRLVTIPAFFSKSACVYNPIIYCFMNKQFRACIMETV---CGKPMTDES
SWS1_taeGu AFYLQTIFMGLVFVAGTPLNAIVLIVTIKYKK-LR-QPLNYILVNISVSGLMCCVFCIFTVFIASSQGYFVFGKHM-CAFEGFAGATGGLVTGWSLAFLAFERYIVICKPF---GNFRFNSRHALLVVAATWIIGVGVAIPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCMCYVPYAALAMYMVNNREHGI--DLRLVTIPAFFSKSSCVYNPIIYCFMNKQFRACIMETV---CGRPMTDDS
SWS1_galGa AFYLQTAFMGIVFAVGTPLNAVVLWVTVRYKR-LR-QPLNYILVNISASGFVSCVLSVFVVFVASARGYFVFGKRV-CELEAFVGTHGGLVTGWSLAFLAFERYIVICKPF---GNFRFSSRHALLVVVATWLIGVGVGLPPFF--GWS-RYMPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRAV--AAQ------------Q-----QESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGL--DLRLVTIPAFFSKSACVYNPIIYCFMNKQFRACIMETV---CGKPLTDDS
SWS1_neoFo AFFLQAAFMGFVLFVGTPLNAIVLFVTIKYKK-LQ-QPLNYILVNISLAGFIFCFFGVFAVFIASCQGYFIFGKTV-CALEGFTGSVAGLVTGWSLAILAFERYLVICKPI---GNFRFGSKHSMIAVVAAWVIGVGVSIPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIIPLFIICFSYSQLLGALRAV--AAQ------------Q-----QESATTQKAEREVSRMIIVMVGSFCVCYVPYAALAMYMVNNRDHGI--DLRLVTIPAFFSKSSFVYNPIIYCFMNKQFRACIMQTV---FGKPMTDDS
SWS1_xenLa AFTLQAIFMGMVFLIGTPLNFIVLLVTIKYKK-LR-QPLNYILVNITVGGFLMCIFSIFPVFVSSSQGYFFFGRIA-CSIDAFVGTLTGLVTGWSLAFLAFERYIVICKPM---GNFNFSSSHALAVVICTWIIGIVVSVPPFL--GWS-RYMPEGLQCSCGPDWYTVGTKYRSEYYTWFIFIFCFVIPLSLICFSYGRLLGALRAV--AAQ------------Q-----QESASTQKAEREVSRMVIFMVGSFCLCYVPYAAMAMYMVTNRNHGL--DLRLVTIPAFFSKSSCVYNPIIYSFMNKQFRGCIMETV---CGRPMSDDS
SWS1_geoAu AFYLQAAFMGFVFICGTPLNAIVLVVTIKYKK-LR-QPLNYILVNISAAGLVFCLFSISTVFVASMQGYFFLGPTI-CALEAFFGSLAGLVTGWSLAFLAAERYIVICKPF---GNFRFGSKHALVAVGLTWMLGLSVALPPFF--GWS-RYIPEGLQCSCGPDWYTVGTKYKSEYYTYFLFVFCFVVPLSIIIFSYGSLLGTLRAV--AAQ------------Q-----QESASTQKAEREVSRMVIMMVASFCTCYVPYAALAVYMVTNRDHNI--DLRFVTVPAFFSKASCVYNPLIYSFMNKQFRACILETV---CGKPITDES
SWS1_danRe AFYLQAAFMGFVFIVGTPMNGIVLFVTMKYKK-LR-QPLNYILVNISLAGFIFDTFSVSQVSVCAARGYYSLGYTL-CSMEAAMGSIAGLVTGWSLAVLAFERYVVICKPF---GSFKFGQGQAVGAVVFTWIIGTACATPPFF--GWS-RYIPEGLGTACGPDWYTKSEEYNSESYTYFLLITCFMMPMTIIIFSYSQLLGALRAV--AAQ------------Q-----AESESTQKAEREVSRMVVVMVGSFVLCYAPYAVTAMYFANSDEPNK--DYRLVAIPAFFSKSSSVYNPLIYAFMNKQFNACIMETV---FGKKIDESS
SWS1_oryLa AFYLQAAFMGFVFFVGTPLNFVVLLATAKYKK-LR-VPLNYILVNITFAGFIFVTFSVSQVFLASVRGYYFFGQTL-CALEAAVGAVAGLVTSWSLAVLSFERYLVICKPF---GAFKFGSNHALAAVIFTWFMGVGCACPPFF--GWS-RYIPEGLGCSCGPDWYTNCEEFSCASYSKFLLVTCFICPITIIIFSYSQLLGALRAV--AAQ------------Q-----AESASTQKAEKEVSRMIIVMVASFVTCYGPYALTAQYYAYSQDENK--DYRLVTIPAFFSKSSCVYNPLIYAFMNKQFNGCIMEMV---FGKKMEEAS
LWS_homSap VYHLTSVWMIFVVIASVFTNGLVLAATMKFKK-LR-HPLNWILVNLAVADLAETVIASTISVVNQVYGYFVLGHPM-CVLEGYTVSLCGITGLWSLAIISWERWMVVCKPF---GNVRFDAKLAIVGIAFSWIWAAVWTAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSYPGVQSYMIVLMVTCCITPLSIIVLCYLQVWLAIRAV--AKQ------------Q-----KESESTQKAEKEVTRMVVVMVLAFCFCWGPYAFFACFAAANPGYPF--HPLMAALPAFFAKSATIYNPVIYVFMNRQFRNCILQLF--GKKVDDGS
LWS_monDom VYNLTSLWMVFVVIASIFTNGLVLVATMKFKK-LR-HPLNWILVNLAVADLGETVIASTISVINQIYGYFILGHPL-CVLEGYTVSLCGITGLWSLAIISWERWVVVCKPF---GNVKFDAKLAMVGIIFSWVWAAVWTAPPLF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMATCCIFPLSIILLCYVQVWLAIRAV--AKQ------------Q-----KESESTQKAEKEVSRMVVVMILAYCFCWGPYTLFACFAAANPGYSF--HPLTASLPAYFAKSATIYNPIIYVFMNRQFRTCILQLF--GKKVDDGS
LWS_ornAna AYNVTSLWMIFVVIASVFTNGLVLVATMKFKK-LR-HPLNWILVNLAVADLGETLIASTISVINQIFGYFILGHPM-CVLEGYTVSLCGITGLWSLSIISWERWIVVCKPF---GNVKFDAKLAMVGIVFSWVWAAVWTAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMSTCCILPLSIIVLCYLQVWLAIRAV--AKQ------------Q-----KESESTQKAEKEVSRMVVVMILAYCFCWGPYTIFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLF--GKKVDDGS
LWS_galGal VYNLTSLWMIFVVAASVFTNGLVLVATWKFKK-LR-HPLNWILVNLAVADLGETVIASTISVINQISGYFILGHPM-CVVEGYTVSACGITALWSLAIISWERWFVVCKPF---GNIKFDGKLAVAGILFSWLWSCAWTAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFFPLAIIILCYLQVSLAIRAV--AAQ------------Q-----KESESTQKAEKEVSRMVVVMIVAYCFCWGPYTFFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCILQLF--GKKVDDGS
LWS_anoCar VYNITSVWMIFVVIASIFTNGLVLVATAKFKK-LR-HPLNWILVNLAIADLGETVIASTISVINQISGYFILGHPM-CVLEGYTVSTCGISALWSLAVISWERWVVVCKPF---GNVKFDAKLAVAGIVFSWVWSAVWTAPPVF--GWS-RYWPHGLKTSCGPDVFSGSDDPGVLSYMIVLMITCCFIPLAVILLCYLQVWLAIRAV--AAQ------------Q-----KESESTQKAEKEVSRMVVVMIIAYCFCWGPYTVFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLF--GKKVDDGS
LWS_xenTro VYNISSLWMIFVVLASVFTNGLVLVATLKFKK-LR-HPLNWILVNMAIADLGETVIASTISVCNQIFGYFVLGHPM-CILEGYTVSVCGIAALWSLTVIAWERWFVVCKPF---GNIKFDGKLAATGIIFSWVWAAGWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMLVLMITCCIIPLAIIVLCYMHVWLTIRQV--AQQ------------Q-----KESESTQKAEREVSRMVVVMIIAYIFCWGPYTFFACFAAFNPGYNF--HPLAAAMPAYFAKSATIYNPIIYVFMNRQFRNCIYQLF--GKKVDDGS
LWS_takRub VYNVATVWMFIVVVLSVFTNGLVLVATAKFKK-LR-HPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPM-CVFEGYTVSTCGIAALWSLTIISWERWVVVCKPF---GNVKFDAKWATGGIVFSWVWAAVWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCIIPLAIIILCYLAVWLAIRSV--AMQ------------Q-----KESESTQKAEKEVSRMVVVMIVAYCVCWGPYTFFACFAAANPGYAF--HPLAAAMPAYFAKSATIYNPVIYVFMNRQFRVCIMKLF--GKEVDDGS
LWS_gasAcu VYNLSTLWMFIVVALSVFTNGLVLVATAKFKK-LQ-HPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPM-CVFEGYVVSVCGITALWSLTIISWERWIVVCKPF---GNVKFDAKWATAGIVFSWIWSAVWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCLIPLAIIILCYLAVWLAIRAV--AMQ------------Q-----KESESTQKAERDVSRMVVVMIVAYIVCWGPYTTFACFAAANPGYAF--HPLAAAMPAYFAKSATIYNPVIYVFMNRQFRSCIMQLF--GKEVDDGS
LWS_petMar VFNLTSVWMIIVVVLSLFSNGLVLVATVKFKK-LR-HPLNWIIVNLAIADILETIFASTISVCNQVYGYFILGHPM-CVFEGYVVSTCGIAGLWSLAIISWERWMVVCKPF---GNIKFDGKIATILIVFSWVWPASWCSLPIF--GWS-RYWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFLPLSIIILCYLQVWLAIHSV--AQQ------------Q-----KESETTQKAERDVSRMVVVMILAYVFCWGPYTFFACFAAANPGYSF--HPIAAALPAYFAKGATIYNPIIYVFMNRQFRNCILQLF--GKKVDDGS
LWS_letJap MFNLTSVWMIIVVVLSLFTNGLVLVATMKFKK-LR-HPLNWILVNLAIADILETIFASTISVCNQVFGYFILGHPM-CVFEGYVVSTCGIAGLWSLAIISWERWMVVCKPF---GNIKFDGKIAIILIVFSWVWPACWCSLPIF--GWS-RYWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFLPLSVIILCYLQVWLAIHSV--AQQ------------Q-----KESETTQKAERDVSRMVVVMILAYIFCWGPYTFFACYAAANPGYAF--HPLTAALPAYFAKSATIYNPVIYVFMNRQFRNCIMQLF--GKKVDDGS
LWS_geoAus MYNLTSFWMIIVVILSLFTNGLVLVATLKFKK-LR-HPLNWILVNLAIADIGETIFASTVSVVNQIFGYFILGHPL-CVFEGFTVSVCGITALWSLAIISFERWMVVCKPF---GNLKFDGKVAIVLIIFSWAWSAGWCAPPIF--GWS-RYWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFIPLALIIICYLQVWLAIHTV--AQQ------------Q-----KESETTQKAERDVSRMVVVMIFAYIFCWGPYTFFACFAAANPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLF--GKKVDDGS
LWS_neoFor VYNLTSLWMIFVVFASCFTNGLVLMATYKFKK-LR-HPLNWILVNLAIADLGETLIASTISVTNQIFGYFILGHPM-CMLEGFTVATCGITGLWSLTIIAWERWVVVCKPF---GNIKFDGKWAAGGIIFSWVWSAFWCAMPLF--GWS-RFWPHGLKTSCGPDVFSGEDKYGTRSFMIALMITCCIIPLGVIILCYIQVWWAIRTV--AKQ------------Q-----KESESTQKAEKEVSRMVVVMIFAYCFCWGPYTFMACFGAAYPGYAF--HPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIYQLL--GKKVDDGS
PIN_galGal TYVGVAVLMGTVVACASVVNGLVIVVSICYKK-LR-SPLNYILVNLAVADLLVTLCGSSVSLSNNINGFFVFGRRM-CELEGFMVSLTGIVGLWSLAILALERYVVVCRPL---GDFQFQRRHAVSGCAFTWGWALLWSTPPLL--GWS-SYVPEGLRTSCGPNWYTG--GSNNNSYILSLFVTCFVLPLSLILFSYTNLLLTLRAA--AAQ------------Q-----KEADTTQRAEREVTRMVIVMVMAFLLCWLPYSTFALVVATHKGIII--QPVLASLPSYFSKTATVYNPIIYVFMNKQFQSCLLEML--CCGYQPQRTG
PIN_utaSta IYTSVAVLMGLVVVSAAFVNGLVIVVSIQYKK-LR-SPLNYILVNLAIADLLVTSFGSTLSFANNIYGFFVLGQTA-CEFEGFMVSLTGIVGLWSLAILAFERYLVICKPV---GDFRFQQRHAVFGCVFTWMWSLVWTLPPLF--GWS-SYVPEGLRTSCGPNWYTG--GSGNNSYIMALFVTCFALPLGMIIFSYASLLLTLRAV--ATQ------------Q-----KEVETTQQAEKEVTRRVIAMVMAFLVCWLPYASFAMVVATNKDLVI--QPALASLPSYFSKTATVYNPIIYVFMNKQFRSCLLSTM--SCGHRPRGAQ
PIN_podSic TYISVAVLMGLVVISATLVNGLVIVVSVQFKK-LR-SPLNYVLVNLAVADLLVTFFGSTISFVNNAQGFFIFGQAT-CEFEGFMVSLTGIVGLWSLAILAFERYLVICKPV---GDFRFPARHAVLGCAFTWGWSFVWTVPPLL--GWS-SYVPEGLRTSCGPNWYSG--GSSNNSYIMTLFVTCFAMPLSTILFSYANLLMTLRTV--AAQ------------Q-----KEQETTQRAEREVTRMVVAMVAAFLVCWLPYASFAMVVATHKDLAI--RPALASLPSYFSKTATVYNPIIYVFMNKQFRSCLLYKM--SCGHRALSSQ
PIN_pheMad VYTSLAALMGVVVLSASLANGLVIAVSVRFKR-LR-SPLNYILVNLATADLLVTFFGSIISFVNNAVGFFVFGKTA-CRFEGFMVSLTGIVGLWSLAILAFERYLVICKPV---GDFQFQRRHAVIGCLYTWGWSLIWTVPPLF--GWS-SYVPEGLGTSCGPNWYMG--GTNNNSYIVALFVTCFALPLSMILFSYANLLLTLRAV--AAQ------------Q-----KEQETTQRAEKEVTRMVITMVMAFLVCWLPYATFAMVVATTKDLSI--QPGLASLPSYFSKTATVYNPIIYVFMNKQFRSCLLNTV--SCGRIPQTMP
PIN_xenTro TFLTVAAVMCMVVILAFFVNGLVIVVTLKYKK-LR-SPLNYILVNLAIANLLVTIFGSSVSFSNNVVGYFFMGKTM-CEFEGFMVSLTGIVGLWSLAILAFERYLVICKPM---GDFRFQQKHAILGCSFTWVWSFIWTSPPLF--GWC-SYVPEGLRTSCGPNWYTG--GTNNNSYIMALFLTCFIMPLSTIIFSYSNLLMALRAV--AAQ------------Q-----KDSETTQRAEKEVTRMVIAMVLAFLICWLPYASFAVVVAVNKDVVI--EPTVASLPSYFSKTATVYNPIIYVFMNKQFRNCLMTLL--CCGRS-FGDD
PIN_bufJap TYLTVAVLMGMVVFLAFFVNGMVIVVSLKYKK-LR-SPLNYILVNLAVADILVTMFGSTVSFHNNIFGFFTLGKLV-CELEGFVVSLTGIVGLWSLAILAFERYIVICKPM---GDFRFQQRHAVMGCAFTWIWAFLWTSPPLI--GWC-SYVPEGLGTSCGPNWYTG--GTNNNSYILALFTTCFMMPLTTIIFSYSNLLLALRAV--AAQ------------Q-----KESETTQRAEREVTRMVIAMVLAFLICWLPYAVFAIVMASNKNVVI--DPTLASMPSYFSKTATVYNPVIYVFMNKQFRDCLTKLL--CCGRNPFGED
VAOP_galGa HFRLVAAVMFVVTSLSLAENLAVILVTFKFKQ-LR-QPVNYVIVNLSVADFLVSLTGGTISFLANLKGYFYMGHWA-CVLEGFAVTFFGIVALWSLALLAFERYIVICRPV---GNMRLRGKHAAQGIAFVWTFSFIWTIPPTM--GWS-SYTTSKIGTTCEPNWYSG--AYNDRSYIIAFFTTCFIVPLLVILVSYGKLLQKLRKV--SNT------------Q-----GRLRTARKPERQVTRMVVVMIIAFLICWMPYAVFSILATAYPSIEL--DPHLAAIPAFFSKTATVYNPIIYVFMNKQFRMCLIQMF--KCSAIETAES
VAOP_anoCa NFHLISALMFVVTLFSLSENFTVILVTIKFKQ-LR-QPLNYVIVNLSVADFLVSLIGGTISFSTNLKGYFYMGHWA-CVLEGFAVTFFGIVALWSLALLAFERYVVICRPL---GNMRLNGKHAALGVAFVWIFSFIWTVPPTM--GWS-SYTTSKIGTTCEPNWYSG--DYNDHTFIITFFTTCFILPLLVILVSYGKLMRKLRKV--SDT------------Q-----GRLGTTRKPERQVTGMVVIMILAFLICWSPYAAFSILVTACPSIEL--DPRLAAIPAFFSKTATVYNPVIYVFMNNQFRKCLVQLF--QCSSQETMDA
VAOP_xenTr NFHLLAALMFVVTSLSIAENFIVILVTAKFKQ-LR-QPLNYIIVNLSVADFLVSVIGGTISIATNSRGYFYLGSWA-CVLEGFAVTFFGIVALWSLSVLAFERYIVICRPL---GNLRLQGKHSALAIIFVWVFSFVWTIPPTM--GWS-SYTTSKIGTTCEPNWYSG--EMRDHTYIITFLTTCFVFPLLVIFMSYGKLMRKLRKV--SDT------------Q-----GRLGSTRKPEKEVTRMVVIMILAFLICWTPYAAFSILITAHPTIDL--DPRLAAIPAFFAKTASMYNPIIYVYMNKQFRRCLYQMF--NINDPEAKES
VAOP_danRe NYSVLAALMFVVTALSLSENFTVMLVTFRFQQ-LR-QPLNYIIVNLSLADFLVSLTGGSISFLTNYHGYFFLGKWA-CVLEGFAVTFFGIVALWSLAVLAFERFFVICRPL---GNIRLRGKHAALGLVFVWSFSFIWTVPPVL--GWS-SYTVSRIGTTCEPNWYSG--NFHDHTFIITLFSTCFIFPLGVIIVCYCKLIRKLRKV--SNT------------H-----GRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIIITAHPSMHV--DPRLAAIPAFVAKTAAVYNPIIYVFMNKQFRKCLVQLL--SCSKVTVVEG
VAOP_rutRu NYKVLATLMFVVTAASLSENFAVMLVTFRFTQ-LR-KPLNYIIVNLSLADFLVSLTGGTISFLTNYHGYFFLGKWA-CVLEGFAVTYFGIVALWSLAVLAFERFFVICRPL---GNIRLRGKHAALGLLFVWTFSFIWTIPPVL--GWS-SYTVSKIGTTCEPNWYSG--NFHDHTFIIAFFITCFILPLGVIVVCYCKLIKKLRKV--SNT------------H-----GRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIVVTAHPSIHL--DPRLAAAPAFFSKTAAVYNPVIYVFMNKQFRKCLVQLL--RCRDVTIIEG
VAOP_takRu NFTILAVLMFVVTSLSLCENFLVMFITFKFKQ-LR-QPLNYIIVNLAIADFLVSLTGGLISFLTNARGYFFLGRWA-CVLEGFAVTYFGIVAMWSLAVLSFERFFVICRPL---GNMRLQAKHAAIGLLFVWTFSFVWTFPPVL--GWN-RYTVSKIGTTCEPDWYSN--NMTSHSYIITFFSTCFILPLGIIFFCYGKLLRKLRKV--S--------------H-----GRLATARKPERQVTRMVVVMIVAFMVAWTPYATFAILVTIHPTIEL--DPRLASIPAFFSKTAAVYNPIIYVFMNKQFRKCLIQHF--IGMGVMAES
VAOP_petMa NFTMLAALMGTITALSLGENFAVIVVTARFRQ-LR-QPLNYVLVNLAAADLLVSAIGGSVSFFTNIKGYFFLGVHA-CVLEGFAVTYFGVVALWSLALLAFERYFVICRPL---GNFRLQSKHAVLGLAVVWVFSLACTLPPVL--GWS-SYRPSMIGTTCEPNWYSG--ELHDHTFILMFFSTCFIFPLAVIFFSYGKLIQKLKKA--SET------------Q-----RGLESTRRAEQQVTRMVVVMILAFLVCWMPYATFSIVVTACPTIHL--DPLLAAVPAFFSKTATVYNPVIYIFMNKQFRDCFVQVL--PCKGLKKVSA
PPIN_anoCa GYTIIAIIMATSCTLSVILNTAVIAITIKYRQ-LR-QPINYSLVNLAIADLGAALLGGSLNVETNAVGYYNLGRVG-CVTEGFAMAFFGIVALCTIAVIAVDRAIVIAKPM---GTITFTTRKAMIGVAVSWIWSLVWNTPPLF--GWG-GYQMEGVMTSCAPDWANS--DPINVSYIICYFLFCFTIPFITILASYGYLIWTLRQV--AKV------------G----LAQRGSTTKAEAQVSRMVIVMVMAFLICWLPYATFALVVVGNPQIYI--NPIIATIPMYMAKSSTFYNPIIYIFMNKQFRDCLVRCL--LCGRNPCASE
PPIN_xenTr GYTILALIMAVFCAAALFLNVTVIVVTFKYRQ-LR-HPINYSLVNLAIADLGVTVLGGALTVETNAVGYFNLGRVG-CVIEGFAVAFFGIAALCTIAVIALDRVFVVCKPM---GTLTFTPKQALAGIAASWIWSLIWNTPPLF--GWG-SYELEGVMTSCAPNWYSA--DPVNMSYIVCYFSFCFAIPFLIIVGSYGYLMWTLRQV--AKL------------G----VAEGGTTSKAEVQVSRMVIVMILAFLVCWLPYAAFAMTVVANPGMHI--DPIIATVPMYLTKTSTVYNPIIYIFMNKQFQECVIPFL--FCGRNPWAAE
PPIN_petMa GFTILAVIMAVFTLASLVLNSTVIIVTLRHRQ-LR-HPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVG-CVIEGFAVAFFGIAALCTIAVIAVDRFVVVCKPL---GTLMFTRRHALLGITWAWLWSFVWNTPPLF--GWG-SYKLEGVRTSCAPDWYSR--DPANVSYIVSFFSFCFAIPFLVIVVAYGRLLWTLHQV--AKL------------G----MGESGSTAKAEAQVSRMVVVMVVAFLVCWLPYALFAMIVVAKPGVYI--DPVIATLPMYLTKTSTVYNPIIYIFMNRQFRDCAVPFL--LCGRNPWAEP
PPIN_letJa GFTILAVIMAVFTIASLVLNSTVVIVTLRHRQ-LR-HPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVG-CVIEGFAVAFFGIAALCTIAVIAVDRFVVVCKPL---GTLMFTRRHALLGIAWAWLWSFVWNTPPLF--GWG-SYELEGVRTSCAPDWYSR--DPANVSYITSYFAFCFAIPFLVIVVAYGRLMWTLHQV--AKL------------G----MGESGSTAKAEAQVSRMVVVMVVAFLVCWLPYALFAMIVVTKPDVYI--DPVIATLPMYLTKTSTVYNPIIYIFMNRQFRDCAVPFL--LCGRNPWAEP
PPIN_ictPu GYTILSIIMALSSTFGIILNMVVIIVTVRYKQ-LR-QPLNYALVNLAVADLGCPVFGGLLTAVTNAMGYFSLGRVG-CVLEGFAVAFFGIAGLCSVAVIAVDRYMVVCRPL---GAVMFQTKHALAGVVFSWVWSFIWNTPPLF--GWG-SYQLEGVMTSCAPNWYRR--DPVNVSYILCYFMLCFALPFATIIFSYMHLLHTLWQV--AKL------------Q----VADSGSTAKVEVQVARMVVIMVMAFLLTWLPYAAFALTVIIDSNIYI--NPVIGTIPAYLAKSSTVFNPIIYIFMNRQFRDYALPCL--LCGKNPWAAK
PPIN_oncMy GFTILAVIIGVFSVSGVCMNVLVIMVTMRHRK-LR-QPLNYALVNLAVADLGCALFGGLPTMVTNAMGYFSMGRLG-CVLEGFAVAFFGIAGLCSVAVIAVDRYVVVCRPM---GAVMFQTRHAVGGVVLSWVWSFLWNTPPLF--GWG-SFELEGVRTSCSPNWYSR--EPGNMSYIILYFLLCFAIPFSIIMVSYARILFTLHQV--SKL------------K----VLEGNSTTRVEIQVVRMVVVMVMAFLLSWLPYAAFALSVILDPSLHI--NPLIATVPMYLAKSSTVYNPIIYVFMNRQFRDCAVPFL--LCGLNPWAS
PPIN_danRe GYTILAVIIGVFSVCGVILNVTVITVTLKYKQ-LR-QPLNFALVNLAVADLGCAVFGGLPTVVTNAMGYFSLGRVG-CVLEGFAVAFFGIAALCSVAVIALERCMVVCRPV---GSISFQTRHAVFGVAVSWLWSFIWNTPPLF--GWG-RLQLEGVRTSCAPDWYSR--DLANVSFIVCYFLLCFALPFSVIVYSYTRLLWTLRQV--SRL------------Q----VCEGGSAARAEAQVSCMVVVMILAFLLTWLPYASFALCVILIPELYI--DPVIATVPMYLTKSSTVFNPIIYIFMNRQFRDRALPFL--LCGRNPWAA
PPINa_cioI TYSFLCVYMTFVFLLSCSLNILVIVATLKNKV-LR-QPLNYIIVNLAVVDLLSGFVGGFISIAANGAGYFFWGKTM-CQIEGYFVSNFGVTGLLSIAVMAFERYFVICKPF---GPVRFEEKHSIFGIVITWVWSMFWNTPPLI--FWD-GYDTEGLGTSCAPNWFVK--EKRERLFIILYFVFCFVIPLAVIMICYGKLILTLRQI--AKE------------------SSLSGGTSPEGEVTKMVVVMVTAFVFCWLPYAAFAMYNVVNPEAQI--DYALGAAPAFFAKTATIYNPLIYIGLNRQFRDCVVRMI--FNGRNPWVDE
PPINa_cioS VYSFLAVYMTFICLISCSLNILVITATLKNKV-LR-QPLNYIIVNLAVVDLLSGLVGGVISIFANGAGYFFWGKFM-CQVEGYTVSNFGVTGLLSIAVMAFERYFVICKPF---GPVRFEEKHAVIGIAVTWIWAMFWNTPPLI--FWD-GYDTEGLGTSCAPNWFVK--GNTERLFIILYFVFCFLIPLAIIVLCYGKLILQLRQI--AKE------------------SSLSGGTSPEGEVTKMVVVMVTAFVICWLPYAAFAMYNVVNPEAQI--DYALGAAPAFFAKTATIYNPLIYIGLNRQFRDCVVRMI--FNGRNPWVDE
PPINb_cioI IYTILAVYMTFIFLLAVSLNGFVIIATMKNKK-LR-QPLNYIIINLSIADFLSGLVGGFIGMISNSAGYFYFGKTV-CILEGYIVSVAGVCGLMSISVMAFERYFVVCKPY---GPFTLTNTHAALGIGFTWTWSVLWSTPGLI--WLD-GYVPEGLGTSCAPNWFSK--NKSERIFIFVYFVFCFFIPLLVIIICYGKIVLFLKQA--TRQ------------------SSASSNRQADNKVTKMVLVMISAFLICWTPYGVLSLYNAINPDKQL--DYGLGAVPVFFAKTANIYNPLIYIGLNKQFRDGVIKMV--FRGRNPWAEE
PPINb_cioS TYSGLCVFMSFVFVLAVPLNLLVIVATYKNKD-LR-RPINYIIVNLAVADLTCSVVGGLLGVLNNGAGYYFLGKSV-CIFEGYVMSVTGVCGILSITVMAFERYFVVCKPF---GQTNLKWSHAITGIVFTWTWSVIWHTPGLF--FWN-GYEPEGFGTSCAPNWFSQ--QKSERIFIFAYFAFCFLTPLTIIFACYLKLILFIRKV--SKK------------------SMVNEADRRDFEVTRMVFVMIAAFLICWLPYGCLSMYNAIHPDNLL--SYGIGSVPAFFAKTATIYNPIIYMGLNKKFRDGVIRML--FKGRNPWLDG
PARIE_utaS GYGVLAFLMFLNALFSIFNNSLVIAVTLKNPQ-LR-NPINIFILNLSFSDLMMSLCGTTIVIATNYYGYFYLGRKF-CIFQGFAVNYFGIVSLWSLTILAYERYNVVCQPL---GTLQMSTKRGYQLLGFIWVFCLFWAVVPLF--GWS-SYGPEGVQTSCSIGWEER--SWSNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHGL--NKK-----------VE----QLGGKSSPEEEFRAVIMVLVMVVAFLICWLPYTVFALIVVFNPALNI--SPLAATIPTYLSKTSPVYNPIIYIFLNKQFRDCAVEFI--TCGQVVLTSP
PARIE_anoC GYGVLAFLMFINALFSLFNNFLVIAVTLKNPQ-LR-NPINIFILNLSFSDLMMSICGTTIVIATNYHGYFYLGRRF-CIFQGFAVNYFGIVSLWSLTILAYERYNVVCQPL---GTLQMSTQRAYQLLGFIWVFCLFWAVVPLF--GWS-SYGPEGVQTSCSIGWEER--SWNNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHGL--NKK-----------VE----QLGGKSNPEEEFRAVIMVLVMVVAFLICWLPYTLFALTVVFNPALNI--SPLAATIPTYLSKTSPVYNPIIYIFLNKEFRECAVEFI--TCGKVVLTSP
PARIE_xenT GYSILSFLMFLNAVFSICNNAIVILVTLKHPQ-LR-NPINIFILNLSFSDLMMALCGTTIVVSTNYHGYFYLGKQF-CIFQGFAVNYFGIVSLWSLTLLAYERYNVVCEPI---GALKLSTKRGYQGLVFIWLFCLFWAIAPLF--GWS-SYGPEGVQTSCSIGWEER--SWSNYSYIISYFLTCFIIPVGIIGFSYGSILRSLHQL--NRK-----------IE----QQGGKTNPREEKRVVIMVLFMVLAFLICWLPYTVFALIVVINPQLYI--SPLAATLPTYFAKTSPVYNPIIYIFLNKQFRTYAVQCL--TCGHINLDSL
PARIE_takR GYSILSFLMFINTVLSVFNNSLAIAVMLKNPS-LL-QPINIFILSLAVSDLMIGLCGSLVVTITNYHGSFFIGHTA-CVFQGFAVNYFGLVSLCTLTLLAYERYNVVCKPR---AGLKLTMRRSIIGLLFVWTFCLFWAVTPLL--GWS-SYGPEGVQTSCSLAWEER--SWNNYSYLILYTLLCFIFPVGVIIYCYCKVLTSMNKL--NKS-----------VE----LQGGLSCRRENKHAINMVLAMIIAFFVCWLPYTALSVVVVVDPELHI--PPLVATMPMYFAKTSPVYNPIIYFLSNKQFRDATLEVL--SCSRYIPHAS
PARIE_gasA GYSILSFLMFINTVLTVFNNVLVITVLVRNPS-LL-QPMNVFILSLAVSDLMIGLCGSLVVTITNYHGSFFIGHTA-CIFQGFAVNYFGLVSLCTLTLLSYERYNVVCRPR---NALKLSMRRSIHGLLIVWTFCLFWAVAPLF--GWS-GYGPEGVQTSCSLAWEER--SWSNYSYLVLYTLLCFIVPVAVIIYCYAKVLTSMNTL--NRS-----------VE----VQGGRSSQKENDHAVSMVLAMIIAFFSCWLPYTALSVVVVVDPTLYI--PPLVATMPMYFAKTSPVYNPIIYFLSNKQFRDAALEML--SCGRYIAHMP
PARIE_danR GYSILSYLMFINTTLSVFNNVLVIAVMVKNLH-FL-NAMTVIIFSLAVSDLLIATCGSAIVTVTNYEGSFFLGDAF-CVFQGFAVNYFGLVSLCTLTLLAYERYNVVCKPM---AGFKLNVGRSCQGLLLVWLYCLFWAVAPLL--GWS-SYGPEGVQTSCSLGWEER--SWRNYSYLILYTLMCFILPTVIITYCYSNVLLTMRKI--NKS-----------IE----CQGGKNCAEDNEHAVRMVLAMIIAFFICWLPYTAISVLVVVNPEISI--PPLIATMPMYFAKTSPVYNPIIYFLTNKRFRESSLEVL--SCGRYISRET
CILI2_plaD SYVITAIYLCIVGVIGTLSNGVIMYLYFKDKS-LR-SPMNLLFVNLAMSDFTVAFFGAMFQFGLTCTRKYMSPGMALCDFYGFITFLGGLASEMNLFIISVERYLAVVRPF---DVGNLTNRRVIAGGVFVWLYSLVFAGGPLV--GWS-SYRPEGLGTWCSISWQ--DRSMNTMSYVTAVFLGCYFFPVSIIIFCYFNVWRKVKE----AA-----------DA----QGGAGTAGKAEKSIFRMSVIMVTCYLTAWTPYAIVCLIASYGPPNGL--PIYAEVLPSLFAKSSQVYNPIIYVLMNKPYRSALVSLV--CRGRNPFDEA
CILI1_plaD DYNICAAYLFFIACLGVSLNVLVLVLFIKDRK-LR-SPNNFLYVSLALGDLLVAVFGTAFKFIITARKTLLREEDGFCKWYGFITYLGGLAALMTLSVIAFVRCLAVLRLG---SFTGLTTRMGVAAMAFIWIYSLAFTLAPLL--GWN-HYIPEGLATWCSIDWL--SDETSDKSYVFAIFIFCFLVPVLIIVVSYGLIYDKVRK----VA-----------KT-------GGSVAKAEREVLRMTLLMVSLFMLAWSPYAVICMLASFGPKDLL--HPVATVIPAMFAKSSTMYNPLIYVFMNKQFRRSLKVLL--GMGVEDLNSE
ENCEPH_hom TYERLALLLGSIGLLGVGNNLLVLVLYYKFQR-LR-TPTHLLLVNISLSDLLVSLFGVTFTFVSCLRNGWVWDTVG-CVWDGFSGSLFGIVSIATLTVLAYERYIRVVH------ARVINFSWAWRAITYIWLYSLAWAGAPLL--GWN-RYILDVHGLGCTVDWK--SKDANDSSFVLFLFLGCLVVPLGVIAHCYGHILYSIRMLRCVED-----------LQ----TIQVIKILKYEKKLAKMCFLMIFTFLVCWMPYIVICFLVVNGHGHLV--TPTISIVSYLFAKSNTVYNPVIYVFMIRKFRRSLLQLLCL
ENCEPH_mon TYELLALLIATIGLLGLCNNLLVLVLYYKFQR-LR-TPTHLFLVNISFNDLLVSLFGVTFTFVSCLRSGWVWDSVG-CAWDGFSNTLFGIVSIMTLTVLAYERYNRIVH------AKVINFSWAWRAITYIWLYSLVWTGAPLL--GWN-RYTLEIHGLGCSVDWK--SKDPNDSSFVIFLFFGCLMLPVGVMAYCYGHILYAIRMLRCVEE-----------LQ----TIQVIKILRYEKKVAKMCFLMIAIFLFCWMPYAVICLLVANGYGSLV--TPTVAIIASLFAKSSTAYNPIIYIFMSRKFRRCLLQLLCF
ENCEPH_gal TYELLALLIATIGTLGVCNNLLVLVLYYKFKR-LR-TPTNLFLVNISLSDLLVSVCGVSLTFMSCLRSRWVWDAAG-CVWDGFSNSLFGIVSIMTLTVLAYERYIRVVH------AKVIDFSWSWRAITYIWLYSLAWTGAPLL--GWN-RYTLEIHGLGCSMDWK--SKDPNDTSFVLLFFLGCLVAPVVIMAYCYGHILYAVRMLRCVED-----------FQ----TSQVIKLLKYEKKVAKMCFLMISTFLICWMPYAVVSLLVTYGYSNLV--TPTVAIIPSFFAKSSTAYNPVIYIFMSRKFRQCLLQLLCF
ENCEPH_ano TYELLALLVAAIGLLGLCNNLLVLVLYAKFKR-LR-TPTHLFLVNISLSDLLVSLFGVSFTFGSCLRHRWVWDAAG-CVWDGFSNSLFGIVSIMTLTVLAYERYIRVVH------ARVIDFSWSWRAITYIWLYSLAWTGAPLL--GWN-HYTLEIHGLGCSVDWQ--SKEPSDSSFVLFFFLGCLAAPVGIMAYCYGHILHAIRMLRCVED-----------LQ----SIQVIKILRYEKKVAKMCFLMVTTFLICWMPYAVVSLLIAYGYGHLI--TPTVAIIPSFFAKSSTAYNPVIYIFMSRKFRRCLVQLFCV
ENCEPH_gas TYKLLAFAIGTIGVFGFCNNVVVIVLYCKFKR-LR-TPTNLLVVNISLSDLLVSVIGINFTFVSCIRGGWTWSRAT-CIWDGFSNSLFGIVSIMTLASLAYERYIRVVH------AQVVDFPWAWRAIGHIWLYSLVWTGAPLL--GWN-RYTLEIHRLGCSLDWA--SKDPNDASFILLFLLACFFVPVGIMIYCYGNILYAVQMLRSIQD-----------LQ----TVQIIKILRYEKKVAVMFLLMISCFLLCWTPYAVVSMMEAFGRKNMV--SPTVAIIPSFFAKSSTAYNPLICVFMSRKFRRCLMQLLCS
ENCEPH_xen TYHFLALIVATVGFLGLVNNLLVLILYCKFKR-LQ-TPTNLLFFNTSLCHFVFSLLAITFTFMSCVRGSWAFSVEM-CVFHGFSKNLLGIVSFGTLTVVAYERYARVVY------GKYVNSSWSKRSITFVWVYSLAWTGFPLI--GWN-LYTFETHKLDCSFEWT--ATDPKDTAFVLLFFLACITLPLSIMAYCYGYILYEIQKLRSVKN-----------IQ----NFQEITILDYEIKMAKMCLLMMLTFLIGWMPYTILSLLVTSGYSKFI--TPTITVMPSLLAIASAAYNPVIHIFTIKKFRQCLVQLLPPINFHPPIN
ENCEPH4a_t GNLVVSVFLGFIGTFGLVNNLLVLVLFCRYKM-LR-SPINLLLMNISISDLLVCVLGTPFSFAASTQGRWLIGEAG-CVWYGFANSLFGVVSLISLAVLSFERYSTMMTPT---EADPSNYCKVCLGITLSWVYSLVWTVPPLF--GWS-SYGPEGPGTTCSVNWT--AKTTNSISYIICLFVFCLIVPFLVIVFCYGKLLCAIRQ---VSG------------------INASTSRKREQRVLCMVVIMVICYLLCWLPYGVVALLATFGPPDLV--TPEASIIPSVLAKSSTVINPIIYVFMNKQFYRCFLALL--CCQDPRSGSS
ENCEPH4b_t GHLVVAVCLGFIGTVGFLSNFLVLALFCRYRA-LR-TPMNLMLVSISASDLLVSVLGTPFSFAASTQGRWLIGRAG-CVWYGFVNACLGIVSLISLAVLSYERYCTMVSST---IASNRDYRPVLGGICFSWFYSLAWTVPPLL--GWS-RYGPEGPGTTCSVDWR--TQTPNNISYIVCLFTFCLLLPFFVILYSYGKLLHTIRQ---VRR------------------VSSTVTRRREHRVLVMVVAMVVCYLICWLPYGVTALLATFGPPNLL--TPEATITPSLLAKFSTVINPFIYIFMNKQFYRCFRAFL--NCSTPKRDST
ENCEPH4_br GYTAIATCLALIGFVGFTNNFVVILLIGCHRQ-LR-TPFNLLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPG-CVWYGFANSLFGIVSLVTLSALAFERYCVVVR-----SSDMLTYKSSLVVITFIWLYSLLWTSLPLL--GWS-SYQFEGHNVGCSVNWV--QHNPDNVSYIVTLMVTCFFVPMVVVCWSYAWIWRTVRM---SSE------------A----KPECGNSQNAGRLVTTMVVVMIICFLVCWTPYAVMALIVTFGADHLV--TPTASVIPSLVAKSSTAYNPIIYVLMNNQFREFLLARLQRVCCRQ
ENCEPH5_br GFDTVAVVIAAIGIAGFLSNGAVVLLFLKFRQ-LR-TPFNMLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPG-CVWYGFANHLFGLVSLISLAVISYERYRMVVKPK-GPGSSYLTYNKVGLAIIFIYLYCLLWTTLPIV--GWS-SYQLEGPKISCSVAWE--EHSLSNTSYIVAIFIMCLLLPLLIIIYSYCRLWYKVKK---GSQ-----------NL----PPAIRKSSQKEQKIARMVVVMITCFLVCWLPYGAMALVVSFGGESLI--SPTAAVVPSLLAKSSTCYNPLVYFAMNNQFRRYFQDLL--CCGRRLFDAS
PIN_stoPur TYNYLTVYTGFLTIFGILNNGIVMILFARFPS-LR-HPINSFLFNVSLSDLIISCLASPFTFASNFAGRWLFGDLG-CTLYAFLVFVAGTEQIVILAALSIQRCMLVVRPF---TAQKMTHRWALFFISLTWIYSLIICVPPLF--GWN-RYTYEGPGTACSVAWN--SPSPGDTSYIIFIFVLVLVIPFGIIIFCYGLLVYAVKK---ISR------------T----QAALSSEAKADRKVSKMIFIMILFFLIAWTPYTGFSLYVTFGKNVVI--TPLAGTFPPFFAKLCTIHNPIIYFLLNKQFKDALIQLF--CCGENPFDRD
ENCEPH_api MYIGAAIALGFIGFFGFTANLLVAIVIVKDAQILW-TPVNVILFNLVFGDFLVSIFGNPVAMVSAATGGWYWGYKM-CLWYAWFMSTLGFASIGNLTVMAVERWLLVARPM-----QALSIRHAVILASFVWIYALSLSLPPLF--GWG-SYGPEAGNVSCSVSWEVHDPVTNSDTYIGFLFVLGLIVPVFTIVSSYAAIVLTLKK-----------------VR----K-RAGASGRREAKITKMVALMITAFLLAWSPYAALAIAAQYFNAK-P--SATVAVLPALLAKSSICYNPIIYAGLNNQFSRFLKKIFDARGSRT
ENCEPH1_an AYNGAAVTLFFIGFFGFFLNIFVIALMYKDVQ-LW-TPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWLYGKSI-CVAYGFFMSLLGIASITTLTVLSYERFCLISRPF---AAQNRSKQGACLAVLFIWSYSFALTSPPLF--GWG-AYVNEAANISCSVNWE--SQTANATSYIIFLFIFGLILPLAVIIYSYINIVLEMRK-----------------NS----A-RVGRVNRAERRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELI--GPGLAVLPALVAKSSICYNPIIYVGMNTQFRAAFWRIRRSNGVAGQPD
ENCEPH2_an AYNASAVTLFFIGFFGFFLNLFVIALMCKDMQ-LW-TPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWIFGRTL-CVAYGFFMSLLGITSITTLTVLSYERYCLISRPF---SSRNLTRRGAFLAIFFIWGYSFALTSPPLF--GWG-AYVQEAANISCSVNWE--SQTKNATTYIIFLFVFGLVVPLIVIVYSYTNIIVNMRE-----------------NS----A-RVGRINRAEQRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELI--GPGLAVLPALVAKSSICYNPIIYVGMNTQFRAAFSRVRNKGQQA
ENCEPH_aed AYVASAVTLFFIGFFGFFLNLFVIALMCKDVQ-LW-TPINIILFNLVCSDFSVSIIGNPFTLTSAISRHWIFGRTV-CIAYGFFMSLLGITSITTLTVLSYERFCLISHPF---SSRSLSRRGAVFAILFIWSYSFALTSPPLF--GWG-AYVNEAANISCSVNWE--SQTLNATSYIIFLFVFGLVVPLVVIVYSYTNIVVNMKR-----------------NA----A-RVGRINRAEKRVTRMVFVMVLAFMIAWTPYAVFALIEQFGPTDII--SPALGVLPALIAKSSICYNPIIYVGMNTQFRAAFNRVRNNE
ENCEPH_cul AYVATAVVLFFIGFFGFFLNLFVIALMCKEVQVLW-TPMNIILLNLVCSDFSVSIVGNPFTLSSAISHRWLFGRKL-CVAYGFFMSLLGITSITTLTVLSYERFYLISRPF---SSRSLSRRGALGAVLLIWCYSFALTSPPLF--GWG-AYVNEAANISCSVNWE--TQTLNATTYIIYLFVFGLVVPLTVIVYSYTNIIVNMKK-----------------NA----A-RVGRINRAEKRVTTMVAVMVIAFMVAWTPYSVFALMEQFGPPDVI--GPGLAVLPALIAKSSICYNPIIYVGMNTQFRAAFNRVRHDP
ENCEPH_tri GYIAAAVVLFCIGFFGFSLNLTVIIFMLKERQ-LW-SPLNIILFNLVVSDFLVSVLGNPWTFFSAINYGWIFGETG-CTIYGFIMSLLSITSITTLTVLAFERYLLIARPF---RNNALNFHSAALSVFSIWLYSLSLTIPPLI--GWG-EYVHEAANLSCSVNWE--EKSPNSTSYILYLFAFGLFLPLVIITFSYVNIILTMRR-----------------NA----AFRVGQVSKAENKVAYMIFIMIIAFLTAWSPYAIMALIVQFGDAALV--TPGMAVIPALLAKSSICYNPVIYIGLNAQVKGAKWVSGLIYLFQFQQ
ENCEPHa_ne EANIVLGYYIAIFVIGFVTNTIVVIIFISSQR-LH-TTPNLILFSMSVCDWLMATMAKSVGIYGNARYWPTVGKVT-CDYYAFATSAIGYASILHLAALAVEKRMAVVSPM----TNSFNGRRMLVIIATLWGFAILWAVFPLI--GWS-SYGPEPGYVSCSITWYT--TDHNNVSYIICVSVLFFLIPIVTMTFCFASIYHTIRNLSHEAT-----------ARWGSDARATQETIRAKAKTAKMAFLMVMCFLFAWTPYAVVSLWTTFGDTHRI--PALLGVLPSLFAKLSSCYNPIIYFFMYTKFRCAGKALLYQEHH
ENCEPHb_ne EANIVLGYYIAIFVIGFVTNTIVVITFIFSKR-LH-TTPNLILFSMSVCDWLMAAMAKSVGIYGNARYWPTVGKVT-CDYYAFATSAIGYASILHLAALAVEKRMAVASPM----TNSLNERRMLVIIATLWGFAILWAVFPLI--GWS-SYGPEPGYVSCSITWYT--TDHNNVSYIICISVLFFFVPIVTMTFSFASIYKAIRNISHEAI-----------ARWGSHARATQETIKAKAKTAKMAFLMVMCFLFAWTPYAVVSLWTTFGGTHRN--PALLGVLPSLFAKLSSCYNPIIYFFMYTKFRRAAKLLFIKKVIRPTEA
ENCEPHc_ne HAITVMYSLLAAGAFVLNGIVLIIFLATRS-LR-TIPNMILLSMAWADWLMACLADAVGAYANANNWPSMVGGL-CVYYGFITTALGLTSMIHLTALSVERFVTVTIPM----TRPITETQMLLVVTFLWAFSFLWAIFPLV--GWS-SYGPEPGYAACSIAWYR--QDLNNMSYILCLFMFFFFLPIVIMIACFSSIYFTVRKLTRDSM-----------RRWGASSDSTQQTLAAERKTAWMSFIMVLAFLFAWVPYAVVSLYASFGGVTTI--PKLMSTLPAMLAKTSACYNPIIYFFMYSKFRKAFQRFFFKNVITPSQT
MOLL_PERc_ EFRIIGIFISICCIIGVLGNLLIIIVF-AKRRSVR-RPINFFVLNLAVSDLIVALLGYPMTAASAFSNRWIFDNIG-CKIYAFLCFNSGVISIMTHAALSFCRYIIICQYG--YR-KKITQTTVLRTLFSIWSFAMFWTLSPLF--GWS-SYVIEVVPVSCSVNW--YGHGLGDVSYTISVIVAVYVFPLSIIVFSYGMILQEKVCKDSRKN-------GIRAQQRY----TPRFIQDIEQRVTFISFLMMAAFMVAWTPYAIMSALAI--GSFNV--ENSFAALPTLFAKASCAYNPFIYAFTNANFRDTVVEIMAPWTTRRVGV
PER2_strPu GYLLTAIYLTIVGSIATVGNITVICVL-CRYRTFRKRSINLLLINMAASDLGVSVAGYPLTTVSGYWGRWLFGDVG-CQFYAFCVYTLSCSTISTHAAIAVYRYIYIVKTD--LR-PKLTANFTSGVIVVIWVYAFFWTVTPFV--GWS-SYIYEPFGTSCSVNW--VGRTISDISYMVACTIGVYLLQIFIMLYCYIRVAKKIRGVDPGRT-------EEKDAGVVVFGRLRKREAKIDTHVTKMCFMMMLTFIVVWAPYAVECLRAA--HVHRI--SALSSVLPTMFAKSSCMVNPIIFLTSSSKFRQDLGKLWSRPSSQDSL
PER1_strPu GYLLTALYLTLVGIVSTIGNITVLCVL-CRYGTFRKRSVNILLMNMAVSDLGVSVAGYPLTAISGYRGRWVFADIG-CQFSGFCVYALSCSTISTHAVVAIYRYIYIVKPY--HR-PRLSSSTSCLAILCIWTFTLFWTITPFF--GWS-SYTYEPFGTSCSINW--YGKSLGDLTYIICCVVFVFILPIIIMLYCYIGVAKKIKGIDPLRT-------EERDIAVV-FGRLRKHETKIDTRVTKICFMMMASFIVVWTPYAVGSIWAS--KIGKI--SASASVLPTMFAKSSCMINPIIFLTSSSKFRADLGKLWNRPSSLEHTI
PER_homSap EHNIVATYLIMAGMISIISNIIVLGIF-IKYKELR-TPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAG-CQVYAGLNIFFGMASIGLLTVVAVDRYLTICLPD--VG-RRMTTNTYIGLILGAWINGLFWALMPII--GWA-SYAPDPTGATCTINW--RKNDRSFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDC-----------TESL------NRDWSDQIDVTKMSVIMICMFLVAWSPYSIVCLWASFGDPKKI--PPPMAIIAPLFAKSSTFYNPCIYVVANKKFRRAMLAMFKCQTHQTMPV
PER_monDom EHKIVAAYLITAGVISIVSNVIVLGIF-VKYKALR-TATNTIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYDG-CQIYAGLNIFFGMASIGLLTAVAIDRYLTICQPD--LG-R-MTSYNYTLMILTAWVNGFFWALMPIV--GWA-GYAPDPTGATCTINW--RKNDVSFVSYTMTVITINFAMPLGVMFYCYYNVSQKMKQYSPSNC-----------PDHI------NRDWSNQVAVTKMSVVMILMFLLAWSPYSIVCLWASFGDPKEI--PPAMAIVAPLFAKSSTFYNPCIYVAANKKFRRAISAMIRCQTHQSMPI
PER_galGal EHNIVAAYLITAGVISIFSNIVVLGIF-VKYKEFR-TATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYTG-CQIYAALNIFFGMASIGLLTVVAVDRYLTICRPD--IG-RRMTTRNYAALILAAWINAVFWASMPTV--GWA-GYASDPTGATCTANW--RKNDVPFVSYTMSVIAVNFVVPLTVMFYCYYNVSRTMKQYTSSNC-----------LESI------NMDWSDQVDVTKMSVVMIVMFLVAWSPYSIVCLWSSFGDPKKI--SPAMAIIAPLFAKSSTFYNPCIYVIANKKFRRAILAMVRCQTRQEITI
PER_xenTro EHNIVAAYLITAGVISILSNIIVLGIF-VKYKELR-TATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYVG-CQIYAGLNIFFGMASIGLLTVVAIDRYLTICRPD--IGGRRISGRHYTAMILAAWINAVFWSVMPVV--GWS-SYAPDPTGATCTINW--RKNDVSFVSYTMSVVAVNFVVPLMVMFYCYYNVSRTMKGYGSRSS-----------LGGI------NADWSDQTDVTKMSMVMIVMFLVAWSPYSIVCLWSSFGDPRKI--PPAMAIIAPLFAKSSTFYNPCIYVIANKKFRRAILSMVQCKSRQEVTL
PER_gasAcu EHNIVAGYLITAGVISLFSNIVVLLMF-WKFKELR-TATNFIIINLAFTDIGVAGIGYPMSAASDIHGSWKFGYAG-CQIYAALNIFFGMASIGLLTVVAIDRYLTICRPD--IGGQKMTMQSYNLLILAAWLNAVFWSSMPVV--GWA-SYAPDPTGATCTINW--RQNDVSFISYTMAVIAVNFVLPLSAMFYCYYNVSATVKRYKASNC-----------LDSA------NIDWSDQMDVTKMSIVMIIMFLVAWSPYSIVCLWASFGDPKTI--PAPMAIIAPLFAKSSTFYNPCIYVIANKKFRRAIIGMVRCQTRQRITI
PERa_braFl DHLIVGLYLFVIGIIGTVENGITLATF-TKFRSLR-SPTTMLLVHLAIADLGICIFGYPFSGASSLRSHWLFGGVG-CQWYGFNGMFFGMANIGLLTCVAVDRYLVICRQD--LV-DKVNYNTYGVMAALGWLFAAFWAALPLV--GWA-EYSLEPSGTACTINW--QKNDSLYISYVTSCFILGFALPLAVMMFCYWQASCFVNKVLKGDI-----------SGDLTFPVAVNVDWEYQNHFSKMCLAMVAAFVVAWTPYSVLFLFAAFGNPADI--PAWITLLPPLIAKSSALYNPIIYIIANRRFRSAIFSMVKGQNPDVET
PERa_braBe DHLIVGLYLFVIGIIGTIENGITLATF-SKFRSLR-SPTTMLLVHLAIADLGICIFGYPFSGASSLRSHWLFGGVG-CQWYGFNGMFFGMANIGLLTCVAVDRYLVICRHD--LV-DKVNYNTYGVMAALGWLFAAFWAALPLV--GWA-EYALEPSGTACTINF--QKNDSLYISYVTSCFVLGFVVPLAVMAFCYWQASCFVSKVLKGDI-----------AGDLTFPVAANVDWEYQNHFSKMCLAMVAAFVVAWTPYSVLFLFAAFWNPADI--PAWLTLLPPLIAKSSALYNPIIYIIANRRFRNAICSMMKGQDPDVEDD
PERc_braFl GYLASAVYLTITGLIAFVGNIFAIIVFLTE-KEFRKKEHNSFALNLAIADLSVCVFAYPSSTISGYAGEWMLGDVG-CTIYGFLCFTFSLTSMVTLCAISVYRYIVICKPQ--YA-HLLTHRRTNYVILGIWLYALVFSVPPLF--GVN-RYTYEPI-ITCSLDW--NVQHVGETIYTAAVIIIVYVLNVSIMCFCYFNIIFKSANLKFAAL-------ASEKTR--------TAAKKDIWKTSMMCLAMVVSFLIAWTPYAVSSTWDIL-TEEDL--PIIATILPTMFAKSSCMMNPIIYSCCNGKFRQAALKTFSK
PERc_braBe GYLASAIYITLTGLIAFFGNVITITVFLTE-KEFRKKQQNGFVLNLAIADLSVCVFAYPSSAIAGYAGRWVLGDVG-CTIYGFLCFTFALVSMVTLCVISIYRYILICKPQ--YA-HLLTHRRTVYVIIGTWLYALVFTVPPLV--GVK-RYTYEPMQITCSLDW--NVQHPGEKAYIAAVLVIVYVLQVLIMCFCYFNIIFKSANLKFAAL-------ASEKTK--------MAAKKDTWKTSVMCLTMVVSFLIAWTPYAVSSTWDIL-SAEDL--PIIATILPSLFAKSSCMMNPIIYACCNTKFRQAAVKSFRKLCGMCKQK
PERb_braFl SATIMGVYLTIVGLVSTVGNATVVLMFMLKWRQLCRKAPNLLIINLAAVDLCISVFGYPFSASSGFANQWLFSDAI-CTLYGFSCFLLSMVSMHTLCLISAHRYITICRPE--HA-SKLTMTRTILAVVGAWVYGISVAVPPLF--GIA-GYTYESFGLSCTIDF--HGTTVADMVYLSILIILCYVINVAVMGTCYFKIIRKFSKHRFREV-------RDVRTS--------HQHSFERGVT-LRCILMTLFYLISWTPYTAVAVWTMV-GPPP---PVQLGMVAALTAKTHCAFNPILYMLMSEVYRKLVLRTMCPCCFNKISN
PERb_braBe SATIMGVYLTIVGLVATVGNATVVLMFIMKWRQLCRKAPNLLVINLAAANLCITIFGYPFSASSGYAHQWLFPDAI-CTLYGFSCFLLSMVSMHTLCLISAHRYITICRPE--HA-SKLTMNRTVLAVIGTWLYAIAVAVPPLF--NIA-RYTYEPSGLSCTIDF--RVTTVADLVYLGSLIVLCYVIHVAVMATCYFKIIRKFSRHRFRQV-------RDIRTS--------HQRSFEMGVT-MRCILMTLFYLLSWTPYTAVCIWTMV-GPPP---PVVVSMAAALIAKTHCAFNPILYAFMSEVYRKLVFRTMCPCCFNRISC
NEUR_homSa ADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLR--PAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLS--YG-VWLKRKHAYICLAAIWAYASFWTTMPLV--GLG-DYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKS-SSKEV-------AHFDSRIHSSHVLEMKLTKV-------AMLICAGFLIAWIPYAVVSVWSAFGRPDSI--PIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEG
NEUR_monDo ADLVAGFYLTIIGVLSTLGNGYVIYMSSKRKKKLR--PAEIMTVNLAVCDLGISVVGKPFTIISCFSHRWVFGWVG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLS--YG-TWLKRHHAYICLVIIWAYATFWATMPLA--GLG-NYAPEPFGTSCTLDWWLAQASVTGQTFILNILFFCLLLPTAVIVFSYVKIIAKVKS-STKEV-------AHFDSRIQSSHVLEMKLTK--------AMLICAGFLIAWIPYAVVSVWSAFGQPDSI--PVQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCQSGGQKAAKKESLRT
NEUR_ornAn ADLVAGVYLVIIGVLSTLGNGYVIYMSSRRKKKLR--PAEIMTVNLAVCDLGISVVGKPFTIVSCFCHRWVFGWMG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLS--YG-TWLKRHHAYICLAIIWAYASFWATMPLV--GLG-NYAPEPFGTSCTLDWWLAQASVAGQAFILNILFFCLLLPTAVIVFSYVKIIAKVKS-STKEV-------AHFDSRIQNSHVLEMKLTK--------AMLICAGFLIAWIPYAVVSVWSAFGQPDSI--PIQFSVVPTLLAKSAAMYNPIIYQVIDCRISCCRLGGPKTGKKESLKN
NEUR_galGa ADIIAGFYLTVIGILSTLGNGYVIFMSSKRKKKLR--PAEIMTVNLAVCDLGISV-GKPFSIISFFSHRWIFGWMG-CRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLA--YG-TWLKRHHAFICLALIWAYATFWATVPFA--GVG-SYAPEPFGTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKS-STKEV-------AHYDTRIQNSHILEMKLTKV-------AMLICAGFLIAWIPYAVVSVWSAFGQPDSV--PIQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCRSGGPKTLQKKSSLK
NEUR_anoCa ADIVAGVYLLVIGILSTLGNGYVIYMSTQRKKKLK--PAEIMTVNLAVCDLGISV-GKPFSIIAFFSHRWIFGWSG-CRWYGWAGFFFGIGSLITMTAVSLDRYFKICHLS--YG-TWLKRHHVFICLGIIWSYAAFWATIPFA--GFG-NYAPEPFGTSCTLDWWLAQGSVAGQAFILNILFFCLVLPTAVIMFCYVKIIAKVQS-STKEV-------AHYDTRIQNQHVLEMKLTKV-------AMLICAGFMFAWIPYAVVSVWSAFGRPDSV--PIKVSVIPTLLAKSAAMYNPVIYQVIDCKSACCRPGNLQPLQKKNSRY
NEUR_xenTr ADIFAGVYLMAIGILSTLGNGYVIYMACSRKKKLR--PAEIMTINLAVCDLGISVTGKPFAIVSCFSHRWVFGWNA-CRWYGWAGFFFGCGSLITLTVVSLDRYLKICHLR--YG-TWLKRRHAFIALAVIWAYATLWATLPLV--GVG-NYAPEPFGTTCTLDWWLAQASVKGQIFVLSMLFFCLLFPTMVIVFSYAKIIAKVKS-SAKEV-------AHFDTRNQNNHTLEIKLTK--------AMLICAGFLIAWFPYAVVSVWSAFGQPDSI--PIELSVVPTMMAKSASMYNPIIYQVIDCKPACCKKD--KSLQNTTSRY
NEUR_gasAc ADIIAAFYICIIGIMSATGNGYVIYMTIKRKSKLK--PPELMTVNLAVFDFGISVTGKPFFVVSSFAHRWLFGWEG-CRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLR--YG-TWLKRQHAFLCLVFVWMYAAFWATMPLV--GWG-NYAPEPFGTSCTLDWWLAQASVSGQSFVVAILFFCLVLPAGIIVFSYVMIIFKVKS-SAKEI-------SNFDARIKNSHNLEIKLTKTRNCATEDAMLICAGFLIAWIPYAVVSVVSAFGEPDSV--PISVSVIPTLLAKSSAMYNPIIYQVLDLKNSCMKSSCFKGLKKPRHFR
NEUR_calMi GLLSTLGNGYVIYLSITQKRKLK--PPEILITNLAISDFGMSVGGQPFLIISCFSHRWIFGWVG-CRWHGWAGFFFGCGSLITMTVVSLDRYLKICHLQ--YG-SWLQRRHVFMSLAFIWFYAAFWATMPLV--GWG-NYAPEPFGTSCTLDWWLARVSVSGLIFVLTILFFCLLLPIIIIVFSYIKIIAKVKS-SAKEV-------AHFDSRIQNHHSLEMNLTK
MEL1_homSa AHYTLGTVILLVGLTGMLGNLTVIYTFCRSRS-LR-TPANMFIINLAVSDFLMSFTQA-PVFFTSSLYKQWLFGETGCEFYAFCGALFGISSMITLTAIALDRYLVITRPL--ATFGVASKRRAAFVLLGVWLYALAWSLPPFF--GWS-AYVPEGLLTSCSWDYMS--FTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGRALQTFGAC------KGNGESLWQRQ-RLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAGYAHVL--TPYMSSVPAVIAKASAIHNPIIYAITHPKYRVAIAQHLPCLGVLLGVS
MEL1_monDo AHYTIGATILAVGFTGVLGNLLVIYTFCR----LR-TPANMFIINLAISDFFMSFTQA-PVFFASSMYKRWIFGEKACEFYAFCGALFGITSMITLMAIALDRYFVITRPL--ASIGVISKKKTGFILLGVWLYSLAWSLPPFF--GWS-AYVPEGLLTSCSWDYTT--FTPSVRAYTMLLFCFVFFIPLIVIIYCYIFIFRAIQDTNKAVHSIGSG------ESTA-SPRHCQ-RMKNEWKMAKIALVVILLYVLSWAPYSTVALVAFAGYSHIL--TPYMNSVPAIIAKASAIHNPIIYAISHPKYRMAIAQNFPCLRALLCVR
MEL1_xenTr VHYVVGAVILAVGITGMLGNFLVIYAFCRSRS-LR-SPANMFIINLAITDFLMSVTQA-PVFFATSLHKRWIFGEKGCELYAFCGALFGITSMITLMVIAVDRYFVITRPL--TSIGVMSKKRAVLILSGVWLYSLAWSLPPFF--GWS-AYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFCFVFFIPLFIIIYCYIFIFKAIKNTNRAVQKIGTD------N-NKESHKQYQ-KMKNEWKMAKIALIVILLYVVSWSPYSTVALLAFAGYASIL--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRMAIAKYIPCLGSLLRVK
MEL1_galGa AHYTIGTVILIVGITGTLGNFLVIYAFCRSRT-LQ-KPANIFIINLAVSDFLMSITQS-PVFFTNSLHKRWIFGEKGCELYAFCGALFGITSMITLMVIALDRYFVITKPL--ASVRVMSKKKALIILVGVWLYSLAWSLPPFF--GWS-AYVPEGLLTSCSWDYMT--FTPSVRAYTMLLFCFVFFIPLIAIIYSYVFIFEAIKKANKSVQTFGCK------HGNRELQKQYH-RMKNEWKLAKIALIVILLYVISWSPYSVVALVAFAGYSHVL--TPFMNSVPAVIAKASAIHNPIIYAITHPKYRTAIATYVPCLGFLLRVS
MEL1_calMi AHYIIGATILAVGVTGMVGNFLVIYAFLRSRS-LR-TPANTFIINLAATDFLMSVTQS-PIFFITSIHKRWIFGEKGCELYAFCGALFGITSMITLMVIALDRYFVITRPL--ASIGVLSHRRAGLIILSLWLYSLAWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFCFVFFIPLGVIIYCYIFIFRAIKSTNKKVG----G------STNRESQKQHQ-RMKNEWKMAKIALIVILLFVISWSPYSTVALTAFAGYADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRMAIAKYVPLLGLLLRVS
MEL1_danRe AHYTIGAVILTVGITGMLGNFLVIYAFSRSRT-LR-TPANLFIINLAITDFLMCATQA-PIFFTTSMHKRWIFGEKGCELYAFCGALFGICSMITLMVIAVDRYFVITRPL--ASIGVLSQKRALLILLVAWVYSLGWSLPPFF--GWS-AYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFIFVFFIPLIVIIYCYFFIFRSIRTTNEAVGKINGD-------NKRDSMKRFQ-RLKNEWKMAKIALIVILMYVISWSPYSTVALTAFAGYSDFL--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRLAIAKYIPCLRLLLCVP
MEL1_takRu AHYTIGSVILVIGITGMIGNFLVIYAFCRSRS-LR-TPANMFIINLAVTDLLMCVTQT-PIFFTTSMYKRWIFGEKGCELYAFCGALFGICSMITLTVIAIDRYFVITRPL--TSIGVLSRKRAFVILMTVWIYSLGWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FSPSVRAYTMLLFIFVFFLPLFIIIYCYFFIFRAIRATNKAVGKVNGS--VHSHSRRRESVKNFQ-RLQNEWKMAKIALMVILLYVISWSPYSCVALTAFAGYADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRLALAKYIPCLGFLLCIS
MEL1_gasAc AHYTIGSVILAIGITGIIGNVLVIYAFSKSRS-LR-TPANMFIINLAITDLLMCVTQA-PIFFTTSMHKRWIFGEKGCELYAFCGALFGICSMITLTVIALDRYFVITRPL--TSIGMMSRRRALLILMGAWTYSLGWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFIFVFFLPLFIIIYCYFFIFRAIRVTNRAVGKMNGS--IHSHGSGRDSTKNFH-RLQNEWKMAKIALIVILLYVVSWSPYSAVALTAFAGYADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRIALAKYIPFLGVLLCVP
MEL1_oryLa AHYTIGSVILAIGITGIIGNFLVIYAFSRSRS-LR-TPANMFIINLAITDLLMCVTQS-PIFFTTSMHKRWIFGEKGCELYAFCGALFGICSMITLTVIAIDRYFVITRPL--TSIGVLSRKRALLILSAAWAYSLGWSLPPFF--GWSGAYVPEGLLTSCTWDYMT--FTPSVRAYTMLLFIFVFFLPLFIIIYCYVFIFRAIRSTNRAVGKINGN--T------RDAVKSFN-RLQNEWKMAKIALIVILLYVISWSPYSTVALTAFAGYADML--TPYMNSIPAVIAKASAIHNPIIYAITHPKYRMALAKYIPGLGVLLCIH
MEL1D_danR AHYTIGSVILAVGITGMVGNLLVMYAFCKSRS-LR-TPANMFIINLAVTDFLMCVTQT-PIFFTTSLHKRWIFGEKGCELYAFCGALFGICSMITLMIIAVDRYFVITRPL--ASIGVMSRKRALLILSAAWAYSMGWSLPPFF--GWSGAYVPEGLLTSCSWDYMT--FSPSVRAYTMLLFTFVFFIPLFVIIYCYFFIFKAIRETNRAVGKINGE------GGPRDSIKKIH-RMKNEWKMAKIALIVILLYVISWSPYSCVALTAF--YADML--TPYMNSVPAVIAKASAIHNPIIYAITHPKYRSAIAKYIPCLGVLLCVP
MEL2_galGa VLYTVGTCVLVIGSIGIIGNLLVLYAFYSNKK-LR-TPQNFFIMNLAVSDFLMSASQA-PICFVNSLHREWILGDIGCDLYAFCGALFGITSMMTLLAISVDRYLVITKPL--RSIQWTSKKRTIQIIAAVWLYSLGWSVAPLL--GWS-SYVPEGLMISCTWDYVT--YSPANRSYTMILCCCVFFIPLIIILHCYLFMFLAIRSTGRDVQKLGSC---------SRKSFLSQ-SMKNEWKLAKIAFVVIIVYVLSWSPYACVTLIAWAGRGNTL--TPYSKSVPAVIAKASAIYNPIIYAIIHPRYRKTIHNAVPCLRFLIRIS
MEL2_xenLa VLYTIGSFILIIGSVGIIGNMLVLYAFYRNKK-LR-TAPNYFIINLAISDFLMSATQA-PVCFLSSLHREWILGDIGCNVYAFCGALFGITSMMTLLAISINRYIVITKPL--QSIQWSSKKRTSQIIVLVWMYSLMWSLAPLL--GWS-SYVPEGLRISCTWDYVT--STMSNRSYTMMLCCCVFFIPLIVISHCYLFMFLAIRSTGRNVQKLGSY---------GRQSFLSQ-SMKNEWKMAKIAFVIIIVFVLSWSPYACVTLIAWAGHGKSL--TPYSKTVPAVIAKASAIYNPIIYGIIHPKYRETIHKTVPCLRFLIREP
MEL2_anoCa VLYTVGSCVLVIGCIGITGNLLVLYAFYSNKR-LR-TPPNYFIMNLAVSDFLMSATQA-PICFLNSMHKEWVLGDIGCNLYAFCGALFGITSMITLLAISVDRYCVITKPL--QSIKRTSKKRTCIIIVFVWLYSLGWSVCPLF--GWS-SYIPEGLMISCTWDYVT--YSPANRSYTMMLCCCVFFIPLVIIFHCYIFMFLAIRSTGR------------------RKSSISH-SIKSEWKLAKIAFVAIVVFVLSWSPYACVTLISWAG---TL--TPYSKSVPAVIAKASAIYNPIIYAIIHPRYRKTIRSAVPCLRFLIPIS
MEL2_tetNi VHYIIAFFVFVIGILGITGNVLVIFAFYSNKK-LR-SLPNYFIVNLAVSDLLMASTQS-PIFFIN-LYKEWMFGETACKMYAFCGALFGITSMINLLAISVDRYVVITKPL--QTIRRSSKRRTALAILMVWLYSLAWSLAPLV--GWG-SYIPEGLMTSCTWDYVT--YTLANRSYTMMLCCFVFFIPLAIILCCYLLMFLAIRKTSRR----------------KSTLIQQK-SIRSEWKLAKIAFVVIVVYVLSWSPYACVTLISWAG---TL--TPYSKSVPAVIAKASAIYNPIIYAIIHPRYRKTIRSAVPCLRFLIPIS
MEL2_gasAc AHYIVAVFVVVIGTLGITGNALVMLAVYSNKK-LR-NLPNYFIMNLAVSDFLMAFTQS-PIFFINCLYKEWAFGETGCKIYAFCGALFGIASMINLLAISIDRYLVITKPL--QAIHWGSKRRTTLAILLVWLYSLAWSLAPLV--GWG-SYIPEGLMTSCTWDYVT--YTLANRSYTMMLCCFVFFIPLGIILYCYLFMFLAIRKTSRR----------------KSTLIKQK-SMKSEWKLAKIAFVVIVVYVLSWSPYACVTLISWAG---IL--SPYSKAVPAIIAKASAIYNPFIYAIIHNKYRMTLAAKFPCLRFLSPTP
MEL2_danRe VHYIIAFLILIIGTLGVSGNALVMFAFYRNKK-LR-SLPNYFIMNLAVSDFLMAITQS-PIFFINCLYKEWMFGELGCKIYAFCGALFGITSMINLLAISIDRYLVITKPL--QTIQWNSKRRTGLAILCIWLYSLAWSLAPLI--GWG-SYIPEGLMTSCTWDYVS--PSPANKSYTMMLCCFVFFIPLSIILYCYLFMFLSVRQASRQ----------------KSSFVKQQ-SMRSEWKLAKIAAVVIVVYVLSWAPYACVTLVAWAG----L--TPYSKTLPAVLAKSSAIYNPFIYAIIHNKYRATLAEKVPGLSCLSRSQ
MEL1a_braF AHYIVGTAVFCVGCCGMFGNAVVVYSFIKSKG-LR-TPANFFIINLALSDFLMNLTNM-PIFAVNSAFQRWLLSDFACELYGFAGGLFGCLSINTLMAISMDRYLVITKPF--LVMRIVTKQRVMFAILLLWIWSLVWALPPLF--GWS-AYVPEGFGTSCTFDYMT--PKLSYHIFTYIIFFTMYFIPMGVIIYCYYNIFATVKSGDKQFGKAVKE--M-AHEDVKNKAQQER-QRKNEIKTAKIAFIVITLFLSAWTPYAVVSALGTLGYQDLV--TPYLQSIPAVFAKSSAVYNPIVYAITHPKFRAAVKKHIPCLSGCLPAD
MEL1a_braB AHYIVGTAVFCIGCCGMFGNAVVVYSFIKSKG-LR-TPANFFIINLALSDFLMNLTNM-PIFAVNSAFQRWLLSDFACELYGFAGGLFGCLSINTLMAISMDRYLVITKPF--LVMRIVTKQRVMFAILLLWIWSLVWALPPLF--GWS-AYVSEGFGTSCTFDYMT--PKLSYHIFTYIIFFTMYFIPGGVMIYCYYNIFATVKSGDKQFGKAVKE--M-AHEDVKNKAQQER-QRKNEIKTAKIAFIVISLFMSAWTPYAVVSALGTLGYQDLV--TPYLQSIPAMFAKSSAVYSPIVYAITYPKFREAVKKHIPCLSGCLPAS
MOLL_RHO_l VYYSLGIFIAICGIIGCAGNGIVIYLFTKTKS-LQ-TPANMFIINLAFSDFTFSLVNGFPMMTISCFLKHWVFGQAACKVYGLIGGIFGLTSIMTMTMISIDRYNVIRRPM--SASKKMSHRKAFIMIVFVWIWSTIWAIGPIF--GWG-AYQLEGVLCNCSFDYIT--RDASTRSNIVCMYIFAFMFPIVVIFFCYFNIVMSVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GASAEMKLAKISIVIVTQSLLSWSPYAIVALLAQFGPIEWV--TPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAIASNFPWILTCCQ
MOLL_RHO_s VYYSLGIFIGICGIIGCTGNGIVIYLFTKTKS-LQ-TPANMFIINLAFSDFTFSLVNGFPLMTISCFIKKWVFGMAACKVYGFIGGIFGLMSIMTMSMISIDRYNVIGRPM--AASKKMSHRRAFLMIIFVWMWSTLWSIGPIF--GWG-AYVLEGVLCNCSFDYIT--RDSATRSNIVCMYIFAFCFPILIIFFCYFNIVMAVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GASAEMKLAKISIVIVTQFLLSWSPYAVVALLAQFGPIEWV--TPYAAQLPVMFAKASAIHNPLIYSVSHPKFREAIAENFPWIITCCQ
MOLL_RHO_t VYYSLGIFIGICGIIGCGGNGIVIYLFTKTKS-LQ-TPANMFIINLAFSDFTFSLVNGFPLMTISCFLKKWIFGFAACKVYGFIGGIFGFMSIMTMAMISIDRYNVIGRPM--AASKKMSHRRAFIMIIFVWLWSVLWAIGPIF--GWG-AYTLEGVLCNCSFDYIS--RDSTTRSNILCMFILGFFGPILIIFFCYFNIVMSVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GANAEMRLAKISIVIVSQFLLSWSPYAVVALLAQFGPLEWV--TPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAISQTFPWVLTCCQ
MOLL_RHO_e VYYSVGIFIGVVGIIGILGNGVVIYLFSKTKS-LQ-TPANMFIINLAMSDLSFSAINGFPLKTISAFMKKWIFGKVACQLYGLLGGIFGFMSINTMAMISIDRYNVIGRPM--AASKKMSHRRAFLMIIFVWMWSIVWSVGPVF--NWG-AYVPEGILTSCSFDYLS--TDPSTRSFILCMYFCGFMLPIIIIAFCYFNIVMSVSNHEKEMAAMA-K--R-LNAKE--LRKAQA-GASAEMKLAKISMVIITQFMLSWSPYAIIALLAQFGPAEWV--TPYAAELPVLFAKASAIHNPIVYSVSHPKFREAIQTTFPWLLTCCQ
MOLL_MEL_p WHYIIGVYITIVGLLGIMGNTTVVYIFSNTKS-LR-SPSNLFVVNLAVSDLIFSAVNGFPLLTVSSFHQKWIFGSLFCQLYGFVGGVFGLMSINTLTAISIDRYVVITKPL--QASQTMTRRKVHLMIVIVWVLSILLSIPPFF--GWG-AYIPEGFQTSCTFDYLT--KTARTRTYIVVLYLFGFLIPLIIIGVCYVLIIRGVRRHDQKMLTIT-R--S-MKTED--ARANNK-RARSELRISKIAMTVTCLFIISWSPYAIIALIAQFGPAHWI--TPLVSELPMMLAKSSSMHNPVVYALSHPKFRKALYQRVPWLFCCCK
LOPH_RHO_p WHYAVAAWMTFFGILGVSGNLLVVWTFLKTKS-LR-TAPNMLLVNLAIGDMAFSAINGFPLLTISSINKRWVWGKLWRELYAFVGGIFGLMSINTLAWIAIDRFYVITNPL--GAAQTMTKKRAFIILTIIWANASLWALAPFF--GWG-AYIPEGFQTSCTYDYLT--QDMNNYTYVLGMYLFGFIFPVAIIFFCYLGIVRAIFAHHAEMMATA-K--R-MGAN---TGKADA-DKKSEIQIAKVAAMTIGTFMLSWTPYAVVGVFGMIKPHSEMFIHPLLAEIPVMMAKASARYNPIIYALSHPKFRAEIDKHFPWLLCCCKPK
RHAB_schMe YHYLVGVYISIVGISGVLGNLLVLYIFARAKS-LR-TPPNMFIMSLAIGDLTFSAVNGFPLLTISSFNTRWAWGKLTCEIYGFIGGLFGFISINTMALISLDRYFVIAQPF--QTMKSLTIKRAIIMLVFVWLYSLIWSTPPFF--GYG-NYVPEGFQTSCTFDYLT--QSKGNIIFNIGMYIGNFIIPVGIIIFCYYQIVKAVRVHELEMLKMA-Q--K-MNASHPTSMKTGA--KKADVQAAKISVIIVFLYMLSWTPYAIIALMALTGRRDHL--NPYTAELPVLFAKTSAMYNPFIYAINHPKFRIQLEKKFPCLICCCPPK
MEL1_schMa YYYLVGIYIGIVGILAVMGNSLVITLFLLCKQ-LR-TPPNMLIVSLAISDFSFALINGFPLKTIAAFNHRWGWGKLACELYGFAGSIFGFISLTTMAFIALDRYLVIVQPF--ETFSRITYGKVIVMIFITWIWSALWSIPPFF--GYG-SYIPEGFHTSCTFDYLS--TDLPNLIFNAGLYILGFLCPVFIIIFSYYQIVKTVRLNELELMKMA-Q--S-LDLQNPSAMKTGG-DKKADIEAAKTSIILVLLYLMSWSPYAIVCLMTLIGSRDSL--TPFHSELPVLFAKTSAVYNPIVYAVKHPKFRMEIEKRFPFLICCCPPK
MEL1_capCa IYYGLGLYMAVVGIVGTLGNLVVITLFI--KS-LR-TPPNMFIINLALSDMGFCATNGFPLMTVASFQKLWRWGPVACELYALAGSITGFNSIATLALISMDRYMVIAKPF--YAMKHVSHKRSLIQIILAWTWAFIWSAPPLLRMGYG-RYIPEGFQVSCTFDYLS--RDLKNLIFVWCLFVFGFFIPVLAIACSYVGIIRAVGAQSKEMRKTA-E--K-MGAK---TGKSDK-EKKQDIAMAKVAAGTIGLFLMSWTPYAAVSMIGIAGNRSWI--TPYVSQIPVMFAKASAMWNPILYALSHPKFRAALEDHMPWLLVC
MEL2_schMa YQYAIGLFIAVVGITGMCLNLLVIVFFTMFKS-LR-TPSNILVVNLAISDFGFSAVIGFPLKTMAAFNNFWPWGKLACDLYGLAGGLFGFVSLSTIAAVALDRYLVIATPF--ESVFQTTPRRTLLLMLFLWMWSLMWTIPPLFGFG-K-RYVTEGYQTSCTMDYIS--TDLNNRLFNIGLFGFGFLCPLFLSLFCYARIILIVRSRGKDFIEMAAS--S-KGTNQKEKSANVS-SSKSDTFVSKSSAILLGVYLICWTPYSFVCLMALIGYADYI--TPLMVEIPCLCAKTA---NPCIYAFRYPKFRSLLQQRFGFLRLTKNRV
MEL1_helRo FYYFLGTFFAVVGFLGVFGNIIVVWVFSRTPS-LR-TPSNVLVINLAICDILFSALIGFPMSALSCFQRHWIWGNF-CQFYSFVAGITGLASINCLAVIAVDRYLVVGQPL--AMLNQSHFRRSFYHVLIIWTWACVWSAMPLI--GWG-EYILEGFGVSCTFDYLT--RTTWNISFNVCLFTFCFGMPVSVIILSYIGIIRSIAKNRKEFSSL--------------TAENSS-RARQEIKIAKVFAVCMTAFILCWVPYATVAQLGIYGYDQMV--SPYTAELPVMLAKTSALWNPIIYAFSHPKYRKCLKELPIF
MEL1_strPu MNAVTTALPHGLNKPTIEARWTKS-LR-TPPNMLIVNLAISDFGMVITN-FPLMFASTIYNRWLFGDAGCQFYAFCGALFGIMSIANMTAIALDRYYVICWSL--EAVRSVTHRRSMIIIIIVWCYAIFWSIPPFFGVG---SYVLEGYGLGCTFDFMT--KDLNHYLHVSFLFASSFVVPVTIIIVCFTRIAITVRAHRHELNKMRTKLTEDKDKKHKSSIRRAN-KAKTEFQIAKVGFQVTIFYVLSWMPYSIVAVIGQYFDSDLL--TPLGTVVPVIFAKCSAIWNPIIYCLSHEKFNAALKEKLMGMCGIEIPS
MEL2_strPu AFPLIGGYLLVVVLLGTAGNSLVIYTFLRFKK-LH-SPINLLIVNLSASDLLVATT-GTPLSMVSSFYGRWLFGTNACAFYGFVNYYCGCISLNSLAAISVFRYIIVVRGQ--AQNNKLSLRSSIYAILVIHLYTLIFSTPPLY--GWN-RFVLAGYHTSCDIDFHT--KTPLFVSYICYMFFFLFFLPLGLISWSYFKIYQRVSK--HSNSMRTSFTGVTKEINSDEKHANHR-------RTASTLFVTIVVFLFAWFPYCIVSLWVLIGDANSI--SKLSTTIPSLFAKSSVIYNPLIYVVLNSKFRKALIQTLSFLKCLSKHE
MOLL_MEL_a VHLSVGVFITLVGVLAVCGNSLVIITCIRFKD-LR-TRSNILIINLAVGDLLMCLI-DFPLLAAASFYGEWPYGRQVCQMYAFLTAIAGLVTINTLAVIAADRYWAVVRRP--TPGQKLPKCVTSIAVASVWAYSISWALCPIL--GWG-AYVLDGIRTTCTFDFLT--RTWENRSFVIGMMIGNFVLPFALMVFSYFRIWVAVRKVKSGNVFCAIRHNYNLALGSTLFVKQHRYRLHCEQKTVKIIMFLLIAFTVSWSPYLAVSIIGLFGDRSQL--TYQNTLTASLIAKTSMVFNPILYSISHPKVRKRIANLACCYSVRRHQQ
MOLL_MEL_l CQYTIGIFISTVAVIAVIGNSIVIWAHVRIKS-LS-TTSNMLILNLCVGCLIMCIV-DFPLYATSSFLQKWIFGHKVCEIYATITGTAGLLIMNSYSAIAFDRFITVTRYN--NPNYPRSKSATMCISGFVWIYSLSWSMAPVV--GWS-RYQLDGSGTTCTFDYLS--TTWTNRSFILSIAFFNFVLPLCFILFAYSRILHLISS--HSREMKSYRSAVIISKGKASIPKRFR----SERKTAITLLITVVVFCLSWVPYVIIALIGQFGNQSFI--TPQISVIPQLVAKLSTVTNPILYSLSHPVVRNKLFLRLRHELYRRPSD
CHEL_LWS_l WYSILGVAMIILGIICVLGNGMVIYLMMTTKS-LR-TPTNLLVVNLAFSDFCMMAFMMPTMTSNCFAE-TWILGPFMCEVYGMAGSLFGCASIWSMVMITLDRYNVIVRGM--AA-APLTHKKATLLLLFVWIWSGGWT-ILPF-FGWS-RYVPEGNLTSCTVDYLT--KDWSSASYVVIYGLAVYFLPLITMIYCYFFIVHAVAEHEKQLREQAKK-----MNVASLRANADQQKQSAECRLAKVAMMTVGLWFMAWTPYLIISWAGVFSSGTRL--TPLATIWGSVFAKANSCYNPIVYGISHPRYKAALYQRFPSLACGSGE
CHEL_LWS_i WHSLLGFAMVILGVISVVGNSMVIYIMTTSKS-LR-SPTNMLVVNLAFSDWCMMAFMMPTMAANCFAE-TWILGPFMCEVYGMVGSLFGCGSIWSMVMITLDRYNVIVRGV--AA-APLTHKRAALMIFFVWFWALTWT-LLPF-FGWS-RYVPEGNMTSCTIDYLT--KALWSASYVVAYAGGVYWTPLFINIYCYSKIVRAVAQHEKQLRLQARK-----MNVASLRANAEQTKTSAEARLAKIALMTVGLWFMAWTPYLTIAWAGIFSDGSKL--TPLATIWGSVFAKANACYNPIVYGISHPKYRAALARRFPSLVCMPPGG
INSE_LWS1_ WHKILGLVMIILGIMGWCGNGVVVYVFIMTPS-LR-TPSNLLVVNLAFSDFIMMGFMCPPMVICCFYE-TWVLGSLMCDIYAMVGSLCGCASIWTMTAIALDRYNVIVKGM--SG-TPLTIKRAMLQILGIWLFGLIWT-ILPL-VGWN-RYVPEGNMTACGTDYLS--QDWTFKSYILVYSFFVYYTPLFTIIYSYYFIVSAVAAHEKAMKEQAKK-----MNVTSLRSGDNQNTSA-EAKLAKVALTTISLWFMAWTPYLVINYIGIFNR-SLI--TPLFTIWGSLFAKANAIYNPIVYGISHPKYRAALKEKLPFLVCGSTED
INSE_LWS2_ WHGILGFVIGMLGFVSAMGNGMVVYIFLSTKS-LR-TPSNLFVINLAISNFLMMFCMSPPMVINCYYE-TWVLGPLFCQIYAMLGSLFGCGSIWTMTMIAFDRYNVIVKGL--SG-KPLSINGALIRIIAIWLFSLGWT-IAPM-FGWN-RYVPEGNMTACGTDYFN--RGLLSASYLVCYGIWVYFVPLFLIIYSYWFIIQAVAAHEKNMREQAKK-----MNVASLRSSENQNTSA-ECKLAKVALMTISLWFMAWTPYLVINFSGIFNL-VKI--SPLFTIWGSLFAKANAVYNPIVYGISHPKYRAALFAKFPSLACAAEPS
INSE_LWS_c WHAILGFVIGILGMISVIGNGMVIYIFTTTKS-LR-TPSNLLVINLAISDFLMMLSMSPAMVINCYYE-TWVLGPLVCELYGLTGSLFGCGSIWTMTMIAFDRYNVIVKGL--SA-KPMTINGALLRILGIWFFSLGWT-IAPM-FGWN-RYVPEGNMTACGTDYLT--KDLLSRSYILVYSFFCYFLPLFLIIYSYFFIIQAVAAHEKNMREQAKK-----MNVASLRSAENQSTSA-ECKLAKVALMTISLWFMAWTPYLVINYAGIFET-VKI--NPLFTIWGSLFAKANAVYNPIVYGISHPKYRAALFQRFPSLACSSGPA
INSE_LWS_p WHGLLGFTIGVLGFISITGNGMVVYIFTSTKS-LK-TPSNLLVVNLAFSDFLMMLCMAPPMLINCYYE-TWVFGPLACELYACAGSLFGSISIWTMTMIAFDRYNVIVKGI--AA-KPMTINGALLRILGIWLFSLAWT-IAPM-LGWN-RYVPEGNMTACGTDYLS--KSWLSRSYILVYSIFVYYTPLLLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSEAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYTGVFET-API--SPLATIWGSVFAKANAVYNPIVYGISHPKYRAALYQKFPSLACQPSA
INSE_LWS_m WHALLGFTIGVLGFVSISGNGMVIYIFMSTKS-LK-TPSNLLVVNLAFSDFLMMCAMSPAMVVNCYYE-TWVWGPFACELYACAGSLFGCASIWTMTMIAFDRYNVIVKGI--AA-KPMTSNGALLRILGIWVFSLAWT-LLPF-FGWN-RYVPEGNMTACGTDYLS--KSWVSRSYILIYSVFVYFLPLLLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSEAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYTGVFES-API--SPLATIWGSLFAKANAVYNPIVYGISHPKYQAALYAKFPSLQCQSAP
INSE_LWS_v WHGLLGFVIGILGFISITGNGMVIYIFTTTKS-LK-TPSNILVVNLAFSDFLMMCVMSPPMVVNCYTE-TWVFGPLACQLYACAGSLFGCASIWTMTMIAFDRYNVIVKGI--AA-KPLTINGAMLRVLGIWVFSLAWT-VAPL-FGWG-RYVPEGNMTACGTDYLD--KSWFNRSYILIYSIFCYFSPLFLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSDAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYAGIFET-ATI--TPLATIWGSVFAKANAVYNPIVYGISHPKYRAALYARFPALACQPSP
INSE_LWS_t WHGILGFVIGVLGFVSIVGNGMVIYIFSSTKA-LR-TPSNLLVVNLAFSDFLMMXCMSPAMVINCYNE-TWVLGPLVCELYGMSGSLFGCASIWTMTFIALDRYNVIVKGL--SA-QPLTKKGAMLRILIIWVFSTLWT-IAPF-FGWN-RYVPEGNMTACGTDYLT--KDWVSRSYILVYAVWVYFVPLFTIIYSYWFIVQAVAAHEKSMREQAKK-----MNVASLRSSEAAQTSA-ECKLAKIALMTITLWFFAWTPYLVTNFTGIFEG-AKI--SPLATIWCSLFAKANAVYNPIVYGISHPKYRQALQKKFPSLVCAGEP
INSE_LWS_s WHGLLGFVIGVLGVISVIGNGMVIYIFSTTKS-LR-TPSNLLVVNLAFSDFLMMFTMSAPMGINCYYE-TWVLGPFMCELYALFGSLFGCGSIWTMTMIALDRYNVIVKGL--SA-KPMTNKTAMLRILFIWAFSVAWT-IMPL-FGWN-RYVPEGNMTACGTDYLT--KDWVSRSYILVYSFFVYLLPLGTIIYSYFFILQAVSAHEKQMREQRKK-----MNVASLRSAEASQTSA-ECKLAKVALMTISLWFFGWTPYLIINFTGIFET-MKI--SPLLTIWGSLFAKANAVFNPIVYGISHPKYRAALEKKFPSLACASSS
INSE_LWS_b WHGILGFVIGLLGFISVSGNGMVVYIFLSTKS-LR-TPSNMFVINLAISDFLMMFCMSPPMVINCYYE-TWVLGPLFCQVYAMLGSLFGCGSIWTMTMIAFDRYNVIVKGL--SG-KPLTINGALLRILGIWLFSLIWT-IAPM-FGWN-RYVPEGNMTACGTDYFS--KDIVSVSYILLYSIWVYFFPLFLIIWSYWFIXQAVAAHEKNMREQAKK-----MNVASLRSSENQNTSA-ECKLAKVALMTISLWFMAWTPYLVINWSGIFSL-VKI--SPLYTIWGSLFAKANAV
INSE_LWS_d WFGIIGFVIAILGTMSLAGNFIVMYIFTSSKG-LR-TPSNMFVVNLAFSDFMMMFTMFPPVVLNGFYG-TWIMGPFLCELYGMFGSLFGCVSIWSMTLIAYDRYCVIVKGM--AR-KPLTATAAVLRLMVVWTICGAWA-LMPL-FGWN-RYVPEGNMTACGTDYFA--KDWWNRSYIIVYSLWVYLTPLLTIIFSYWHIMKAVAAHEKAMREQAKK-----MNVASLRNSEADKSKAIEIKLAKVALTTISLWFFAWTPYTIINYAGIFES-MHL--SPLSTICGSVFAKANAVCNPIVYGLSHPKYKQVLREKMPCLACGKDDL
CRUS_LWS_m WYGILAFVVTVVGLCSICGNFVVIWVFMNTKA-LR-SPANTLVVSLAVSDFIMMACMFPPLVLNCYWG-TWIFGPLFCEVYAFIGNTVGCASIGNMIFITFDRYNVIVKGI--SG-TPLSQKNTTLQVLFVWICSIMWC-VFPF-FGWN-RYVPRGDMTACGTDYLT--EDEFSRSYLYVYSVWVYIGPLALIIYCYFHIVSAVATHEKQMRDQAKK-----MGVKSLRTEEAKKTSA-ECRLAKVALTTVSLWFMAWTPYLIINWAGMFYP-SVV--SPLFSIWGSVFAKANAVYNPIVYAISHPKYRAALYKKLPCLACSTESA
CRUS_LWS_n WYGILAFVVTVVGLCSICGNFVVIWVIMNTKA-LR-SPANTLVVSLAVSDYIMMTCMFPPLVLNCYWG-TWIFGPLFCEVYAFIGNTVGCASIGNMIFITFDRYNVIVKGV--SG-KPLSQKNATLQVLFVWICSIMWC-VFPF-FGWN-RYVPEGNMTACGTDYLT--EDEFSRSYLYIYSVWVYIGPLALIIYCYFHIVSAVATHEKQMRDQAKK-----MGVKSLRTEEAKKTSA-GCRLAKVALTTVSLWFMAWTPYLIINWAGMFYP-SVV--SPLFSIWGSVFAKSNAVYNPIVYAISHPKYRAALYKKLPCLACSTESA
INSE_MWS_d WAKILTAYMIIIGMISWCGNGVVIYIFATTKS-LR-TPANLLVINLAISDFGIMITNTPMMGINLYFE-TWVLGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQVIVKGM--AG-RPMTIPLALGKIAYIWFMSTIWCCLAPV-FGWS-RYVPEGNLTSCGIDYLE--RDWNPRSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKAMREQAKK-----MNVKSLRSSEDADKSA-EGKLAKVALVTISLWFMAWTPYLVINCMGLFKF-EGL--TPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVFGKVDD
INSE_MWS_c WAKFLAAYMVLIATISWCGNGVVIYIFSTTKS-LR-TPANLLVINLAISDFGIMITNTPMMGINLFYE-TWVLGPLMCDIYGGLGSAFGCSSILSMCMISLDRYNVIVKGM--AG-QPMTIKLAIMKIALIWFMASIWT-LAPV-FGWS-RYVPEGNLTSCGIDYLE--RDWNPRSYLIFYSIFVYYLPLFLICYSYWFIIAAVSAHEKAMREQAKK-----MNVKSLRSSEDADKSA-EGKLAKVALVTISLWFMAWTPYTIINTLGLFKY-EGL--TPLNTIWGACFAKSAACYNPIVYGISHPKYGIALKEKCPCCVFGKVDD
CRUS_LWS_c MYPLLLVFMLITGILCLAGNFVTIWVFMNTKS-LR-TPANLLVVNLAMSDFLMMFTMFPPMMITCYYH-TWTLGATFCEVYAFLGNLCGCASIWTMVFITFDRYNVIVKGV--AG-EPLSTKKASLWILTVWVLSFTWC-VAPF-FGWN-RYVPEGNLTGCGTDYLS--EDILSRSYLYIYSTWVYFLPLAITIYCYVFIIKAVAAHEKGMRDQAKK-----MGIKSLRNEEAQKTSA-ECRLAKIAMTTVALWFIAWTPYLLINWVGMFAR-SYL--SPVYTIWGYVFAKANAVYNPIVYAIS
CRUS_LWS_p MYPLLLIFMLFTGILCLAGNFVTIWVFMNTKS-LR-TPANLLVVNLAMSDFLMMFTMFPPMMVTCYYH-TWTLGPTFCQVYGFLGNLCGCASIWTMVFITFDRYNVIVKGV--AG-EPLSTKKASLWILIVWVLSLAWC-MAPF-FGWN-RYVPEGNLTGCGTDYLS--EDILSRSYLYIYSTWVYFLPLTITIYCYVFIIKAVAAHEKGMRDQAKK-----MGIKSLRNEEAQKTSA-ECRLAKIAMTTVALWFIAWTPYLLINWVGMFAR-SYL--SPVYTIWGYVFAKANAVYNPIVYAIS
INSE_LWS_h WHGLLGFVIGVLGFISVTGNGMVVYIFTTTKS-LK-TPSNILVVNLAFSDFLMMFMMAPPMVINCYNE-TWVFGPLACQLYACAGSLYGCVSIWTMTMIAFDRYNVIVKGI--AA-KPMTINGALLRVFGIWAFSLAWT-IAPL-FGWG-RYVPEGNMTACGTDYFD--QSFSNRSYILLYSIACYYAPLFLIIYSYFFIVQAVAAHEKAMREQAKK-----MNVASLRSSDAANTSA-ECKLAKVALMTISLWFMAWTPYLVINYAGIFKTMT
CRUS_LWS_e WYGLLGFVIFCLGCLSVFGNSVVIWVFTSTKT-LR-SPANMLVVNLALSDFLMMANMSPPTVHSCYHG-TWMLGPTYCEYYALVGSLSGCISIWTMVWITLDRYNVIVKGV--AA-TPLTNKGAFARNIFSWLSALIWC-VSPL-YGWN-RYVPEGNMTACGTDYLT--DDWLSHSYLYAYTFWVYLFPFFIIVYCYTYIVSAVFAHEKGMRDQAKK-----MGVKSLRNEEAQKTSA-ECRLAKVALVTVSLWFIAWTPYCVINVTGMWDK-TKI--TPLFTIWGSL
CRUS_LWS_a WYGLLGFVIFCLGILSVCGNAVVIWVFMNTKS-LR-SPANLLVVNLAFSDFLMMLNMFPPMVHSCYHG-TWMLGAFFCEFYGFTGSLFGCISIWTMVFITMDRYNVIVKGV--AA-EPLTSKGASIRILFVWTVAFAWT-ILPF-FGWN-RYVPEGNLTACGTDYLT--EDSTSHLYLYMYASWAYYTPLLYIIYAYTFIVQAVSAHEKGMREQAKK-----MGVKSLRNEEAQKTSA-ECRLAKVALMTVSLWFMAWTPYMIINFTGMNDR-TKL--TPLCTIWGSL
CRUS_LWS_h WYGLLGFWMTVMGTLSVAGNFVVIWVFMNTKS-LR-TPANLLVVNLAISDFFMMLTMTPPLLANAYWG-TWILGAFFCEVYAFLGSFFGCVSIWSMVFITADRYNVIVKGV--SA-EPLTSGGAMMRIAGTWAFTLAWC-LPPF-FGWN-RYVPEGNMLACGTDYLT--ETELSRSYLYVYSVWVYLFPLAYIIYSYTFIVKAVAAHEKGMREQAKK-----MGVKSLRSEEAQKTSA-ECRLCKVALMTVTLWFMAWTPYFIINWGGMFNK-PMV--TPLFS
CRUS_MWS_h WHYLLGVVYLFLGVISIAGNGLVIYLYMKSQA-LK-TPANMLIVNLALSDLIMLTTNFPPFCYNCFSGGRWMFSGTYCEIYAALGAITGVCSIWTLCMISFDRYNIICNGF--NG-PKLTQGKATFMCGLAWVISVGWS-LPPF-FGWG-SYTLEGILDSCSYDYFT--RDMNTITYNICIFIFDFFLPASVIVFSYVFIVKAIFAHEAAMRAQAKK-----MNVTNLRSNEAETQRA-EIRIAKTALVNVSLWFICWTPYAAITIQGLLGNAEGI--TPLLTTLPALLAKSCSCYNPFVYAISHPKFRLAITQHLPWFCVHEKDP
INSE_UVV_c LHYLLAIVYILFTFVALFGNGLVIWIFCSAKS-LR-TPSNLFVVNLAFCDFMMMLKA-PIFIYNSFHT-GFATGHLGCQIFACMGSLSGIGAGMTNAAIAYDRYSTIARPL--DG-K-LSRGQVLLLIMLIWTYTIPWALMPLM-QVWG-RFVPEGFLTSCSFDYLT--DSQEIRYFVPTIFTFSYCVPMLLIIYYYSQIVGHVVSHEKALREQAKK-----MNVESLRSNVNTNAQSAEIRIAKAAITICFLFVLSWTPYGALAMIGAFGNRALL--TPGITMIPACACKFVACLDPYVYAISHPRYRLELQKRLPWLELQEKP
INSE_UVV_a LHYLLALLYILFTFLALLGNGLVIWIFCAAKS-LR-TPSNMFVVNLAICDFFMMIKT-PIFIYNSFNT-GFALGNLGCQIFAVIGSLTGIGAAITNAAIAYDRYSTIARPL--DG-K-LSRGQVILFIVLIWTYTIPWALMPVM-GVWG-RFVPEGFLTSCSFDYLT--DTNEIRIFVATIFTFSYCIPMILIIYYYSQIVSHVVNHEKALREQAKK-----MNVDSLRSNANTSSQSAEIRIAKAAITICFLYVLSWTPYGVMSMIGAFGNKALL--TPGVTMIPACTCKAVACLDPYVYAISHPKYRLELQKRLPWLELQEKP
INSE_UVV_m AHTALALLYIFFTFAALVGNGMVIFIFSTTKS-LR-TSSNFLVLNLAILDFIMMAKA-PIFIYNSAMR-GFAVGTVGCQIFALMGAYSGIGAGMTNACIAYDRHSTITRPL--DG-R-LSEGKVLLMVAFVWIYSTPWALLPLL-KIWG-RYVPEGYLTSCSFDYLT--NTFDTKLFVACIFTCSYVFPMSLIIYFYSGIVKQVFAHEAALREQAKK-----MNVESLRANQGGSSESAEIRIAKAALTVCFLFVASWTPYGVMALIGAFGNQQLL--TPGVTMIPAVACKAVACISPWVYAIRHPMYRQELQRRMPWLQIDEPD
INSE_UVV_p AHTMLALVYVFFTAAALIGNGLVIFIFSASKS-LR-TPSNLLVVQLAVLDFLMMLKA-PIFIYNSIKR-GFASGVIGCQIFAFMGSVSGTAAGLTNACIAYDRHSTITRPL--DG-R-LSRGKVLLMMVCVWLYTAPWAILPQL-QIWG-RYVPEGFLTSCTFDYLT--TTFDNKLFVASMFVCVYIFPMIAILYFYSGIVKQVFAHEAALREQAKK-----MNVDSLRSNQNAAAESAEIRIAKAALTVCFLYVASWTPYGVMSLIGAFGDQNLL--TPGVTMIPALACKGVACIDPWVYAISHPKYRQELQKRMPWLQIDEPD
INSE_UVV_d MHYMLGVFYIFLFCASTVGNGMVIWIFSTSKS-LR-TPSNMFVLNLAVFDLIMCLKA-PIFIYNSFHR-GFALGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNVITKPM--NR-N-MTFTKAVIMNIIIWLYCTPWVVLPLT-QFWD-RFVPEGYLTSCSFDYLS--DNFDTRLFVGTIFFFSFVCPTLMILYYYSQIVGHVFSHEKALREQAKK-----MNVESLRSNVDKSKETAEIRIAKAAITICFLFFVSWTPYGVMSLIGAFGDKSLL--TPGATMIPACTCKLVACIDPFVYAISHPRYRLELQKRCPWLGVNEKS
INSE_BLU_m WHYVLALIYTMLMVTSLTGNGIVIWIFSTSKS-LR-SASNMFVINLAVFDLMMMLEM-PLLIMNSFYQ-RLVGYQLGCDVYAVLGSLSGIGGAITNAVIAFDRYKTISSPL--DG-R-INTVQAGLLIAFTWFWALPFTILPAF-RIWG-RFVPEGFLTTCSFDYFT--EDQDTEVFVACIFVWSYCIPMALICYFYSQLFGAVRLHERMLQEQAKK-----MNVKSLASNKEDNSRSVEIRIAKVAFTIFFLFICAWTPYAFVTMTGAFGDRTLL--TPIATMIPAVCCKVVSCIDPWVYAINHPRYRAELQKRLPWMGVREQDP
INSE_BLU_a FHIGLAIIYSMLLIMSLVGNCCVIWIFSTSKS-LR-TPSNMFIVSLAIFDIIMAFEM-PMLVISSFME-RMIGWEIGCDVYSVFGSISGMGQAMTNAAIAFDRYRTISCPI--DG-R-LNSKQAAVIIAFTWFWVTPFTVLPLL-KVWG-RYTTEGFLTTCSFDFLT--DDEDTKVFVTCIFIWAYVIPLIFIILFYSRLLSSIRNHEKMLREQAKK-----MNVKSLVSN-QDKERSAEVRIAKVAFTIFFLFLLAWTPYATVALIGVYGNRELL--TPVSTMLPAVFAKTVSCIDPWIYAINHPRYRQELQKRCKWMGIHEP
INSE_BLU_d YHAGFYIAFIVLMLSSIFGNGLVIWIFSTSKS-LR-TPSNLLILNLAIFDLFMCTNM-PHYLINATVG-YIVGGDLGCDIYALNGGISGMGASITNAFIAFDRYKTISNPI--DG-R-LSYGQIVLLILFTWLWATPFSVLPLF-QIWG-RYQPEGFLTTCSFDYLT--NTDENRLFVRTIFVWSYVIPMTMILVSYYKLFTHVRVHEKMLAEQAKK-----MNVKSLSANANADNMSVELRIAKAALIIYMLFILAWTPYSVVALIGCFGEQQLI--TPFVSMLPCLACKSVSCLDPWVYATSHPKYRLELERRLPWLGIREKHA
MEL1b_braF MQLVFGSMMLVFGLIGVVGNAVALYAFCRSRS-LR-RPKNYLIANLCLTDMVVCLVYSPIIVTRSLSH--GLPSKESCIVEGFVVGLGSIVSICSLAGIAVERYVTITQPI--KSLSILTHRALLGAVSAVWVYAFLLAFPPLV--GWG-RYVSEESKISCTFDYLS--TDDATRAHVIVLVIGAFGLPFSVITYCYVRSFATVRKCTKERKQM---------------SPLAKSDSRSEVKAAVNSFVITTSFCLCWCPYAVVATMGVSGFTV----HSHAVFIAALLAKLSVLFNPVAYVLSIPN
MEL1b_braB MQLIFGSMMLVFGLIGVVGNVVALYAFCRTRS-LR-RPKNYVVANLCLTDMFVCLVYCPIVVSRSFSH--GFPSKESCIVEGFMVGVGSIASICSLAAIAVERYLSVTQPL--KSLTILTQRKLLVAVLTVWVYSLLLAFPPLV--GWG-RYVREETYISCTFDYLS--TDDATRAYVITLVMGAFGFPLLTIAYCYIRVFTTARKHAEERKFM---------------SPLKRPESRTEIKTAVTACVITTSFCLCWCPYAVVATLGISGVSV----QQQTVFSAALLAKLTVIINPIVYVLSIPNFRKALFAQEREKYASEDVV
MEL_nemVec WVIAQVVLWGCIFVISSLGNSLVLLCIVKSNR--LHSSIYAFYGSLAASDCIAGMLCCPLLLVTALHQLWIMGK-VMCHVYSTLLSTSLNASIATLCLISMDRLNAVRKPFEYRGHNTFTQRWCKWLLVLSWVHSIFWAAAPLG--GWG-EIITDSATYTCKPNWSA--ASIVNRSYSLCLALFPFAFPVFLMVAIYCVIYRHTKKCSNLMS----------GLEDGRNLVAEQERQMRERRLFRTVLIIIGAFAACWAIYTLATTCKLFIGQTP---PTWLVQLGLICAIAGSCVNPVIYTIRDATFARELGRLHPCLAWLLKQS
LWS_nemVec TSFTAIALLVIMLLTIIGNLMVCYVVLSNKR--LWTEMNMFLVNLAFGDLAVGLICMVFPLITAIKREWIFGRGILCQLNACCNSVLFCSTIFTHTVISIDRYIVIVHPMK----KIMTRKKAALMIVGVWVFSVFIVLGPVF--GWG-RMEYNASTLQCGFGFPR--DKMASM-YIVIVAIIAFIIPLLIMTYTYIRIYISVLEHTRRMS-------------ETATAMQQQAVFSAQKRIVFTFFIALLAFFACWAPFFSFIAFAVVVKNPHD-IPHGLGLASYVCGFINSACNPFIIGLRSKQFKSGFSRILCCCRGRDP
Consensus ............g......N..V.......k. LR .P.N...vNLA..Dl.....................g....C..yg%.....G..s...$..ia.#RY.v!..P. ..........a.......W.....w...Pl. GW. .Y.pEg..tsC..#w..........s%....f...%..Pl.!I.%.Y..i............... . .................E.....m...m!..F...W.PYa.............. .p.....P..fAK.s..%NP!IY......%R...................
Structural and functional markers along the opsin molecule:
>RHO1_homSap rhodopsin <----------TM1---------> c1 <----------TM2-------> x1 <c--i------TM3---------> c2 <----------TM4--------> x2 <----------TM5------c-> c3
MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLGGEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSRYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKE
AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA* 0
c3 <----------TM6-------> x3 <--------b-TM7---gprot> helix8 palm cyto tail
See also: Curated Sequences | Ancestral Introns | Informative Indels | Ancestral Sequences | Cytoplasmic face | Update Blog
