Opsin evolution: Difference between revisions

From genomewiki
Jump to navigationJump to search
No edit summary
No edit summary
Line 663: Line 663:
0 ISLVELGTVDRSSP...IMANKHLNGLGQRKGDSFTRRSPSSRIPSIVVTHSNHQGSPAAVRHNSTLPGIKVSNSQDREKELKRQIEKVKQYVPIVTITSDTENSTGGFSNELLPANTS* 0
0 ISLVELGTVDRSSP...IMANKHLNGLGQRKGDSFTRRSPSSRIPSIVVTHSNHQGSPAAVRHNSTLPGIKVSNSQDREKELKRQIEKVKQYVPIVTITSDTENSTGGFSNELLPANTS* 0


>MEL1a_Branchiostoma belcheri melanopsin
>MEL_braFlo Branchiostom floridae 12 exons + chrUn:47589027-47597447 size 8421 +tandem dup assembly error
0 MTELPSFQPPTNSTEEENAVFPTALTEWISE 0
0 VGNQVGEAALKLLSGEGDGMEVTPTPGCTGNASVCNGTDSGGGVVWDIPPLAHYIVGTAVFCVGCCGMFGNAVVVYSFIK 2
1 SKGLRTPANFFIINLALSDFLMNLTNMPIFAVNSAFQRWLLSDF 1
2 ACELYGFAGGLFGCLSINTLMAISMDRYLVITKPFLVMRIVTKQR 0
0 VMFAILLLWIWSLVWALPPLFGWSAYVPEGF 1
2 GTSCTFDYMTPKLSYHIFTYIIFFTMYFIPMGVIIYCYYNIFATVKSGDKQFGKAVKEMAHEDVKNK 0
0 AQQERQRKNEIKTAKIAFIVITLFLSAWTPYAVVSALGTLGYQDLVTPYLQSIPAVFAKSSAVYNPI 1
2 VYAITHPKFRAAVKKHIPCLSGCLPADEEETKTKTRGATTTASMSMTQTTAPTV 0
0 HDPQASVHSGSSVSVDDSSGVSRQDTMMVK 0
0 VEVDNRMEKAGGGAADTAPKDGTSVPTVSAQIEVRPSGNVNTKAEVIPSPQSAAVAHGASASPVPK 0
0 VAELSSSVSLESAAIPGKIPTPLPSQPIAAPIERHMAAMADDPPPKPRGVATTVNVRRSESGYERSQDSLRKK 0
0 AVSETRSRSFNSTKDHFASERQTSTTLNQPRDMYSGDMVKKTRQSPEKQEYDNPAFDAGIAEIDTDSENETEGSYDMLSVRFQAMAEEPPVETYRKASDMSINLGKASLMLTEAHDETVL* 0
 
>MEL_braBel Branchiostoma belcheri (Japanese lancelet) AB205400 Amphi-mop
0 MTEIPSFQPPINATEVEEENAVFPTALTEWFSE 0
0 MTEIPSFQPPINATEVEEENAVFPTALTEWFSE 0
VGNQVGEVALKLLSGEGDGMEVTPTPGCTGNGSVCNGTDSGGVVWDI
0 VGNQVGEVALKLLSGEGDGMEVTPTPGCTGNGSVCNGTDSGGVVWDIPPLAHYIVGTAVFCIGCCGMFGNAVVVYSFIK 2
PPLAHYIVGTAVFCIGCCGMFGNAVVVYSFIKSKGLRTPANFFIINLALSDFLMNLTNMPIFAVNSAFQRWLLSDFACEL
1 SKGLRTPANFFIINLALSDFLMNLTNMPIFAVNSAFQRWLLSDF 1
YGFAGGLFGCLSINTLMAISMDRYLVITKPFLVMRIVTKQRVMFAILLLWIWSLVWALPPLFGWSAYVSEGFGTSCTFDY
2 ACELYGFAGGLFGCLSINTLMAISMDRYLVITKPFLVMRIVTKQR 0
MTPKLSYHIFTYIIFFTMYFIPGGVMIYCYYNIFATVKSGDKQFGKAVKEMAHEDVKNKAQQERQRKNEIKTAKIAFIVI
0 VMFAILLLWIWSLVWALPPLFGWSAYVSEGF 1
SLFMSAWTPYAVVSALGTLGYQDLVTPYLQSIPAMFAKSSAVYSPIVYAITYPKFREAVKKHIPCLSGCLPASEEETKTK
2 GTSCTFDYMTPKLSYHIFTYIIFFTMYFIPGGVMIYCYYNIFATVKSGDKQFGKAVKEMAHEDVKNK 0
TRGQSSASASMSMTQTTAPVHDPQASVDSGSSVSVDDSSGVSRQDTMMVKVEVDKRMEKAGGGAADAAPQEGASVSTVSA
0 AQQERQRKNEIKTAKIAFIVISLFMSAWTPYAVVSALGTLGYQDLVTPYLQSIPAMFAKSSAVYSPI 1
QIEVRPSGKVTTKADVISTPQTAHGLSASPVPKVAELGSSATLESAAIPGKIPTPLPSQPIAAPIERHMAAMADEPPPKP
2 VYAITYPKFREAVKKHIPCLSGCLPASEEETKTKTRGQSSASASMSMTQTTAPV 0
RGVATTVNVRRTESGYDRSQDSQRKKVVGDTHRSRSFNTTKDHFASEQPAALIQPKELYSDDTTKKMARQSSEKHEYDNP
0 HDPQASVDSGSSVSVDDSSGVSRQDTMMVK 0
AFDEGITEVDTDSENETEGSYDMLSVRFQAMAEEPPVETYRKASDLAINLGKASLMLSEAHDETVL*  
0 VEVDKRMEKAGGGAADAAPQEGASVSTVSAQIEVRPSGKVTTKADVISTPQTAHGLSASPVPK 0
0 VAELGSSATLESAAIPGKIPTPLPSQPIAAPIERHMAAMADEPPPKPRGVATTVNVRRTESGYDRSQDSQRKK 0
0 VVGDTHRSRSFNTTKDHFASEQPAALIQPKELYSDDTTKKMARQSSEKHEYDNPAFDEGITEVDTDSENETEGSYDMLSVRFQAMAEEPPVETYRKASDLAINLGKASLMLSEAHDETVL* 0


>MEL1b_Branchiostoma belcheri Amphiop6 melanopsin
>MEL1b_Branchiostoma belcheri Amphiop6 melanopsin

Revision as of 22:20, 6 November 2007

This is a set of hand-curated opsins that serves as a gene family classifier ... just blast unknown sequence against the database below and look for the top hits ...

>RHO1_homSap rhodopsin ciliary all GT-AG chr3:130,730,172 3q22.1 + 6706 5  12 21 00 00
0 MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLG 1
2 GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSR 2
1 YIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKE 0
0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQ 0
0 FRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA* 0

>RHO1_monDom Monodelphis opossum 349
0 MNGTEGPNFYVPFSNKTGTVRSPFEEPQYYLADPWQFSCLAAYMFMLIVLGFPINFLTLYVTIQHKKLRTPLNYILLNLAIADLFMVFGGFTMTLYTSLHGYFVFGPTGCNLEGFFATLG 1
2 GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIIGVAFTWVMALACAFPPLIGWSR 2
1 YIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGQLVFTVKE 0
0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSNFGPIFMTIPAFFAKSSSVYNPVIYIMMNKQ 0
0 FRTCMITTLCCGKNPLGDDEASATASKTETSQVAPA* 0

>RHO1_ornAna rhodopsin rod pigment Ultra20:114,479-118,391
0 MNGTEGQDFYIPMSNKTGVVRSPFEYPQYYLAEPWQYSVLAAYMFMLIMLGFPINFLTLYVTIQHKKLRTPLNYILLNLAFANHFMVLGGFTTTLYTSLHGYFVFGPTGCNIEGFFATLG 1
2 GEIALWSLVVLAIERYIVVCKPMSNFRFGENHAIMGVAFTWIMALACALPPLVGWSR 2
1 YIPEGMQCSCGIDYYTLRPEVNNESFVIYMFVVHFTIPMTIIFFCYGRLVFTVKE 0
0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTVPAFFAKSSAIYNPVIYIMMNKQ 0
0 FRNCMLTTICCGKNPLGDDEASATASKTEQSSVSTSQVSPA* 0

>RHO1_galGal RH1 RDP1 rhodopsin rod galGal+ ornAna+ homSap+
0 MNGTEGQDFYVPMSNKTGVVRSPFEYPQYYLAEPWKFSALAAYMFMLILLGFPVNFLTLYVTIQHKKLRTPLNYILLNLVVADLFMVFGGFTTTMYTSMNGYFVFGVTGCYIEGFFATLG 1
2 GEIALWSLVVLAVERYVVVCKPMSNFRFGENHAIMGVAFSWIMAMACAAPPLFGWSR 2
1 YIPEGMQCSCGIDYYTLKPEINNESFVIYMFVVHFMIPLAVIFFCYGNLVCTVKE 0
0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTNQGSDFGPIFMTIPAFFAKSSAIYNPVIYIVMNKQ 0
0 FRNCMITTLCCGKNPLGDEDTSAGKTETSSVSTSQVSPA* 0

>RHO1_anoCar Anolis lizard
0 MNGTEGQNFYVPMSNKTGVVRNPFEYPQYYLADPWQFSALAAYMFLLILLGFPINFLTLFVTIQHKKLRTPLNYILLNLAVANLFMVLMGFTTTMYTSMNGYFIFGTVGR 2
2 GEMGLWSLVVLAVERYVVICKPMSNFRFGETHALIGVSCTWIMALACAGPPLLGWSR 2
1 YIPEGMQCSCGVDYYTPTPEVHNESFVIYMFLVHFVTPLTIIFFCYGRLVCTVKA 0
0 AAAQQQESATTQKAEREVTRMVVIMVISFLVCWVPYASVAFYIFTHQGSDFGPVFMTIPAFFAKSSAIYNPVIYILMNKQ 0
0 FRNCMIMTLCCGKNPLGDEDTSAGTKTETSTVSTSQVSPA* 0

>RHO1_xenTro frog 354
0 MNGTEGPNFYIPMSNKTGVVRSPFDYPQYYLAEPWKYSALAAYMFLLILLGFPINFMTLYVTIQHKKLRTPLNYILLNLVFANHFMVLCGFTVTMYTSMHGYFIFGQTGCYIEGFFATLG 1
2 GEMALWSLVVLAIERYVVVCKPMANFRFGENHAIMGVVFTWIMALSCAAPPLFGWSR 2
1 YIPEGMQCSCGVDYYTLKPEVNNESFVVYMFIVHFTIPLCVIFFCYGRLLCTVKE 0
0 AAAQQQESATTQKAEKEVTRMVVMMVIFFLICWVPYAYVAFYIFTHQGSDFGPVFMTVPAFFAKSSAIYNPVIYIVLNKQ 0
0 FRNCLITTLCCGKNPFGDEEGSSAASSKTEASSVSSSQVSPA* 0

>RHO1_Protopterus lungfish partial
0                               LAEPWKYSALAAYMFFLILVGFPINFMTLYVTIQHKKLRTPLNYILLNLAIGGLFMVFGGFTTTMYTSMNGYFVFGVTGCNIEGFFATFG 1
2 GILGLWSLVVLAIERYIVVCKPMSNFRFGENHAIMGIVFTWIMTLACAAPPLFGWSR 2
1 YIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFLIPLIIIFFCYGRLVCTVKE 0
0 AAAQQQESATTQKAEKEVTRMVYIMVISFLVCWLPYAFVAFYIFTHKGSDFGPVFMTAPAFFAKTAAFYNPVIYIFMNKQ 0
0 FRNCM                                     *0

>RHO1_danRer Danio zebrafish
0 MNGTEGPAFYVPMSNATGVVRSPYEYPQYYLVAPWAYGFVAAYMFFLIITCNIEGFFATLG 1
2 GEMGLWSLVVLAVERYVVICKPMSNFRFGETHALIGVSCTWIMALACAGPPLLGWSGFPVNFLTLYVTIEHKKLRTPLNYILLNLAIADLFMVFGGFTTTMYTSLHGYFVFGRLGCNLEGFFATLG 1
2 GEMGLKSLVVLAIERWMVVCKPVSNFRFGENHAIMGVAFTWVMACSCAVPPLVGWSR 2
1 YIPEGMQCSCGVDYYTRTPGVNNESFVIYMFIVHFFIPLIVIFFCYGRLVCTVKE 0
0 AARQQQESETTQRAEREVTRMVIIMVIAFLICWLPYAGVAWYIFTHQGSEFGPVFMTLPAFFAKTSAVYNPCIYICMNKQ 0
0 FRHCMITTLCCGKNPFEEEEGASTTASKTEASSVSSSSVSPA* 0

>RHO1_Raja erinacea rod opsin intronated using RHO1_Callorhinchus fragment
0 MNGTEGENFYVPMSNKTGVVRSPFDYPQYYLGEPWMFSALAAYMFFLILTGLPVNFLTLFVTIQHKKLRQPLNYILLNLAVSDLFMVFGGFTTTIITSMNGYFIFGPAGCNFEGFFATLG 1
2 GEVGLWCLVVLAIERYMVVCKPMANFRFGSQHAIIGVVFTWIMALSCAGPPLVGWSR 2
1 YIPEGLQCSCGVDYYTMKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKE 0
0 AAAQQQESESTQRAEREVTRMVIIMVVAFLICWVPYASVAFYIFINQGCDFTPFFMTVPAFFAKSSAVYNPLIYILMNKQ 0
0 FRNCMITTICLGKNPFEEEESTSASASKTEASSVSSSQVAPA* 0

>RHO1_lamprey1 petMar Petromyzon marinus
0 MNGTEGENFYIPFSNKTGLARSPFEYPQYYLAEPWKYSVLAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAVANLFMVLFGFTLTMYSSMNGYFVFGPTMCNFEGFFATLG 1
2 GEMSLWSLVVLAIERYIVICKPMGNFRFGSTHAYMGVAFTWFMALSCAAPPLVGWSR 2
1 YLPEGMQCSCGPDYYTLNPNFNNESFVIYMFLVHFIIPFIVIFFCYGRLLCTVKE 0
0 AAAAQQESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDFGATFMTVPAFFAKTSALYNPIIYILMNKQ 0
0 FRNCMITTLCCGKNPLGDEDSGASTSKTEVSSVSTSQVSPA* 0

>RHO1_lamprey2 Geotria australis rodRhA
0 MNGTEGQNFYIPFSNKTDVARSPFEYPQYYLAEPWKFSALAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAVSNLFMILFGFTTTMYTSMNGYFVFGPTMCSIEGFFATLG 1
2 GEVSLWSLVVLAIERYIVICKPMGNFRFGNTHAIMGVALTWVMALSCAAPPLLGWSR 2
1 YLPEGMQCSCGPDYYTMNPTYNNESFVIYMFIVHFTIPFVIIFFSYGRLLCTVKE 0
0 AAAAQQESASTQKAEKEVTRMVVLMVVGFLVCWVPYASVAFYIFTNQGSDFGATFMTLPAFFAKSSALYNPVIYILMNKQ 0
0 FRNCMITTLCCGKNPLGDDDSGASTSKTEVSSVSTSQVAPA* 0

>RHO1_lamprey3 Lethenteron japonicum
0 MNGTEGDNFYVPFSNKTGLARSPYEYPQYYLAEPWKYSALAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAMANLFMVLFGFTVTMYTSMNGYFVFGPTMCSIEGFFATLG 1
2 GEVALWSLVVLAIERYIVICKPMGNFRFGNTHAIMGVAFTWIMALACAAPPLVGWSR 2
1 YIPEGMQCSCGPDYYTLNPNFNNESYVVYMFVVHFLVPFVIIFFCYGRLLCTVKE 0
0 AAAAQQESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDFGATFMTLPAFFAKSSALYNPVIYILMNKQ 0
0 FRNCMITTLCCGKNPLGDDESGASTSKTEVSSVSTSQVSPA* 0

>RHO2_galGal rhodopsin-like cone galGal+ ornAna- homSap-
0 MNGTEGINFYVPMSNKTGVVRSPFEYPQYYLAEPWKYRLVCCYIFFLISTGLPINLLTLLVTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFVFGPVGCAVEGFFATLG 1
2 GQVALWSLVVLAIERYIVVCKPMGNFRFSATHAMMGIAFTWVMAFSCAAPPLFGWSR 2
1 YMPEGMQCSCGPDYYTHNPDYHNESYVLYMFVIHFIIPVVVIFFSYGRLICKVRE 0
0 AAAQQQESATTQKAEKEVTRMVILMVLGFMLAWTPYAVVAFWIFTNKGADFTATLMAVPAFFSKSSSLYNPIIYVLMNKQ 0
0 FRNCMITTICCGKNPFGDEDVSSTVSQSKTEVSSVSSSQVSPA* 0

>RHO2_anoCar rhodopsin-like cone anoCar+ HMGA1 GRM4 RHO2 MLN LEMD2 IHPK3
0 MNGTEGINFYVPLSNKTGLVRSPFEYPQYYLAEPWKYKVVCCYIFFLIFTGLPINILTLLVTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFIFGPIGCAIEGFFATLG 1
2 GQVALWSLVVLAIERYIVVCKPMGNFRFSATHALMGISFTWFMSFSCAAPPLLGWSR 2
1 YIPEGMQCSCGPDYYTLNPDYHNESYVLYMFGVHFVIPVVVIFFSYGRLICKVRE 0
0 AAAQQQESASTQKAEREVTRMVILMVLGFLLAWTPYAMVAFWIFTNKGVDFSATLMSVPAFFSKSSSLYNPIIYVLMNKQ 0
0 FRNCMITTICCGKNPFGDEDVSSSVSQSKTEVSSVSSSQVSPA* 0

>RHO2_Gekko gekko 467 Cone visual pigments are present in gecko rod cells
0 MNGTEGINFYVPLSNKTGLVRSPFEYPQYYLADPWKFKVLSFYMFFLIAAGMPLNGLTLFVTFQHKKLRQPLNYILVNLAAANLVTVCCGFTVTFYASWYAYFVFGPIGCAIEGFFATIG 1
2 GQVALWSLVVLAIERYIVICKPMGNFRFSATHAIMGIAFTWFMALACAGPPLFGWSR 2
1 FIPEGMQCSCGPDYYTLNPDFHNESYVIYMFIVHFTVPMVVIFFSYGRLVCKVRE 0
0 AAAQQQESATTQKAEKEVTRMVILMVLGFLLAWTPYAATAIWIFTNRGAAFSVTFMTIPAFFSKSSSIYNPIIYVLLNKQ 0
0 FRNCMVTTICCGKNPFGDEDVSSSVSQSKTEVSSVSSSQVAPA* 0

>RHO2_Latimeria chalumnae (coelacanth) RH2 opsin
0 MNGTEGMNFYVPLSNRTGLVRSPFEYTQYYLAEPWKFSVLCAYMFLLIILGFPINFLTLLVTFKHKKLRQPLNYILVNLAVASLFMVVFGFTVTFYSSLNGYFVLGPMGCAMEGFFATLG 1
2 GQVALWSLVVLAIERYIVVCKPMGNFRFASSHAIMGIAFTWIMALACAAPPLVGWSR 2
1 YIPEGLQCSCGPDYYTLNPDFHNESYVMYLFLVHFLLPIIIIFFTYGRLICKVKE 0
0 AAAQQQESASTQKAEKEVTRMVILMVIGFLTAWVPYASAAFWIFCNRGAEFTATLMTVPAFFSKSSCLFNPIIYVLLNKQ 0
0 FRNCMITTLCCGKNPLGDDDTSSAVSQSKTDVSSVSSSQVSPA* 0

>RHO2_Geotria australis lamprey rod RhB  lamprey
0 MNGTEGANFYIPFHNRTGVVRSPYEYPQYYLADPWMYSAISAYVFTLILIGFPVNFMTLFVTFKLKKLRQPLNFILVNLCVADLLMIMFGFTTTFYTAMNGYFVFGPTGCNIEGFFATLG 1
2 GEVSLWSLVMLAIERYIVVCKPMGNFRFATTHAALGVVFTWVMASACAVPPLVGWSR 2
1 YIPEGMQCSCGPDYYTLNPKYYNESYVIYLFLVHFLLPVTIIFFTYGRLICTVKE 0
0 AAAQQQESASTQKAEREVTRMVIIMVVGFLVCWVPYASFAFYLFMNKGILFSATAMTVPAFFSKSSVLYNPIIYVLLNKQ 0
0 FRTCMVTTLFCGKNPFGEDDSSMVSTSKTEVSSVSSSQVSPS* 0

>SWS1_homSap   5 exons all GT-AG Blue opsin chr7:128,199,783-128,203,087 7q32.1 - 3305 5 348 (tritanopia) [MIM:190900].
0 MRKMSEEEFYLFKNISSVGPWDGPQYHIAPVWAFYLQAAFMGTVFLIGFPLNAMVLVATLRYKKLRQPLNYILVNVSFGGFLLCIFSVFPVFVASCNGYFVFGRHVCALEGFLGTVA 1
2 GLVTGWSLAFLAFERYIVICKPFGNFRFSSKHALTVVLATWTIGIGVSIPPFFGWSR 2
1 FIPEGLQCSCGPDWYTVGTKYRSESYTWFLFIFCFIVPLSLICFSYTQLLRALKA 0
0 VAAQQQESATTQKAEREVSRMVVVMVGSFCVCYVPYAAFAMYMVNNRNHGLDLRLVTIPSFFSKSACIYNPIIYCFMNKQ 0
0 FQACIMKMVCGKAMTDESDTCSSQKTEVSTVSSTQVGPN* 0

>SWS1_macDom Monodelphis
0 MSGDEEFYLFKNISSVGPWDGPQYHIAPAWAFHFQTVFMGFVFCAGTPLNAVVLVATLRYKKLRQPLNYILVNVSLCGFIFCIFAVFTVFISSSQGYFIFGRHVCAMEAFLGSVA 1
2 GLVTGWSLAFLAFERFIVICKPFGNFRFNSKHAMMVVLATWVIGIGVSIPPFFGWSR 2
1 FIPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIMPLFLICFSYSQLLRALRA 0
0 VAAQQQESATTQKAEREVSRMVVMMVGSFCLCYVPYAALAMYMVNNQNHGLDLRLVTIPAFFSKSACVYNPIIYCFMNKQ 0
0 FHACIMEMVCRKPMTDDSDVSSSQKTEVSAVSSSQVGPT* 0

>SWS1_galGal violet cone pigment
0 MSSDDDFYLFTNGSVPGPWDGPQYHIAPPWAFYLQTAFMGIVFAVGTPLNAVVLWVTVRYKRLRQPLNYILVNISASGFVSCVLSVFVVFVASARGYFVFGKRVCELEAFVGTHG 1
2 GLVTGWSLAFLAFERYIVICKPFGNFRFSSRHALLVVVATWLIGVGVGLPPFFGWSR 2
1 YMPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRA 0
0 VAAQQQESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGLDLRLVTIPAFFSKSACVYNPIIYCFMNKQ 0
0 FRACIMETVCGKPLTDDSDASTSAQRTEVSSVSSSQVGPT* 0

>SWS1_Taeniopygia guttata
0 MDEEEFYLFKNQSSVGPWDGPQYHIAPMWAFYLQTIFMGLVFVAGTPLNAIVLIVTIKYKKLRQPLNYILVNISVSGLMCCVFCIFTVFIASSQGYFVFGKHMCAFEGFAGATG 1
2 GLVTGWSLAFLAFERYIVICKPFGNFRFNSRHALLVVAATWIIGVGVAIPPFFGWSR 2
1 YIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRA 0
0 VAAQQQESATTQKAEREVSRMVVVMVGSFCMCYVPYAALAMYMVNNREHGIDLRLVTIPAFFSKSSCVYNPIIYCFMNKQ 0
0 FRACIMETVCGRPMTDDSEVSSSAQRTEVSSVSSSQVGPS* 0

>SWS1_Gekko gecko
0 MSGEEDFYLYANISSVGPFDGPQYHIAPMWAFYFQTAFMGFVFFVGTPLNAIILFAIVKYKKLRQPLNYILVNISAAGFLFCVVAVFTVFISSSQGYFIFGKHICALEAFLGSLA 1
2 GLVTGWSLAFLALERYIVICKPFGNFRFSAKHASLVVAATWFIGIGVSIPPYFGWSR 2
1 FIPEGLQCSCGPDWYTVGTKYYSEYYTWFLFVLCFIVPLSIIVFSYSQLLSALRA 0
0 VAAQQQESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRNHGIDLRMVTIPAFFSKSSCVYNPIIYCFMNKQ 0
0 FRGCILEMVCGKTMAEESEVSSASQKTEVSSVSSSQVGPS* 0

>SWS1_Uta stansburiana
0 MSGEEDFYLFENISSVGPWDGPQYHIAPMWAFYFQTAFMGFVFFAGTPLNAIILIVTVKYKKLRQPLNYILVNISFAGFLFCVFSVFTVFLASSQGYFFFGRHICALEAFLGSVA 1
2 GLVTGWSLAFLAFERYIVICKPFGNFRFNSKHALLVVAATWFIGIGVSIPPFFGWSR 2
1 FIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRA 0
0 VAAQQQESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGIDLRLVTIPAFFSKSACVYNPIIYCFMNKQ 0
0 FRACIMETVCGKPMTDESDVSSSAQKTEVSSVSSSQVSPS* 0

>SWS1_Xenopus laevis
0 MLEEEDFYLFKNVSNVSPFDGPQYHIAPKWAFTLQAIFMGMVFLIGTPLNFIVLLVTIKYKKLRQPLNYILVNITVGGFLMCIFSIFPVFVSSSQGYFFFGRIACSIDAFVGTLT 1
2 GLVTGWSLAFLAFERYIVICKPMGNFNFSSSHALAVVICTWIIGIVVSVPPFLGWSR 2
1 YMPEGLQCSCGPDWYTVGTKYRSEYYTWFIFIFCFVIPLSLICFSYGRLLGALRA 0
0 VAAQQQESASTQKAEREVSRMVIFMVGSFCLCYVPYAAMAMYMVTNRNHGLDLRLVTIPAFFSKSSCVYNPIIYSFMNKQ 0
0 FRGCIMETVCGRPMSDDSSVSSTSQRTEVSTVSSSQVSPA* 0

>SWS1_Danio rerio
0 MDAWAVQFGNASKVSPFEGEQYHIAPKWAFYLQAAFMGFVFIVGTPMNGIVLFVTMKYKKLRQPLNYILVNISLAGFIFDTFSVSQVSVCAARGYYSLGYTLCSMEAAMGSIA 1
2 GLVTGWSLAVLAFERYVVICKPFGSFKFGQGQAVGAVVFTWIIGTACATPPFFGWSR 2
1 YIPEGLGTACGPDWYTKSEEYNSESYTYFLLITCFMMPMTIIIFSYSQLLGALRA 0
0 VAAQQAESESTQKAEREVSRMVVVMVGSFVLCYAPYAVTAMYFANSDEPNKDYRLVAIPAFFSKSSSVYNPLIYAFMNKQ 0
0 FNACIMETVFGKKIDESSEVSSKTETSSVSA* 0

>SWS1_Oryzias latipes
0 MGKYFYLYENISKVGPYDGPQYYLAPTWAFYLQAAFMGFVFFVGTPLNFVVLLATAKYKKLRVPLNYILVNITFAGFIFVTFSVSQVFLASVRGYYFFGQTLCALEAAVGAVA 1
2 GLVTSWSLAVLSFERYLVICKPFGAFKFGSNHALAAVIFTWFMGVGCACPPFFGWSR 2
1 YIPEGLGCSCGPDWYTNCEEFSCASYSKFLLVTCFICPITIIIFSYSQLLGALRA 0
0 VAAQQAESASTQKAEKEVSRMIIVMVASFVTCYGPYALTAQYYAYSQDENKDYRLVTIPAFFSKSSCVYNPLIYAFMNKQ 0
0 FNGCIMEMVFGKKMEEASEVSSKTEVSTDS*0

>SWS1_Geotria australis lamprey UV-sensitive retinal pigment lamprey
0 MSGDEEFYLFKNISKVGPWDGPQFHIAPKWAFYLQAAFMGFVFICGTPLNAIVLVVTIKYKKLRQPLNYILVNISAAGLVFCLFSISTVFVASMQGYFFLGPTICALEAFFGSLA 1
2 GLVTGWSLAFLAAERYIVICKPFGNFRFGSKHALVAVGLTWMLGLSVALPPFFGWSR 2
1 YIPEGLQCSCGPDWYTVGTKYKSEYYTYFLFVFCFVVPLSIIIFSYGSLLGTLRA 0
0 VAAQQQESASTQKAEREVSRMVIMMVASFCTCYVPYAALAVYMVTNRDHNIDLRFVTVPAFFSKASCVYNPLIYSFMNKQ 0
0 FRACILETVCGKPITDESETSSSRTEVSSVSTTQMIPG* 0

>SWS2_ornAna blue cone bp 2 gene tandem -FLNB--+MECP2 with MWS1 5 usual exons
0 MHKTHRNLQNELPEDFFIPLPLDTDNITSLSPFLVPQTHLGGSGIFMSLAAFMFLLITLGFPINLLTVICTIKYKKLRSHLNYILVNLAVSNMLVVCVGSATAFYSFAHMYFVLGPTACKIEGFAATLG 1
2 GMVSLWSLAVIAFERFLVICKPLGNLSFRGTHAIFGCAATWVFGLAASLPPLFGWSR 2
1 YIPEGLQCSCGPDWYTTNNKWNNESYVIFLFSFCFGVPLSIIIFSYGRLLLTLRA 0
0 VAKQQEQSATTQKAEREVTKMVIVMVLGFLVCWLPYASFSLWVVTNRGQVFDLRMASIPSVFSKASTIYNPIIYVFMNKQ 0
0 FRSCMLKLVFCGKSPFGDEDEISGSSQATQVSSVSSSQVSPA* 0

>SWS2_galGal blue cone pigment galGal- ornAna+ homSap+ 
0 MHPPRPTTDLPEDFYIPMALDAPNITALSPFLVPQTHLGSPGLFRAMAAFMFLLIALGVPINTLTIFCTARFRKLRSHLNYILVNLALANLLVILVGSTTACYSFSQMYFALGPTACKIEGFAATLG 1
2 GMVSLWSLAVVAFERFLVICKPLGNFTFRGSHAVLGCVATWVLGFVASAPPLFGWSR 2
1 YIPEGLQCSCGPDWYTTDNKWHNESYVLFLFTFCFGVPLAIIVFSYGRLLITLRA 0
0 VARQQEQSATTQKADREVTKMVVVMVLGFLVCWAPYTAFALWVVTHRGRSFEVGLASIPSVFSKSSTVYNPVIYVLMNKQ 0
0 FRSCMLKLLFCGRSPFGDDEDVSGSSQATQVSSVSSSHVAPA* 0

>SWS2_Taeniopygia
0 MPKPREMRDELPEDFYIPMSLETPNLTALSPFLVPQTHLGSPGIFKAMAAFMFLLVLLGVPINALTVLCTAKYKKLRSHLNYILVNLAVANLLVVCVGSTTAFYSFSQMYFALGPLACKIEGFTATLG 1
2 GMVSLWSLAVVAFERFLVICKPLGNFTFRGSHAVLGCAITWIFGLIASLPPLFGWSR 2
1 YIPEGLQCSCGPDWYTTDNKWNNESYVIFLFCFCFGFPLTVIVFSYGRLLLTLRA 0
0 VAKQQEQSASTQKAEREVTKMVVVMVLGFLVCWLPYCSFALWVVTHRGHPFDLGLASIPSVFSKASTVYNPIIYVFMNKQ 0
0 FRSCMLKLVFCGRSPFGDEDDVSGSSQATQVSSVSSSQVSPA* 0 

>SWS2_utaSta
0 MHNSRPHSRDDLPEDFFIPMPLDVANITTLSPFLVPQTHLGSPALFMGMAAFMFLLIILGVPINVLTIFCTFKYKKLRSHLNYILVNLAVSNLLVVCIGSTTAFYSFAQMYFSLGPTACKIEGFAATLG 1
2 GMVSLWSLAVVAFERFLVICKPLGNFSFRGTHAIIGCIITWVFGLVASLPPLFGWSR 2
1 YIPEGLQCSCGPDWYTTNNKWNNESYVLFLFSFCFGVPLSVIIFSYGRLLLTLRA 0
0 VAKQQEQSATTQKAEREVTKMVVVMVMGFLVCWLPYASFALWVVTHRGEPFDVRLATIPSVFSKASSVYNPVIYVFMNKQ 0
0 FRSCMLKLVFCGKSPFGDEDDVSGSSQTTQVSSVSSSQVSPA* 0

>SWS2_Xenopus
0 MSKGRPDLRMEMPDEFYVPIPLETTNISSLSPFLVPQTHLGTPGIFMSISAFMLFTIIFGFPLNLLTIICTVKYKKLRSHLNYILVNLAVANLIVICFGSTTAFYSFSQMYFSLGTLACKIEGFTATLG 1
2 GIIGLWSLAVVAFERFLVICKPMGNFTFRESHAVLGCILTWVIGLVAAIPPLLGWSR 2
1 YIPEGLQCSCGPDWYTVNNKWNNESYVLFLFCFCFGFPLAIIVFSYGRLLLALHA 0
0 VAKQQEQSATTQKAEREVTRMVIVMVVGFLVCWLPYASFALWAVTHRGELFDLRMSSVPSVFSKASTVYNPFIYIFMNRQ 0
0 FRSCMMKMIFCGKNPLGDDEETSVSGSTQVSSVSSSQIAPS* 0 

>SWS2_danRer
0 MKQQQQTPELFEDFHMPITLDVSNISAYSPFLVPQDHLGHSGVFMGMSAFMLFLFIAGTAINVLTIVCTIQYKKLRSHLNYILVNLAISNLWVSVFGSSVAFYAFYKKYFVFGPIGCKIEGFTSTIG 1
2 GMVSLWSLAVVALERWLVICKPLGNFTFKTPHAIAGCILPWCMALAAGLPPLLGWSR 2
1 YIPEGLQCSCGPDWYTTNNKFNNESYVMFLFCFCFAVPFSTIVFCYGQLLITLKL 0
0 AAKAQADSASTQKAEREVTKMVVVMVFGFLICWGPYAIFAIWVVSNRGAPFDLRLATIPSCLCKASTVYNPVIYVLMNKQ 0
0 FRSCMMKMVFNKNIEEDEASSSSQVTQVSSVAPEK* 0 

>SWS2_Takifugu
0 MRGVRQHEFQEDFYIPIPLDVDNITALSPFLVPQDHLGSPAVFYGMSAFMFFLFVAGTGINVLTIACTIQYKKLRSHLNYILVNLAFSNLLVTTVGSFTCFCCFFVRYMIVGPLGCKIEGFAATLG 1
2 GMVSLWSLAVVAFERWLVVCKPLGNFIFKPDHAIVCCIFTWFFALIISAPPLFGWSR 2
1 YIPEGFQCSCGPDWYTTGNKYNNESYVWFIFGFGFAVPLFVIVFCYSQLLVMLKS 0
0 AKAQAESASTQKAEREVTRMVVVMILGFLVCWLPYASFALWVVNNRGTPFDLRLATIPACFSKASTVYNPIIYVVLNKQ 0
0 FRSCMKKMLGMSGGDDEESSSQSVTEVSKVSPS* 0 

>SWS2_Geotria australis lamprey blue-sensitive retinal pigment lamprey
0 MYQGKSTQVDDLPEDFYIPIALNVKNMSELSPFLVPQVHLGDSFIFYGMSAFMLFLVLAGFPLNFLTVFVTIKYKKLRSHLNYILVNLAIANLIVVCCGSTLAFYSFMHKYFILGPLFCKMEGFTATLG 1
2 GMLSLWSLAVLAFERCLVICKPFGNIAFRGTHALIRCGFAWAAAIAASTPPLFGWSR 2
1 YIPEGLQCSCGPDWYTTNNKYNNESYVMFLFIFCFGTPFTIIIVSYSKLILTLRA 0
0 AAAQQQESASTQKAEKEVSRMVVIMVGGFLVCWLPYASLALWIVFNRGSPFDLRLATIPSVFSKASTVYNPVIYIFLNKQ 0
0 FRSCMMKTIFCGKNPLGDDEDATSTTTQVSSVSTSQVAPA* 0

>LWS_ornAna LWS green cone -Ultra401:228,567-238,486
0 MTPAWNSGVYAARRRFEDEEDTTRTSVFVYTNSNNTR 1
2 DPFEGPNYHIAPRWAYNVTSLWMIFVVIASVFTNGLVLVATMKFKKLRHPLNWILVNLAVADLGETLIASTISVINQIFGYFILGHPMCVLEGYTVSLC 1
2 GITGLWSLSIISWERWIVVCKPFGNVKFDAKLAMVGIVFSWVWAAVWTAPPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMSTCCILPLSIIVLCYLQVWLAIRA 0
0 VAKQQKESESTQKAEKEVSRMVVVMILAYCFCWGPYTIFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0
0 FRNCIMQLFGKKVDDGSELSSTSRTEVSSVSSVSPA* 0

>LWS_galGal green iodopsin red cone long-wave-sensitive missing in assembly
0 MAAWEAAFAARRRHEEEDTTRDSVFTYTNSNNTR 1
2 GPFEGPNYHIAPRWVYNLTSLWMIFVVAASVFTNGLVLVATWKFKKLRHPLNWILVNLAVADLGETVIASTISVINQISGYFILGHPMCVVEGYTVSAC 1
2 GITALWSLAIISWERWFVVCKPFGNIKFDGKLAVAGILFSWLWSCAWTAPPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFFPLAIIILCYLQVSLAIRA 0
0 VAAQQKESESTQKAEKEVSRMVVVMIVAYCFCWGPYTFFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0
0 FRNCILQLFGKKVDDGSEVSTSRTEVSSVSNSSVSPA* 0

>LWS_anoCar Anolis
0 MAGTVTEAWDVAVFAARRRNDEDDTTRDSLFTYTNSNNTR 1
2 GPFEGPNYHIAPRWVYNITSVWMIFVVIASIFTNGLVLVATAKFKKLRHPLNWILVNLAIADLGETVIASTISVINQISGYFILGHPMCVLEGYTVSTC 1
2 GISALWSLAVISWERWVVVCKPFGNVKFDAKLAVAGIVFSWVWSAVWTAPPVFGWSR 2
1 YWPHGLKTSCGPDVFSGSDDPGVLSYMIVLMITCCFIPLAVILLCYLQVWLAIRA 0
0 VAAQQKESESTQKAEKEVSRMVVVMIIAYCFCWGPYTVFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0
0 FRNCIMQLFGKKVDDGSELSSTSRTEVSSVSNSSVSPA* 0

>LWS_Lithochromis sp
0 MAEEWGKQSFAARRYHEDSTRGSAFAYTNSNNTR 1
2 DPFEGPNYHIAPRWIYNLATLWMFVVVVLSVFTNGLVLVATMKFKKLRHPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPMCIFEGYVVSVC 1
2 GIAALWSLTIISWERWIVVCKPFGNVKFDAKWATAGIVFSWVWAAVWCAPPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMLTCCIFPLAIIILCYLAVWMAIRA 0
0 VAMQQKESESTQKAEREVSRMVVVMIVAYCVCWGPYTFFACFAAANPGYAFHPLAAAMPAYFAKSATIYNPIIYVFMNRQ 0
0 FRTCIMQLFGKQVDDGSEVSTSKTEVSSVAPA* 0

>LWS_Gasterosteus aculeatus
0 MAEEWGKQAFAARRYNEDTTRGSMFVYTNSNNTK 1
2 DPFEGPNYHIAPRWVYNLSTLWMFIVVALSVFTNGLVLVATAKFKKLQHPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPMCVFEGYVVSVC 1
2 GITALWSLTIISWERWIVVCKPFGNVKFDAKWATAGIVFSWIWSAVWCAPPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCLIPLAIIILCYLAVWLAIRA 0
0 VAMQQKESESTQKAERDVSRMVVVMIVAYIVCWGPYTTFACFAAANPGYAFHPLAAAMPAYFAKSATIYNPVIYVFMNRQ 0
0 FRSCIMQLFGKEVDDGSEVSTsKTEVSSVAPA* 0

>LWS_Petromyzon maritimus lamprey traces key to intron 3 position and gapping
0 MTASWQGAMFAARRRQDDEDTTMESLFRYTNENNTK 1
2 DPFEGPNYHIAPRWVFNLTSVWMIIVVVLSLFSNGLVLVATVKFKKLRHPLNWIIVNLAIADILETIFASTISVCNQVYGYFILGHPMCVFEGYVVSTC 1
2 GIAGLWSLAIISWERWMVVCKPFGNIKFDGKIATILIVFSWVWPASWCSLPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFLPLSIIILCYLQVWLAIHS 0
0 VAQQQKESETTQKAERDVSRMVVVMILAYVFCWGPYTFFACFAAANPGYSFHPIAAALPAYFAKGATIYNPIIYVFMNRQ 0
0 FRNCILQLFGKKVDDGSEVSSSSRTEVSSVSNSSVSPA* 0

>LWS_lamprey Lethenteron japonicum
0 MTASWHGAVFAARRRNDDEDTTKDSIFRYTNENNTR 1
2 DPFEGPNYHIAPRWMFNLTSVWMIIVVVLSLFTNGLVLVATMKFKKLRHPLNWILVNLAIADILETIFASTISVCNQVFGYFILGHPMCVFEGYVVSTC 1
2 GIAGLWSLAIISWERWMVVCKPFGNIKFDGKIAIILIVFSWVWPACWCSLPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFLPLSVIILCYLQVWLAIHS 0
0 VAQQQKESETTQKAERDVSRMVVVMILAYIFCWGPYTFFACYAAANPGYAFHPLTAALPAYFAKSATIYNPVIYVFMNRQ 0
0 FRNCIMQLFGKKVDDGSEVSSASRTEVSSVSNSSISPA*

>LWS_Geotria australis lamprey red-sensitive retinal pigment 
0 MAQSWERAMFAARRRQDEDTTKGDLFRYTNENNTR 1
2 DPFEGPNYHIAPRWMYNLTSFWMIIVVILSLFTNGLVLVATLKFKKLRHPLNWILVNLAIADIGETIFASTVSVVNQIFGYFILGHPLCVFEGFTVSVC 1
2 GITALWSLAIISFERWMVVCKPFGNLKFDGKVAIVLIIFSWAWSAGWCAPPIFGWSR 2
1 YWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFIPLALIIICYLQVWLAIHT 0
0 VAQQQKESETTQKAERDVSRMVVVMIFAYIFCWGPYTFFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0
0 FRNCIMQLFGKKVDDGSEVSSSARTEVSSVSNSSVSPA* 0

>PIN_galGal pinopsin pineal pigment non-visual homSap- RPH3AL DOC2B PIN TEX14 RAD51C
0 MSSNSSQAPPNGTPGPFDGPQWPYQAPQSTYVGVAVLMGTVVACASVVNGLVIVVSICYKKLRSPLNYILVNLAVADLLVTLCGSSVSLSNNINGFFVFGRRMCELEGFMVSLT 1
2 GIVGLWSLAILALERYVVVCRPLGDFQFQRRHAVSGCAFTWGWALLWSTPPLLGWSSYVPE 1
2 GLRTSCGPNWYTGGSNNNSYILSLFVTCFVLPLSLILFSYTNLLLTLRA 0
0 AAAQQKEADTTQRAEREVTRMVIVMVMAFLLCWLPYSTFALVVATHKGIIIQPVLASLPSYFSKTATVYNPIIYVFMNKQ 0
0 FQSCLLEMLCCGYQPQRTGKASPGTPGPHADVTAAGLRNKVMPAHPV* 0

>PIN_Uta stansburiana pinopsin missing in Anole genome
0 MVNEWSNATPGPFDGPQWPYLAPRSIYTSVAVLMGLVVVSAAFVNGLVIVVSIQYKKLRSPLNYILVNLAIADLLVTSFGSTLSFANNIYGFFVLGQTACEFEGFMVSLT 1
2 GIVGLWSLAILAFERYLVICKPVGDFRFQQRHAVFGCVFTWMWSLVWTLPPLFGWSSYVPE 1
2 GLRTSCGPNWYTGGSGNNSYIMALFVTCFALPLGMIIFSYASLLLTLRA 0
0 VATQQKEVETTQQAEKEVTRRVIAMVMAFLVCWLPYASFAMVVATNKDLVIQPALASLPSYFSKTATVYNPIIYVFMNKQ 0
0 FRSCLLSTMSCGHRPRGAQETTPAMISIPQGPTSALQGSRNKVTPSASEGSGNEAIPS* 0

>PIN_Phelsuma madagascariensis gekko
0 MHVQMANASQASLKNGTLSPFDGPQWPHRASRRVYTSLAALMGVVVLSASLANGLVIAVSVRFKRLRSPLNYILVNLATADLLVTFFGSIISFVNNAVGFFVFGKTACRFEGFMVSLT 1
2 GIVGLWSLAILAFERYLVICKPVGDFQFQRRHAVIGCLYTWGWSLIWTVPPLFGWSSYVPE 1
2 GLGTSCGPNWYMGGTNNNSYIVALFVTCFALPLSMILFSYANLLLTLRA 0
0 VAAQQKEQETTQRAEKEVTRMVITMVMAFLVCWLPYATFAMVVATTKDLSIQPGLASLPSYFSKTATVYNPIIYVFMNKQ 0
0 FRSCLLNTVSCGRIPQTMPGTPATTAVRGGFVLTSEGRGNKVASTELHS* 0

>PIN_Podarcis sicula pinopsin mRNA ruin lizard
0 MQASNASWVEVRNRTPGPFEGPQWPYLAPQSTYISVAVLMGLVVISATLVNGLVIVVSVQFKKLRSPLNYVLVNLAVADLLVTFFGSTISFVNNAQGFFIFGQATCEFEGFMVSLT 1
2 GIVGLWSLAILAFERYLVICKPVGDFRFPARHAVLGCAFTWGWSFVWTVPPLLGWSSYVPE 1
2 GLRTSCGPNWYSGGSSNNSYIMTLFVTCFAMPLSTILFSYANLLMTLRT 0
0 VAAQQKEQETTQRAEREVTRMVVAMVAAFLVCWLPYASFAMVVATHKDLAIRPALASLPSYFSKTATVYNPIIYVFMNKQ 0
0 FRSCLLYKMSCGHRALSSQDTTPAGISLPGRLTTSASKGSRNQVSPS* 0

>PIN_xenTro
0 MRAGNMSAYEAPGPYDGPQWPHLAPRSTFLTVAAVMCMVVILAFFVNGLVIVVTLKYKKLRSPLNYILVNLAIANLLVTIFGSSVSFSNNVVGYFFMGKTMCEFEGFMVSLT 1
2 GIVGLWSLAILAFERYLVICKPMGDFRFQQKHAILGCSFTWVWSFIWTSPPLFGWCSYVPE 1
2 GLRTSCGPNWYTGGTNNNSYIMALFLTCFIMPLSTIIFSYSNLLMALRA 0
0 VAAQQKDSETTQRAEKEVTRMVIAMVLAFLICWLPYASFAVVVAVNKDVVIEPTVASLPSYFSKTATVYNPIIYVFMNKQ 0
0 FRNCLMTLLCCGRSFGDDETSSASGRTDVTSVSEAGGNKVTPA* 0

>PIN_Bufo japonicus toad classifies somewhat oddly
0 MHSANMSALETPGPFEGPQWPHVAPRSTYLTVAVLMGMVVFLAFFVNGMVIVVSLKYKKLRSPLNYILVNLAVADILVTMFGSTVSFHNNIFGFFTLGKLVCELEGFVVSLT 1
2 GIVGLWSLAILAFERYIVICKPMGDFRFQQRHAVMGCAFTWIWAFLWTSPPLIGWCSYVPE 1
2 GLGTSCGPNWYTGGTNNNSYILALFTTCFMMPLTTIIFSYSNLLLALRA 0
0 VAAQQKESETTQRAEREVTRMVIAMVLAFLICWLPYAVFAIVMASNKNVVIDPTLASMPSYFSKTATVYNPVIYVFMNKQ 0
0 FRDCLTKLLCCGRNPFGEDETSTTSGRTDVTSVSEGGGNKVTPA* 0

>VAOP_galGal Gallus gallus exon 1 genbank error + too short + frameshift INPP5A NKX6 VAOP ALDH18A stubbed in terminal finch
0 MDVFRALGNESLLSNSSGPARWDPFHHPLDSIQPWHFRLVAAVMFVVTSLSLAENLAVILVTFKFKQLRQPVNYVIVNLSVADFLVSLTGGTISFLANLKGYFYMGHWACVLEGFAVTFF 1
2 GIVALWSLALLAFERYIVICRPVGNMRLRGKHAAQGIAFVWTFSFIWTIPPTMGWSSYTTSKIGTTCEPNW 2
1 YSGAYNDRSYIIAFFTTCFIVPLLVILVSYGKLLQKLRK 0
0 VSNTQGRLRTARKPERQVTRMVVVMIIAFLICWMPYAVFSILATAYPSIELDPHLAAIPAFFSKTATVYNPIIYVFMNKQ 0
0 FRMCLIQMFKCSAIETAESNMNPTSERATLTQDKRDSQLSVMAVRSTILKRKTGDEHRADDLWLFRQLQKPKCVPCRAGDGS* 0

>VAOP_anoCar Anole INPP5A NKX6 VAOP new
0 MAGLRREAENDSWLFDPSSSSAPFDPFLQPLDIIEPWNFHLISALMFVVTLFSLSENFTVILVTIKFKQLRQPLNYVIVNLSVADFLVSLIGGTISFSTNLKGYFYMGHWACVLEGFAVTFF 1
2 GIVALWSLALLAFERYVVICRPLGNMRLNGKHAALGVAFVWIFSFIWTVPPTMGWSSYTTSKIGTTCEPNW 2
1 YSGDYNDHTFIITFFTTCFILPLLVILVSYGKLMRKLRK 0
0 VSDTQGRLGTTRKPERQVTGMVVIMILAFLICWSPYAAFSILVTACPSIELDPRLAAIPAFFSKTATVYNPVIYVFMNNQ 0
0 FRKCLVQLFQCSSQETMDANVNPISEKDTLTHTKHCGEMSTVAAHVIVFNPRSEDEQGSCQSFAQLAISENKVYPL* 0

>VAOP_xenTro frog new lost synteny GSTO1 VAOP C10ORF92
0 MPTNVSLLATPENSTVWNPFTGPLKTIEAWNFHLLAALMFVVTSLSIAENFIVILVTAKFKQLRQPLNYIIVNLSVADFLVSVIGGTISIATNSRGYFYLGSWACVLEGFAVTFF 1
2 GIVALWSLSVLAFERYIVICRPLGNLRLQGKHSALAIIFVWVFSFVWTIPPTMGWSSYTTSKIGTTCEPNW 2
1 YSGEMRDHTYIITFLTTCFVFPLLVIFMSYGKLMRKLRK 0
0 VSDTQGRLGSTRKPEKEVTRMVVIMILAFLICWTPYAAFSILITAHPTIDLDPRLAAIPAFFAKTASMYNPIIYVYMNKQ 0
0 FRRCLYQMFNINDPEAKESNLNPTSERGVLTRNNNGGEMLAIATHITSSAVTNREEEKSSSNSFAHIPVSDNKVCPM*

>VAOP_Danio rerio vertebrate ancient long opsin assembly missing exon 3 no synteny
0 MEASSAAVNAVSPAEDPFSAPLSSIAPWNYSVLAALMFVVTALSLSENFTVMLVTFRFQQLRQPLNYIIVNLSLADFLVSLTGGSISFLTNYHGYFFLGKWACVLEGFAVTFF 1
2 GIVALWSLAVLAFERFFVICRPLGNIRLRGKHAALGLVFVWSFSFIWTVPPVLGWSSYTVSRIGTTCEPNW 2
1 YSGNFHDHTFIITLFSTCFIFPLGVIIVCYCKLIRKLRK 0
0 VSNTHGRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIIITAHPSMHVDPRLAAIPAFVAKTAAVYNPIIYVFMNKQ 0
0 FRKCLVQLLSCSKVTVVEGNNNQTTERAGMTSGSNTGEMSAIAARVSVPKTEENPGDRSTFSHIPIPENKVCPM*

>VAOP_Rutilus rutilus (roach minnow) vertebrate ancient opsin
0 MELFPVAVNGVSHAEDPFSGPLTFIAPWNYKVLATLMFVVTAASLSENFAVMLVTFRFTQLRKPLNYIIVNLSLADFLVSLTGGTISFLTNYHGYFFLGKWACVLEGFAVTYF 1
2 GIVALWSLAVLAFERFFVICRPLGNIRLRGKHAALGLLFVWTFSFIWTIPPVLGWSSYTVSKIGTTCEPNW 2
1 YSGNFHDHTFIIAFFITCFILPLGVIVVCYCKLIKKLRK 0
0 VSNTHGRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIVVTAHPSIHLDPRLAAAPAFFSKTAAVYNPVIYVFMNKQ 0
0 FRKCLVQLLRCRDVTIIEGNINQTSERQGMTNESHTGEMSTIASRIPKDGSIPEKTQEHPGERRSLAHIPIPENKVCPM* 0

>VAOP_Petromyzon marinus 5 exons validated #123 in traces  pineal gland-specific opsin AH006524
0 MDALQESPPSHHSLPSALPSATGGNGTVATMHNPFERPLEGIAPWNFTMLAALMGTITALSLGENFAVIVVTARFRQLRQPLNYVLVNLAAADLLVSAIGGSVSFFTNIKGYFFLGVHACVLEGFAVTYF 1
2 GVVALWSLALLAFERYFVICRPLGNFRLQSKHAVLGLAVVWVFSLACTLPPVLGWSSYRPSMIGTTCEPNW 2
1 YSGELHDHTFILMFFSTCFIFPLAVIFFSYGKLIQKLKK 0
0 ASETQRGLESTRRAEQQVTRMVVVMILAFLVCWMPYATFSIVVTACPTIHLDPLLAAVPAFFSKTATVYNPVIYIFMNKQ 0
0 FRDCFVQVLPCKGLKKVSATQTAGAQDTEHTASVNTQSPGNRHNIALAAGSLRFTGAVAPSPATGVVEPTMSAAGSMGAPPNKSTAPCQQQGQQQQQQGTPIPAITHVQPLLTHSESVSKICPV* 0

>PPIN_anaCar Anoles parapinopsin new ACTR8 SELK PPIN CACNA2D3 PPIN syntenic region deleted in chicken VL 00 FNQVAKVGLAQRGSTT too many
0 MDSLDTNTLSPNASTVRVVLMPRIGYTIIAIIMATSCTLSVILNTAVIAITIKYRQLRQPINYSLVNLAIADLGAALLGGSLNVETNAVGYYNLGRVGCVTEGFAMAFF 1
2 GIVALCTIAVIAVDRAIVIAKPMGTITFTTRKAMIGVAVSWIWSLVWNTPPLFGWGGYQMEGVMTSCAPDWANSDPINVSYIICYFLFCFTIPFITILASYGYLIWT 0 
0 VAKVGLAQRGSTTKAEAQVSRMVIVMVMAFLICWLPYATFALVVVGNPQIYINPIIATIPMYMAKSSTFYNPIIYIFMNKQ 0
0 FRDCLVRCLLCGRNPCASEQTDEDDLEVSTIAPAPSSRRGKVAPV* 0

>PPIN_Xenopus tropicalis bistable UV pigment lamprey pineal broken contigs ACTR8 SELK PPIN CACNA2D3 genome only last 2
0 MADEALLPPMMNVTNEEMHPGKVLMPRIGYTILALIMAVFCAAALFLNVTVIVVTFKYRQLRHPINYSLVNLAIADLGVTVLGGALTVETNAVGYFNLGRVGCVIEGFAVAFF 1
2 GIAALCTIAVIALDRVFVVCKPMGTLTFTPKQALAGIAASWIWSLIWNTPPLFGWGSYELEGVMTSCAPNWYSADPVNMSYIVCYFSFCFAIPFLIIVGSYGYLMWT 0
0 LRQVAKLGVAEGGTTSKAEVQVSRMVIVMILAFLVCWLPYAAFAMTVVANPGMHIDPIIATVPMYLTKTSTVYNPIIYIFMNKQ 0
0 FQECVIPFLFCGRNPWAAEKSSSMETSISVTSGTPTKRGQVAPA* 0

>PPIN_Ictalurus punctatus (channel catfish) parapinopsin index sequence
0 MASIILINFSETDTLHLGSVNDHIMPRIGYTILSIIMALSSTFGIILNMVVIIVTVRYKQLRQPLNYALVNLAVADLGCPVFGGLLTAVTNAMGYFSLGRVGCVLEGFAVAFF 1
2 GIAGLCSVAVIAVDRYMVVCRPLGAVMFQTKHALAGVVFSWVWSFIWNTPPLFGWGSYQLEGVMTSCAPNWYRRDPVNVSYILCYFMLCFALPFATIIFSYMHLLHT 0
0 LWQVAKLQVADSGSTAKVEVQVARMVVIMVMAFLLTWLPYAAFALTVIIDSNIYINPVIGTIPAYLAKSSTVFNPIIYIFMNRQ 0
0 FRDYALPCLLCGKNPWAAKEGRDSDTNTLTTTVSKNTSVSPL* 0

>PPIN_Danio rerio parapinopsin [not checked wgd or synteny]
0 MESETSTAASGSIAEVMPRMGYTILAVIIGVFSVCGVILNVTVITVTLKYKQLRQPLNFALVNLAVADLGCAVFGGLPTVVTNAMGYFSLGRVGCVLEGFAVAFF 1
2 GIAALCSVAVIALERCMVVCRPVGSISFQTRHAVFGVAVSWLWSFIWNTPPLFGWGRLQLEGVRTSCAPDWYSRDLANVSFIVCYFLLCFALPFSVIVYSYTRLLWT 0
0 LRQVSRLQVCEGGSAARAEAQVSCMVVVMILAFLLTWLPYASFALCVILIPELYIDPVIATVPMYLTKSSTVFNPIIYIFMNRQ 0
0 FRDRALPFLLCGRNPWAAEAEEEEEETTVSSVSRSTSVSPA* 0

>PPIN_Oncorhynchus mykiss
0 MDHQQLLPNLHGNISSSPGSVSEALLSRTGFTILAVIIGVFSVSGVCMNVLVIMVTMRHRKLRQPLNYALVNLAVADLGCALFGGLPTMVTNAMGYFSMGRLGCVLEGFAVAFF 1
2 GIAGLCSVAVIAVDRYVVVCRPMGAVMFQTRHAVGGVVLSWVWSFLWNTPPLFGWGSFELEGVRTSCSPNWYSREPGNMSYIILYFLLCFAIPFSIIMVSYARILFT 0
0 LHQVSKLKVLEGNSTTRVEIQVVRMVVVMVMAFLLSWLPYAAFALSVILDPSLHINPLIATVPMYLAKSSTVYNPIIYVFMNRQ 0
0 FRDCAVPFLLCGLNPWASEPVGSEADTALSSVSKNPRVSPQ*

>PPIN_Lethenteron japonicum lamprey bistable pineal UV/green aligns perfectly
0 MENLTSLDLLPNGEVPLMPRYGFTILAVIMAVFTIASLVLNSTVVIVTLRHRQLRHPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVGCVIEGFAVAFF 1
2 GIAALCTIAVIAVDRFVVVCKPLGTLMFTRRHALLGIAWAWLWSFVWNTPPLFGWGSYELEGVRTSCAPDWYSRDPANVSYITSYFAFCFAIPFLVIVVAYGRLMWT 0
0 LHQVAKLGMGESGSTAKAEAQVSRMVVVMVVAFLVCWLPYALFAMIVVTKPDVYIDPVIATLPMYLTKTSTVYNPIIYIFMNRQ 0
0 FRDCAVPFLLCGRNPWAEPSSESATAASTSATSVTLASAPGQVSPS* 0

>PPINa_Ciona intestinalis parapinopsin 8 mostly odd exons 88  +- 200671 202490 1820 opsin1 larval ocellus
0 MNQWLETIMNERKFEIVNSSLQG 2
1 LLKPDSLGMDHDVTPTVDLTDGVPQCKDLNPYVLKGDGWVPQHISRANRSTYSFLCVYMTFVFLLSCSLNILVIVATLKNK 0
0 VLRQPLNYIIVNLAVVDLLSGFVGGFISIAANGAGYFFWGKTMCQIEGYFVSNFGVTGLL 0
0 SIAVMAFERYFVICKPFGPVRFEEKHSIFGIV 0
0 ITWVWSMFWNTPPLIFWDGYDTEGLGTSCAPNWFVKEKRERLFIILYFVFCFVIPLAVIMICYGKLILTLRQ 0
0 IAKESSLSGGTSPEGEVTKMVVVMVTAFVFCWLPYAAFAMYNVVNPEAQ 0
0 IDYALGAAPAFFAKTATIYNPLIYIGLNRQ 0
0 FRDCVVRMIFNGRNPWVDELVGSQVSSTGSQLTAVSSNKVAPA* 0

>PPINb_Ciona intestinalis 137  +- 67340 71147 3808 new 61% classifies perhaps as PIN jgi gene model wrong both ends
0 MTTAETTTECYEKNPYIRNEMGWVPKHILIAERHIYTILAVYMTFIFLLAVSLNGFVIIATMKNK 0
0 KLRQPLNYIIINLSIADFLSGLVGGFIGMISNSAGYFYFGKTVCILEGYIVSVA 1
2 GVCGLMSISVMAFERYFVVCKPYGPFTLTNTHAAL 1
2 GIGFTWTWSVLWSTPGLIWLDGYVPEGLGTSCAPNWFSKNK 2
1 SERIFIFVYFVFCFFIPLLVIIICYGKIVLFLKQVSLY 0
0 ATRQSSASSNRQADNKVTKMVLVMISAFLICWTPYGVLSLYNAINPDKQ 0
0 LDYGLGAVPVFFAKTANIYNPLIYIGLNKQ 0
0 FRDGVIKMVFRGRNPWAEEMSTQQRQRSTEAGQPIVSNEV* 0

>PPIN2_Ciona savignyi new
46 IPQNYSNGNPYATTDSGWVPEHIEIANRSTYSGLCVFMSFVFVLAVPLNLLVIVATYKNKV 104
106 LRRPINYIIVNLAVADLTCSVVGGLLGVLNNGAGYYFLGKSVCIFEGYVMSVTG 162
149 KISSFKKISLGVCGILSITVMAFERYFVVCKPFGQTNLKWSHAITG 194
187 ENVNAMFSLGIVFTWTWSVIWHTPGLFFWNGYEPEGFGTSCAPNWFSQQKR 235
236 ERIFIFAYFAFCFLTPLTIIFACYLKLILFIRKVS 270
269 VSKKSMVNEADRRDFEVTRMVFVMIAAFLICWLPYGCLSMYNAIHPD 315
317 QLSYGIGSVPAFFAKTATIYNPIIYMGLNKKVIRMYILN 358
348 FRDGVIRMLFKGRNPWLDGRNTTSSTSTRAQ 378 

>PARIE_Uta stansburiana lizard some minor problems shift in counterion
0 MENDSSLATELAEGAIVKPTIFPKAGYGVLAFLMFLNALFSIFNNSLVIAVTLKNPQLRNPINIFILNLSFSDLMMSLCGTTIVIATNYYGYFYLGRKFCIFQGFAVNYF 1
2 GIVSLWSLTILAYERYNVVCQPLGTLQMSTKRGYQLLGFIWVFCLFWAVVPLFGWSSYGPEGVQTSCSIGWEERSWSNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHG 0
0 LNKKVEQLGGKSSPEEEFRAVIMVLVMVVAFLICWLPYTVFALIVVFNPALNISPLAATIPTYLSKTSPVYNPIIYIFLNKQ 0
0 FRDCAVEFITCGQVVLTSPEEDISTSAIPVEGKGPCKINQVTPV* 0

>PARIE_Anole carolinensis KERA DCN BTG1 PARIE EEA1 NUDT4
0 MENESSLVLEGAEGYIVRPTIFPRAGYGVLAFLMFINALFSLFNNFLVIAVTLKNPQLRNPINIFILNLSFSDLMMSICGTTIVIATNYHGYFYLGRRFCIFQGFAVNYF 1
2 GIVSLWSLTILAYERYNVVCQPLGTLQMSTQRAYQLLGFIWVFCLFWAVVPLFGWSSYGPEGVQTSCSIGWEERSWNNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHG 0
0 LNKKVEQLGGKSNPEEEFRAVIMVLVMVVAFLICWLPYTLFALTVVFNPALNISPLAATIPTYLSKTSPVYNPIIYIFLNKE 0
0 FRECAVEFITCGKVVLTSPEEDISTSAISDEGIAPCKINQVTPV* 0

>PARIE_Xenopus tropicalis parietopsin
0 MDGNSTTPGIAVNLTVMPTIFPRSGYSILSFLMFLNAVFSICNNAIVILVTLKHPQLRNPINIFILNLSFSDLMMALCGTTIVVSTNYHGYFYLGKQFCIFQGFAVNYF 1
2 GIVSLWSLTLLAYERYNVVCEPIGALKLSTKRGYQGLVFIWLFCLFWAIAPLFGWSSYGPEGVQTSCSIGWEERSWSNYSYIISYFLTCFIIPVGIIGFSYGSILRSLHQ 0
0 LNRKIEQQGGKTNPREEKRVVIMVLFMVLAFLICWLPYTVFALIVVINPQLYISPLAATLPTYFAKTSPVYNPIIYIFLNKQ 0
0 FRTYAVQCLTCGHINLDSLEEDTESVSAQAENMLTPKTNQVAPA* 0

>ENCEPH_braBel Branchiostoma belcheri Amphiop5 early dup of ENCEPH extra Nfrag in mrna
0 MLGIYNVVNATEYGNNTTFAAWDFKRNGTGGEEEVEFFGYDAVAGVIAIIGVVGFVSNGAVVVLFLKFPQLRTPFNLLLLNMAVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANHLF 1
2 GLVSLISLAVISFLRYRMVVKPKGPGSSYLTYTKVGLAILFIYLYCLLWTTLPIAGWSSYQLEGPKIGCSVAWEEHSWSNTSYIVVLFITCLFAPLLIIVYSYYRLWHKVKQ 0
0 GSRNLPAAMRKSSQKEQKIAMMVIVMITCFMVCWLPYGAMALVVTFGGERLISHTAAVVPSLLAKSSTCYNPVVYFAMNSQ 0
0 FRRYFQDLLCCGRRLFDVSQSVVTGNTAMPRNNSQGFRKDDSDQKQDNGLPKQSEGPMCDHSSNESQMEGSRHNTAASQQWIEMQTIAVVVKAVEVDTSAANEP* 0

>ENCEPH_homSap encephalopsin OPN3 panopsin encephalopsin 1q43 - 47250 4 chr1:239,823,075-239,870,324 402 cone opsin with intron loss
0 MYSGNRSGGHGYWDGGGAAGAEGPAPAGTLSPAPLFSPGTYERLALLLGSIGLLGVGNNLLVLVLYYKFQRLRTPTHLLLVNISLSDLLVSLFGVTFTFVSCLRNGWVWDTVGCVWDGFSGSLF 1
2 GIVSIATLTVLAYERYIRVVHARVINFSWAWRAITYIWLYSLAWAGAPLLGWNRYILDVHGLGCTVDWKSKDANDSSFVLFLFLGCLVVPLGVIAHCYGHILYSIRM 0
0 LRCVEDLQTIQVIKILKYEKKLAKMCFLMIFTFLVCWMPYIVICFLVVNGHGHLVTPTISIVSYLFAKSNTVYNPVIYVFMIRK 0
0 FRRSLLQLLCLRLLRCQRPAKDLPAAGSEMQIRPIVMSQKDGDRPKKKVTFNSSSIIFIITSDESLSVDDSDKTNGSKVDVIQVRPL* 0

>ENCEPH_monDom encephalopsin_Monodelphis domestica 82% funny business with extra intron, alt splicing
0 MYSDNSSDDGGGGYWGSGRAGGASGTGVTGEPGPEGSPRQAPLFSPGTYELLALLIATIGLLGLCNNLLVLVLYYKFQRLRTPTHLFLVNISFNDLLVSLFGVTFTFVSCLRSGWVWDSVGCAWDGFSNTLF 1
2 GIVSIMTLTVLAYERYNRIVHAKVINFSWAWRAITYIWLYSLVWTGAPLLGWNRYTLEIHGLGCSVDWKSKDPNDSSFVIFLFFGCLMLPVGVMAYCYGHILYAIRM
0 LRCVEELQTIQVIKILRYEKKVAKMCFLMIAIFLFCWMPYAVICLLVANGYGSLVTPTVAIIASLFAKSSTAYNPIIYIFMSRK 0
0 FRRCLLQLLCFRLLKFQQPKKDRPVIRTEKQIRPIVMSQKVGDRPKKKVTFSSSSIIFIITSDETQMIDENDKNSGTKVNVIQVRPL* 0

>ENCEPH_galGal new
0 MHSGNGTGATSRPQLAAAGHEVPGERPLFSAGTYELLALLIATIGTLGVCNNLLVLVLYYKFKRLRTPTNLFLVNISLSDLLVSVCGVSLTFMSCLRSRWVWDAAGCVWDGFSNSLF 1
2 GIVSIMTLTVLAYERYIRVVHAKVIDFSWSWRAITYIWLYSLAWTGAPLLGWNRYTLEIHGLGCSMDWKSKDPNDTSFVLLFFLGCLVAPVVIMAYCYGHILYAVRM 0
0 LRCVEDFQTSQVIKLLKYEKKVAKMCFLMISTFLICWMPYAVVSLLVTYGYSNLVTPTVAIIPSFFAKSSTAYNPVIYIFMSRK 0
0 FRQCLLQLLCFRLMRFQRIMKEPSGAGNVKPIRPIVMSQKVGDRPKKKVTFSSSSIIFIIASDDTQQIDDNSKHNGTKVNVIQVKPL* 0

>ENCEPH_anoCar new
0 MFSANGTRSGAGSDLEPGPGQQQQQREASEEEERGAGLSPFSAGTYELLALLVAAIGLLGLCNNLLVLVLYAKFKRLRTPTHLFLVNISLSDLLVSLFGVSFTFGSCLRHRWVWDAAGCVWDGFSNSLF 1
2 GIVSIMTLTVLAYERYIRVVHARVIDFSWSWRAITYIWLYSLAWTGAPLLGWNHYTLEIHGLGCSVDWQSKEPSDSSFVLFFFLGCLAAPVGIMAYCYGHILHAIRM 0
0 LRCVEDLQSIQVIKILRYEKKVAKMCFLMVTTFLICWMPYAVVSLLIAYGYGHLITPTVAIIPSFFAKSSTAYNPVIYIFMSRK 0
0 FRRCLVQLFCVQFLRFKRTLKEQPAIESNKPIRPIVMSQKVGDRPKKKVTFSSSSIIFIITSDDTEQIDVSTKCSDTKINVIQVKPL* 0

>ENCEPH_xenTro new called TMT opsin in fish
0 MPVTNGSHNNSISWLHSKDMFTEDTYHFLALIVATVGFLGLVNNLLVLILYCKFKRLQTPTNLLFFNTSLCHFVFSLLAITFTFMSCVRGSWAFSVEMCVFHGFSKNLL 1
2 GIVSFGTLTVVAYERYARVVYGKYVNSSWSKRSITFVWVYSLAWTGFPLIGWNLYTFETHKLDCSFEWTATDPKDTAFVLLFFLACITLPLSIMAYCYGYILYEIQK 0
0 LRSVKNIQNFQEITILDYEIKMAKMCLLMMLTFLIGWMPYTILSLLVTSGYSKFITPTITVMPSLLAIASAAYNPVIHIFTIKK 0
0 FRQCLVQLLPPINFHPPINPPINNFWRLLKNLNGRLAMKKVKPVLGKGRSHNRPEKKVPPINFSSSDFFTRTTSDTGTHGITESTKGKRTNVRLIQVHPL* 0

>ENCEPH_braBel Branchiostoma belcheri Amphiop4 pinopsin encephalopsin introns from braFlo 83%
0 MPLYNTSSGPTQGLPWDTPYSQDPIWNDSSPSNSSEDAVVDQGRGELQDFSDAGYTAIATGLALI 12 GLVGSMNNFVVILLIGCHRQLRTPFNLLLLNVSVADLLVSVCGNTLSFASAVQHRWLWGRPGCVWYGFANSLF 1
2 GIVSLVTLSALAFERYCVVVRSSEMLTYKSSLGMIAFIWMYSLLWTSLPLLGWSSYQFEGHS 00 VGCSVNWVKHNVNNVSYIITLMVTCFFVPMVVVCWSYACIWRTVRM 0
0 SAEMKSEFGNPQNTGRLVTTMVVVMIVCFLVCWTPYTVMALIVTFGADHLVTPTASVIPSLVAKSSTAYNPIIYVLMNNQ 0
0 FREFLLARLRTFCCRQPRMLRVTPMDDNAHARLVGEGPSHAQQVIPSEENGENVEMRKVQGNQLKADSLSTISE* 0

>ENCEPH_braFlo blat from Amphiop4 Un - 207841137 207851157  10021 new exon12 and 34 + perfect fit 43% galGal 41% homSap
0 MALYNNTSSPSQDLLWDAPYSQGHIWDNSSASNSSEDVMDQGKVELQDFSDAGYTAIATCLALI 12 GFVGFTNNFVVILLIGCHRQLRTPFNLLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANSLF 1
2 GIVSLVTLSALAFERYCVVVRSSDMLTYKSSLVVITFIWLYSLLWTSLPLLGWSSYQFEGHN 00 VGCSVNWVQHNPDNVSYIVTLMVTCFFVPMVVVCWSYAWIWRTVRM 0
0 SSEAKPECGNSQNAGRLVTTMVVVMIICFLVCWTPYAVMALIVTFGADHLVTPTASVIPSLVAKSSTAYNPIIYVLMNNQ 0
0 FREFLLARLQRVCCRQQAVPRVTPMDDNVHVRLGGEGPSQSQQFLPAGENVENVDMLEYVQENCKPKADSLSTISE* 0

>ENCEPH2_Apis melifera length=321
0 MEHVIYEDQVSPVMYIGAAIALGFIGFFGFTANLLVAIVIVKDAQILWTPVNVILFNLV 0
0 FGDFLVSIFGNPVAMVSAATGGWYWGYKMCLW 2
1 YAWFMSTLGFASIGNLTVMAVERWLLVARPMQALSIR 2
1 HAVILASFVWIYALSLSLPPLFGWGSYGPEAGNVSCSVSWEVHDPVTNSDTYIGFLFVLGLIVPVFTIVSSYAAIVLTLKKVRKRA 1
2 GASGRREAKITKMVALMITAFLLAWSPYAALAIAAQYFN 0
0 AKPSATVAVLPALLAKSSICYNPIIYAGLNNQFSRFLKKIFDARGSRTAVPDSQHTALTALNRQEQRK* 0

>ENCEPH1_Anopheles gambiae XM_312503 GPROP11 adjacent head-to-head tandem GPROP12
0 MYDVTDAAAINSDHQELMAPWAYNGAAVTLFFIGFFGFFLNIFVIALMYKDVQ 0
0 LWTPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWLYGKSICVAYGFFMSLL 1
2 GIASITTLTVLSYERFCLISRPFAAQNRSKQGACLAVLFIWSYSFALTSPPLFGWGAYVNEAANI 1
2 SCSVNWESQTANATSYIIFLFIFGLILPLAVIIYSYINIVLEMRK 0
0 NSARVGRVNRAERRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELIGPGLAVLPALVAKSSICYNPIIYVGMNTQ FRAAFWRIRRSNGVAGQPDSNNTNNSNRDKESARHTAKEGL
ECSLDFCHWTVRGTRVSISSAERNVPAPAARERSGGHSVTGSREESRDRHVTLKTMLSVGPRSPSSVAPVAADCSTTDVPTSGDGSVRIVRQDSELSVIHDGGGGGGGSSSRVLVIKSQKPRSNML* 0

>ENCEPH2_Anopheles gambiae XM_312502 GPROP12
0 MNDAPNDVAASAVDYEDLMAPWAYNASAVTLFFIGFFGFFLNLFVIALMCKDMQ 0
0 LWTPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWIFGRTLCVAYGFFMSLL 1
2 GITSITTLTVLSYERYCLISRPFSSRNLTRRGAFLAIFFIWGYSFALTSPPLFGWGAYVQEAANI 1
2 SCSVNWESQTKNATTYIIFLFVFGLVVPLIVIVYSYTNIIVNMRE 0
0 NSARVGRINRAEQRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELIGPGLAVLPALVAKSSICYNPIIYVGMNTQ FRAAFSRVRNKGQQAAADQNTTTMQRELTKSSRDMVECSF
DFCRKKSRFKISLVKPTAPLAVVDVSSTSHRDKGTSRSPLDQTVLNETNEDVGRERSGGGGGGGAYAGTRFVRPDFELSVINSGKSILIKSKNFRSNLL* 0

>CILL2_Platynereis dumerilii emb|CT030681 htgs 171779 bp new CILL1 ENCEPH PARIE 44% e-65 393 5 exons 1 missing
0 MDDLGFLGNSSVNYTVPLLQEDPLLLRILYFGPTSYVITAIYLCIVGVIGTLSNGVIMYLYFKDKSLRSPMNLLFVNLAMSDFTVAFFGAMFQFGLTCTRKYMSPGMALCDFYGFITFLG 1
2 GLASEMNLFIISVERYLAVVRPFDVGNLTNRRVIAGG 1
2 VFVWLYSLVFAGGPLVGWSSYRPEGLGTWCSISWQDRSMNTMSYVTAVFLGCYFFPVSIIIFCYFNVWRKVKE 0
0 AADAQGAGTAGKAEKSIFRMSVIMVTCYLTAWTPYAIVCLIASYGPPNGLPIYAEVLPSLFAKSSQVYNPIIYVLMNKP 0
0  0* 0

>CILL1_Platynereis dumerilii lophotrochozoa ciliary opsin mRNA polychaeta unavailable genomically
0 MDGENLTIPNPVTELMDTPINSTYFQNLNAETDGGNHYIYNAFTATDYNICAAYLFFIACLGVSLNVLVLVLFIKDRKLRSPNNFLYVSLALGDLLVAVFGTAFKFIITARKTLLREEDGFCKWYGFITYLG 1
2 GLAALMTLSVIAFVRCLAVLRLGSFTGLTTRMGVAAMA 1
2 FIWIYSLAFTLAPLLGWNHYIPEGLATWCSIDWLSDETSDKSYVFAIFIFCFLVPVLIIVVSYGLIYDKVRK 0
0 VAKTGGSVAKAEREVLRMTLLMVSLFMLAWSPYAVICMLASFGPKDLLHPVATVIPAMFAKSSTMYNPLIYVFMNKQ 0
0 FRRSLKVLLGMGVEDLNSESERATGGTATNQVAAT*

>RGR_homSap	10q23.1	+ 14136 7 chr10:85,994,789-86,008,924 295 exons distinct retinal pigment epithelium and Mueller cells
0 MAETSALPTGFGELEVLAVGMVLLVE 1
2 ALSGLSLNTLTIFSFCKTPELRTPCHLLVLSLALADSGISLNALVAATSSLLR 2
1 VSHRRWPYGSDGCQAHGFQGFVTALASICSSAAIAWGRYHHYCT 1
2 RSQLAWNSAVSLVLFVWLSSAFWAALPLLGWGHYDYEPLGTCCTLDYSKGDR 2
1 NFTSFLFTMSFFNFAMPLFITITSYSLMEQKLGKSGHLQ 0
0 VNTTLPARTLLLGWGPYAILYLYAVIADVTSISPKLQM 0
0 VPALIAKMVPTINAINYALGNEMVCRGIWQCLSPQKREKDRTK* 0

>PER_homSap peropsin RRH 4q25 7 chr4:110,968,599-110,985,310 337 + 16712 retinal photoisomerase Retinal pigment epithelium 
0 MLRNNLGNSSDSKNEDGSVFSQTEHNIVATYLIMA 1
2 GMISIISNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0
0 VYAGLNIFFGMASIGLLTVVAVDRYLTICLPDV 1
2 GRRMTTNTYIGLILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDR 2
1 SFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDCTESLNRDWSDQIDVTK 0
0 MSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKK 2
1 FRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI* 0

>PERa_Branchiostoma belcheri Amphiop3 peropsin
MDIPTETPYGAEEDIGESAGWRWTETDKNGFHKYDHLIVGLYLFVIGIIGTIENGITLATFSKFRSLRSPTTMLLVHLAI
ADLGICIFGYPFSGASSLRSHWLFGGVGCQWYGFNGMFFGMANIGLLTCVAVDRYLVICRHDLVDKVNYNTYGVMAALGW
LFAAFWAALPLVGWAEYALEPSGTACTINFQKNDSLYISYVTSCFVLGFVVPLAVMAFCYWQASCFVSKVLKGDIAGDLT
FPVAANVDWEYQNHFSKMCLAMVAAFVVAWTPYSVLFLFAAFWNPADIPAWLTLLPPLIAKSSALYNPIIYIIANRR FRNAICSMMKGQDPDVEDDEHADEHRVRSIEDNDKEIISMVNLNMTV*

>PERb_Branchiostoma belcheri Amphiop2 peropsin RRH
MIPTNNNTENNDLEWGLEKEHGVSATIMGVYLTIVGLVATVGNATVVLMFIMKWRQLCRKAPNLLVINLAAANLCITIFG
YPFSASSGYAHQWLFPDAICTLYGFSCFLLSMVSMHTLCLISAHRYITICRPEHASKLTMNRTVLAVIGTWLYAIAVAVP
PLFNIARYTYEPSGLSCTIDFRVTTVADLVYLGSLIVLCYVIHVAVMATCYFKIIRKFSRHRFRQVRDIRTSHQRSFEMG
VTMRCILMTLFYLLSWTPYTAVCIWTMVGPPPPVVVSMAAALIAKTHCAFNPILYAFMSEVYRKLVFRTMCPCCFNRISC
KFVGTPTGGSKVSANPDIFTVDYNSRDQAVQINKAPSRRFCFVMETSEDLGSDDTGLTGHSGLWRSGAEVEGLGGLQVTQ
SPSVSGSELSLSLLDFLPPKPSGRAVSAKLPSPPALNSERATCPESSQQPSDRPATGLRQYQKGDTTRSSVGDLILTEDD
VTNLPPASETWGRKKSENPLSYRQTTRRTFGRSRKHSYIVD*

>PERc_Branchiostoma belcheri Amphiop1 peropsin RRH 
MNASPSSWLSSGEFFTDSPENSSEWPWTDGPTDTTWRHHQSVDSVSYEGYLASAIYITLTGLIAFFGNVITITVFLTEKE
FRKKQQNGFVLNLAIADLSVCVFAYPSSAIAGYAGRWVLGDVGCTIYGFLCFTFALVSMVTLCVISIYRYILICKPQYAH
LLTHRRTVYVIIGTWLYALVFTVPPLVGVKRYTYEPMQITCSLDWNVQHPGEKAYIAAVLVIVYVLQVLIMCFCYFNIIF
KSANLKFAALASEKTKMAAKKDTWKTSVMCLTMVVSFLIAWTPYAVSSTWDILSAEDLPIIATILPSLFAKSSCMMNPII
YACCNTK FRQAAVKSFRKLCGMCKQKVPLSTPQVVLAMQRNTEFTSTVEPTGQAFPMRVLPSISATHTAL*

>NEUR_homSap neuropsin 6p12.3 chr6:47,857,757-47,902,075 7 354
0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYK FACCQTGGLKATKKKSLEGFR 2
1 LHTVTTVRKSSAVLEIHEE 0
0 WE* 0

>MEL1_homSap  melanopsin rhabdomeric OPN4 10q23.2 +  11903 10 chr10 GLUD1 MMRN2 BMPR1A LDB3 OPN4 WAPAL GRID1
0 MNPPSGPRVPPSPTQEPSCMATPAPPSWWDSSQSSISSLGRLPSISPT 0 
0 APGTWAAAWVPLPTVDVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCR 2
1 SRSLRTPANMFIINLAVSDFLMSFTQAPVFFTSSLYKQWLFGET 1
2 GCEFYAFCGALFGISSMITLTAIALDRYLVITRPLATFGVASKRRAAFVLLGVWLYALAWSLPPFFGW 1
2 SAYVPEGLLTSCSWDYMSFTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGR 2
1 ALQTFGACKGNGESLWQRQRLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAG 2
1 YAHVLTPYMSSVPAVIAKASAIHNPIIYAITHPKYR 2
1 VAIAQHLPCLGVLLGVSRRHSRPYPSYRSTHRSTLTSHTSNLSWISIRRRQESLGSESEV 0
0 GWTHMEAAAVWGAAQQANGRSLYGQGLEDLEAKAPPRPQGHEAETPGK 0
0 TKGLIPSQDPRM* 0

>MEL1_smiCra OPN4 Sminthopsis crassicaudata melanopsin (Opn4)
0 MNPSPMLRHLSCPAQDSNCTKIMASISEWNNTEVDAYHLVDLPPITPT 0
0 AVVLPPYSQKVFPTADVPDYAHYTIGATILVVGFTGVLGNLLVIYTFCR 2
1 SRSLRTPANMFIINLAISDFFMSFTQAPVFFASSLYERWIFGEK 1
2 GCEFYAFCGALFGITSMITLMVIALDRYFVITRPLASIGMISKKKTGLILLGVWLYSLAWSLPPFFGW 1
2 SAYVPEGLLTSCSWDYTTFTPSVRAYTILLFCFVFFIPLTVIIYCYIFIFRAIKDTNKAVQNIGSSE
HTPSLRHFQRMKNEWKMAKIALVVILLFVLSWAPYSTVALVAFAGYSHVLTPYMNSVP
AIIAKASAIHNPIIYAISHPKYRMAIAQNFPCLRAVLGIRHPRTQSFSSYRFTHRSTT
ASQASDISWQSRGRRQLSLGSESEAGWNNIETGLTLRSLEGSCGMDEETMDTRELSAS
TKAKGQSWETLAKTLEEMDDLSLLEAGTLLSSLDLQI* 0

>MEL1_galGal OPN4 melanopsin1 genome screwed up
0 MDLPPRAPT 0
0 KMTVKDVRGAFPTVDVPDHAHYTIGTVILIVGITGTLGNFLVIYAFCR 2
1 SRTLQKPANIFIINLAVSDFLMSITQSPVFFTNSLHKRWIFGEK 1
2 GCELYAFCGALFGITSMITLMVIALDRYFVITKPLASVRVMSKKKALIILVGVWLYSLAWSLPPFFGW 1
2 SAYVPEGLLTSCSWDYMTFTPSVRAYTMLLFCFVFFIPLIAIIYSYVFIFEAIKKANK 2
1 SVQTFGCKHGNRELQKQYHRMKNEWKLAKIALIVILLYVISWSPYSVVALVAFAG 2
1 YSHVLTPFMNSVPAVIAKASAIHNPIIYAITHPKYR 2
1 TAIATYVPCLGFLLRVSPKESRSFSSYPSSRRTTITSQSSETSGLQKGKRRLSSISDSES 0
0 GCTDTETDITSMISRPASSQVSYEMGEDTTQTSDLGGKPKVKSHDSGIFRK 0
0 TVVDADEIPMVEINDTEHSATSTCKTSEKCNVEEIQ 0
0 RSESLSGIGLREGESRHRTSASQIPSIIITYSNVQGVELHSGYSAGFLHPKNKSHKQNKSSNS* 0

>MEL1_xenTro Xenopus tropicalis melanopsin approx
0 MNYQSVRKGITCPPQDANCSRILESLNSWNNSEVNSYKLVELPPIVTTETPQYEIHHVYPT 0
0 VDVPDHVHYVVGAVILAVGITGMLGNFLVIYAFCR 2
1 SRSLRSPANMFIINLAITDFLMSVTQAPVFFATSLHKRWIFGEK 1
2 GCELYAFCGALFGITSMITLMVIAVDRYFVITRPLTSIGVMSKKRAVLILSGVWLYSLAWSLPPFFGWS 1
2 AYVPEGLLTSCTWDYMTFTPSVRAYTMLLFCFVFFIPLFIIIYCYIFIFKAIKNTNR 2
1 AVQKIGTDNNKESHKQYQKMKNEWKMAKIALIVILLYVVSWSPYSTVALLAFAG 2
1 YASILTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2
1 MAIAKYIPCLGSLLRVKRRDSRSYSSYPSSRRSTVTSHCSQSSDVGGHPKLKNHLPSVSDSES 0
0 GWTDTEADSSVNSRPASRQVSYEMGKDTTETNDLKSKAKLKSHDSGIFEKTSMDADD 0
0 ISLVELGTVDRSSP...IMANKHLNGLGQRKGDSFTRRSPSSRIPSIVVTHSNHQGSPAAVRHNSTLPGIKVSNSQDREKELKRQIEKVKQYVPIVTITSDTENSTGGFSNELLPANTS* 0

>MEL_braFlo Branchiostom floridae 12 exons + chrUn:47589027-47597447 size 8421 +tandem dup assembly error
0 MTELPSFQPPTNSTEEENAVFPTALTEWISE 0
0 VGNQVGEAALKLLSGEGDGMEVTPTPGCTGNASVCNGTDSGGGVVWDIPPLAHYIVGTAVFCVGCCGMFGNAVVVYSFIK 2
1 SKGLRTPANFFIINLALSDFLMNLTNMPIFAVNSAFQRWLLSDF 1
2 ACELYGFAGGLFGCLSINTLMAISMDRYLVITKPFLVMRIVTKQR 0
0 VMFAILLLWIWSLVWALPPLFGWSAYVPEGF 1
2 GTSCTFDYMTPKLSYHIFTYIIFFTMYFIPMGVIIYCYYNIFATVKSGDKQFGKAVKEMAHEDVKNK 0
0 AQQERQRKNEIKTAKIAFIVITLFLSAWTPYAVVSALGTLGYQDLVTPYLQSIPAVFAKSSAVYNPI 1
2 VYAITHPKFRAAVKKHIPCLSGCLPADEEETKTKTRGATTTASMSMTQTTAPTV 0
0 HDPQASVHSGSSVSVDDSSGVSRQDTMMVK 0
0 VEVDNRMEKAGGGAADTAPKDGTSVPTVSAQIEVRPSGNVNTKAEVIPSPQSAAVAHGASASPVPK 0
0 VAELSSSVSLESAAIPGKIPTPLPSQPIAAPIERHMAAMADDPPPKPRGVATTVNVRRSESGYERSQDSLRKK 0
0 AVSETRSRSFNSTKDHFASERQTSTTLNQPRDMYSGDMVKKTRQSPEKQEYDNPAFDAGIAEIDTDSENETEGSYDMLSVRFQAMAEEPPVETYRKASDMSINLGKASLMLTEAHDETVL* 0

>MEL_braBel Branchiostoma belcheri (Japanese lancelet) AB205400 Amphi-mop
0 MTEIPSFQPPINATEVEEENAVFPTALTEWFSE 0
0 VGNQVGEVALKLLSGEGDGMEVTPTPGCTGNGSVCNGTDSGGVVWDIPPLAHYIVGTAVFCIGCCGMFGNAVVVYSFIK 2
1 SKGLRTPANFFIINLALSDFLMNLTNMPIFAVNSAFQRWLLSDF 1
2 ACELYGFAGGLFGCLSINTLMAISMDRYLVITKPFLVMRIVTKQR 0
0 VMFAILLLWIWSLVWALPPLFGWSAYVSEGF 1
2 GTSCTFDYMTPKLSYHIFTYIIFFTMYFIPGGVMIYCYYNIFATVKSGDKQFGKAVKEMAHEDVKNK 0
0 AQQERQRKNEIKTAKIAFIVISLFMSAWTPYAVVSALGTLGYQDLVTPYLQSIPAMFAKSSAVYSPI 1
2 VYAITYPKFREAVKKHIPCLSGCLPASEEETKTKTRGQSSASASMSMTQTTAPV 0
0 HDPQASVDSGSSVSVDDSSGVSRQDTMMVK 0
0 VEVDKRMEKAGGGAADAAPQEGASVSTVSAQIEVRPSGKVTTKADVISTPQTAHGLSASPVPK 0
0 VAELGSSATLESAAIPGKIPTPLPSQPIAAPIERHMAAMADEPPPKPRGVATTVNVRRTESGYDRSQDSQRKK 0
0 VVGDTHRSRSFNTTKDHFASEQPAALIQPKELYSDDTTKKMARQSSEKHEYDNPAFDEGITEVDTDSENETEGSYDMLSVRFQAMAEEPPVETYRKASDLAINLGKASLMLSEAHDETVL* 0

>MEL1b_Branchiostoma belcheri Amphiop6 melanopsin
MSSNLTNVSLVANRTDQTELSPTDVTMQLIFGSMMLVFGLIGVVGNVVALYAFCRTRSLRRPKNYVVANLCLTDMFVCLV
YCPIVVSRSFSHGFPSKESCIVEGFMVGVGSIASICSLAAIAVERYLSVTQPLKSLTILTQRKLLVAVLTVWVYSLLLAF
PPLVGWGRYVREETYISCTFDYLSTDDATRAYVITLVMGAFGFPLLTIAYCYIRVFTTARKHAEERKFMSPLKRPESRTE
IKTAVTACVITTSFCLCWCPYAVVATLGISGVSVQQQTVFSAALLAKLTVIINPIVYVLSIPNFRKALFAQEREKYASED
VVLTSLPGKTRRMKKVERSQSSNSNVVIEVKESSMAYSTSRESCLLSRAATKRLAGKTKSIVDLVDEFGLQETAPHKESLV*

>MEL2_galGal melanopsin2 GRID2 SMARCAD1 PGDS MEL2 SEC24B COL25A1
0 M 0
0 DLNMGTQPHSVTKSEIPDHVLYTVGTCVLVIGSIGIIGNLLVLYAFYS 2
1 NKKLRTPQNFFIMNLAVSDFLMSASQAPICFVNSLHREWILGDI 1
2 GCDLYAFCGALFGITSMMTLLAISVDRYLVITKPLRSIQWTSKKRTIQIIAAVWLYSLGW 1
2 SVAPLLGWSSYVPEGLMISCTWDYVTYSPANRSYTMILCCCVFFIPLIIILHCYLFMFLAIRSTGR 2
1 DVQKLGSCSRKSFLSQSMKNEWKLAKIAFVVIIVYVLSWSPYACVTLIAWAG 2
1 RGNTLTPYSKSVPAVIAKASAIYNPIIYAIIHPRYRK 2
1 TIHNAVPCLRFLIRISKNDLLRGSINESSFRTSLSSHQSLAGRTKNTCVSSVSTGEA 0
0 NWSDVELDTVEPAHEKLQPRRSHSFSSSLRQKRDLLPDSYSCSEETEEK 0
0 VSLSSSYLEKVLGRSAFPSSPVALVTSSLRAASLPVGLNSSSASRGAGSDISQMKTEESHNNGGLDSIVSNTVPQIIIIPTSETNLFQEEPEEEETELFHFHDKKNNLLDLEGLSSSTEFLEAVEKFLS* 0

>MEL2_anoCar melanopsin2 frag GRID2 SMARCAD1 MEL2 PDILM5 BMPR1B UNC5B
0 MGPHHRTKVDVPDHVLYTVGSCVLVIGCIGITGNLLVLYAFYS 2
1 NKRLRTPPNYFIMNLAVSDFLMSATQAPICFLNSMHKEWVLGDI 1
2 GCNLYAFCGALFGITSMITLLAISVDRYCVITKPLQSIKRTSKKRTCIIIVFVWLYSLGWSVCPLFGW 1
2 SSYIPEGLMISCTWDYVTYSPANRSYTMMLCCCVFFIPLVIIFHCYIFMFLAIRSTGR 2
1 RKSSISHSIKSEWKLAKIAFVAIVVFVLSWSPYACVTLISWAG 2
1 TLTPYSKSVPAVIAKASAIYNPIIYAIIHPRYRK 2

>MEL2_xenLae Xenopus laevis melanopsin (Xmop) GRID2 SMARCAD1 PGDS MEL2 SEC24B COL25A1 21 12 12 21 21 21 00 00
0 MDLGKTVEYGTHRQDAIAQIDVPDQVLYTIGSFILIIGSVGIIGNMLVLYAFYR 2
1 NKKLRTAPNYFIINLAISDFLMSATQAPVCFLSSLHREWILGDI 1
2 GCNVYAFCGALFGITSMMTLLAISINRYIVITKPLQSIQWSSKKRTSQIIVLVWMYSLMWSLAPLLGW 1
2 SSYVPEGLRISCTWDYVTSTMSNRSYTMMLCCCVFFIPLIVISHCYLFMFLAIRSTGR 2
1 NVQKLGSYGRQSFLSQSMKNEWKMAKIAFVIIIVFVLSWSPYACVTLIAWAG 2
1 HGKSLTPYSKTVPAVIAKASAIYNPIIYGIIHPKYRE 2
1 TIHKTVPCLRFLIREPKKDIFESSVRGSIYGRQSASRKKNSFISTVSTAET 0
0 VSSHIWDNTPNGHWDRKSLSQTMSNLCSPLLQDPNSSHTLEQTLTWPDDPSPKEILLPSSLKSVTYPIGLESIVKDEHTNNSCVR
NHRVDKSGGLDWIINATLPRIVIIPTSESNISETKEEHDNNSEEKSKRTEEEEDFFNFHVDTSLLNLEGLNSSTDLYEVVERFLS* 0

>MEL_Platynereis dumerilii mRNA for rhabdomeric opsin runs as melanopsin unavailable genomically
MSRSEVLVPGSMSLDGLLTTAHPIGNDSIETILHPYWQQFDIENTIPDSWHYAVAAWMTFFGILGVSGNLLVVWTFLKTKSLRTAPNMLLVNLAIGDMAFSA
INGFPLLTISSINKRWVWGKLWRELYAFVGGIFGLMSINTLAWIAIDRFYVITNPLGAAQTMTKKRAFIILTIIWANASLWALAPFFGWGAYIPEGFQTSCTYDYLTQDMNNYTYV
LGMYLFGFIFPVAIIFFCYLGIVRAIFAHHAEMMATAKRMGANTGKADADKKSEIQIAKVAAMTIGTFMLSWTPYAVVGVFGMIKPHSEMFIHPLLAEIPVMMAKASARYNPIIYA
LSHPKFRAEIDKHFPWLLCCCKPKPKAQLPSSTTKGSIASKTEADTSV*

Here are those sequences aligned to show rare genomic events such as indels and intron gains and losses:

>RHO1_homSap LAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLG GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWS
>RHO1_monDom LAAYMFMLIVLGFPINFLTLYVTIQHKKLRTPLNYILLNLAIADLFMVFGGFTMTLYTSLHGYFVFGPTGCNLEGFFATLG GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIIGVAFTWVMALACAFPPLIGWS
>RHO1_ornAna LAAYMFMLIMLGFPINFLTLYVTIQHKKLRTPLNYILLNLAFANHFMVLGGFTTTLYTSLHGYFVFGPTGCNIEGFFATLG GEIALWSLVVLAIERYIVVCKPMSNFRFGENHAIMGVAFTWIMALACALPPLVGWS
>RHO1_galGal LAAYMFMLILLGFPVNFLTLYVTIQHKKLRTPLNYILLNLVVADLFMVFGGFTTTMYTSMNGYFVFGVTGCYIEGFFATLG GEIALWSLVVLAVERYVVVCKPMSNFRFGENHAIMGVAFSWIMAMACAAPPLFGWS
>RHO1_anoCar LAAYMFLLILLGFPINFLTLFVTIQHKKLRTPLNYILLNLAVANLFMVLMGFTTTMYTSMNGYFIFGTVGCNIEGFFATLG GEMGLWSLVVLAVERYVVICKPMSNFRFGETHALIGVSCTWIMALACAGPPLLGWS
>RHO1_xenTro LAAYMFLLILLGFPINFMTLYVTIQHKKLRTPLNYILLNLVFANHFMVLCGFTVTMYTSMHGYFIFGQTGCYIEGFFATLG GEMALWSLVVLAIERYVVVCKPMANFRFGENHAIMGVVFTWIMALSCAAPPLFGWS
>RHO1_danRer VAAYMFFLIITGFPVNFLTLYVTIEHKKLRTPLNYILLNLAIADLFMVFGGFTTTMYTSLHGYFVFGRLGCNLEGFFATLG GEMGLKSLVVLAIERWMVVCKPVSNFRFGENHAIMGVAFTWVMACSCAVPPLVGWS
>RHO1_Raja   LAAYMFFLILTGLPVNFLTLFVTIQHKKLRQPLNYILLNLAVSDLFMVFGGFTTTIITSMNGYFIFGPAGCNFEGFFATLG GEVGLWCLVVLAIERYMVVCKPMANFRFGSQHAIIGVVFTWIMALSCAGPPLVGWS
>RHO1_lamp1  LAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAVANLFMVLFGFTLTMYSSMNGYFVFGPTMCNFEGFFATLG GEMSLWSLVVLAIERYIVICKPMGNFRFGSTHAYMGVAFTWFMALSCAAPPLVGWS
>RHO1_lamp2  LAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAVSNLFMILFGFTTTMYTSMNGYFVFGPTMCSIEGFFATLG GEVSLWSLVVLAIERYIVICKPMGNFRFGNTHAIMGVALTWVMALSCAAPPLLGWS
>RHO1_lamp3  LAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAMANLFMVLFGFTVTMYTSMNGYFVFGPTMCSIEGFFATLG GEVALWSLVVLAIERYIVICKPMGNFRFGNTHAIMGVAFTWIMALACAAPPLVGWS
>RHO2_galGal VCCYIFFLISTGLPINLLTLLVTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFVFGPVGCAVEGFFATLG GQVALWSLVVLAIERYIVVCKPMGNFRFSATHAMMGIAFTWVMAFSCAAPPLFGWS
>RHO2_anoCar VCCYIFFLIFTGLPINILTLLVTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFIFGPIGCAIEGFFATLG GQVALWSLVVLAIERYIVVCKPMGNFRFSATHALMGISFTWFMSFSCAAPPLLGWS
>RHO2_Gekko  LSFYMFFLIAAGMPLNGLTLFVTFQHKKLRQPLNYILVNLAAANLVTVCCGFTVTFYASWYAYFVFGPIGCAIEGFFATIG GQVALWSLVVLAIERYIVICKPMGNFRFSATHAIMGIAFTWFMALACAGPPLFGWS
>RHO2_Lat    LCAYMFLLIILGFPINFLTLLVTFKHKKLRQPLNYILVNLAVASLFMVVFGFTVTFYSSLNGYFVLGPMGCAMEGFFATLG GQVALWSLVVLAIERYIVVCKPMGNFRFASSHAIMGIAFTWIMALACAAPPLVGWS
>RHO2_Geo    ISAYVFTLILIGFPVNFMTLFVTFKLKKLRQPLNFILVNLCVADLLMIMFGFTTTFYTAMNGYFVFGPTGCNIEGFFATLG GEVSLWSLVMLAIERYIVVCKPMGNFRFATTHAALGVVFTWVMASACAVPPLVGWS
>SWS1_homSap QAAFMGTVFLIGFPLNAMVLVATLRYKKLRQPLNYILVNVSFGGFLLCIFSVFPVFVASCNGYFVFGRHVCALEGFLGTVA GLVTGWSLAFLAFERYIVICKPFGNFRFSSKHALTVVLATWTIGIGVSIPPFFGWS
>SWS1_macDom QTVFMGFVFCAGTPLNAVVLVATLRYKKLRQPLNYILVNVSLCGFIFCIFAVFTVFISSSQGYFIFGRHVCAMEAFLGSVA GLVTGWSLAFLAFERFIVICKPFGNFRFNSKHAMMVVLATWVIGIGVSIPPFFGWS
>SWS1_galGal QTAFMGIVFAVGTPLNAVVLWVTVRYKRLRQPLNYILVNISASGFVSCVLSVFVVFVASARGYFVFGKRVCELEAFVGTHG GLVTGWSLAFLAFERYIVICKPFGNFRFSSRHALLVVVATWLIGVGVGLPPFFGWS
>SWS1_Taenio QTIFMGLVFVAGTPLNAIVLIVTIKYKKLRQPLNYILVNISVSGLMCCVFCIFTVFIASSQGYFVFGKHMCAFEGFAGATG GLVTGWSLAFLAFERYIVICKPFGNFRFNSRHALLVVAATWIIGVGVAIPPFFGWS
>SWS1_Gekko  QTAFMGFVFFVGTPLNAIILFAIVKYKKLRQPLNYILVNISAAGFLFCVVAVFTVFISSSQGYFIFGKHICALEAFLGSLA GLVTGWSLAFLALERYIVICKPFGNFRFSAKHASLVVAATWFIGIGVSIPPYFGWS
>SWS1_Utasta QTAFMGFVFFAGTPLNAIILIVTVKYKKLRQPLNYILVNISFAGFLFCVFSVFTVFLASSQGYFFFGRHICALEAFLGSVA GLVTGWSLAFLAFERYIVICKPFGNFRFNSKHALLVVAATWFIGIGVSIPPFFGWS
>SWS1_Xenlae QAIFMGMVFLIGTPLNFIVLLVTIKYKKLRQPLNYILVNITVGGFLMCIFSIFPVFVSSSQGYFFFGRIACSIDAFVGTLT GLVTGWSLAFLAFERYIVICKPMGNFNFSSSHALAVVICTWIIGIVVSVPPFLGWS
>SWS1_Danio  QAAFMGFVFIVGTPMNGIVLFVTMKYKKLRQPLNYILVNISLAGFIFDTFSVSQVSVCAARGYYSLGYTLCSMEAAMGSIA GLVTGWSLAVLAFERYVVICKPFGSFKFGQGQAVGAVVFTWIIGTACATPPFFGWS
>SWS1_Oryzia QAAFMGFVFFVGTPLNFVVLLATAKYKKLRVPLNYILVNITFAGFIFVTFSVSQVFLASVRGYYFFGQTLCALEAAVGAVA GLVTSWSLAVLSFERYLVICKPFGAFKFGSNHALAAVIFTWFMGVGCACPPFFGWS
>SWS1_Geotri QAAFMGFVFICGTPLNAIVLVVTIKYKKLRQPLNYILVNISAAGLVFCLFSISTVFVASMQGYFFLGPTICALEAFFGSLA GLVTGWSLAFLAAERYIVICKPFGNFRFGSKHALVAVGLTWMLGLSVALPPFFGWS
>SWS2_ornAna LAAFMFLLITLGFPINLLTVICTIKYKKLRSHLNYILVNLAVSNMLVVCVGSATAFYSFAHMYFVLGPTACKIEGFAATLG GMVSLWSLAVIAFERFLVICKPLGNLSFRGTHAIFGCAATWVFGLAASLPPLFGWS
>SWS2_galGal MAAFMFLLIALGVPINTLTIFCTARFRKLRSHLNYILVNLALANLLVILVGSTTACYSFSQMYFALGPTACKIEGFAATLG GMVSLWSLAVVAFERFLVICKPLGNFTFRGSHAVLGCVATWVLGFVASAPPLFGWS
>SWS2_Taenio MAAFMFLLVLLGVPINALTVLCTAKYKKLRSHLNYILVNLAVANLLVVCVGSTTAFYSFSQMYFALGPLACKIEGFTATLG GMVSLWSLAVVAFERFLVICKPLGNFTFRGSHAVLGCAITWIFGLIASLPPLFGWS
>SWS2_Utasta MAAFMFLLIILGVPINVLTIFCTFKYKKLRSHLNYILVNLAVSNLLVVCIGSTTAFYSFAQMYFSLGPTACKIEGFAATLG GMVSLWSLAVVAFERFLVICKPLGNFSFRGTHAIIGCIITWVFGLVASLPPLFGWS
>SWS2_Xenopu ISAFMLFTIIFGFPLNLLTIICTVKYKKLRSHLNYILVNLAVANLIVICFGSTTAFYSFSQMYFSLGTLACKIEGFTATLG GIIGLWSLAVVAFERFLVICKPMGNFTFRESHAVLGCILTWVIGLVAAIPPLLGWS
>SWS2_Danio  MSAFMLFLFIAGTAINVLTIVCTIQYKKLRSHLNYILVNLAISNLWVSVFGSSVAFYAFYKKYFVFGPIGCKIEGFTSTIG GMVSLWSLAVVALERWLVICKPLGNFTFKTPHAIAGCILPWCMALAAGLPPLLGWS
>SWS2_Takifu MSAFMFFLFVAGTGINVLTIACTIQYKKLRSHLNYILVNLAFSNLLVTTVGSFTCFCCFFVRYMIVGPLGCKIEGFAATLG GMVSLWSLAVVAFERWLVVCKPLGNFIFKPDHAIVCCIFTWFFALIISAPPLFGWS
>SWS2_Geotri MSAFMLFLVLAGFPLNFLTVFVTIKYKKLRSHLNYILVNLAIANLIVVCCGSTLAFYSFMHKYFILGPLFCKMEGFTATLG GMLSLWSLAVLAFERCLVICKPFGNIAFRGTHALIRCGFAWAAAIAASTPPLFGWS
>LWS_ornAna  TSLWMIFVVIASVFTNGLVLVATMKFKKLRHPLNWILVNLAVADLGETLIASTISVINQIFGYFILGHPMCVLEGYTVSLC GITGLWSLSIISWERWIVVCKPFGNVKFDAKLAMVGIVFSWVWAAVWTAPPIFGWS
>LWS_galGal  TSLWMIFVVAASVFTNGLVLVATWKFKKLRHPLNWILVNLAVADLGETVIASTISVINQISGYFILGHPMCVVEGYTVSAC GITALWSLAIISWERWFVVCKPFGNIKFDGKLAVAGILFSWLWSCAWTAPPIFGWS
>LWS_anoCar  TSVWMIFVVIASIFTNGLVLVATAKFKKLRHPLNWILVNLAIADLGETVIASTISVINQISGYFILGHPMCVLEGYTVSTC GISALWSLAVISWERWVVVCKPFGNVKFDAKLAVAGIVFSWVWSAVWTAPPVFGWS
>LWS_Lithoch ATLWMFVVVVLSVFTNGLVLVATMKFKKLRHPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPMCIFEGYVVSVC GIAALWSLTIISWERWIVVCKPFGNVKFDAKWATAGIVFSWVWAAVWCAPPIFGWS
>LWS_Gastero STLWMFIVVALSVFTNGLVLVATAKFKKLQHPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPMCVFEGYVVSVC GITALWSLTIISWERWIVVCKPFGNVKFDAKWATAGIVFSWIWSAVWCAPPIFGWS
>LWS_Petrom  TSVWMIIVVVLSLFSNGLVLVATVKFKKLRHPLNWIIVNLAIADILETIFASTISVCNQVYGYFILGHPMCVFEGYVVSTC GIAGLWSLAIISWERWMVVCKPFGNIKFDGKIATILIVFSWVWPASWCSLPIFGWS
>LWS_lamprey TSVWMIIVVVLSLFTNGLVLVATMKFKKLRHPLNWILVNLAIADILETIFASTISVCNQVFGYFILGHPMCVFEGYVVSTC GIAGLWSLAIISWERWMVVCKPFGNIKFDGKIAIILIVFSWVWPACWCSLPIFGWS
>LWS_Geotria TSFWMIIVVILSLFTNGLVLVATLKFKKLRHPLNWILVNLAIADIGETIFASTVSVVNQIFGYFILGHPLCVFEGFTVSVC GITALWSLAIISFERWMVVCKPFGNLKFDGKVAIVLIIFSWAWSAGWCAPPIFGWS
>PIN_galGal  VAVLMGTVVACASVVNGLVIVVSICYKKLRSPLNYILVNLAVADLLVTLCGSSVSLSNNINGFFVFGRRMCELEGFMVSLT GIVGLWSLAILALERYVVVCRPLGDFQFQRRHAVSGCAFTWGWALLWSTPPLLGWS
>PIN_UtaSta  VAVLMGLVVVSAAFVNGLVIVVSIQYKKLRSPLNYILVNLAIADLLVTSFGSTLSFANNIYGFFVLGQTACEFEGFMVSLT GIVGLWSLAILAFERYLVICKPVGDFRFQQRHAVFGCVFTWMWSLVWTLPPLFGWS
>PIN_pheMad  LAALMGVVVLSASLANGLVIAVSVRFKRLRSPLNYILVNLATADLLVTFFGSIISFVNNAVGFFVFGKTACRFEGFMVSLT GIVGLWSLAILAFERYLVICKPVGDFQFQRRHAVIGCLYTWGWSLIWTVPPLFGWS
>PIN_podSic  VAVLMGLVVISATLVNGLVIVVSVQFKKLRSPLNYVLVNLAVADLLVTFFGSTISFVNNAQGFFIFGQATCEFEGFMVSLT GIVGLWSLAILAFERYLVICKPVGDFRFPARHAVLGCAFTWGWSFVWTVPPLLGWS
>PIN_xenTro  VAAVMCMVVILAFFVNGLVIVVTLKYKKLRSPLNYILVNLAIANLLVTIFGSSVSFSNNVVGYFFMGKTMCEFEGFMVSLT GIVGLWSLAILAFERYLVICKPMGDFRFQQKHAILGCSFTWVWSFIWTSPPLFGWC 
>PIN_bufJap  VAVLMGMVVFLAFFVNGMVIVVSLKYKKLRSPLNYILVNLAVADILVTMFGSTVSFHNNIFGFFTLGKLVCELEGFVVSLT GIVGLWSLAILAFERYIVICKPMGDFRFQQRHAVMGCAFTWIWAFLWTSPPLIGWC
>VAOP_galGal VAAVMFVVTSLSLAENLAVILVTFKFKQLRQPVNYVIVNLSVADFLVSLTGGTISFLANLKGYFYMGHWACVLEGFAVTFF GIVALWSLALLAFERYIVICRPVGNMRLRGKHAAQGIAFVWTFSFIWTIPPTMGWS
>VAOP_anoCar ISALMFVVTLFSLSENFTVILVTIKFKQLRQPLNYVIVNLSVADFLVSLIGGTISFSTNLKGYFYMGHWACVLEGFAVTFF GIVALWSLALLAFERYVVICRPLGNMRLNGKHAALGVAFVWIFSFIWTVPPTMGWS
>VAOP_xenTro LAALMFVVTSLSIAENFIVILVTAKFKQLRQPLNYIIVNLSVADFLVSVIGGTISIATNSRGYFYLGSWACVLEGFAVTFF GIVALWSLSVLAFERYIVICRPLGNLRLQGKHSALAIIFVWVFSFVWTIPPTMGWS
>VAOP_danRer LAALMFVVTALSLSENFTVMLVTFRFQQLRQPLNYIIVNLSLADFLVSLTGGSISFLTNYHGYFFLGKWACVLEGFAVTFF GIVALWSLAVLAFERFFVICRPLGNIRLRGKHAALGLVFVWSFSFIWTVPPVLGWS
>VAOP_rutRut LATLMFVVTAASLSENFAVMLVTFRFTQLRKPLNYIIVNLSLADFLVSLTGGTISFLTNYHGYFFLGKWACVLEGFAVTYF GIVALWSLAVLAFERFFVICRPLGNIRLRGKHAALGLLFVWTFSFIWTIPPVLGWS
>VAOP_Petro  LAALMGTITALSLGENFAVIVVTARFRQLRQPLNYVLVNLAAADLLVSAIGGSVSFFTNIKGYFFLGVHACVLEGFAVTYF GVVALWSLALLAFERYFVICRPLGNFRLQSKHAVLGLAVVWVFSLACTLPPVLGWS
>PPIN_anaCar IAIIMATSCTLSVILNTAVIAITIKYRQLRQPINYSLVNLAIADLGAALLGGSLNVETNAVGYYNLGRVGCVTEGFAMAFF GIVALCTIAVIAVDRAIVIAKPMGTITFTTRKAMIGVAVSWIWSLVWNTPPLFGWG
>PPIN_Xenop  LALIMAVFCAAALFLNVTVIVVTFKYRQLRHPINYSLVNLAIADLGVTVLGGALTVETNAVGYFNLGRVGCVIEGFAVAFF GIAALCTIAVIALDRVFVVCKPMGTLTFTPKQALAGIAASWIWSLIWNTPPLFGWG
>PPIN_Ictal  LSIIMALSSTFGIILNMVVIIVTVRYKQLRQPLNYALVNLAVADLGCPVFGGLLTAVTNAMGYFSLGRVGCVLEGFAVAFF GIAGLCSVAVIAVDRYMVVCRPLGAVMFQTKHALAGVVFSWVWSFIWNTPPLFGWG
>PPIN_Danio  LAVIIGVFSVCGVILNVTVITVTLKYKQLRQPLNFALVNLAVADLGCAVFGGLPTVVTNAMGYFSLGRVGCVLEGFAVAFF GIAALCSVAVIALERCMVVCRPVGSISFQTRHAVFGVAVSWLWSFIWNTPPLFGWG
>PPIN_Oncor  LAVIIGVFSVSGVCMNVLVIMVTMRHRKLRQPLNYALVNLAVADLGCALFGGLPTMVTNAMGYFSMGRLGCVLEGFAVAFF GIAGLCSVAVIAVDRYVVVCRPMGAVMFQTRHAVGGVVLSWVWSFLWNTPPLFGWG
>PPINa_Ciona LCVYMTFVFLLSCSLNILVIVATLKNKVLRQPLNYIIVNLAVVDLLSGFVGGFISIAANGAGYFFWGKTMCQIEGYFVSNF GVTGLLSIAVMAFERYFVICKPFGPVRFEEKHSIFGIVITWVWSMFWNTPPLIFWD
>PPINb_Ciona LAVYMTFIFLLAVSLNGFVIIATMKNKKLRQPLNYIIINLSIADFLSGLVGGFIGMISNSAGYFYFGKTVCILEGYIVSVA GVCGLMSISVMAFERYFVVCKPYGPFTLTNTHAALGIGFTWTWSVLWSTPGLIWLD
>PPIN_lamp   LAVIMAVFTIASLVLNSTVVIVTLRHRQLRHPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVGCVIEGFAVAFF GIAALCTIAVIAVDRFVVVCKPLGTLMFTRRHALLGIAWAWLWSFVWNTPPLFGWG
>PARIE_Utast LAFLMFLNALFSIFNNSLVIAVTLKNPQLRNPINIFILNLSFSDLMMSLCGTTIVIATNYYGYFYLGRKFCIFQGFAVNYF GIVSLWSLTILA YERYNVV--CQPLGTLQMSTKR GYQLLGFIWVFCLFWAVVPLFGWS
>PARIE_Anole LAFLMFINALFSLFNNFLVIAVTLKNPQLRNPINIFILNLSFSDLMMSICGTTIVIATNYHGYFYLGRRFCIFQGFAVNYF GIVSLWSLTILA YERYNVV--CQPLGTLQMSTQR AYQLLGFIWVFCLFWAVVPLFGWS
>PARIE_Xenop LSFLMFLNAVFSICNNAIVILVTLKHPQLRNPINIFILNLSFSDLMMALCGTTIVVSTNYHGYFYLGKQFCIFQGFAVNYF GIVSLWSLTLLA YERYNVV--CEPIGALKLSTKR GYQGLVFIWLFCLFWAIAPLFGWS
>ENCEPH_braB VAGVIAIIGVVGFVSNGAVVVLFLKFPQLRTPFNLLLLNMAVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANHLF GLVSLISLAVIS FLRYRMVVKPKGPGSSYLTYTK VGLAILFIYLYCLLWTTLPIAGWS
>ENCEPH_homS LALLLGSIGLLGVGNNLLVLVLYYKFQRLRTPTHLLLVNISLSDLLVSLFGVTFTFVSCLRNGWVWDTVGCVWDGFSGSLF GIVSIATLTVLA YERYIRVV-----HARVINFSW AWRAITYIWLYSLAWAGAPLLGWN
>ENCEPH_monD LALLIATIGLLGLCNNLLVLVLYYKFQRLRTPTHLFLVNISFNDLLVSLFGVTFTFVSCLRSGWVWDSVGCAWDGFSNTLF GIVSIMTLTVLA YERYNRIV-----HAKVINFSW AWRAITYIWLYSLVWTGAPLLGWN
>ENCEPH_galG LALLIATIGTLGVCNNLLVLVLYYKFKRLRTPTNLFLVNISLSDLLVSVCGVSLTFMSCLRSRWVWDAAGCVWDGFSNSLF GIVSIMTLTVLA YERYIRVV-----HAKVIDFSW SWRAITYIWLYSLAWTGAPLLGWN
>ENCEPH_anoC LALLVAAIGLLGLCNNLLVLVLYAKFKRLRTPTHLFLVNISLSDLLVSLFGVSFTFGSCLRHRWVWDAAGCVWDGFSNSLF GIVSIMTLTVLA YERYIRVV-----HARVIDFSW SWRAITYIWLYSLAWTGAPLLGWN
>ENCEPH_xenT LALIVATVGFLGLVNNLLVLILYCKFKRLQTPTNLLFFNTSLCHFVFSLLAITFTFMSCVRGSWAFSVEMCVFHGFSKNLL GIVSFGTLTVVA YERYARVV-----YGKYVNSSW SKRSITFVWVYSLAWTGFPLIGWN
>ENCEPH_braB IATGLALIGLVGSMNNFVVILLIGCHRQLRTPFNLLLLNVSVADLLVSVCGNTLSFASAVQHRWLWGRPGCVWYGFANSLF GIVSLVTLSALA FERYCVVV----RSSEMLTYKS SLGMIAFIWMYSLLWTSLPLLGWS
>ENCEPH_braF IATCLALIGFVGFTNNFVVILLIGCHRQLRTPFNLLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANSLF GIVSLVTLSALA FERYCVVV----RSSDMLTYKS SLVVITFIWLYSLLWTSLPLLGWS
>ENCEPH2_Api AIALGFIGFFGFTANLLVAIVIVKDAQILWTPVNVILFNLVFGDFLVSIFGNPVAMVSAATGGWYWGYKMCLWYAWFMSTL GFASIGNLTVMA VERWLLVA----RPMQALSIRH AVILASFVWIYALSLSLPPLFGWG
>ENCEPH1_Ano AAVTLFFIGFFGFFLNIFVIALMYKDVQLWTPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWLYGKSICVAYGFFMSLL GIASITTLTVLS YERFCLIS--RPFAAQNRSKQG ACLAVLFIWSYSFALTSPPLFGWG
>ENCEPH2_Ano SAVTLFFIGFFGFFLNLFVIALMCKDMQLWTPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWIFGRTLCVAYGFFMSLL GITSITTLTVLS YERYCLIS--RPFSSRNLTRRG AFLAIFFIWGYSFALTSPPLFGWG
>CILL2_Platy AIYLCIVGVIGTLSNGVIMYLYFKDKSLRSPMNLLFVNLAMSDFTVAFFGAMFQFGLTCTRKYMSPGMALCDFYGFITFLG GLASEMNLFIIS VERYLAVV--RPFDVGNLTNRR VIAGGVFVWLYSLVFAGGPLVGWS
>CILL1_Platy AAYLFFIACLGVSLNVLVLVLFIKDRKLRSPNNFLYVSLALGDLLVAVFGTAFKFIITARKTLLREEDGFCKWYGFITYLG GLAALMTLSVIA FVR CLAV-LRLGSFTGLTTRM GVAAMAFIWIYSLAFTLAPLLGWN
>MEL1_homSap LGTVILLVGLTGMLGNLTVIYTFCRSRSLRTPANMFIINLAVSDFLMSFTQAPVFFTSSLYKQWLFGETGCEFYAFCGALF GISSMITLTAIA LDRYLVIT-RPLATFGVASKRR AAFVLLGVWLYALAWSLPPFFGWS
>MEL1_smiCra IGATILVVGFTGVLGNLLVIYTFCRSRSLRTPANMFIINLAISDFFMSFTQAPVFFASSLYERWIFGEKGCEFYAFCGALF GITSMITLMVIA LDRYFVIT-RPLASIGMISKKK TGLILLGVWLYSLAWSLPPFFGWS
>MEL1_galGal IGTVILIVGITGTLGNFLVIYAFCRSRTLQKPANIFIINLAVSDFLMSITQSPVFFTNSLHKRWIFGEKGCELYAFCGALF GITSMITLMVIA LDRYFVIT-KPLASVRVMSKKK ALIILVGVWLYSLAWSLPPFFGWS
>MEL1_xenTro VGAVILAVGITGMLGNFLVIYAFCRSRSLRSPANMFIINLAITDFLMSVTQAPVFFATSLHKRWIFGEKGCELYAFCGALF GITSMITLMVIA VDRYFVIT-RPLTSIGVMSKKR AVLILSGVWLYSLAWSLPPFFGWS
>MEL1a_Bran  VGTAVFCIGCCGMFGNAVVVYSFIKSKGLRTPANFFIINLALSDFLMNLTNMPIFAVNSAFQRWLLSDFACELYGFAGGLF GCLSINTLMAIS MDRYLVIT-KPFLVMRIVTKQR VMFAILLLWIWSLVWALPPLFGWS
>MEL2_anoCar VGSCVLVIGCIGITGNLLVLYAFYSNKRLRTPPNYFIMNLAVSDFLMSATQAPICFLNSMHKEWVLGDIGCNLYAFCGALF GITSMITLLAIS VDRYCVIT-KPLQSIKRTSKKR TCIIIVFVWLYSLGWSVCPLFGWS
>MEL2_xenLae IGSFILIIGSVGIIGNMLVLYAFYRNKKLRTAPNYFIINLAISDFLMSATQAPVCFLSSLHREWILGDIGCNVYAFCGALF GITSMMTLLAIS INRYIVIT-KPLQSIQWSSKKR TSQIIVLVWMYSLMWSLAPLLGWS
>NEUR_homSap AGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFF GCGSLITMTAVS LDRYLKIC--YLSYGVWLKRKH AYICLAAIWAYASFWTTMPLVGLG
>RGR_homSap  VLLVEALSGLSLNTLTIFSFCKTPELRTPCHLLVLSLALADSGISLNALVAATSSLLRVSHRRWPYGSDGCQAHGFQGFVT ALASICSSAAIA WGRYHHYC-----TRSQLAWNS AVSLVLFVWLSSAFWAALPLLGWG
>PER_homSap  VATYLIMAGMISIISNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQVYAGLNIFF GMASIGLLTVVA VDRYLTIC--LPDVGRRMTTNT YIGLILGAWINGLFWALMPIIGWA
>PERa_Branc  VGLYLFVIGIIGTIENGITLATFSKFRSLRSPTTMLLVHLAIADLGICIFGYPFSGASSLRSHWLFGGVGCQWYGFNGMFF  

>PPIN2_cioSa LCVFMSFVFVLAVPLNLLVIVATYKNKVLRRPINYIIVNLAVADLTCSVVGGLLGVLNNGAGYYFLGKSVCIFEGYVMSVT  
>MEL_Platy                                                                                     GIFGLMSINTLA WIAIDRFYVITNPLGAAQTMTKKRAFIILTIIWANASLWALAPFFGWG
>MEL1b_Bran  IFGSMMLVFGLIGVVGNVVALYAFCRTRSLRRPKNYVVANLCLTDMFVCLVYCPIVVSRSFSHGFPSKESCIVEGFMVGVG - 
>MEL2_galGal CFVNSLHREWILGDIGCDLYAFCGALFGITSMMTLLAISVDRYLVITKPLRSIQWTSKKRTIQIIAAVWLYSLGWS

amino terminal alignment of imaging opsins:
((((RHO1,RHO2), SWS1),(LWS,SWS2),(PIN,VAOP)
                                     !!!    ***   !!!     **  ***   * *  *
>RHO1_homSap                        MNGTEGPNFYVPFSNATGVVRSPFEYPQY-YLAEPWQFSM LAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLG
>RHO1_monDom                        MNGTEGPNFYVPFSNKTGTVRSPFEEPQY-YLADPWQFSC LAAYMFMLIVLGFPINFLTLYVTIQHKKLRTPLNYILLNLAIADLFMVFGGFTMTLYTSLHGYFVFGPTGCNLEGFFATLG
>RHO1_ornAna                        MNGTEGQDFYIPMSNKTGVVRSPFEYPQY-YLAEPWQYSV LAAYMFMLIMLGFPINFLTLYVTIQHKKLRTPLNYILLNLAFANHFMVLGGFTTTLYTSLHGYFVFGPTGCNIEGFFATLG
>RHO1_galGal                        MNGTEGQDFYVPMSNKTGVVRSPFEYPQY-YLAEPWKFSA LAAYMFMLILLGFPVNFLTLYVTIQHKKLRTPLNYILLNLVVADLFMVFGGFTTTMYTSMNGYFVFGVTGCYIEGFFATLG
>RHO1_anoCar                        MNGTEGQNFYVPMSNKTGVVRNPFEYPQY-YLADPWQFSA LAAYMFLLILLGFPINFLTLFVTIQHKKLRTPLNYILLNLAVANLFMVLMGFTTTMYTSMNGYFIFGTVGCNIEGFFATLG
>RHO1_xenTro                        MNGTEGPNFYIPMSNKTGVVRSPFDYPQY-YLAEPWKYSA LAAYMFLLILLGFPINFMTLYVTIQHKKLRTPLNYILLNLVFANHFMVLCGFTVTMYTSMHGYFIFGQTGCYIEGFFATLG
>RHO1_danRer                        MNGTEGPAFYVPMSNATGVVRSPYEYPQY-YLVAPWAYGF VAAYMFFLIITGFPVNFLTLYVTIEHKKLRTPLNYILLNLAIADLFMVFGGFTTTMYTSLHGYFVFGRLGCNLEGFFATLG
>RHO1_Raja                          MNGTEGENFYVPMSNKTGVVRSPFDYPQY-YLGEPWMFSA LAAYMFFLILTGLPVNFLTLFVTIQHKKLRQPLNYILLNLAVSDLFMVFGGFTTTIITSMNGYFIFGPAGCNFEGFFATLG
>RHO1_lamp1                         MNGTEGENFYIPFSNKTGLARSPFEYPQY-YLAEPWKYSV LAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAVANLFMVLFGFTLTMYSSMNGYFVFGPTMCNFEGFFATLG
>RHO1_lamp2                         MNGTEGQNFYIPFSNKTDVARSPFEYPQY-YLAEPWKFSA LAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAVSNLFMILFGFTTTMYTSMNGYFVFGPTMCSIEGFFATLG
>RHO1_lamp3                         MNGTEGDNFYVPFSNKTGLARSPYEYPQY-YLAEPWKYSA LAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAMANLFMVLFGFTVTMYTSMNGYFVFGPTMCSIEGFFATLG share second glyco, 2aa insert
>RHO2_galGal                        MNGTEGINFYVPMSNKTGVVRSPFEYPQY-YLAEPWKYRL VCCYIFFLISTGLPINLLTLLVTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFVFGPVGCAVEGFFATLG share second glyco, 2aa insert
>RHO2_anoCar                        MNGTEGINFYVPLSNKTGLVRSPFEYPQY-YLAEPWKYKV VCCYIFFLIFTGLPINILTLLVTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFIFGPIGCAIEGFFATLG
>RHO2_Gekko                         MNGTEGINFYVPLSNKTGLVRSPFEYPQY-YLADPWKFKV LSFYMFFLIAAGMPLNGLTLFVTFQHKKLRQPLNYILVNLAAANLVTVCCGFTVTFYASWYAYFVFGPIGCAIEGFFATIG
>RHO2_Lat                           MNGTEGMNFYVPLSNRTGLVRSPFEYTQY-YLAEPWKFSV LCAYMFLLIILGFPINFLTLLVTFKHKKLRQPLNYILVNLAVASLFMVVFGFTVTFYSSLNGYFVLGPMGCAMEGFFATLG
>RHO2_Geo                           MNGTEGANFYIPFHNRTGVVRSPYEYPQY-YLADPWMYSA ISAYVFTLILIGFPVNFMTLFVTFKLKKLRQPLNFILVNLCVADLLMIMFGFTTTFYTAMNGYFVFGPTGCNIEGFFATLG
>SWS1_homSap                       MRKMS-EEEFYL-FKNIS--SVGPWDGPQY-HIAPVWAFYL QAAFMGTVFLIGFPLNAMVLVATLRYKKLRQPLNYILVNVSFGGFLLCIFSVFPVFVASCNGYFVFGRHVCALEGFLGTVA 2aa indel is above
>SWS1_macDom                          MSGDEEFYL-FKNIS--SVGPWDGPQY-HIAPAWAFHF QTVFMGFVFCAGTPLNAVVLVATLRYKKLRQPLNYILVNVSLCGFIFCIFAVFTVFISSSQGYFIFGRHVCAMEAFLGSVA
>SWS1_galGal                          MSSDDDFYL-FTNGS--VPGPWDGPQY-HIAPPWAFYL QTAFMGIVFAVGTPLNAVVLWVTVRYKRLRQPLNYILVNISASGFVSCVLSVFVVFVASARGYFVFGKRVCELEAFVGTHG
>SWS1_Taeniop                         MD-EEEFYL-FKNQS--SVGPWDGPQY-HIAPMWAFYL QTIFMGLVFVAGTPLNAIVLIVTIKYKKLRQPLNYILVNISVSGLMCCVFCIFTVFIASSQGYFVFGKHMCAFEGFAGATG
>SWS1_Gekko                           MSGEEDFYL-YANIS--SVGPFDGPQY-HIAPMWAFYF QTAFMGFVFFVGTPLNAIILFAIVKYKKLRQPLNYILVNISAAGFLFCVVAVFTVFISSSQGYFIFGKHICALEAFLGSLA
>SWS1_Utasta                          MSGEEDFYL-FENIS--SVGPWDGPQY-HIAPMWAFYF QTAFMGFVFFAGTPLNAIILIVTVKYKKLRQPLNYILVNISFAGFLFCVFSVFTVFLASSQGYFFFGRHICALEAFLGSVA
>SWS1_Xenopus                         MLEEEDFYL-FKNVS--NVSPFDGPQY-HIAPKWAFTL QAIFMGMVFLIGTPLNFIVLLVTIKYKKLRQPLNYILVNITVGGFLMCIFSIFPVFVSSSQGYFFFGRIACSIDAFVGTLT
>SWS1_Danio                           MD--AWAVQ-FGNAS--KVSPFEGEQY-HIAPKWAFYL QAAFMGFVFIVGTPMNGIVLFVTMKYKKLRQPLNYILVNISLAGFIFDTFSVSQVSVCAARGYYSLGYTLCSMEAAMGSIA
>SWS1_Oryzias                         MG--KYFYL-YENIS--KVGPYDGPQY-YLAPTWAFYL QAAFMGFVFFVGTPLNFVVLLATAKYKKLRVPLNYILVNITFAGFIFVTFSVSQVFLASVRGYYFFGQTLCALEAAVGAVA
>SWS1_Geotri                          MSGDEEFYL-FKNIS--KVGPWDGPQF-HIAPKWAFYL QAAFMGFVFICGTPLNAIVLVVTIKYKKLRQPLNYILVNISAAGLVFCLFSISTVFVASMQGYFFLGPTICALEAFFGSLA
>SWS2_ornAna             MHKTHRNLQNELPEDFFIPLPLDTDNIT--SLSPFLVPQT-HLGGSGIFMS LAAFMFLLITLGFPINLLTVICTIKYKKLRSHLNYILVNLAVSNMLVVCVGSATAFYSFAHMYFVLGPTACKIEGFAATLG
>SWS2_galGal             MH--PPRPTTDLPEDFYIPMALDAPNIT--ALSPFLVPQT-HLGSPGLFRA MAAFMFLLIALGVPINTLTIFCTARFRKLRSHLNYILVNLALANLLVILVGSTTACYSFSQMYFALGPTACKIEGFAATLG
>SWS2_Taenio              MPKPREMRDELPEDFYIPMSLETPNLT--ALSPFLVPQT-HLGSPGIFKA MAAFMFLLVLLGVPINALTVLCTAKYKKLRSHLNYILVNLAVANLLVVCVGSTTAFYSFSQMYFALGPLACKIEGFTATLG
>SWS2_Utasta             MHNSRPHSRDDLPEDFFIPMPLDVANIT--TLSPFLVPQT-HLGSPALFMG MAAFMFLLIILGVPINVLTIFCTFKYKKLRSHLNYILVNLAVSNLLVVCIGSTTAFYSFAQMYFSLGPTACKIEGFAATLG
>SWS2_Xenopu             MSKGRPDLRMEMPDEFYVPIPLETTNIS--SLSPFLVPQT-HLGTPGIFMS ISAFMLFTIIFGFPLNLLTIICTVKYKKLRSHLNYILVNLAVANLIVICFGSTTAFYSFSQMYFSLGTLACKIEGFTATLG
>SWS2_Danio                MKQQQQTPELFEDFHMPITLDVSNIS--AYSPFLVPQD-HLGHSGVFMG MSAFMLFLFIAGTAINVLTIVCTIQYKKLRSHLNYILVNLAISNLWVSVFGSSVAFYAFYKKYFVFGPIGCKIEGFTSTIG
>SWS2_Takifu                MRGVRQHEFQEDFYIPIPLDVDNIT--ALSPFLVPQD-HLGSPAVFYG MSAFMFFLFVAGTGINVLTIACTIQYKKLRSHLNYILVNLAFSNLLVTTVGSFTCFCCFFVRYMIVGPLGCKIEGFAATLG 
>SWS2_Geotri             MYQGKSTQVDDLPEDFYIPIALNVKNMS--ELSPFLVPQV-HLGDSFIFYG MSAFMLFLVLAGFPLNFLTVFVTIKYKKLRSHLNYILVNLAIANLIVVCCGSTLAFYSFMHKYFILGPLFCKMEGFTATLG
>LWS_ornAna      MTPAWNSGVYAARRRFEDEEDTTRTSVFVYTNSNNT---RDPFEGPNY-HIAPRWAYNV TSLWMIFVVIASVFTNGLVLVATMKFKKLRHPLNWILVNLAVADLGETLIASTISVINQIFGYFILGHPMCVLEGYTVSLC 1 aa indel
>LWS_galGal      MAAWEA--AFAARRR-HEEEDTTRDSVFTYTNSNNT---RGPFEGPNY-HIAPRWVYNL TSLWMIFVVAASVFTNGLVLVATWKFKKLRHPLNWILVNLAVADLGETVIASTISVINQISGYFILGHPMCVVEGYTVSAC
>LWS_anoCar  MAGTVTEAWDVAVFAARRR-NDEDDTTRDSLFTYTNSNNT---RGPFEGPNY-HIAPRWVYNI TSVWMIFVVIASIFTNGLVLVATAKFKKLRHPLNWILVNLAIADLGETVIASTISVINQISGYFILGHPMCVLEGYTVSTC
>LWS_Lithoch     MAEEWGKQSFAARRY---HEDSTRGSAFAYTNSNNT---RDPFEGPNY-HIAPRWIYNL ATLWMFVVVVLSVFTNGLVLVATMKFKKLRHPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPMCIFEGYVVSVC
>LWS_Gastero     MAEEWGKQAFAARRY---NEDTTRGSMFVYTNSNNT---KDPFEGPNY-HIAPRWVYNL STLWMFIVVALSVFTNGLVLVATAKFKKLQHPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPMCVFEGYVVSVC
>LWS_Petrom      MTASWQGAMFAARRR-QDDEDTTMESLFRYTNENNT---KDPFEGPNY-HIAPRWVFNL TSVWMIIVVVLSLFSNGLVLVATVKFKKLRHPLNWIIVNLAIADILETIFASTISVCNQVYGYFILGHPMCVFEGYVVSTC
>LWS_lamprey     MTASWHGAVFAARRR-NDDEDTTKDSIFRYTNENNT---RDPFEGPNY-HIAPRWMFNL TSVWMIIVVVLSLFTNGLVLVATMKFKKLRHPLNWILVNLAIADILETIFASTISVCNQVFGYFILGHPMCVFEGYVVSTC
>LWS_Geotria     MAQSWERAMFAARRR--QDEDTTKGDLFRYTNENNT---RDPFEGPNY-HIAPRWMYNL TSFWMIIVVILSLFTNGLVLVATLKFKKLRHPLNWILVNLAIADIGETIFASTVSVVNQIFGYFILGHPLCVFEGFTVSVC
>PIN_galGal                             MSSNSSQAPPNGT---PGPFDGPQWPYQAPQSTYVG VAVLMGTVVACASVVNGLVIVVSICYKKLRSPLNYILVNLAVADLLVTLCGSSVSLSNNINGFFVFGRRMCELEGFMVSLT P 1aa insert non-ancestral unless VAOP earlier
>PIN_utaSta                                 MVNEWSNAT---PGPFDGPQWPYLAPRSIYTS VAVLMGLVVVSAAFVNGLVIVVSIQYKKLRSPLNYILVNLAIADLLVTSFGSTLSFANNIYGFFVLGQTACEFEGFMVSLT these opsins are very limited frog to liz/bird
>PIN_pheMad                         MHVQMANASQASLKNGT---LSPFDGPQWPHRASRRVYTS LAALMGVVVLSASLANGLVIAVSVRFKRLRSPLNYILVNLATADLLVTFFGSIISFVNNAVGFFVFGKTACRFEGFMVSLT
>PIN_podSic                           MQASNASWVEVRNRT---PGPFEGPQWPYLAPQSTYIS VAVLMGLVVISATLVNGLVIVVSVQFKKLRSPLNYVLVNLAVADLLVTFFGSTISFVNNAQGFFIFGQATCEFEGFMVSLT
>PIN_xenTro                               MRAGNMSAYEA---PGPYDGPQWPHLAPRSTFLT VAAVMCMVVILAFFVNGLVIVVTLKYKKLRSPLNYILVNLAIANLLVTIFGSSVSFSNNVVGYFFMGKTMCEFEGFMVSLT the glyco is either not homologous or lost in frogs
>PIN_bufJap                               MHSANMSALET---PGPFEGPQWPHVAPRSTYLT VAVLMGMVVFLAFFVNGMVIVVSLKYKKLRSPLNYILVNLAVADILVTMFGSTVSFHNNIFGFFTLGKLVCELEGFVVSLT
>VAOP_galGal                       MDVFRALGNESLLSNSSGPARWDPFHHPLD-SIQP-WHFRL VAAVMFVVTSLSLAENLAVILVTFKFKQLRQPVNYVIVNLSVADFLVSLTGGTISFLANLKGYFYMGHWACVLEGFAVTFF no consistent glyco, much older than PIN
>VAOP_anoCar                        MAGLRREAENDSWLFDPSSSSAPFLQPLD-IIEP-WNFHL ISALMFVVTLFSLSENFTVILVTIKFKQLRQPLNYVIVNLSVADFLVSLIGGTISFSTNLKGYFYMGHWACVLEGFAVTFF losing all the invariant residues almost
>VAOP_xenTro                            MPTNVSLLATPENSTVWNPFTGPLK-TIEA-WNFHL LAALMFVVTSLSIAENFIVILVTAKFKQLRQPLNYIIVNLSVADFLVSVIGGTISIATNSRGYFYLGSWACVLEGFAVTFF
>VAOP_danRer                              MEASSAAVNAVSPAEDPFSAPLS-SIAP-WNYSV LAALMFVVTALSLSENFTVMLVTFRFQQLRQPLNYIIVNLSLADFLVSLTGGSISFLTNYHGYFFLGKWACVLEGFAVTFF
>VAOP_rutRut                              MELFPVAVNGVSHAEDPFSGPLT-FIAP-WNYKV LATLMFVVTAASLSENFAVMLVTFRFTQLRKPLNYIIVNLSLADFLVSLTGGTISFLTNYHGYFFLGKWACVLEGFAVTYF
>VAOP_Petro              MDALQESPPSHHSLPSALPSATGGNGTVATMHNPFERPLE-GIAP-WNFTM LAALMGTITALSLGENFAVIVVTARFRQLRQPLNYVLVNLAAADLLVSAIGGSVSFFTNIKGYFFLGVHACVLEGFAVTYF
>PPIN_anaCar                                    MDSLDTNTLSPNASTVRVVLMPRIGYTI IAIIMATSCTLSVILNTAVIAITIKYRQLRQPINYSLVNLAIADLGAALLGGSLNVETNAVGYYNLGRVGCVTEGFAMAFF
>PPIN_Xenop                                 MADEALLPPMMNVTNEEMHPGKVLMPRIGYTI LALIMAVFCAAALFLNVTVIVVTFKYRQLRHPINYSLVNLAIADLGVTVLGGALTVETNAVGYFNLGRVGCVIEGFAVAFF
>PPIN_Ictal                                 MASIILINFSETDTLHLGSVNDHIMPRIGYTI LSIIMALSSTFGIILNMVVIIVTVRYKQLRQPLNYALVNLAVADLGCPVFGGLLTAVTNAMGYFSLGRVGCVLEGFAVAFF
>PPIN_Danio                                        MESETSTAASGSIAE-VMPRMGYTI LAVIIGVFSVCGVILNVTVITVTLKYKQLRQPLNFALVNLAVADLGCAVFGGLPTVVTNAMGYFSLGRVGCVLEGFAVAFF
>PPIN_Oncor                                MDHQQLLPNLHGNISSSPGSVSEALLSRTGFTI LAVIIGVFSVSGVCMNVLVIMVTMRHRKLRQPLNYALVNLAVADLGCALFGGLPTMVTNAMGYFSMGRLGCVLEGFAVAFF
>PPINa_Ciona                  MDHDVTPTVDLTDGVPQCKDLNPYVLKGDGWVPQHISRANRSTYSF LCVYMTFVFLLSCSLNILVIVATLKNKVLRQPLNYIIVNLAVVDLLSGFVGGFISIAANGAGYFFWGKTMCQIEGYFVSNF
>PPINb_Ciona                MTTAETTTECYEKNPYIRNEMGWVPKHILIAERHIYTILAVYMTFIFL LAVYMTFIFLLAVSLNGFVIIATMKNKKLRQPLNYIIINLSIADFLSGLVGGFIGMISNSAGYFYFGKTVCILEGYIVSVA
>PPIN2_cioSa                               IPQNYSNGNPYATTDSGWVPEHIEIANRSTYSG LCVFMSFVFVLAVPLNLLVIVATYKNKVLRRPINYIIVNLAVADLTCSVVGGLLGVLNNGAGYYFLGKSVCIFEGYVMSVT
>PPIN_lamp                                         MENLTSLDLLPNGEVPLMPRYGFTI LAVIMAVFTIASLVLNSTVVIVTLRHRQLRHPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVGCVIEGFAVAFF 
>PARIE_Utast       ME-NDSSLATEL--------------------AEGAIVKPTIFPKAG-------YGV LAFLMFLNALFSIFNNSLVIAVTLKNPQLRNPINIFILNLSFSDLMMSLCGTTIVIATNYYGYFYLGRKFCIFQGFAVNYF 
>PARIE_Anole       ME--NESSLVLE--------------------AEGYIVRPTIFPRAG-------YGV LAFLMFINALFSLFNNFLVIAVTLKNPQLRNPINIFILNLSFSDLMMSICGTTIVIATNYHGYFYLGRRFCIFQGFAVNYF 
>PARIE_Xenop       MD-GNSTTPGI---------------------AVNLTVMPTIFPRSG-------YSI LSFLMFLNAVFSICNNAIVILVTLKHPQLRNPINIFILNLSFSDLMMALCGTTIVVSTNYHGYFYLGKQFCIFQGFAVNYF 
>ENCEPH_braBe       EY-GNNTTFAAWDFKRNGTGGEEEVEF-----------------FG-------YDA VAGVIAIIGVVGFVSNGAVVVLFLKFPQLRTPFNLLLLNMAVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANHLF 
>ENCEPH_homS       MYSGNRS--------GGHGYWDGGGAAGAEG------PAPAGTLSPAPLFSPGTYER LALLLGSIGLLGVGNNLLVLVLYYKFQRLRTPTHLLLVNISLSDLLVSLFGVTFTFVSCLRNGWVWDTVGCVWDGFSGSLF 
>ENCEPH_monD       MYSDNSS------DDGGGGYWGSGRAGGASGTGVTGEPGPEGSPRQAPLFSPGTYEL LALLIATIGLLGLCNNLLVLVLYYKFQRLRTPTHLFLVNISFNDLLVSLFGVTFTFVSCLRSGWVWDSVGCAWDGFSNTLF 
>ENCEPH_galG       MHSGNGT---------------------GATSRPQLAAAGHEVPGERPLFSAGTYEL LALLIATIGTLGVCNNLLVLVLYYKFKRLRTPTNLFLVNISLSDLLVSVCGVSLTFMSCLRSRWVWDAAGCVWDGFSNSLF 
>ENCEPH_anoC       MFSANGT---------RSGAGSDLEPGPGQQQQQREASEEEERGAGLSPFSAGTYEL LALLVAAIGLLGLCNNLLVLVLYAKFKRLRTPTHLFLVNISLSDLLVSLFGVSFTFGSCLRHRWVWDAAGCVWDGFSNSLF 
>ENCEPH_xenT       MPVTNGS-----------------------------HNNSISWLHSKDMFTEDTYHF LALIVATVGFLGLVNNLLVLILYCKFKRLQTPTNLLFFNTSLCHFVFSLLAITFTFMSCVRGSWAFSVEMCVFHGFSKNLL 
>ENCEPH_braB       MPLYNTSSGPTQGLPWDTPYSQDPIWNDSSPSNSSEDAVVDQGRGELQDFSDAGYTA IATGLALIGLVGSMNNFVVILLIGCHRQLRTPFNLLLLNVSVADLLVSVCGNTLSFASAVQHRWLWGRPGCVWYGFANSLF 
>ENCEPH_braF       MALYNNTSSPSQDLLWDAPYSQGHIW-DNSSASNSSEDVMDQGKVELQDFSDAGYTA IATCLALIGFVGFTNNFVVILLIGCHRQLRTPFNLLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANSLF 
>ENCEPH2_Api                                              MEHVIYEDQVSPVMYIGA AIALGFIGFFGFTANLLVAIVIVKDAQILWTPVNVILFNLVFGDFLVSIFGNPVAMVSAATGGWYWGYKMCLWYAWFMSTL 
>ENCEPH1_Ano                                      MYDVTDAAAINSDHQELMAPWAYNGA AVTLFFIGFFGFFLNIFVIALMYKDVQ-LWTPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWLYGKSICVAYGFFMSLL 
>ENCEPH2_Ano                                     MNDAPNDVAASAVDYEDLMAPWAYNAS AVTLFFIGFFGFFLNLFVIALMCKDMQ-LWTPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWIFGRTLCVAYGFFMSLL 
>CILL2_Platyr                        MDDLGFLGNSSVNYTVPLLQEDPLLLRILYFGPTSYVIT AIYLCIVGVIGTLSNGVIMYLYFKDKSLRSPMNLLFVNLAMSDFTVAFFGAMFQFGLTCTRKYMSPGMALCDFYGFITFLG 
>CILL1_Platyr            MDGENLTIPNPVTELMDTPINSTYFQNLNAETDGGNHYIYNAFTATDYNIC AAYLFFIACLGVSLNVLVLVLFIKDRKLRSPNNFLYVSLALGDLLVAVFGTAFKFIITARKTLLREEDGFCKWYGFITYLG
                                     !!!    ***   !!!     **  ***   * *  *
>RHO1_homSap                        MNGTEGPNFYVPFSNATGVVRSPFEYPQY-YLAEPWQFSM LAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLG


Here's the distal alignment of the opsins and a blurb on how the gene classifier can be used:

>RHO1_homSap   RYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKE AAAQQQESATTQKAEK----EVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQ FRNCMLTTIC CGKNPLG DDEASATVS  KTE     TSQVAPA
>RHO1_monDom   RYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGQLVFTVKE AAAQQQESATTQKAEK----EVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSNFGPIFMTIPAFFAKSSSVYNPVIYIMMNKQ FRTCMITTLC CGKNPLG DDEASATAS  KTE     TSQVAPA
>RHO1_ornAna   RYIPEGMQCSCGIDYYTLRPEVNNESFVIYMFVVHFTIPMTIIFFCYGRLVFTVKE AAAQQQESATTQKAEK----EVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTVPAFFAKSSAIYNPVIYIMMNKQ FRNCMLTTIC CGKNPLG DDEASATAS  KTEQSSVSTSQVSPA
>RHO1_galGal   RYIPEGMQCSCGIDYYTLKPEINNESFVIYMFVVHFMIPLAVIFFCYGNLVCTVKE AAAQQQESATTQKAEK----EVTRMVIIMVIAFLICWVPYASVAFYIFTNQGSDFGPIFMTIPAFFAKSSAIYNPVIYIVMNKQ FRNCMITTLC CGKNPLG DEDTSAG    KTETSSVSTSQVSPA
>RHO1_anoCar   RYIPEGMQCSCGVDYYTPTPEVHNESFVIYMFLVHFVTPLTIIFFCYGRLVCTVKA AAAQQQESATTQKAER----EVTRMVVIMVISFLVCWVPYASVAFYIFTHQGSDFGPVFMTIPAFFAKSSAIYNPVIYILMNKQ FRNCMIMTLC CGKNPLG DEDTSAGT   KTETSTVSTSQVSPA
>RHO1_xenTro   RYIPEGMQCSCGVDYYTLKPEVNNESFVVYMFIVHFTIPLCVIFFCYGRLLCTVKE AAAQQQESATTQKAEK----EVTRMVVMMVIFFLICWVPYAYVAFYIFTHQGSDFGPVFMTVPAFFAKSSAIYNPVIYIVLNKQ FRNCLITTLC CGKNPFG DEEGSSAASS KTEASSVSSSQVSPA
>RHO1_danRer   RYIPEGMQCSCGVDYYTRTPGVNNESFVIYMFIVHFFIPLIVIFFCYGRLVCTVKE AARQQQESETTQRAER----EVTRMVIIMVIAFLICWLPYAGVAWYIFTHQGSEFGPVFMTLPAFFAKTSAVYNPCIYICMNKQ FRHCMITTLC CGKNPFE EEEGASTTAS KTEASSVSSSSVSPA
>RHO1_Raja     RYIPEGLQCSCGVDYYTMKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKE AAAQQQESESTQRAER----EVTRMVIIMVVAFLICWVPYASVAFYIFINQGCDFTPFFMTVPAFFAKSSAVYNPLIYILMNKQ FRNCMITTIC LGKNPFE EEESTSASAS KTEASSVSSSQVAPA
>RHO1_lamprey1 RYLPEGMQCSCGPDYYTLNPNFNNESFVIYMFLVHFIIPFIVIFFCYGRLLCTVKE AAAAQQESASTQKAEK----EVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDFGATFMTVPAFFAKTSALYNPIIYILMNKQ FRNCMITTLC CGKNPLG DEDSGASTS  KTEVSSVSTSQVSPA
>RHO1_lamprey2 RYLPEGMQCSCGPDYYTMNPTYNNESFVIYMFIVHFTIPFVIIFFSYGRLLCTVKE AAAAQQESASTQKAEK----EVTRMVVLMVVGFLVCWVPYASVAFYIFTNQGSDFGATFMTLPAFFAKSSALYNPVIYILMNKQ FRNCMITTLC CGKNPLG DDDSGASTS  KTEVSSVSTSQVAPA
>RHO1_lamprey3 RYIPEGMQCSCGPDYYTLNPNFNNESYVVYMFVVHFLVPFVIIFFCYGRLLCTVKE AAAAQQESASTQKAEK----EVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDFGATFMTLPAFFAKSSALYNPVIYILMNKQ FRNCMITTLC CGKNPLG DDESGASTS  KTEVSSVSTSQVSPA
>RHO2_galGal   RYMPEGMQCSCGPDYYTHNPDYHNESYVLYMFVIHFIIPVVVIFFSYGRLICKVRE AAAQQQESATTQKAEK----EVTRMVILMVLGFMLAWTPYAVVAFWIFTNKGADFTATLMAVPAFFSKSSSLYNPIIYVLMNKQ FRNCMITTIC CGKNPFG DEDVSSTVSQSKTEVSSVSSSQVSPA
>RHO2_anoCar   RYIPEGMQCSCGPDYYTLNPDYHNESYVLYMFGVHFVIPVVVIFFSYGRLICKVRE AAAQQQESASTQKAER----EVTRMVILMVLGFLLAWTPYAMVAFWIFTNKGVDFSATLMSVPAFFSKSSSLYNPIIYVLMNKQ FRNCMITTIC CGKNPFG DEDVSSSVSQSKTEVSSVSSSQVSPA
>RHO2_Gekko    RFIPEGMQCSCGPDYYTLNPDFHNESYVIYMFIVHFTVPMVVIFFSYGRLVCKVRE AAAQQQESATTQKAEK----EVTRMVILMVLGFLLAWTPYAATAIWIFTNRGAAFSVTFMTIPAFFSKSSSIYNPIIYVLLNKQ FRNCMVTTIC CGKNPFG DEDVSSSVSQSKTEVSSVSSSQVAPA
>RHO2_Latime   RYIPEGLQCSCGPDYYTLNPDFHNESYVMYLFLVHFLLPIIIIFFTYGRLICKVKE AAAQQQESASTQKAEK----EVTRMVILMVIGFLTAWVPYASAAFWIFCNRGAEFTATLMTVPAFFSKSSCLFNPIIYVLLNKQ FRNCMITTLC CGKNPLG DDDTSSAVSQSKTDVSSVSSSQVSPA
>RHO2_Geot     RYIPEGMQCSCGPDYYTLNPKYYNESYVIYLFLVHFLLPVTIIFFTYGRLICTVKE AAAQQQESASTQKAER----EVTRMVIIMVVGFLVCWVPYASFAFYLFMNKGILFSATAMTVPAFFSKSSVLYNPIIYVLLNKQ FRTCMVTTLF CGKNPFG EDDSSMVSTS KTEVSSVSSSQVSPS
>SWS1_homSap   RFIPEGLQCSCGPDWYTVGTKYRSESYTWFLFIFCFIVPLSLICFSYTQLLRALKA VAAQQQESATTQKAER----EVSRMVVVMVGSFCVCYVPYAAFAMYMVNNRNHGLDLRLVTIPSFFSKSACIYNPIIYCFMNKQ FQACIMKMV  CGKAMTDESDTCSSQ    KTEVSTVSSTQVGPN
>SWS1_macDom   RFIPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIMPLFLICFSYSQLLRALRA VAAQQQESATTQKAER----EVSRMVVMMVGSFCLCYVPYAALAMYMVNNQNHGLDLRLVTIPAFFSKSACVYNPIIYCFMNKQ FHACIMEMV  CRKPMTDDSDVSSSQ    KTEVSAVSSSQVGPT
>SWS1_galGal   RYMPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRA VAAQQQESATTQKAER----EVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGLDLRLVTIPAFFSKSACVYNPIIYCFMNKQ FRACIMETV  CGKPLTDDSDASTSAQ   RTEVSSVSSSQVGPT
>SWS1_Taeni    RYIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRA VAAQQQESATTQKAER----EVSRMVVVMVGSFCMCYVPYAALAMYMVNNREHGIDLRLVTIPAFFSKSSCVYNPIIYCFMNKQ FRACIMETV  CGRPMTDDSEVSSSAQ   RTEVSSVSSSQVGPS
>SWS1_Gekko    RFIPEGLQCSCGPDWYTVGTKYYSEYYTWFLFVLCFIVPLSIIVFSYSQLLSALRA VAAQQQESATTQKAER----EVSRMVVVMVGSFCLCYVPYAALAMYMVNNRNHGIDLRMVTIPAFFSKSSCVYNPIIYCFMNKQ FRGCILEMV  CGKTMAEESEVSSASQ   KTEVSSVSSSQVGPS
>SWS1_Uta      RFIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRA VAAQQQESATTQKAER----EVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGIDLRLVTIPAFFSKSACVYNPIIYCFMNKQ FRACIMETV  CGKPMTDESDVSSSAQ   KTEVSSVSSSQVSPS
>SWS1_Xenop    RYMPEGLQCSCGPDWYTVGTKYRSEYYTWFIFIFCFVIPLSLICFSYGRLLGALRA VAAQQQESASTQKAER----EVSRMVIFMVGSFCLCYVPYAAMAMYMVTNRNHGLDLRLVTIPAFFSKSSCVYNPIIYSFMNKQ FRGCIMETV  CGRPMSDDSSVSSTSQ   RTEVSTVSSSQVSPA
>SWS1_Danio    RYIPEGLGCSCGPDWYTNCEEFSCASYSKFLLVTCFICPITIIIFSYSQLLGALRA VAAQQAESASTQKAEK----EVSRMIIVMVASFVTCYGPYALTAQYYAYSQDENKDYRLVTIPAFFSKSSCVYNPLIYAFMNKQ FNACIMETV  FGKKIDESSEVSS      KTETSSVSA
>SWS1_Oryzia   RYIPEGLQCSCGPDWYTVGTKYKSEYYTYFLFVFCFVVPLSIIIFSYGSLLGTLRA VAAQQQESASTQKAER----EVSRMVIMMVASFCTCYVPYAALAVYMVTNRDHNIDLRFVTVPAFFSKASCVYNPLIYSFMNKQ FNGCIMEMV  FGKKMEEASEVSS      KTEVSTDS
>SWS1_Geotri   RYIPEGLQCSCGPDWYTVGTKYKSEYYTYFLFVFCFVVPLSIIIFSYGSLLGTLRA VAAQQQESASTQKAER----EVSRMVIMMVASFCTCYVPYAALAVYMVTNRDHNIDLRFVTVPAFFSKASCVYNPLIYSFMNKQ FRACILETV  CGKPITDESETSSS     RTEVSSVSTTQMIPG
>SWS2_ornAna   RYIPEGLQCSCGPDWYTTNNKWNNESYVIFLFSFCFGVPLSIIIFSYGRLLLTLRA VAKQQEQSATTQKAER----EVTKMVIVMVLGFLVCWLPYASFSLWVVTNRGQVFDLRMASIPSVFSKASTIYNPIIYVFMNKQ FRSCMLKLVF CGKSPFGDEDEISGSS   QATQVSSVSSSQVSPA
>SWS2_galGal   RYIPEGLQCSCGPDWYTTDNKWHNESYVLFLFTFCFGVPLAIIVFSYGRLLITLRA VARQQEQSATTQKADR----EVTKMVVVMVLGFLVCWAPYTAFALWVVTHRGRSFEVGLASIPSVFSKSSTVYNPVIYVLMNKQ FRSCMLKLLF CGRSPFGDDEDVSGSS   QATQVSSVSSSHVAPA
>SWS2_Taeni    RYIPEGLQCSCGPDWYTTDNKWNNESYVIFLFCFCFGFPLTVIVFSYGRLLLTLRA VAKQQEQSASTQKAER----EVTKMVVVMVLGFLVCWLPYCSFALWVVTHRGHPFDLGLASIPSVFSKASTVYNPIIYVFMNKQ FRSCMLKLVF CGRSPFGDEDDVSGSS   QATQVSSVSSSQVSPA
>SWS2_Uta      RYIPEGLQCSCGPDWYTTNNKWNNESYVLFLFSFCFGVPLSVIIFSYGRLLLTLRA VAKQQEQSATTQKAER----EVTKMVVVMVMGFLVCWLPYASFALWVVTHRGEPFDVRLATIPSVFSKASSVYNPVIYVFMNKQ FRSCMLKLVF CGKSPFGDEDDVSGSS   QTTQVSSVSSSQVSPA
>SWS2_Xeno     RYIPEGLQCSCGPDWYTVNNKWNNESYVLFLFCFCFGFPLAIIVFSYGRLLLALHA VAKQQEQSATTQKAER----EVTRMVIVMVVGFLVCWLPYASFALWAVTHRGELFDLRMSSVPSVFSKASTVYNPFIYIFMNRQ FRSCMMKMIF CGKNPLGDDEETSVSG    STQVSSVSSSQIAPS
>SWS2_Danio    RYIPEGLQCSCGPDWYTTNNKFNNESYVMFLFCFCFAVPFSTIVFCYGQLLITLKL AAKAQADSASTQKAER----EVTKMVVVMVFGFLICWGPYAIFAIWVVSNRGAPFDLRLATIPSCLCKASTVYNPVIYVLMNKQ FRSCMMKMVF NKNIEEDEASSSS      QVTQVSSVAPEK
>SWS2_Takif    RYIPEGFQCSCGPDWYTTGNKYNNESYVWFIFGFGFAVPLFVIVFCYSQLLVMLKS AAKAQAESASTQKAER----EVTRMVVVMILGFLVCWLPYASFALWVVNNRGTPFDLRLATIPACFSKASTVYNPIIYVVLNKQ FRSCMKKMLG MSGGDDEESSS          QSVTEVSKVSPS
>SWS2_Geotria  RYIPEGLQCSCGPDWYTTNNKYNNESYVMFLFIFCFGTPFTIIIVSYSKLILTLRA AAAQQQESASTQKAEK----EVSRMVVIMVGGFLVCWLPYASLALWIVFNRGSPFDLRLATIPSVFSKASTVYNPVIYIFLNKQ FRSCMMKTIF CGKNPLGDDEDATSTTT    QVSSVSTSQVAPA
>LWS_ornAna    RYWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMSTCCILPLSIIVLCYLQVWLAIRA VAKQQKESESTQKAEK----EVSRMVVVMILAYCFCWGPYTIFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ FRNCIMQL   FGKKVDDGSELSSTS    RTEVSSVSS  VSPA
>LWS_galGal    RYWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFFPLAIIILCYLQVSLAIRA VAAQQKESESTQKAEK----EVSRMVVVMIVAYCFCWGPYTFFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ FRNCILQL   FGKKVDDGSEVSTS     RTEVSSVSNSSVSPA
>LWS_anoCar    RYWPHGLKTSCGPDVFSGSDDPGVLSYMIVLMITCCFIPLAVILLCYLQVWLAIRA VAAQQKESESTQKAEK----EVSRMVVVMIIAYCFCWGPYTVFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ FRNCIMQL   FGKKVDDGSELSSTS    RTEVSSVSNSSVSPA
>LWS_Lithoch   RYWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMLTCCIFPLAIIILCYLAVWMAIRA VAMQQKESESTQKAER----EVSRMVVVMIVAYCVCWGPYTFFACFAAANPGYAFHPLAAAMPAYFAKSATIYNPIIYVFMNRQ FRTCIMQL   FGKQVDDGSEVSTS     KTEV     SSVAPA
>LWS_Gastero   RYWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCLIPLAIIILCYLAVWLAIRA VAMQQKESESTQKAER----DVSRMVVVMIVAYIVCWGPYTTFACFAAANPGYAFHPLAAAMPAYFAKSATIYNPVIYVFMNRQ FRSCIMQL   FGKEVDDGSEVSTs     KTEV     SSVAPA
>LWS_Petromy   RYWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFLPLSIIILCYLQVWLAIHS VAQQQKESETTQKAER----DVSRMVVVMILAYVFCWGPYTFFACFAAANPGYSFHPIAAALPAYFAKGATIYNPIIYVFMNRQ FRNCILQL   FGKKVDDGSEVSSSS    RTEVSSVSNSSVSPA
>LWS_lamprey   RYWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFLPLSVIILCYLQVWLAIHS VAQQQKESETTQKAER----DVSRMVVVMILAYIFCWGPYTFFACYAAANPGYAFHPLTAALPAYFAKSATIYNPVIYVFMNRQ FRNCIMQL   FGKKVDDGSEVSSAS    RTEVSSVSNSSISPA
>LWS_Geotria   RYWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFIPLALIIICYLQVWLAIHT VAQQQKESETTQKAER----DVSRMVVVMIFAYIFCWGPYTFFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ FRNCIMQL   FGKKVDDGSEVSSSA    RTEVSSVSNSSVSPA
>PIN_galGal    SYVPEGLRTSCGPNWYTGGSNNN--SYILSLFVTCFVLPLSLILFSYTNLLLTLRA AAAQQKEADTTQRAER----EVTRMVIVMVMAFLLCWLPYSTFALVVATHKGIIIQPVLASLPSYFSKTATVYNPIIYVFMNKQ FQSCLLEMLC CGYQPQRTGKASPGTPGPHADVTAAGLRNKVMPAHPV
>PIN_Uta sta   SYVPEGLRTSCGPNWYTGGSGNN--SYIMALFVTCFALPLGMIIFSYASLLLTLRA VATQQKEVETTQQAEK----EVTRRVIAMVMAFLVCWLPYASFAMVVATNKDLVIQPALASLPSYFSKTATVYNPIIYVFMNKQ FRSCLLSTMS CGHRPRGAQETTPAMISIPQGPTSALQGSRNKVTPSA
>PIN_Phelsuma  SYVPEGLGTSCGPNWYMGGTNNN--SYIVALFVTCFALPLSMILFSYANLLLTLRA VAAQQKEQETTQRAEK----EVTRMVITMVMAFLVCWLPYATFAMVVATTKDLSIQPGLASLPSYFSKTATVYNPIIYVFMNKQ FRSCLLNTVS CGRIPQTMPGTPATTAVRGGFVLTSEGRGNKVASTEL
>PIN_Podarcis  SYVPEGLRTSCGPNWYSGGSSNN--SYIMTLFVTCFAMPLSTILFSYANLLMTLRT VAAQQKEQETTQRAER----EVTRMVVAMVAAFLVCWLPYASFAMVVATHKDLAIRPALASLPSYFSKTATVYNPIIYVFMNKQ FRSCLLYKMS CGHRALSSQDTTPAGISLPGRLTTSASKGSRNQVSPS
>PIN_xenTro    SYVPEGLRTSCGPNWYTGGTNNN--SYIMALFLTCFIMPLSTIIFSYSNLLMALRA VAAQQKDSETTQRAEK----EVTRMVIAMVLAFLICWLPYASFAVVVAVNKDVVIEPTVASLPSYFSKTATVYNPIIYVFMNKQ FRNCLMTLLC CGRSFGDDETSSASGRTDVTSVSEAGGNKVTPA
>PIN_Bufo      SYVPEGLGTSCGPNWYTGGTNNN--SYILALFTTCFMMPLTTIIFSYSNLLLALRA VAAQQKESETTQRAER----EVTRMVIAMVLAFLICWLPYAVFAIVMASNKNVVIDPTLASMPSYFSKTATVYNPVIYVFMNKQ FRDCLTKLLC CGRNPFGEDETSTTSGRTDVTSVSEGGGNKVTPA
>CILL2_Platyn  SYRPEGLGTWCSISWQDRSMNTM--SYVTAVFLGCYFFPVSIIIFCYFNVWRKVKE AADAQGAGTAGKAEKS-----IFRMSVIMVTCYLTAWTPYAIVCLIASYGPPNGLPIYAEVLPSLFAKSSQVYNPIIYVLMNKP 
>ENCEPH_braBel SYQFEGHSVGCSVNWVKHNVNNV--SYIITLMVTCFFVPMVVVCWSYACIWRTVRM SAEMKSEFGNPQNTGR----LVTTMVVVMIVCFLVCWTPYTVMALIVTFGADHLVTPTASVIPSLVAKSSTAYNPIIYVLMNNQ FREFLLARLRTFCCRQPRMLRVTPMDDNAHARLVGEGPSHAQQVIPSEEN
>ENCEPH_braFlo SYQFEGHNVGCSVNWVQHNPDNV--SYIVTLMVTCFFVPMVVVCWSYAWIWRTVRM SSEAKPECGNSQNAGR----LVTTMVVVMIICFLVCWTPYAVMALIVTFGADHLVTPTASVIPSLVAKSSTAYNPIIYVLMNNQ FREFLLARLQRVCCRQQAVPRVTPMDDNVHVRLGGEGPSQSQQFLPAGEN
>VAOP_galGal   SYTTSKIGTTCEPNWYSGAYNDR--SYIIAFFTTCFIVPLLVILVSYGKLLQKLRK VSNTQGRLRTARKPER----QVTRMVVVMIIAFLICWMPYAVFSILATAYPSIELDPHLAAIPAFFSKTATVYNPIIYVFMNKQ FRMCLIQMFK CSAIETAESNMNPTSERATLTQDKRDSQLSVMAVRST
>VAOP_anoCar   SYTTSKIGTTCEPNWYSGDYNDH--TFIITFFTTCFILPLLVILVSYGKLMRKLRK VSDTQGRLGTTRKPER----QVTGMVVIMILAFLICWSPYAAFSILVTACPSIELDPRLAAIPAFFSKTATVYNPVIYVFMNNQ FRKCLVQLFQ CSSQETMDANVNPISEKDTLTHTKHCGEMSTVAAHVI
>VAOP_xenTro   SYTTSKIGTTCEPNWYSGEMRDH--TYIITFLTTCFVFPLLVIFMSYGKLMRKLRK VSDTQGRLGSTRKPEK----EVTRMVVIMILAFLICWTPYAAFSILITAHPTIDLDPRLAAIPAFFAKTASMYNPIIYVYMNKQ FRRCLYQMFN INDPEAKESNLNPTSERGVLTRNNNGGEMLAIATHIT
>VAOP_Danio    SYTVSRIGTTCEPNWYSGNFHDH--TFIITLFSTCFIFPLGVIIVCYCKLIRKLRK VSNTHGRLGNARKPER----QVTRMVVVMIVAFMVAWTPYAAFSIIITAHPSMHVDPRLAAIPAFVAKTAAVYNPIIYVFMNKQ FRKCLVQLLS CSKVTVVEGNNNQTTERAGMTSGSNTGEMSAIAARVS
>VAOP_Ruti     SYTVSKIGTTCEPNWYSGNFHDH--TFIIAFFITCFILPLGVIVVCYCKLIKKLRK VSNTHGRLGNARKPER----QVTRMVVVMIVAFMVAWTPYAAFSIVVTAHPSIHLDPRLAAAPAFFSKTAAVYNPVIYVFMNKQ FRKCLVQLLR CRDVTIIEGNINQTSERQGMTNESHTGEMSTIASRIP
>VAOP_Petrom   SYRPSMIGTTCEPNWYSGELHDH--TFILMFFSTCFIFPLAVIFFSYGKLIQKLKK ASETQRGLESTRRAEQ----QVTRMVVVMILAFLVCWMPYATFSIVVTACPTIHLDPLLAAVPAFFSKTATVYNPVIYIFMNKQ FRDCFVQVLP CKGLKKVSATQTAGAQDTEHTASVNTQSPGNRHNIAL
>PPIN_anaCar   GYQMEGVMTSCAPDWANSDPINV--SYIICYFLFCFTIPFITILASYGYLIWTLRQ VAKVGLAQRGSTTKAEA---QVSRMVIVMVMAFLICWLPYATFALVVVGNPQIYINPIIATIPMYMAKSSTFYNPIIYIFMNKQ FRDCLVRCLL CGRNPCASEQTDEDDLEVSTIAPAPSSRRGKVAPV
>PPIN_Xeno     SYELEGVMTSCAPNWYSADPVNM--SYIVCYFSFCFAIPFLIIVGSYGYLMWTLRQ VAKLGVAEGGTTSKAEV---QVSRMVIVMILAFLVCWLPYAAFAMTVVANPGMHIDPIIATVPMYLTKTSTVYNPIIYIFMNKQ FQECVIPFLF CGRNPWAAEKSSSMETSISVTSGTPTKRGQVAPA
>PPIN_Icta     SYQLEGVMTSCAPNWYRRDPVNV--SYILCYFMLCFALPFATIIFSYMHLLHTLWQ VAKLQVADSGSTAKVEV---QVARMVVIMVMAFLLTWLPYAAFALTVIIDSNIYINPVIGTIPAYLAKSSTVFNPIIYIFMNRQ FRDYALPCLL CGKNPWAAKEGRDSDTNTLTTTVSKNTSVSPL
>PPIN_Danio    RLQLEGVRTSCAPDWYSRDLANV--SFIVCYFLLCFALPFSVIVYSYTRLLWTLRQ VSRLQVCEGGSAARAEA---QVSCMVVVMILAFLLTWLPYASFALCVILIPELYIDPVIATVPMYLTKSSTVFNPIIYIFMNRQ FRDRALPFLL CGRNPWAAEAEEEEEETTVSSVSRSTSVSPA
>PPIN_Oncorhy  SFELEGVRTSCSPNWYSREPGNM--SYIILYFLLCFAIPFSIIMVSYARILFTLHQ VSKLKVLEGNSTTRVEI---QVVRMVVVMVMAFLLSWLPYAAFALSVILDPSLHINPLIATVPMYLAKSSTVYNPIIYVFMNRQ FRDCAVPFLL CGLNPWASEPVGSEADTALSSVSKNPRVSPQs
>PPIN_lamp     SYELEGVRTSCAPDWYSRDPANV--SYITSYFAFCFAIPFLVIVVAYGRLMWTLHQ VAKLGMGESGSTAKAEA---QVSRMVVVMVVAFLVCWLPYALFAMIVVTKPDVYIDPVIATLPMYLTKTSTVYNPIIYIFMNRQ FRDCAVPFLL CGRNPWAEPSSESATAASTSATSVTLASAPGQVSPS
>PARIE_Uta     SYGPEGVQTSCSIGWEERSWSNY--SYLIVYFLSCFFIPVLIIGFSYGNVIRSLHG LNKKVEQLGGKSSPEEEF--RAVIMVLVMVVAFLICWLPYTVFALIVVFNPALNISPLAATIPTYLSKTSPVYNPIIYIFLNKQ FRDCAVEFIT CGQVVLTSPEEDISTSAIPVEGKGPCKINQVTPV
>PARIE_Anole   SYGPEGVQTSCSIGWEERSWNNY--SYLIVYFLSCFFIPVLIIGFSYGNVIRSLHG LNKKVEQLGGKSNPEEEF--RAVIMVLVMVVAFLICWLPYTLFALTVVFNPALNISPLAATIPTYLSKTSPVYNPIIYIFLNKE FRECAVEFIT CGKVVLTSPEEDISTSAISDEGIAPCKINQVTPV
>PARIE_Xenop   SYGPEGVQTSCSIGWEERSWSNY--SYIISYFLTCFIIPVGIIGFSYGSILRSLHQ LNRKIEQQGGKTNPREEK--RVVIMVLFMVLAFLICWLPYTVFALIVVINPQLYISPLAATLPTYFAKTSPVYNPIIYIFLNKQ FRTYAVQCLT CGHINLDSLEEDTESVSAQAENMLTPKTNQVAPA
>ENCEPH_braBel SYQLEGPKIGCSVAWEEHSWSNT--SYIVVLFITCLFAPLLIIVYSYYRLWHKVKQ GSRNLPAAMRKSSQKEQ---KIAMMVIVMITCFMVCWLPYGAMALVVTFGGERLISHTAAVVPSLLAKSSTCYNPVVYFAMNSQ FRRYFQDLLC CGRRLFDVSQSVVTGNTAMPRNNSQGFRKDDSDQKQD
>ENCEPH_homSap RYILDVHGLGCTVDWKSKDANDS--SFVLFLFLGCLVVPLGVIAHCYGHILYSIRM LRCVEDLQTIQVIKILKYEKKLAKMCFLMIFTFLVCWMPYIVICFLVVNGHGHLVTPTISIVSYLFAKSNTVYNPVIYVFMIRK FRRCLLQLLC FRLLKFQQPKKDRPVIRTEKQIRPIVMSQKVGDRPKKKVT
>ENCEPH_monDom RYTLEIHGLGCSVDWKSKDPNDS--SFVIFLFFGCLMLPVGVMAYCYGHILYAIRM LRCVEELQTIQVIKILRYEKKVAKMCFLMIAIFLFCWMPYAVICLLVANGYGSLVTPTVAIIASLFAKSSTAYNPIIYIFMSRK FRRCLLQLLC FRLLKFQQPKKDRPVIRTEKQIRPIVMSQKVGDRPKKKVT
>ENCEPH_galGal RYTLEIHGLGCSMDWKSKDPNDT--SFVLLFFLGCLVAPVVIMAYCYGHILYAVRM LRCVEDFQTSQVIKLLKYEKKVAKMCFLMISTFLICWMPYAVVSLLVTYGYSNLVTPTVAIIPSFFAKSSTAYNPVIYIFMSRK FRQCLLQLLC FRLMRFQRIMKEPSGAGNVKPIRPIVMSQKVGDRPKKKVT
>ENCEPH_anoCar HYTLEIHGLGCSVDWQSKEPSDS--SFVLFFFLGCLAAPVGIMAYCYGHILHAIRM LRCVEDLQSIQVIKILRYEKKVAKMCFLMVTTFLICWMPYAVVSLLIAYGYGHLITPTVAIIPSFFAKSSTAYNPVIYIFMSRK FRRCLVQLFC VQFLRFKRTLKEQPAIESNKPIRPIVMSQKVGDRPKKKVT
>ENCEPH_xenTro LYTFETHKLDCSFEWTATDPKDT--AFVLLFFLACITLPLSIMAYCYGYILYEIQK LRSVKNIQNFQEITILDYEIKMAKMCLLMMLTFLIGWMPYTILSLLVTSGYSKFITPTITVMPSLLAIASAAYNPVIHIFTIKK FRQCLVQLLP PINFHPPINPPINNFWRLLKNLNGRLAMKKVKPVLGKGRS
>ENCEPH1_Anoph AYVNEAANISCSVNWESQTANAT--SYIIFLFIFGLILPLAVIIYSYINIVLEMRK NSARVGRVNRAERRVT-------SMVAVMIVAFMVAWTPYAIFALIEQFGPPELIGPGLAVLPALVAKSSICYNPIIYVGMNTQ FRAAFWRIRR SNGVAGQPDSNNTNNSNRDKESARHTAKEGL
>ENCEPH2_Anoph AYVQEAANISCSVNWESQTKNAT--TYIIFLFVFGLVVPLIVIVYSYTNIIVNMRE NSARVGRINRAEQRVT-------SMVAVMIVAFMVAWTPYAIFALIEQFGPPELIGPGLAVLPALVAKSSICYNPIIYVGMNTQ FRAAFSRVRN KGQQAAADQNTTTMQRELTKSSRDMVECSF
>ENCEPH2_Apis  SYGPEAGNVSCSVSWEVHDPVTNSDTYIGFLFVLGLIVPVFTIVSSYAAIVLTLKK VRKRAGASGRREAKIT-------KMVALMITAFLLAWSPYAALAIAAQYFNAKPSATV-AVLPALLAKSSICYNPIIYAGLNNQ FSRFLKKIFD ARGSRTAVPDSQHTALTALNRQEQRK
>CILL1_Platyn  HYIPEGLATWCSIDWLSDETSDK--SYVFAIFIFCFLVPVLIIVVSYGLIYDKVRK VAKTGGSVAKAEREVL-------RMTLLMVSLFMLAWSPYAVICMLASFGPKDLLHPVATVIPAMFAKSSTMYNPLIYVFMNKQ FRRSLKVLLG MGVEDLNSESERATGGTATNQVAAT
>PER_homSap    SYAPDPTGATCTINWRKNDRSFV--SYTMTVIAINFIVPLTVMFYCYYHVTLSIKH HTTSDCTESLNRDWSD--QIDVTKMSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKK FRRAMLAMFK CQTHQTMPVTSILPMDVSQNPLASGRI
>PERa_Bran     EYALEPSGTACTINFQKNDSLYI--SYVTSCFVLGFVVPLAVMAFCYWQASCFVSK VLKGDIAGDLTFPVAAN.QNHFSKMCLAMVAAFVVAWTPYSVLFLFAAFWNPADIPAWLTLLPPLIAKSSALYNPIIYIIANRR FRNAICSMMK GQDPDVEDDEHADEHRVRSIEDNDKEIISMVNLNMTV
>PERc_Bran     YTYETPMQITCSLDWNVQHPGEK--AYIAAVLVIVYVLQVLIMCFCYFNIIFKSAN LKFAALASEKTKMAAKKDTWKTSVMCLTMVVSFLIAWTPYAVSSTWDILSAE-DLPIIATILPSLFAKSSCMMNPIIYACCNTK FRQAAVKSFR KLCGMCKQKVPLSTPQVVLAMQRNTEFTSTVEPT
>NEUR_homSap   DYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKS SSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYK FACCQTGGLK ATKKKSLEGFRLHTVTTVRKSSAVLEIHEEWE
>MEL_Platy     AYIPEGFQTSCTYDYLTQDMNNY--TYVLGMYLFGFIFPVAIIFFCYLGIVRAIFA HHAEMMATAKRMGAN...EIQIAKVAAMTIGTFMLSWTPYAVVGVFGMIKPHSEMFIH.AEIPVMMAKASARYNPIIYALSHPK FRAEIDKHFPWLLCCCKPKPKAQLPSSTTKGSIASKTEADTSV
>MEL1_homSap   AYVPEGLLTSCSWDYMSFTPAVR--AYTMLLCCFVFFLPLLIIIYCYIFIFRAIRE TGRALQTFGACKGNG.QSECKMAKIMLLVILLFVLSWAPYSAVALVAFAGYAHVLTPYMSSVPAVIAKASAIHNPIIYAITHPK YRVAIAQHLP CLGVLLGVSRRHSRPYPSYRSTHRSTLTSHTSNL

>PPINa_Ciona   YDTEGLGTSCAPNWFVKEKRERL--FIILYFVFCFVIPLAVIMICYGKLILTLRQ IAKESSLSGGTSPEGEVTKMVVVMVTAFVFCWLPYAAFAMYNVVNPEAQ IDYALGAAPAFFAKTATIYNPLIYIGLNRQ FRDCVVRMIF NGRNPWVDELVGSQVSSTGSQLTAVSSNKVAPA
>PPINb_Ciona   GYVPEGLGTSCAPNWFSKNKSER--IFIFVYFVFCFFIPLLVIIICYGKIVLFLKQVSLY ATRQSSASSNRQADNKVTKMVLVMISAFLICWTPYGVLSLYNAINPDKQ LDYGLGAVPVFFAKTANIYNPLIYIGLNKQ FRDGVIKMVF RGRNPWAEEMSTQQRQRSTEAGQPIVSNEV
>PPIN2_Ciona   SVIWHTPGLFFWNGYEPEGFGTS--CAPNWFSQQKRERIFIFAYFAFCFLTPLTIIFACYLKLILFIRKVSVSKKSMVNEADRRDFEVTRMVFVMIAAFLICWLPYGCLSMYNAIHPD FRDGVIRMLF KGRNPWLDGRNTTSSTSTRAQ

>scallop SCOP2 Patinopecten yessoensis scallop Mollusca
MPFPLNRTDTALVISPSEFRIIGIFISICCIIGVLGNLLIIIVFAKRRSVRRPINFFVLNLAVSDLIVALLGYPMTAASAFSNRWIFDNIGCKIYAFLCFNS
GVISIMTHAALSFCRYIIICQYGYRKKITQTTVLRTLFSIWSFAMFWTLSPLFGWS
SYVIEVVPVSCSVNWYGHGLGDVSYTISVIVAVYVFPLSIIVFSYGMILQEKVCKDSRKNGIRAQQRYTPRFIQDIEQRVTFISFLMMAAFMVAWTPYAIMSALAIGSFNVENSFAALPTLFAKASCAYNPFIYAFTNAN
FRDTVVEIMAPWTTRRVGVSTLPWPQVTYYPRRRTSAVNTTDIEFPDDNIFIVNSSVNGPTVKREKIVQRNPINVRLGIKIEPRDSRAATENTFTADFSVI

>scallop       
SYVIEVVPVSCSVNWYGHGLGDV--SYTISVIVAVYVFPLSIIVFSYGMILQEKVCSKNGIRAQQRYTPRFIQDIEQRVTFISFLMMAAFMVAWTPYAIMSALAIGSFNVENSFAALPTLFAKASCAYNPFIYAFTNAN



The key to timing rare genomic changes are opsin sequences from non-genomic lampreys which provide suitable probe to traces of Petromyzon maritimus, the lamprey chosen for a genome project. From traces, it can be verified that the 6 bp insert plus contemporary intron positions/phases (novel 12 upstream intron in LWS and 21 post-GWSR intron exclusively in rod/cone opsins) were already established by the time of lamprey divergence. Both rare genomic events occur between transmembrane helices TM2 and TM3, ie in extracellular loop 2. These two rare genomic changes -- in conjunction with  the absence of contradictory events and conventional clustering -- provide overwelming evidence against the oft-published tree sistering pinopsin with LWS to the exclusion of other cone opsins.

Despite 6 sequenced opsin mRNAs and an assembly in Branchiostoma, no rod/cone opsin can be located in it or earlier diverging deuterostomes (tunicate, amphioxus, urchin, acornworm, xenoturbella). These species may have larval eye spots and related photoreceptors but lack imaging eyes. Lamprey lack to date any opsin classifying within the PIN subtree but do contain modernly configured VAOP and PPIN, the latter in its pineal gland. Hagfish, sister group to lamprey, have imaging eyes but have not been studied; their opsins situation may be derived due to deepwater marine habitat (similarly deepwater coelocanth opsins are adapted to 420 nm). The next-diverging chondrichtyes have inadequate data -- only RHO1 from skate and fragments from Callirhyncus. 

In summary, full-blown pentachromatic color vision and rhodopsin appeared in the stem between amphioxus and lamprey divergence nodes. The oldest known fossil lamprey, Priscomyzon, dates at 360 myr to the Devonian has a full complement of modern opsins (other than parietal). Molecular clocks place lamprey divergence at approximately 430 myr, a hundred million years after Chengjiang.

The fossil record is unsatisfactory here: less than 1 bilateran in 10,000 in Chengjiang and Burgess Shale fossils is even a candidate for deuterostomy. Low numbers of specimens and poor preservation conspire with career ambitions to produce grandiose claims (published in Nature under controversial editor Henry Gi) that greatly exceed the data in the analysis of Hou, discoverer of the Chinese lagerstaette. Myllokunmingia is in the best situation with 500 specimens but Haikouichthys as stem deuterostome, Metaspreggina as post-ediacaran, and Yunnanozoan are all problematic. While signs of bilaterily disposed eyes are sometimes inferred, it does not follow these were image-forming eyes. Indeed contemporary Branchiostoma and tunicate larva have an eye-spot (ocellus); the genomes contain ciliary opsins clustering to approximately ENCEPH and PPIN -- still a long long road to imaging opsins. Echinoderms and hemichordates genomes have opsins but even more remote. Sea urchin genome encodes at least six opsins, four of these cluster classify to rhabdomeric, ciliary and Go-type. Tube feet are apparently the photosensory organ in adult urchins.

Meanwhile, thousands of high-quality Cambrian arthropod fossils unmistakably show stalked paired eyes. Imaging eyes of contemporary arthropods and lophotrochozoan are rhabdomeric, utilizing depolarizing Gq-type receptor, phospholipase C, phosphoinositola, diacylglycerol, and transient receptor potential TRP and TRPL channel signaling. However their genomes can also contain ciliary opsins, using hyperpolarizing Gt-type transducins and phosphodiesterase cGMP second-messaging (as well as Go-type gustducin ciliary opsins in other types of photoreceptors). Vertebrates are just the opposite, having crossed over to a ciliary opsin-based imaging system, while retaining rhabdomeric signaling in melanopsin retinal ganglion cells. Cnidarian opsins are available from Hydra and Nematostella genomes. Hydra expresses a ciliary-type opsin in ectodermal sensory nerve cells whereas Nematostella has opsins classifying between melanopsin and encephalopsin. It must not be thought that bilaterans invented imaging eyes because earlier diverging cubomedusan jellyfish Carybdea marsupialis has 4 eyestalks each with 6 photoreceptors of 4 types: simple eyespots, pigment cups, complex pigment cups with lenses, and camera-type eyes with a cornea, lens, and retina. This species needs a genome project.
 
Thus there is no evidence whatsoever -- and every reason to doubt from genomic analysis -- that deuterostomes had imaging eyes during the Cambrian. Despite this, a BBC series, Walking With Monsters, portrayed a school of 25 mm Haikouichthys attacking and wounding a 500 mm Anomalocaris. It is easy to guess at the scientific advisory panel. [This recurrent anthropocentric theme is echoed by fantastic museum imagery of early mammals nimbly predating on dinosaur nests -- dioramas quietly dismantled after Yucatan meteriorite discovery.] Imaging eyes are not essential to survival; even today subterranean mammals such as blind mole rat flourish without them. Discounting ray-finned fish numbers, a very substantial proportion of extant animal species lack imaging eyes 525 myr after the Cambrian. Of 33 animal phyla, a one-third have no specialized organ for detecting light, one-third have light-sensitive organs, and the remaining 6 have imaging eyes (Cnidaria, Mollusca, Annelida, Onychophora, Arthropoda, and Chordata). Thus 82% of animal phyla have survived well over 500 myr without imaging eyes despite competition/predation from animals with them. 

>RHO1_homSap rhodopsin                 <----------TM1--------->    c1     <----------TM2------->    x1      <c--i------TM3--------->           c2       <----------TM4-------->               x2         <----------TM5------c->     c3
MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLGGEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSRYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKE 
AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA* 0
           c3       <----------TM6------->    x3      <--------b-TM7---gprot>  helix8    palm cyto tail

 urochordate (1?), Cathaymyrus (cepahochordate, 1)

lungfish opsins> Neoceratodus can't find it ... have to query with rhodopsin as ncbi tool is too stupid.

did find doubly partial seq

>RHO1_Protopterus lungfish doubly partial seq
LAEPWKYSALAAYMFFLILVGFPINFMTLYVTIQHKKLRTPLNYILLNLAIGGLFMVFGGFTTTMYTSMNGYFVFGVTGCNIEGFFATFGGILGLWSLVV
LAIERYIVVCKPMSNFRFGENHAIMGIVFTWIMTLACAAPPLFGWSRYIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFLIPLIIIFFCYGRLVCTV
KEAAAQQQESATTQKAEKEVTRMVYIMVISFLVCWLPYAFVAFYIFTHKGSDFGPVFMTAPAFFAKTAAFYNPVIYIFMNKQ 0
0 FRNCM      *0
  

The optics of the growing lungfish eye: Lens shape, focal ratio and pupillary movements in Neoceratodus forsteri (Krefft, 1870).
  Helena J Bailes , Ann E O Trezise , Shaun P Collin
Lungfish (order Dipnoi) evolved during the Devonian period and are believed to be the closest living relatives to the land vertebrates. Here we describe the previously unknown morphology of the lungfish eye in order to examine ocular adaptations present in early sarcopterygian fish. Unlike many teleosts, the Australian lungfish Neoceratodus forsteri possesses a mobile pupil with a slow pupillary response similar to amphibians. The structure of the eye changes from juvenile to adult, with both eye and lens becoming more elliptical in shape with growth. This change in structure results in a decrease in focal ratio (the distance from lens center to the retina divided by the lens radius) and increased retinal illumination in adult fish. Despite a degree of lenticular correction for spherical aberration, there is considerable variation across the lens. A re-calculation of spatial resolving power using measured focal ratios from cryosectioning reveals a low ability to discriminate fine detail. The dipnoan eye shares more features with amphibian eyes than with most teleost eyes, which may echo the visual needs of this living fossil.

AAD30519 coelocanth
>RHO1_Scyliorhinus canicula (catshark)
mngtegenfy ipmsnktgvv rspfdypqyy laepwkfsvl aaymffliia gfpvnfltly
       61 vtiqhkklrq plnyillnla vadlfmifgg fpstmitsmn gyfvfgpsgc nfegffatlg
      121 geiglwslvv laieryvvvc kpmsnfrfgs qhafmgvglt wimamacafp plvgwsryip
      181 egmqcscgid yytlkpevnn esfviymfvv hfsipltiif fcygrlvctv keaaaqqqes
      241 ettqraerev trmviimvia flicwlpyas vaffifcnqg sefgpifmti paffakaasl
      301 ynpliyilmn kqfrncmitt iccgknpfee eestsasask teassvsssq vapa

browser details OPNLW_ornAna    1087     1   364   364 100.0%  Ultra401  +-     228519    239398  10880
browser details OPNSW2_ornAna   1085     1   363   363 100.0%  Ultra401  +-     258788    263967   5180
>OPNLWS_ornAna green cone Ultra401:228519-239398
>OPNSWS2_ornAna blue cone Ultra401:258788-263967
Ultra401:239378-258808

platypus inline tandems intergenic gap 19,391 bp 10880 span 5180 span
<--LWS--<19k<--SWS2<
any internal intron markers? larger gen duP
how far back? chicken/liz? xenopus? teleost? 

probably very old association because 5 opsins in lamprey already. so if truly tandem, then has to be very old conserved-synteny dupl pre-lamprey
chicken genome missing but in mrna don't need
lizard both  scaffold_406:780,156-815,073
frog both scaffold_275:171,667-235,246 but showing signs of 3rd lws frag pseudo
zfish oddly has 4=3+1 in a row!  chr11:26,179,088-26,677,303 but still all about sws2 and lws1
unfortunately no chance of callo contig
lamprey be a mess
branchio lacks all cones

>OPNLW_ornAna green cone -Ultra401:228519-23939 minus strand
MTPAWNSGVYAARRRFEDEEDTTRTSVFVYTNSNNTRDPFEGPNYHIAPRWAYNVTSLWM
IFVVIASVFTNGLVLVATMKFKKLRHPLNWILVNLAVADLGETLIASTISVINQIFGYFI
LGHPMCVLEGYTVSLCGITGLWSLSIISWERWIVVCKPFGNVKFDAKLAMVGIVFSWVWA
AVWTAPPIFGWSRYWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMSTCCILPLSIIVLCYL
QVWLAIRAVAKQQKESESTQKAEKEVSRMVVVMILAYCFCWGPYTIFACFAAANPGYAFH
PLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLFGKKVDDGSELSSTSRTEVSSVSSVSPA*

>OPNSW2_ornAna blue cone -Ultra401:258788-263967 minus strand
MHKTHRNLQNELPEDFFIPLPLDTDNITSLSPFLVPQTHLGGSGIFMSLAAFMFLLITLG
FPINLLTVICTIKYKKLRSHLNYILVNLAVSNMLVVCVGSATAFYSFAHMYFVLGPTACK
IEGFAATLGGMVSLWSLAVIAFERFLVICKPLGNLSFRGTHAIFGCAATWVFGLAASLPP
LFGWSRYIPEGLQCSCGPDWYTTNNKWNNESYVIFLFSFCFGVPLSIIIFSYGRLLLTLR
AVAKQQEQSATTQKAEREVTKMVIVMVLGFLVCWLPYASFSLWVVTNRGQVFDLRMASIP
SVFSKASTIYNPIIYVFMNKQFRSCMLKLVFCGKSPFGDEDEISGSSQATQVSSVSSSQVSPA*


Here's a memo i wrote on 4 color vision in platypus ancestor:

	From: 	  tom@cyber-dyne.com
	Subject: 	proof that ancestral mammal had trichromatic vision
	Date: 	September 7, 2007 11:41:38 AM PDT
	To: 	  platypus@watson.wustl.edu

wes,

I really like the new title. The broader constituency probably prefers ancestral mammal genome content to platypus-specific biology; the platypus divergence date nicely breaks up the long branch to chicken and anole. I worked out a longwinded example below illustrating how the platypus genome can successfully bridge a long-standing gap in our understanding of mammalian color vision evolution.

I had to carefully distinguish genomic reconstructions of last common ancestors (beyond our remit) from initial inferences based on early assemblies of extant species. Absence of assembly evidence does not constitute evidence of genomic absence but more commonly lack of coverage, even when unused raw traces among the 33,353,710 total are consulted.

Protein-coding genes in the platypus have been evolving fairly rapidly under selection over the same 166 million years as eutherians -- the tree of life is right-justified. While 'living fossils' do exist (species with slowly evolving proteomes), the platypus is not one of them. On the contrary, picking a protein at random from pre-bilateran Nematostella and employing blastp against all mammals, the human ortholog is most commonly top hit (ie least evolved). Platypus proteins rarely surface as best-blast.

Comparative genomics does not support the conceit of evolutionary progress chugging steadily along to a pinnacle of human innovation even as primitive creatures drop by the wayside for want of manifest destiny. We know today the platypus genome does not encode any ancient, ancestral, antiquated, archaic, dead-end, failed-experiment, fossil, frozen, primitive, primordial, relic, retro or stationary genes. Indeed platypus genes in conserved syntentic position to bird and lizard homologs but lost in human represent genome content (and often functionality) conserved in ancestral mammal but gone in placentals.

As well-known example, contemporary amniote eyes have demonstrably superior color vision (four cone opsins and improved spectral resolution from pigmented oil droplets) relative to those of nocturnally degenerated placental mammal, as proposed by Gordon Walls in 1942 (ref 1). 

Opsins don't lend themselves to simple homology-based parsimony reconstructions because of evolutionary convergence of experimentally accessible (spectroscopic) properties and lineage-specific expansions and contractions. What's needed here are complete genomes and syntenic validation of orthology, which we lack to a certain degree here in key species (eg 3 of 6 known chicken opsins are missing from its current assembly).

However using the platypus genome, I've now been able to reliably reconstruct the cone opsin situation in ancestral mammal, building on ref 2. This is a difficult argument because platypus assembly initially exhibits only exon 5 of an opsin gene clustering to the short wavelength sensitive opsin SWS1 (as observed in ref 2). 

Close synteny analysis of the gapless platypus assembly contig 45.4 using a conserved non-coding phastCons marker and adjacent CALU gene shows that exons 1-4 do not lie in adjacent assembly gaps but were lost in a multi-kbp deletion. Exon 5 exhibits a small number of disabling non-synonymous changes despite retaining overall high (63%) sequence identity to human, consistent with millions of years of normal divergence followed by rapid neutral evolution after the deletion of exons 1-4 pseudogenized residual exon 5. The deletion can be heuristically dated to perhaps 40 mya, an estimate that echidna SWS1 sequence would greatly improve. No opsin data currently exists for echidna; they may retain functional SWS1.

It follows that ancestral monotreme resp. ancestral mammal had 3-color vision (one step down from ancestral amniote) even though the platypus itself today has 2-color vision (similar to marsupials and most placentals). (This assumes intact SWS1 was not exapted to some unrelated function; this is testable from its spectral reconstruction.)

contempory:
 chicken   (SWS1  SWS2  LWS RHO2 RHO1) 4-color cone vision + 1 rod opsin
 platypus  (SWS1- SWS2  LWS ---  RHO1) 2-color cone vision + 1 rod opsin
 marsupial (SWS1  ---   LWS ---  RHO1) 2-color cone vision + 1 rod opsin  
 elephant  (SWS1  ---   LWS ---  RHO1) 2-color cone vision + 1 rod opsin
 human     (SWS1    LWSa/LWSb    RHO1) 3-color cone vision + 1 rod opsin

ancestral:
 amniote   (SWS1  SWS2  LWS RHO2 RHO1) 4-color cone vision + 1 rod opsin
 mammal    (SWS1  SWS2  LWS ---  RHO1) 3-color cone vision + 1 rod opsin
 theran    (SWS1  ---   LWS ---  RHO1) 2-color cone vision + 1 rod opsin  
 placental (SWS1  ---   LWS ---  RHO1) 2-color cone vision + 1 rod opsin
 hominids  (SWS1    LWSa/LWSb    RHO1) 3-color cone vision + 1 rod opsin


Details:

Queries by known SWS1 genes -- but not other opsins -- return a strong match to platypus chr10:3,960,795-3,960,908, which translates to FHACIMEMMRGKLMVDDSESSSQETKTSTVSSRQVGPS* with conserved intron position and phase zero characteristic of SWS1 exon 5. The 11,205 bp contig 45.4 also contains terminal exons of long-conserved syntenic gene CALU in the expected opposite strand orientation. Total coding gene span seems short enough (3,646 bp in opossum, 3,248 bp in human, 11,959 bp in anole) that the contig might be expected to contain some or all of exons 1-4 as well, even adjusting contig length to reflect observed lineage-specific retroposons of platypus. However exons 1-4 are missing from the contig and indeed the assembly. Furthermore missing exons cannot be found among GenBank transcripts now raw trace reads which include singleton reads and reads from other centers not used in the assembly.

This raises the question whether platypus lacks full length SWS1 or whether exons 1-4 merely lie in nearby gaps of the current 6x assembly. Possibly earlier introns have expanded so much that exons 1-4 have been pushed off the contig into these gaps. However contig 45.4 contains the phylogenetically conserved non-coding phastCons element lod109 (ref 3) at its extreme 3' end chr10:3,964,439-3,964,588 (the contig ends at 3,964,658!).

The UCSC genome browser provides putative orthologs of lod109 in marsupial and placental. These blat as standalone queries into full syntenic position with matching strand orientation with respect to marsupial and mouse where they are named lod15+lod45 and lod78 respectively. Here phastCons strand convention, which is inherently arbitrary, can be taken as positive for platypus and carried forward to mouse and oppossum. CALU is positive but SWS1 exon 5 is negative in all three species. 

Thus exons 1-4 of platypus are required to lie between SWS1 exon 5 and lod109, a region spanned by a single gapless contig, but they do not. This requires either a deletion or translocation elsewhere (to a trace read and assembly gap). A local inversion might move exons 1-4 into an assembly gap but that would disrupt reading frame. Implausibly, the missing exons could be translocated into an adjacent bridged gap of estimated modest size preceding the right-flanking gene NAG6 while retenting strand; this is incompatible with all four exons remaining recalcitrant to repeated attempts at genomic amplification (ref 2) and still missing in 6x coverage. 

In summary, SWS1 is non-functional in platypus because the highly conserved exons 1-4 critical to opsin structure and function have been deleted. Conceivably this could be a copy-number polymorphism specific to the individual female platypus chosen for the genome project -- a seemingly viable concern given such polymorphisms are common for the tandem chrX opsin array in human. [Note the current human assembly shows 3 tandem opsins on chrX!] However platypus SWS1 does not lie in tandem array that might foster frequent inhomogeneous recombination. Human colorblindness involving loss of orthologous SWS1 is quite rare. 

Further close analysis of exon 5 shows the deletion event lies between ancient and recent:

Alignment of orthologous exons from 19 species phylogenetically bracketing platypus establish an arginine anomalously replacing conserved cysteine and a one residue deletion in platypus but otherwise very respectable percent identity that is not compatible with gene loss 166 mya ago (date for divergence taken in platypus paper). Non-selected pseudogenized coding dna does not leave detectable remnants at this age -- indeed no remnants can be detected of amniote RHO2 in mammals nor SWS2 in therans even focusing the search on syntenic dna, ref 2 and here. 

>OPNSWS1_xenTro FRGCIMETVCGRPMTDDSSVSSTSQKTEVSTVSSSQVSPA* amphibian
>OPNSWS1_anoCar FRACILETVCGKPMSDESDVSSSAQKTEVSSVSSSQVSPS* lizard
>OPNSWS1_pheMad FRGCIMEMVCGKPMSDDSEASTS-QKTEVSSVSSSQVSPS* lizard
>OPNSWS1_gekGek FRGCILEMVCGKTMAEESEVSSASQKTEVSSVSSSQVGPS* lizard
>OPNSWS1_utaSta FRACIMETVCGKPMTDESDVSSSAQKTEVSSVSSSQVSPS* lizard
>OPNSWS1_galGal FRACIMETVCGKPLTDDSDASTSAQRTEVSSVSSSQVGPT* bird
>OPNSWS1_taeGut FRACIMETVCGRPMTDDSEVSSSAQRTEVSSVSSSQVGPS* bird
>OPNSWS1_phaCar FRACIMETVCGKPMADDSEASSSAQRTEVSSVSSSQVSPS* bird
>OPNSWS1_ancMam FHACIMEMVCGKPMTDDSdVSSS-QKTEVSTVSSSQVGPS* ancestral mammal
>OPNSWS1_ornAna FHACIMEMMRGKLMVDDSE-SSS-QETKTSTVSSRQVGPS* monotreme
>OPNSWS1_monDom FHACIMEMVCRKPMTDDSDVSSS-QKTEVSAVSSSQVGPT* marsupial opossum Didelphimorphia
>OPNSWS1_macEug FHACIMEMVCRKPMTDDSEASSS-QKTEVSTVSSSQVGPS* marsupial wallaby Diprotodontia
>OPNSWS1_smiCra FHACIMEMICKKPMTDDSETTSS-QKTEVSTVSSSQVGPS* marsupial dunnart Dasyuromorphia
>OPNSWS1_setBra FHACIMEMVCRKPMTDDSEASSS-QKTEVSTVSSSQVGPS* marsupial quokka Diprotodontia
>OPNSWS1_isoObe FHACIMEMICRKPMTDDSETSSS-QKTEVSTVSSSQVSPS* marsupial bandicoots Peramelemorphia
>OPNSWS1_canFam FRACIMEMVCGKSMTEDSEMSSS-QKTEVSTVSPSQVGPN* placental
>OPNSWS1_susScr FRACIMEMVCGKPMTDESDMSSS-QKTEVSTVSSTQVGPN* placental
>OPNSWS1_homSap FQACIMKMVCGKAMTDESDTCSS-QKTEVSTVSSTQVGPN* placental
>OPNSWS1_sciBol FRACIMEMVCGKAMTDESDISSS-QKTEVSTVSSSQVGPN* placental
>OPNSWS1_cavPor FRACIMELVCRKPMADESDMSTS-QKTEVSAVSSSKVGPN* placental
>OPNSWS1_musMus FRACILEMVCRKPMADESDVSGS-QKTEVSTVSSSKVGPH* placental
     Consensus  FraCImEmvcgkpMtD#S#.sss.QkTevStVSSsqVgP.*

The anomalous arginine can be validated in raw trace data, necessary because the trace assembly process resolves discrepancies by quality scores weighting that might disadvantage a read more accurate locally at the cysteine position. Viewing original data requires blastn against sepaate platypus WGS and OTHER databases at the NCBI trace archive repository. Some 752,000 additional traces from TIGR and NISC are found there in addition to 3,595,283 cDNA WUGSC traces. These numbers are a significant addition to the 27,607,516 conventional WGS traces used (but not used up) in the current platypus assembly.

The usual error types occur in these traces but none would result in R --> C and similarly for the 3bp indel. This cysteine is deeply conserved in homologous position in other opsins such as RHO1, SWS2, and LWS though there is 6 residues of spacing in the latter two families instead of 5 here.Therefore arginine, manifestly unsuited for a disulfide bond, is not acceptable structurally in this position.  

The amino acid in platypus is a bit unusual at four other sites, highlighted in red in the alignment. Some of these changes may be incapacitating but others tolerated or even advantageous.  While other species have their anomalies as well, platypus is a real outlier in this regard which can be seen for example by MultiAlign tree construction. http://bioinfo.genopole-toulouse.prd.fr/multalin/multalin.html  

The alignment contains a heuristic prediction of exon 5 in the last common ancestor of mammals that is useful in counting changes in descendant lineages. Overall platypus exon 5 has evolved anomalously in that its variations are not concentrated at commonly variable positions but disproportionately at conserved ones, a pattern characteristic of a pseudogene. Opsin change is difficult to assess because to be retained over long timeframes, SWS1 and SWS2 in the same species may push each other apart (subfunctionalize) at significant sites that affect absorption spectra (eg color vision) rather than just drift at soft positions. 

Platypus exon 5 can only have become nonfunctional fairly recently -- retaining 63% identity to human given 332 mya of round-trip evolutionary time is quite respectable and implies long periods of conserved functionality. (Chicken/human have 65% identity.) This degree of conservation is incompatible with ancestral monotreme already having pseudogenized SWS1 from consideration of neutral rates. Thus it appears that a platypus ancestor, possibly prior to echidna divergence at 25 mya (ref 4), still had functional SWS1 and so trichromatic cone vision. That would also be true further back at the ancestral mammalian node because platypus retains SWS2 and LWS, still in amniote-syntenic tandem position at Ultra401:228519-263967 (blat genome or see ref 2).

In summary, the preferred scenario is platypus ancestors possessing a functional SWS1 gene evolving at a moderate rate until the fairly recent deletion of exons 1-4. After the deletion the remaining exon 5 had no constraints and so evolved at the faster neutral rate of pseudogenic dna.

How recent was this deletion? That would require partitioning changes in platypus exon 5 since divergence from ancestral mammal between drift and selective change during the functional era from subsequent neutral change in the pseudogene era, based on anomalous character and what is known generally about opsin evolution and neutral rates in general. That calculation would greatly benefit from determining the sequence and status of exon 5 in echidna. If echidna still had functional SWS1 and the divergence time was 25 mya, the deletion event in platypus must post-date that. On the other hand, if echidna also contained the indel, the departure of its exon 5 sequence from platypus would significantly inform event partitioning and so improve dating of the indel.

It cannot be proven that intact SWS1 functioned as a cone cell color vision receptor in ancestral platypus (because opsins can be exapted to other purposes); echidna sequence might at least allow appropriateness of its adsorption spectrum to be adduced.

ref 1
Walls, GL
The Vertebrate Eye and its Adaptive Radiation
Cran brook Institute 1942
TOC http://www.crsltd.com/research-topics/walls/index.html

ref 2
Davies WL, Carvalho LS, Cowing JA, Beazley LD, Hunt DM, Arrese CA
Visual pigments of the platypus: a novel route to mammalian colour vision.
Curr Biol. 2007 Mar 6;17(5):R161-3. 

ref 3
Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, Clawson H, Spieth J, Hillier LW, Richards S, et al. 
Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. 
Genome Res. 2005 Aug;15(8):1034-50.
 
ref 4
Pettigrew JD.	
Electroreception in monotremes.
J Exp Biol. 1999 May;202(Pt 10):1447-54.
http://jeb.biologists.org/cgi/reprint/202/10/1447

>lod109_ornAna+ chr10:3964439-3964588 
tgctgctgatgtggcaaagaagaccacatctgtccagctacttctatcaccggctcgttggtctaggggtatgattctcgcttagggtgcgagaggtcccgggttcaaatcccggacgagccctgtttttcccacctggttttatttggg

>lod109_monDom+  chr8:184,539,020-184,560,157 from platypus alignment
tgctgatgatgtggcaaaaccacatctggacaactattttcaacactggctcgttggtctaggggtatgattctcgcttagggtgcgagaggtcccgggttcaaatcccggacgagcccaattttac

>lod78_musMus+ chr6:29338830-29338906 from platypus alignment
tgttgatgatgtggtagag-aggatacatctggccaggagtaaccgccagtggctcgttggtctaggggtatgattctcgcttagggtgcgagaggtcccgggttcaaatcccggacgagccctgcttatc

genomic data availability relative to chicken
LWS  galGal- ornAna+ homSap++
SWS2 galGal- ornAna+ homSap+  
SWS1 galGal- ornAna- homSap+
RHO2 galGal+ ornAna- homSap- 
RHO1 galGal+ ornAna+ homSap+
PPNG galGal+ ornAna- homSap-

>OPNLW_ornAna green cone Ultra401:228519-23939
MTPAWNSGVYAARRRFEDEEDTTRTSVFVYTNSNNTRDPFEGPNYHIAPRWAYNVTSLWM
IFVVIASVFTNGLVLVATMKFKKLRHPLNWILVNLAVADLGETLIASTISVINQIFGYFI
LGHPMCVLEGYTVSLCGITGLWSLSIISWERWIVVCKPFGNVKFDAKLAMVGIVFSWVWA
AVWTAPPIFGWSRYWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMSTCCILPLSIIVLCYL
QVWLAIRAVAKQQKESESTQKAEKEVSRMVVVMILAYCFCWGPYTIFACFAAANPGYAFH
PLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIMQLFGKKVDDGSELSSTSRTEVSSVSSVSPA*

>OPNSW2_ornAna blue cone pigment Ultra401:258788-263967
MHKTHRNLQNELPEDFFIPLPLDTDNITSLSPFLVPQTHLGGSGIFMSLAAFMFLLITLG
FPINLLTVICTIKYKKLRSHLNYILVNLAVSNMLVVCVGSATAFYSFAHMYFVLGPTACK
IEGFAATLGGMVSLWSLAVIAFERFLVICKPLGNLSFRGTHAIFGCAATWVFGLAASLPP
LFGWSRYIPEGLQCSCGPDWYTTNNKWNNESYVIFLFSFCFGVPLSIIIFSYGRLLLTLR
AVAKQQEQSATTQKAEREVTKMVIVMVLGFLVCWLPYASFSLWVVTNRGQVFDLRMASIP
SVFSKASTIYNPIIYVFMNKQFRSCMLKLVFCGKSPFGDEDEISGSSQATQVSSVSSSQVSPA*

>OPNRH_ornAna rhodopsin rod pigment Ultra20:114,479-118,391
MNGTEGQDFYIPMSNKTGVVRSPFEYPQYYLAEPWQYSVLAAYMFMLIMLGFPINFLTLY
VTIQHKKLRTPLNYILLNLAFANHFMVLGGFTTTLYTSLHGYFVFGPTGCNIEGFFATLG
GEIALWSLVVLAIERYIVVCKPMSNFRFGENHAIMGVAFTWIMALACALPPLVGWSRYIP
EGMQCSCGIDYYTLRPEVNNESFVIYMFVVHFTIPMTIIFFCYGRLVFTVKEAAAQQQES
ATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTVPAFFAKSSAI
YNPVIYIMMNKQFRNCMLTTICCGKNPLGDDEASATASKTEQSSVSTSQVSPA*

>OPNSWS1_ornAna terminal exon chr10:3,935,884-3,985,883 
FHACIMEMMRGKLMVDDSESSSQETKTSTVSSRQVGPS*


[[Category:Comparative Genomics]]