Opsin evolution: Neuropsin phyloSNPs: Difference between revisions

From genomewiki
Jump to navigationJump to search
No edit summary
No edit summary
Line 135: Line 135:
59calMil  .............RV..S.LI.V.T.........III.....I.........A..........QNH.S...N..........................................................................................................*
59calMil  .............RV..S.LI.V.T.........III.....I.........A..........QNH.S...N..........................................................................................................*
10homSap  PFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKS-LEGFRLHTVT-TVRKSSAVLEIHEEV*
10homSap  PFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKS-LEGFRLHTVT-TVRKSSAVLEIHEEV*
phyloSNP  ...........................................B............B......CA.......................................................B.................................B...B.BBA............A..*</pre>
phyloSNP  ...........................................B............B......CA.......................................................B.................................B...B.BBA............A..*
</pre>
 
== Neuropsins compared to Newropsins ==
 
 
Newropsins are a new gene family closely related to neuropsins. Like so many opsin families, they persist from chondrichthyes to archeosaurs but vanish without a trace in platypus, marsupials, and placentals. (The syntenic order B4GALT6 NEWR KIAA1012 remains conserved in mammals but no NEWR debris remains.) Newropsins retain many key attributes of GPCR signaling proteins and indeed opsins such as the seven transmembrane arrangement, Schiff base lysine, counterion tyrosine, amino terminal glycosylation site, and disulfide but have a very odd replacement of the G-protein binding site DRY with (invariantly conserved) VCC.
 
This motif must be an ancient derived feature that followed the gene duplication event with neuropsin since the much older DRY could not plausibly have re-evolved in neuropsin from VCC. Newropsins very likely link covalently with a retinal and interact with light according to some lambda max type action spectrum. The VCC motif has been conserved over billions of years of branch length so cannot reflect simple loss of DRY functionality; however its signaling capabilities if any are most unclear.
 
[[Image:Opsins_NEWR.png]]
 
<br clear="all" />
<pre>
position    ...................................................................................................1.........1.........1.........1.........1.........1.........1.........1........1
position    .........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2.........3.........4.........5.........6.........7........7
position    12345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789
excMemCy    eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMcccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeee
keyResid    ...GLC.................................................................................................diS..cIon.................DRY?..............................................
NEWR2_galG                MDPSFANS-TFQSKITEAADIVVGTCYMVFGICSLCGNSILLYISYKKKHLLKPAEYFIINLAISDLAMTLTLYPLAVTSSLSHRWLYGKHICLFYAFCGLFFGICSLSTLTLLSVVCCLKICFPAYGNRFRRKHGQILIACAWTYAAIFACSPLAHWGEYGEEPY
NEWR2_anoC                MESYFANT-TFHSKITEAADVIVGVFYIVFGICSFCGNSILLYVSYKKKNLLKPAEYFMINLAISDLGMTLTLYPLAVTSSLAHRWLFGQQVCLFYAFCGVFFGVCSLTTLTLLSIVCCLKICFPVYGNRFRPGHGWILIACAWVYAAIFAFSPLAHWGEYGAEPY
NEWR2_xenT                MGNKSDAS-AFYSSISETDDIVLGVLYSVFGLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHRWLFDKVTCNYYAFCGMLFGLCSLTNLTVLSSVCCLKVCYPAYGNKFSTAHSRILLLGIWAYAGLFATAPLADWGKYGPEPY
NEWR2_danR                MGNVSKTA-LFMSTISRQHDILMGSLYSVFFVLSLLGNGMLLFVAYRKRSSLKPAEFFVVNLSVSDLGMTLSLFPLAIPSALAHRWLFGEITCLCYAVCGVLFGLCSLTNLTALSSVCCLKVCFPNYGNKFSSSHACVMVIGVWCYASVFAVGPLVHWGSFGPEPY
NEWR2_pimP                MGNVSETA-LFVSTISRQHDILMGSLYSVFCVLSLLGNGMLLFVAYRKRSSLKPAEFFVINLSVSDLGMTLSLFPLAIPSALAHRWLFGEVVCLCYAVCGVLFGLCSLTNLTALSSVCCLKVCCPNYGNKFSSNHACVMVIGVWCYASVFAVGPLIRWGSFAPEPY
NEWR2_tetN                MGNASDTSDAFNSKISKEHDFLIGSIYSVFCVLSLMGNCILLLVAHHKRSTLKPAEFFIVNLSISDLGMTLTLFPLAIPSSFSHRWLFGEIACQLYATCGVLFGLCSLTNLTVLSSVCCLKVCLPNLGSKFSSSHARLLVAGVWGYASVFAVGPLVQWGHYGPEPY
NEWR2_takR                MGNASEASDIFLSKISKEHDILIGSIYSVFGLLSLAGNCILLLVAYHKRSMLKPAEFFIINLSISDLGMTLTLFPLAIPSSFSHRWLFGEITCQLYAMCGVLFGLCSLTNLTALSLVCCLKVCFPNHGSRFSSSHARLLVVGVWCYASVFAVGPLVQWGHYGPEPY
NEWR2_gasA                MGNASDTSAVFASTISKERDILMGSLYSVFGVLSLVGNCILLLVAYHKRSTLKPAEFFIINLSISDLGMTLSLFPLAIPSAFKHRWLFGELTCQLYAMCGVLFGLCSLTNLTALSFVCCLKVCFPNHGNRFSSSHARLLVVAVWGYASVFAVGPLARWGRYSPEPY
NEWR2_oryL                MGNVSDTSSLFASSISREHDILMGSLYSVFGLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHRWLFGEITCQLYAMCGVLFGLSSLTNLTALSLVCCLKVCFPNHGNKFSFSHARLLVAGVWCYASVFAVGPLARWGRYSAEPY
NEWR2_calM                                              GILSLVGNSVLLFVAYRKRQILKPAEYFVANLAVSDISMTVTLLPLAISSNFSHRWLFVSKPCMYYGFCSMLFGICSLTNLTVLSTVCCMKVCFPAYMSVVMIV-MFLLAWSPYSIVCLWASFGNPKLIPPAMAII
NEUR_homSa  MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPF
NEUR_canFa  MALNHTARPQDERLPHYLREGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAICDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPF
NEUR_musMu  MALNHTALPQDERLPHYLRDEDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWFGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPF
NEUR_loxAf  MTLNHTAPPQDDRLPQYLQDGDPFTSKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSCRRKKKLRPAEIMTINLAVCDLGISVVGKPFVIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPF
NEUR_monDo  MALNHSVSPQDDYIPHYLRDGDPFASKLSWEADLVAGFYLTIIGVLSTLGNGYVIYMSSKRKKKLRPAEIMTVNLAVCDLGISVVGKPFTIISCFSHRWVFGWVGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSYGTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPF
NEUR_ornAn    MTNYSAPQLGDYLPHYLREGDPFVSKLSWEADLVAGVYLVIIGVLSTLGNGYVIYMSSRRKKKLRPAEIMTVNLAVCDLGISVVGKPFTIVSCFCHRWVFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSYGTWLKRHHAYICLAIIWAYASFWATMPLVGLGNYAPEPF
NEUR_calMi  MTAFDNSTALYSGYWLHDSLHGDPFVSKLSWEADIISACYLIVTGLLSTLGNGYVIYLSITQKRKLKPPEILITNLAISDFGMSVGGQPFLIISCFSHRWIFGWVGCRWHGWAGFFFGCGSLITMTVVSLDRYLKICHLQYGSWLQRRHVFMSLAFIWFYAAFWATMPLVGWGNYAPEPF
NEUR_galGa  MASDCNSSSQEEYLPHYMQQEDPFASKLSREADIIAGFYLTVIGILSTLGNGYVIFMSSKRKKKLRPAEIMTVNLAVCDLGISVVGKPFSIISFFSHRWIFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLAYGTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPF
NEUR_xenTr  MAGNSSYREESGYIPHYERDSDPFASKLSREADIFAGVYLMAIGILSTLGNGYVIYMACSRKKKLRPAEIMTINLAVCDLGISVTGKPFAIVSCFSHRWVFGWNACRWYGWAGFFFGCGSLITLTVVSLDRYLKICHLRYGTWLKRRHAFIALAVIWAYATLWATLPLVGVGNYAPEPF
NEUR_danRe    MENETSISSGYIPHYLLRGDPFASKLSKEADIVAAFYILVIGILSATGNGYVMYMTFKRKTKLKPPEIMTLNLAIFDFGISVSGKPFFIVSSFSHRWLFGWQGCRYYGWAGFFFGCGSLITMTIVSFDRYLKICHLRYGTWLKRHHAFLSVVFIWAYAAFWATMPVVGWGNYAPEPF
NEUR_takRu    MENDTSIPSGYVPHYLLRGDPFASKLSKEADIVAAFYILVIGVLSATGNGYVIYQTIKRKTKLKPPEFMTLNLAVFDFGISVTGKPFFIVSSFSHRWLFGWQGCRYYGWAGFFFGCGSLITMTIVSLDRYLKICHLRYGTWFKRHHAFLCLVFTWLYAAFWATMPVVGWGNYAPEPF
NEUR_tetNi    MENETWTHSSYVPHYLLRGDPFASRLSKEADIVAALYICIIGLMSATGNGYVLYMTFKRKTKLKPPELMTLNLAIFDFGISVTGKPFFIVSSLSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGAWLKRHHAFLCLASVWAYAAFWATMPLVGWGSYAPEPF
NEUR_gasAc    MDNETRSHPSYVPHYLLRGDPFASRLSKEADIVAAFYIFIIGVMSATGNGYVLYMTFKRKTKLKPPELMTVNLAIFDFGISVTGKPFFIVSSLSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGTWLKRHHAFVCLALVWAYAAFWATMPLVGWGSYAPEPF
NEUR_oryLa    MENETWTHPSYIPHYLLRGDPFASRLSKEADIIAAFYICIIGIMSATGNGYVIYMTIKRKSKLKPPELMTVNLAVFDFGISVTGKPFFVVSSFAHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGTWLKRQHAFLCLVFVWMYAAFWATMPLVGWGNYAPEPF
NEUR_pimPr    MENTSWPHSSYVPHYLLRGDPFASRLSKEADIVAAFYILIIGIMSATGNGYVIYMTIKRKSKLKPPELMTVNLAVFDFGISVTGKPFFVVSSFSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGTWLKRQHIFLCLVFVWIYAAFWATMPLVGWGSYAPEPF
NEUR_anoCa      MEQGQNISSQDDNQQEEDPFASKLSVEADIVAGVYLLVIGILSTLGNGYVIYMSTQRKKKLKPAEIMTVNLAVCDLGISVVGKPFSIIAFFSHRWIFGWSGCRWYGWAGFFFGIGSLITMTAVSLDRYFKICHLSYGTWLKRHHVFICLGIIWSYAAFWATIPFAGFGNYAPEPF


position    1.........1.........2.........2.........2.........2.........2.........2.........2.........2.........2.........2.........3........3..........3.........3.........3.........3.....3
position    8.........9.........0.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0........1..........2.........3.........4.........5.....6
position    012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456
excMemCy    eeeeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMccccccccccccccccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMcccccccccccccccccccccccccccccccccccccccccccccc*
keyResid    .diS................................................................................................................K............................................................
NEWR2_galG  GTACCIDWQSTNVDVMSMSYTVVLFVLCFILPCGVIVTSYSLILVTVKESRKAVEQHVSGPTRINNVQTITAKLSIAVCIGFFAAWSPYAIIAMWAAFGSIDKIPPLAFAIPAVFAKSSTLYNPIIHLLLKPNFRSNIAKDFTVIQQLCVR---CCFCVKELQ--TYRSTFNTGLRTFKG
NEWR2_anoC  GTACCIDWRISNMKKTAMSYTTALFVFCYIIPCGIIITSYTLILITVKDSRKAVEQHALGPTRMSSVHTITAKLSIAVCIGFFVAWSPYAIIAMWAAFGSIDMIPPLAFAVPAVFAKSSTLYNPAMYLFLKPNFRSTIAKDLTVLHRLCLK---SCFCPRGMQNCSYRSALEAPLKSFKG
NEWR2_xenT  GTACCLDWEASYRERKALSYTISLFVFCYLIPSSLIFISYTLIFVTVKGARRAVQQHLSPQAKGSSIHSLIIKLSIAVCIGFLIAWTPYAIVAMMAAFGDPTKIPSLVFALAAAFAKSSTIYNPVVYLLLKPNFLNVVTKDLTLFQTMCAV---VCGWCR-----TPAVKTPCPHKDLKT
NEWR2_danR  GTACCINWYTPSHDALAMSYIISLFIFCYVVPCTIIILSYTFILVTVRGSQQAVQQHVSPQTKVTNAHALIVKLSVAVCIGFLTAWSPYAIVAMWAAFSANEQVPPTAFALAAIMAKSSTIYNPMVYLLFKPNFRKSLSQDTQMFRHRICLSHSKASPSPGMKDQERQSSQQCNNKDGSI
NEWR2_pimP  GTACCINWYIPSHDALAMSYIISLFIFCYVVPCTIIILSYTFILLRVRGSRQAVQKHVSPKTKETNAHTLIVKLSVAVCIGFVTAWSPYAVVAMWAAFSANEPVPPTAFALAAILAKSSTIYNPMVYLLFKPNFRKILSQDTQNIRHRMCVSHSKASPTPEIK---AQSSQQC--KDATI
NEWR2_tetN  GTACCINWQAPNHELSSLSYIVCLFLFCYVLPCAIIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHALIVKLSVAVCIGFLGAWSPYAVVAMWASFGDATWVPPDAFAIAAILAKSSTIYNPLVYLLCKPNFRECLYKDTSTLRQRIY----RGSPLSGPRDRSGGVTQR--HKDLSV
NEWR2_takR  GTACCIDWRAPNHELSSLSYIVCLFFFCYVLPCATIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHSLIVKLSVAVCIGFLGAWSPYAIVAMWAAFGDATWVPPDAFAIAAILAKSSTIYNPVVYLLCKPNFRECLYKDTSTLRQRIY----RGSPQSEPRERFGGTSQR--HKDLSI
NEWR2_gasA  GTACCIDWHAPNHELAALSYIVCLFVFCYALPCATIFLSYTFILLTVRGSRQAVQQHVSPQTKTTNTHALIVKLSVAVCIGFLGAWTPYAVVAIWAAFGDATLVPPDAFALAAMFAKSSTIYNPVVYLLCKPNFRACLYRDTTLLRQRIY----RGSPRSEPKAHFGSTSQR--NKDMSV
NEWR2_calM  APLFAKSSTFYNPCIYVISYTMTVIAVNFVVPLSVMFFCYYNV 
NEWR2_oryL  GTACCIDWHAPNHELWALSYILCLFIFCYALPCTIIFLSYAFILLTVRGSRQAVQQHVSPQTKTTNAHTLIVKLSVAVCIGFLGAWTPYAVIAMWAAFGDATQVPPTAFALAAVFAKSSTIYNPMVYLLCKPNFRECLCRDTSLLRHMIY----RGSP--QPQERFGSDSRR--NKDITA
NEUR_homSa  GTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKA-TKKKSLEGFRLHTVT-TVRKSSAVLEIHEEV   
NEUR_calJa  GTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKA-TKKKSLEDFRLHTVT-TVRKSSAVLEIHEEV   
NEUR_canFa  GTSCTLDWWLAQASLGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGRLKA-TKKKSLEDFRLNTVT-TVRKSSAVLEIHQEV   
NEUR_musMu  GTSCTLDWWLAQASGGGQVFILSILFFCLLLPTAVIVFSYAKIIAKVKSSSKEVAHFDSRIHSSHVLEVKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYRFACCQAGGLRG-TKKKSLEDFRLHTVT-TVRKSSAVLEIHQEV   
NEUR_loxAf  GTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHMLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLRA-TKKKSLEGFRLHTVT-TVKKSSAVLEVHQEV   
NEUR_monDo  GTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPVQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCQSGGQKA-AKKESLRTYRLHTVT-TVRRSSAVLEIHQEV   
NEUR_ornAn  GTSCTLDWWLAQASVAGQAFILNILFFCLLLPTAVIVFSYVKIIAKVKSSTKEVAHFDSRIQNSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPIQFSVVPTLLAKSAAMYNPIIYQVIDCRISCCRLGGPKT-GKKESLKNSRSHSMS-TIRKPSAVSGPHQEV   
NEUR_calMi  GTSCTLDWWLARVSVSGLIFVLTILFFCLLLPIIIIVFSYIKIIAKVKSSAKEVAHFDSRIQNHHSLEMNLTK                                                                                                         
NEUR_galGa  GTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGQPDSVPIQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCRSGGPKTLQKKSSLKESRMYTIS-SHRDSAALSGTQLEV   
NEUR_xenTr  GTTCTLDWWLAQASVKGQIFVLSMLFFCLLFPTMVIVFSYAKIIAKVKSSAKEVAHFDTRNQNNHTLEIKLTKVAMLICAGFLIAWFPYAVVSVWSAFGQPDSIPIELSVVPTMMAKSASMYNPIIYQVIDCKPACCKK------DKSLQNTTSRVYTIS-TFRKSTTSAR       
NEUR_danRe  GTSCTLDWWLTQASVSGQSFVMCMLFFCLIFPTVIIVFSYVMIIFKVKSSAKEVSHFDTRNKNNHSLEMKLTKVAMLICAGFLIAWIPYAVVSVMSAFGEPDSVPIPVSVVPTLLAKSSAMYNPIIYQVIDCKKKCVKSCCFQAWRKKKPSKTSRFYTISGSIKQR-PGDEASIEI   
NEUR_takRu  GTSCTLDWWLAQASVSGQSFVMCMLIFCLVLPTGVIVFSYVMIILQVKSSAQEVSHFDTQNKNKHHLEMKLTKVAMLICAGFLIAWIPYAVVSVVSAFGDPDSVPISISVVPTLLAKSSAMYNPIIYQVIDCKKNCAKLSCFQAWSKRKHYKTSRFYSISASMKKR-PANEVPTEI   
NEUR_tetNi  GTSCTLDWWLAQASVSGQSFVMAILFFCLILPTGIIVFSYVMIIFKVKSSAKEISHFDARIRNSHDLEIKLTKVAMLICAGFLIAWIPYAVVSVISAFGEPDSVPIPVSVIPTLLAKSSAMYNPIIYQVVDVKTSCTNFSCCKALKERIHFRKSRLYTISGSLRDPLPPKEAHIEM   
NEUR_gasAc  GTACTLDWWLAQASVSGQSFVMAILFFCLVLPTGIIVFSYIMIIFKVKSSAKEISHFDARIKNSHSLEIKLTKVAMLICAGFLIAWIPYAVVSVVSAFGEPDSVPIPVSVIPTLLAKSSAMYNPIIYQVADLKTSCTSSSCCKALKERVLFRKARLYTISGSLRDTLPPKEAHIEM   
NEUR_oryLa  GTSCTLDWWLAQASVSGQSFVVAILFFCLVLPAGIIVFSYVMIIFKVKSSAKEISNFDARIKNSHNLEIKLTKVAMLICAGFLIAWIPYAVVSVVSAFGEPDSVPISVSVIPTLLAKSSAMYNPIIYQVLDLKNSCMKSSCFKGLKKPRHFRKSRFYTISGSVKDNTTAKEAQIEM   
NEUR_pimPr  GTSCTLDWWLAQASVSGQSFVMSILFFCLVLPAGIIVFSYVMIICKVKSSSKEVSSFDARIKNSHTLEIKLTKVAMLICAGFLIAWIPYAVVSVVSAFGEPDSIPIPVSVIPTLLAKSSAMYNPIIYQLVDLKNSC-STCCAKVIRKRTHFRNSRFYTISGSLKDTAPAKEAHIEI   
NEUR_anoCa  GTSCTLDWWLAQGSVAGQAFILNILFFCLVLPTAVIMFCYVKIIAKVQSSTKEVAHYDTRIQNQHVLEMKLTKVAMLICAGFMFAWIPYAVVSVWSAFGRPDSVPIKVSVIPTLLAKSAAMYNPVIYQVIDCKSACCRPGNLQPLQKKNSR                           
</pre>
== Curated Set of 51 deuterostome neuropsins ==
== Curated Set of 51 deuterostome neuropsins ==


Line 558: Line 635:


</pre>
</pre>
 
== Curated Set of 51 vertebrate newropsins ==
<pre> Curated set of 10 Newropsins
<pre>
>NEWR2_galGal GenBank 3 garbage 5'UTR translated exons
>NEWR_galGal GenBank 3 garbage 5'UTR translated exons -B4GALT6 -NEWR2_galGal -KIAA1012
0 MDPSFANSTFQSKITEAADIVVGTCYMVF 1
0 MDPSFANSTFQSKITEAADIVVGTCYMVF 1
2 GICSLCGNSILLYISYKKKHLLKPAEYFIINLAISDLAMTLTLYPLAVTSSLSHR 2
2 GICSLCGNSILLYISYKKKHLLKPAEYFIINLAISDLAMTLTLYPLAVTSSLSHR 2
Line 568: Line 645:
CSYFPSEKGSHTFECFKSYPNCFQERLSTMGCHLQDCESLENDLQVEVTQGSRNSMKVVEQEEKSTELDNLEITLEAVPVSCTFTDL* 0
CSYFPSEKGSHTFECFKSYPNCFQERLSTMGCHLQDCESLENDLQVEVTQGSRNSMKVVEQEEKSTELDNLEITLEAVPVSCTFTDL* 0


>NEWR2_anoCar Anolis carolinensis
>NEWR_anoCar Anolis carolinensis
0 MESYFANTTFHSKITEAADVIVGVFYIVF 1
0 MESYFANTTFHSKITEAADVIVGVFYIVF 1
2 GICSFCGNSILLYVSYKKKNLLKPAEYFMINLAISDLGMTLTLYPLAVTSSLAHR 2
2 GICSFCGNSILLYVSYKKKNLLKPAEYFMINLAISDLGMTLTLYPLAVTSSLAHR 2
Line 576: Line 653:
YFPCEKCHDPFECFKNYPKCCQGRLNVMDHTPRESISVENNMQSKTKHASEKYIKVVIRGEKNTDIDNLEITLEHIPTDIKFANL* 0
YFPCEKCHDPFECFKNYPKCCQGRLNVMDHTPRESISVENNMQSKTKHASEKYIKVVIRGEKNTDIDNLEITLEHIPTDIKFANL* 0


>NEWR2_xenTro abundant transcripts
>NEWR_xenTro abundant transcripts
0 MGNKSDASAFYSSISETDDIVLGVLYSVF 1
0 MGNKSDASAFYSSISETDDIVLGVLYSVF 1
2 GLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHR 2
2 GLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHR 2
Line 584: Line 661:
LKTTSKPPSSFKKSQGVCRNCVDTFECFRNYPRCCSVGNVDAAQPMAASLVRIPPANGAPQQTVQLVVSSSRTRSGVETVEVSTEAPMSDFIKDFI* 0
LKTTSKPPSSFKKSQGVCRNCVDTFECFRNYPRCCSVGNVDAAQPMAASLVRIPPANGAPQQTVQLVVSSSRTRSGVETVEVSTEAPMSDFIKDFI* 0


>NEWR2_danRer Danio rerio acquired new intron
>NEWR_danRer Danio rerio acquired new intron
0 MGNVSKTALFMSTISRQHDILMGSLYSVF 1
0 MGNVSKTALFMSTISRQHDILMGSLYSVF 1
2 FVLSLLGNGMLLFVAYRKRSSLKPAEFFVVNLSVSDLGMTLSLFPLAIPSALAHR 2
2 FVLSLLGNGMLLFVAYRKRSSLKPAEFFVVNLSVSDLGMTLSLFPLAIPSALAHR 2
Line 593: Line 670:
SQQCNNKDGSISTPFSSGQAESYGACHVYAEAGPHYQQISRQITARVLEGSVQSEIPVKQLTEKMQNDLL* 0
SQQCNNKDGSISTPFSSGQAESYGACHVYAEAGPHYQQISRQITARVLEGSVQSEIPVKQLTEKMQNDLL* 0


>NEWR2_tetNig Tetraodon nigroviridis gene mix
>NEWR_tetNig Tetraodon nigroviridis gene mix
0 MGNASDTSDAFNSKISKEHDFLIGSIYSVF 1
0 MGNASDTSDAFNSKISKEHDFLIGSIYSVF 1
2 CVLSLMGNCILLLVAHHKRSTLKPAEFFIVNLSISDLGMTLTLFPLAIPSSFSHR 2
2 CVLSLMGNCILLLVAHHKRSTLKPAEFFIVNLSISDLGMTLTLFPLAIPSSFSHR 2
1 WLFGEIACQLYATCGVLFGLCSLTNLTVLSSVCCLKVCLPNL 1
1 WLFGEIACQLYATCGVLFGLCSLTNLTVLSSVCCLKVCLPNL 1
2 GRFSSKFSSSHARLLVAGVWGYASVFAVGPLVQWGHYGPEPYGTACCINWQAPNHELSSLSYIVCLFLFCYVLPCAIIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHALIVK 0
2 GSKFSSSHARLLVAGVWGYASVFAVGPLVQWGHYGPEPYGTACCINWQAPNHELSSLSYIVCLFLFCYVLPCAIIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHALIVK 0
0 LSVSVLSVDRWLFGEIACQLYATCGVLFGLCSLTNLTVLSSVCCLKVCLPNLGEAYQTPHREGDGAQAPRWLQEGAKADFGLLSFQAASSLPLTPGSWWR
0 LSVAVCIGFLGAWSPYAVVAMWASFGDATWVPPDAFAIAAILAKSSTIYNPLVYLLCKPNFRECLYKDTSTLRQRIYRGSPLSGPRDRSGGVTQRHKDLSVSTR
ACGVTHQCLLWGPWCSGATTVQNLTAPPAALTGRRQTTSCPLCLTSSASSSSATCCLAPSSSSPTLAS* 0
LSNGQQDSYGTCLHCAEDAELGHVTGSRRTACILTGSTFTEVTLSQLSATPADLL* 0


>NEWR2_takRub Fugu rubripes
>NEWR_takRub Fugu rubripes
0 MGNASEASDIFLSKISKEHDILIGSIYSVF 1
0 MGNASEASDIFLSKISKEHDILIGSIYSVF 1
2 GLLSLAGNCILLLVAYHKRSMLKPAEFFIINLSISDLGMTLTLFPLAIPSSFSHR 2
2 GLLSLAGNCILLLVAYHKRSMLKPAEFFIINLSISDLGMTLTLFPLAIPSSFSHR 2
Line 609: Line 686:
LSNGQQDSYGTCLHCADDAERGHVTTSQRTACILTGSTFTEVTVGQLSAAPADLL*
LSNGQQDSYGTCLHCADDAERGHVTTSQRTACILTGSTFTEVTVGQLSAAPADLL*


>NEWR2_gasAcu Gasterosteus aculeatus
>NEWR_gasAcu Gasterosteus aculeatus
0 MGNASDTSAVFASTISKERDILMGSLYSVF 1
0 MGNASDTSAVFASTISKERDILMGSLYSVF 1
2 GVLSLVGNCILLLVAYHKRSTLKPAEFFIINLSISDLGMTLSLFPLAIPSAFKHR 2
2 GVLSLVGNCILLLVAYHKRSTLKPAEFFIINLSISDLGMTLSLFPLAIPSAFKHR 2
Line 617: Line 694:
APCHVMTPQRTACILTESTNREVTVSRLADKPQADFL*
APCHVMTPQRTACILTESTNREVTVSRLADKPQADFL*


>NEWR2_oryLat Oryzias latipes  
>NEWR_oryLat Oryzias latipes  
0 MGNVSDTSSLFASSISREHDILMGSLYSVF 1
0 MGNVSDTSSLFASSISREHDILMGSLYSVF 1
2 GLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHR 2
2 GLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHR 2
Line 625: Line 702:
TGLCQLASPQNTACILTGSTYAEVTVQQLVDKQQPDFL* 0
TGLCQLASPQNTACILTGSTYAEVTVQQLVDKQQPDFL* 0


>NEWR2_pimPro Pimephales promelas
>NEWR_pimPro Pimephales promelas
0 MGNVSETALFVSTISRQHDILMGSLYSVF 1
0 MGNVSETALFVSTISRQHDILMGSLYSVF 1
2 CVLSLLGNGMLLFVAYRKRSSLKPAEFFVINLSVSDLGMTLSLFPLAIPSALAHR 2
2 CVLSLLGNGMLLFVAYRKRSSLKPAEFFVINLSVSDLGMTLSLFPLAIPSALAHR 2
Line 632: Line 709:
0 LSVAVCIGFVTAWSPYAVVAMWAAFSANEPVPPTAFALAAILAKSSTIYNPMVYLLFKPNFRKILSQDTQNIRHRMCVSHSKASPTPEIK-AQSSQQCKDATISTPFSSGQAESYGTCHIYAEAEPHFQQISPQRTVRILEGIIQSEISVRHMTDRMQNDLL* 0
0 LSVAVCIGFVTAWSPYAVVAMWAAFSANEPVPPTAFALAAILAKSSTIYNPMVYLLFKPNFRKILSQDTQNIRHRMCVSHSKASPTPEIK-AQSSQQCKDATISTPFSSGQAESYGTCHIYAEAEPHFQQISPQRTVRILEGIIQSEISVRHMTDRMQNDLL* 0


>NEWR2_calMil frag
>NEWR_calMil frag
0 1
0 1
2 GILSLVGNSVLLFVAYRKRQILKPAEYFVANLAVSDISMTVTLLPLAISSNFSHR 2
2 GILSLVGNSVLLFVAYRKRQILKPAEYFVANLAVSDISMTVTLLPLAISSNFSHR 2
1 WLFVSKpCMYYGFCSMLFGICSLTNLTVLSTVCCMKVCFPAY 1
1 WLFVSKpCMYYGFCSMLFGICSLTNLTVLSTVCCMKVCFPAY 1

Revision as of 14:18, 17 March 2008

Neuropsin backgrounder

Neuropsin (OPN5) is a deeply diverged member of the opsin family with a single publication on it and considerable confusion over the name (mostly used for an unrelated kalikrein serine protease KLK8, not an opsin). There are no known disease associations or described knockout phenotypes; it is expressed primarily in brain, spinal chord, and testes.

Neuropsin has all the classical attributes of a rhodopsin-class GPCR and indeed opsin photoreceptor: Schiff base lysine at expected position, standard tyrosine counterion and DRY motif, seven transmembrane configuration, disulfide at expected position, proximal glycosylation and distal palmitoylation and kinase sites. It is most closely related to peropsin and rgropsin in terms of blast clustering and intron positioning. Its G-protein signalling partner is not known.

The evolution is illuminated by the massive comparative genomics study described here, which extractes and compares over 50 full length sequences from various genomics projects. Neuropsin can be located outside chordates but not outside deuterostomes. However, like peropsin and rgropsin, it must have originated much earlier in pre-Bilaterans. Thus its absence in earlier diverging species must be due either to gene loss or unrecognizability.

Within placental mammals, neuropsin is extraordinarily conserved, with percent identity relative to human protein 96% averaged over 31 species (likely above the 95% percentile proteomewide). That conservation drops considerably at marsupials and monotremes (86%), is less striking at tetrapods (78%), and unremarkable at teleost fish (68%). This pattern suggests neuropsin acquired significant new adaptive functionality on the placental mammal stem, leading to marked resilience to fixation of any further variation.

The structure of the neuropsin gene is rather odd. In human, a weak splice signal appears to have developed that results in a seventh very short exon. However a stop codon is encountered if the splice is not made. This results in two slightly different alternative carboxy termini sequences QVE vs XXX. Very few transcripts exist in this region for any species but it appears that the recent ancestral form of the protein only utilizes the stop codon in exon 6.

Novel neuropsins in amphioxus and sea urchin

The genome of Branchiostoma (amphioxus, lancelet) contains two distinct neuropsins about 75% identical to each other and 42% to human. These cluster unambiguously with vertebrate opsins and share critical conserved residues. An extra intron distinguishes them from the vertebrate neuropsin pattern. Recall Branchiostoma species has three rather diverged (and well-studied) peropsins but no evident Rgr opsin. These raises the question whether neuropsin and peropsin developed substantial visual roles in this species as an alternative to the ciliary imaging opsin pathway seen by lamprey divergence. Sea urchins, but not acornworm Saccoglossus, contain a single neuropsin that is quite diverged.

These neuropsins are newly reported here, meaning they were not localized in recent in situ hybridization studies. That's especially unfortunate in view of the antecedent role the Branchiostoma ancestral node plays in the evolution of chordate eye and the complexities of photoreceptor tissues in the extant species.

PhyloSNPs in vertebrate neuropsins

Alignment analysis coming shortly. Neuropsin has rather few of them.

position  ...................................................................................................1.........1.........1.........1.........1.........1.........1.........1......1..
position  .........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2.........3.........4.........5.........6.........7......7..
position  123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567..
excMemCy  eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMcccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeee..
keyResid  ...GLC.................................................................................................diS..cIon.................DRY...............................................
exonNumb  111111111111111111111111111111111111111111122222222222222222222222222222222222222223333333333333333333333333333333333333333333333333333333334444444444444444444444444444444444444..
10homSap  MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPE..
11panTro  ...................................................................................................................................................................................
12gorGor  ............................................................................................................................................------------------.....................
13ponPyg  ...................................................................................................................................................................................
14nomLeu  ...................................................................................................................................................................................
15rheMac  .................................I............................................................................................................................................A....
16papHam  .................................I............................................................................................................................................A....
17calJac  ......S.......................................................................................................................................................................A....
18tarSyr  ....................................................................................I.........................................................................................A....
19otoGar  ............LR..........................................................V...................................................................----------------------..M......E.......
20micMur  ......V.................................I..........................................---------------------------------------------------------..............L....V..............A....
21tupBel  ............S.............................................................................................................................................V...................A....
22musMus  ....................E..................................................................................F..................................................V...................A....
23ratNor  ....................E..................................................................................F..................................................V...................A....
24speTri  ............H.......E................................................................................................................................F....V...................A....
25dipOrd  ..F....GT.GQG.....PEE...T........I........................................................................................................................V........................
26cavPor  .......P..N.H..R..Q.E...V.......................V...........................I..................R.............................V.................................V..............A....
27oryCun  ............H......E..............................................................................................................................R.......L...................A....
28ochPri  ....D.......H....F.........................................................................................L....D.....................------......R.......V...................A....
29canFam  .......R...........E........................................................I.............................................................................V...................A....
30felCat  .......P...........E.................................................--------------.......................................................................V...................A....
31bosTau  .......P.P...R.P........................................................V...I................................................................I............V.....A.............A....
32turTru  ......................K..........I.............................................................................V...................A....
33susScr  .......P.P...R.....E....................................................V...I.............................................................................V.....A.............A....
34vicVic  .......P.P...R.RH...............................L..................................................M........................................-------------------------------------..
35equCab  ............................................................................I.............................................................................V........................
36myoLuc  ............G.....Q.............................V.......................V...........................................................................T.....F........................
37pteVam  ......V.....H....V..............................V...........................I.............................................................................V........................
38sorAra  ............N..........................................................................................M.................................................VV...................A....
39eriEur  .S..Q.......G.........................................................................................................................................L...V...................A....
40loxAfr  .T.....P...D...Q..Q.....T.................................C..............................V................................................................V...................A....
41proCap  .T....V..E.D..S.........T................V......C.........Y..............................I...S..........................................H...-------------------------------------..
42echTel  .......P...NS...........V.........G.....I.................Y....................................S.......T..................................................V........................
43dasNov  ...........D...............................................K..............................................................................................V........................
44choHof  ......G....DS....F......................I....................R............................................................................................V.............L..........
45monDom  .....SVS...DYI..............................V...L.....I....K............V......................S.......V................................H....T....H..F....L.....T..A.V.FA.V.S.A....
46macEug  .V...L.....I....K............V..............................V................................H....T....H......VI.....T..A....A...N.A....
47ornAna  MT.YS.PQLGDY......E....V............V..V...V...L.....I.................V..................V...........M................................H....T....H.......I........A........N.A....
48galGal  ..SDCNSSS.E.Y....MQQE........R...II......V......L.....IF...K............V................S...F.S...I...M................................H.A..T....H..F....L.....T..A.V.FA.V.S.A....
49taeGut  ..SEYNNSS.E.YI....QEE........R...II.............L.....IF...K............V................S...F.S...M......C.............................H....T....H..F....I.....M..A.V.FA.V.S.A....
50anoCar  .EQGQNISS..DN----QQEE........V...I...V..LV......L.....I...TQ.....K......V................S..AF.S...I...S.............I..............F...H....T....H.VF...GI..S..A..A.I.FA.F.N.A....
51xenTro  ..G.SSYREESGYI...E..S........R...IF..V..MA......L.....I..ACS........................T....A.V...S.......NA..................L.V..........H.R..T....R..F.A..V.....TL.A.L....V.N.A....
52danRer  .E-NET-SISSGYI....LR.........K...I..A..ILV.....AT.....M..TFK..T..K.P....L...IF.F....S....F.V.S.S...L...Q...Y.................I..F.......H.R..T....H..FLSVVF.....A..A...V..W.N.A....
53pimPro  .E-NDT-SIPSGYV....LR.........K...I..A..ILV..V..AT.....I.QTIK..T..K.P.F..L....F.F....T....F.V.S.S...L...Q...Y.................I..........H.R..T.F..H..FL..VFT.L..A..A...V..W.N.A....
54takRub  .E-NET-WTHSSYV....LR......R..K...I..AL.IC...LM.AT........TFK..T..K.P.L..L...IF.F....T....F.V.SLS...L...E...F.................V..........H.R..A....H..FL...SV....A..A......W.S.A....
54tetNig  .D-NET-RSHPSYV....LR......R..K...I..A..IF...VM.AT........TFK..T..K.P.L..V...IF.F....T....F.V.SLS...L...E...F.................V..........H.R..T....H..FV...LV....A..A......W.S.A....
56gasAcu  .E-NET-WTHPSYI....LR......R..K...II.A..IC....M.AT.....I..TIK..S..K.P.L..V....F.F....T....FVV.S.A...L...E...F.................V..........H.R..T....Q..FL..VFV.M..A..A......W.N.A....
57oryLat  .E-N.S-W.HSSYV....LR......R..K...I..A..IL....M.AT.....I..TIK..S..K.P.L..V....F.F....T....FVV.S.S...L...E...F.................V..........H.R..T....Q.IFL..VFV.I..A..A......W.S.A....
59calMil  .TFDNSTALYSGYWL.DSLH....V........IISAC..IVT.L...L.....I.L.ITQ.R..K.P..LIT...IS.F.M..G.Q..L.....S...I...V....H................V..........H.Q..S..Q.R.VFMS..F..F..A..A......W.N.A....
10homSap  MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPE..
phyloSNP  ..................AA......B......A..A..B.....B.BB.........B......C.A.........B.A....A..........B......................................A...B..A.......A.............A...............
  ..
  ..
position  1.1.........1.........2.........2.........2.........2.........2.........2.........2.........2.........2.........2.........3........3..........3.........3.........3.........3.....3
position  7.8.........9.........0.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0........1..........2.........3.........4.........5.....6
position  89012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456
excMemCy  eeeeeeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMccccccccccccccccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMcccccccccccccccccccccccccccccccccccccccccccccc*
keyResid  ...diS................................................................................................................K...........................................
exonNumb  44444444444444444444444444444444444444444444444444444444444444444444444444455555555555555555555555555555555555555555555555555555555555555555555555555555555556666666666666666666666
10homSap  PFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKS-LEGFRLHTVT-TVRKSSAVLEIHEEV*
11panTro  .................................................................................................................................................E.....-..........-...............*
12gorGor  ...............................................................P.....................................K.................................................-....Q.....-...............*
13ponPyg  .....................................................................................................S.................................................-..........-...............*
14nomLeu  .......................................................................................................................................................-..........-S..............*
15rheMac  .......................................................................................................................................................-..........-...............*
16papHam  .......................................................................................................................................................-..........-...............*
17calJac  .......................................................................................................................................................-..D.......-...............*
18tarSyr  ......................................................G.........T......................................................................................-..D.......-...............*
19otoGar  ..-...........-.L...............................................G................................................................................T.....-..D.......-............Q..*
20micMur  .......................................................I.............................................................................----------------------H......-A...........Q..*
21tupBel  ....................I..................................I..........................................................................................................-............Q..*
22musMus  ................G.......S.................A...........................V...............................................................R.....A...RG.....-..D.......-............Q..*
23ratNor  ................G.......S.................A...........................V................................N..............................R.........R......-..D.......-A...........Q..*
24speTri  .......................-..............E..............E..............-------.......................................S....................................-..D.......-A.......V...Q..*
25dipOrd  ................LA.................S...........................P.......................................................................................-.......................Q..*
26cavPor  ................A...I...H...........M..................I.............................................................................SR.....NA.........-..D.......-.D...-......Q..*
27oryCun  ........................................................................................................................................S..R.S.........-..D.......-............Q..*
28ochPri  ................................................................G.......................................................................S..R.....Q.....-..D.......-............Q..*
29canFam  ................L...I.........................................................................................................................R........-..D...N...-............Q..*
30felCat  ....................I..................................................................................................................................-..D.......-............Q..*
31bosTau  ....................I.................................................V................................................................................-..D.......-..........V.Q..*
32turTru  ....................I..........................................P..........................................................V............................-..D.......-............Q..*
33susScr  .....................................................................................V.................................................................-..D.......-...........RQ..*
34vicVic  ---------------------------------------------------------------------------............................................................................-..D....A..-............Q..*
35equCab  ....................I...........................................G.....V................................................................................-..D.......-............Q..*
36myoLuc  ...............T....I.................................K.........-----------.............................................S..............L........R......-..N.......-............Q..*
37pteVam  ....................I...........................................G..M.........................................................................S..R......-..D.....I.-...EA.......Q..*
38sorAra  ....................I..................................................................................N..............................R.....S...R......-.DD.......-...E........Q..*
39eriEur  ................L...I.................................K............M..............................................................................N....-.KDY...................q..*
40loxAfr  ....................I..................................I...........M............................................................................R......-..........-..K.......V.Q..*
41proCap  ---------------------------------------------------------------------------..............................V.................................R.R..R...E..-...V......-............Q..*
42echTel  .......................................................I...........M............................................................................................I.-...........HQ..*
43dasNov  ................................................................................................................................................R......-..D.......-...E......V.Q..*
44choHof  ...................................................................M.......................................................................R....R......-F.........-............Q..*
45monDom  .................A..A.V.S.......F.............L.....T.....Y.T..QN..I.................................Q.....V.F.......................C......S..Q..A..E.-.RTY......-...R........Q..*
46macEug  .................T..T...............................T..........Q..............................................................................................RHTVSTIRKSSSVSETYQ..*
47ornAna  .................A..A...............................T..........QN....................................Q.......F.......................CRIS..RL..P.TG..E.-.KNS.S.SMS-.I..P...SGP.Q..*
48galGal  .................A..A.V.S.......F.............L.....T.....Y.T..QN..I.................................Q...V...F.......................C.....RS..P.TLQ...S.KES.MY.IS-SH.D.A.LSGTQL..*
49taeGut  .................A....V.S.....................L.....T.....Y.T..QN..I.................L...................V...F......................ECRL...RP..*
50anoCar  ..............G..A..A..........V......M.C........Q..T.....Y.T..QNQ..................MF...................V..KV..I.............V......C.S...RP.N.QPLQ..NSR*
51xenTro  ....T............K..I.V.SM......F..M......A.........A.......T.NQNN.T..I.................F............Q......E......MM....S...........C.P...KKD--.SLQNTT----S.VY.IS-.F...TTSAR*
52danRer  ............T....S..S.VMCM.....IF..VI......M..F.....A...S...T.NKNN.S............................M....E...V..PV..........S............C.KK.VKSCCFQ.WR..KPSKTS.FY.ISGSIKQRPGD-.ASI.I*
53pimPro  .................S..S.VMCM.I...V...G.......M..LQ....AQ..S...TQNKNK.H............................V....D...V..SI..........S............C.KN.AKLSCFQ.WS.RKHYKTS.FYSISASMK.RP.N-.VPT.I*
54takRub  .................S..S.VMA......I...GI......M..F.....A..IS...A..RN..D..I.........................I....E...V..PV..I.......S..........V.V.TS.TNFSCC..L.ERIHFRKS..Y.ISGSL.DPLPPK.A.I.M*
54tetNig  ....A............S..S.VMA......V...GI.....IM..F.....A..IS...A..KN..S..I.........................V....E...V..PV..I.......S..........A.L.TS.TSSSCC..L.ERVLFRKA..Y.ISGSL.DTLPPK.A.I.M*
56gasAcu  .................S..S.VVA......V..AGI......M..F.....A..ISN..A..KN..N..I.........................V....E...V..SV..I.......S..........L.L.NS.MKSSCF.GL..PRHFRKS.FY.ISGS.KDNTTAK.AQI.M*
57oryLat  .................S..S.VMS......V..AGI......M..C.........SS..A..KN..T..I.........................V....E......PV..I.......S.........LV.L.NS.-S.CCA.VIR.RTHFRNS.FY.ISGSLKDTAPAK.A.I.I*
59calMil  .............RV..S.LI.V.T.........III.....I.........A..........QNH.S...N..........................................................................................................*
10homSap  PFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKS-LEGFRLHTVT-TVRKSSAVLEIHEEV*
phyloSNP  ...........................................B............B......CA.......................................................B.................................B...B.BBA............A..*

Neuropsins compared to Newropsins

Newropsins are a new gene family closely related to neuropsins. Like so many opsin families, they persist from chondrichthyes to archeosaurs but vanish without a trace in platypus, marsupials, and placentals. (The syntenic order B4GALT6 NEWR KIAA1012 remains conserved in mammals but no NEWR debris remains.) Newropsins retain many key attributes of GPCR signaling proteins and indeed opsins such as the seven transmembrane arrangement, Schiff base lysine, counterion tyrosine, amino terminal glycosylation site, and disulfide but have a very odd replacement of the G-protein binding site DRY with (invariantly conserved) VCC.

This motif must be an ancient derived feature that followed the gene duplication event with neuropsin since the much older DRY could not plausibly have re-evolved in neuropsin from VCC. Newropsins very likely link covalently with a retinal and interact with light according to some lambda max type action spectrum. The VCC motif has been conserved over billions of years of branch length so cannot reflect simple loss of DRY functionality; however its signaling capabilities if any are most unclear.

File:Opsins NEWR.png


position     ...................................................................................................1.........1.........1.........1.........1.........1.........1.........1........1
position     .........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2.........3.........4.........5.........6.........7........7
position     12345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789
excMemCy     eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMcccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeee
keyResid     ...GLC.................................................................................................diS..cIon.................DRY?..............................................
NEWR2_galG                MDPSFANS-TFQSKITEAADIVVGTCYMVFGICSLCGNSILLYISYKKKHLLKPAEYFIINLAISDLAMTLTLYPLAVTSSLSHRWLYGKHICLFYAFCGLFFGICSLSTLTLLSVVCCLKICFPAYGNRFRRKHGQILIACAWTYAAIFACSPLAHWGEYGEEPY
NEWR2_anoC                MESYFANT-TFHSKITEAADVIVGVFYIVFGICSFCGNSILLYVSYKKKNLLKPAEYFMINLAISDLGMTLTLYPLAVTSSLAHRWLFGQQVCLFYAFCGVFFGVCSLTTLTLLSIVCCLKICFPVYGNRFRPGHGWILIACAWVYAAIFAFSPLAHWGEYGAEPY
NEWR2_xenT                MGNKSDAS-AFYSSISETDDIVLGVLYSVFGLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHRWLFDKVTCNYYAFCGMLFGLCSLTNLTVLSSVCCLKVCYPAYGNKFSTAHSRILLLGIWAYAGLFATAPLADWGKYGPEPY
NEWR2_danR                MGNVSKTA-LFMSTISRQHDILMGSLYSVFFVLSLLGNGMLLFVAYRKRSSLKPAEFFVVNLSVSDLGMTLSLFPLAIPSALAHRWLFGEITCLCYAVCGVLFGLCSLTNLTALSSVCCLKVCFPNYGNKFSSSHACVMVIGVWCYASVFAVGPLVHWGSFGPEPY
NEWR2_pimP                MGNVSETA-LFVSTISRQHDILMGSLYSVFCVLSLLGNGMLLFVAYRKRSSLKPAEFFVINLSVSDLGMTLSLFPLAIPSALAHRWLFGEVVCLCYAVCGVLFGLCSLTNLTALSSVCCLKVCCPNYGNKFSSNHACVMVIGVWCYASVFAVGPLIRWGSFAPEPY
NEWR2_tetN                MGNASDTSDAFNSKISKEHDFLIGSIYSVFCVLSLMGNCILLLVAHHKRSTLKPAEFFIVNLSISDLGMTLTLFPLAIPSSFSHRWLFGEIACQLYATCGVLFGLCSLTNLTVLSSVCCLKVCLPNLGSKFSSSHARLLVAGVWGYASVFAVGPLVQWGHYGPEPY
NEWR2_takR                MGNASEASDIFLSKISKEHDILIGSIYSVFGLLSLAGNCILLLVAYHKRSMLKPAEFFIINLSISDLGMTLTLFPLAIPSSFSHRWLFGEITCQLYAMCGVLFGLCSLTNLTALSLVCCLKVCFPNHGSRFSSSHARLLVVGVWCYASVFAVGPLVQWGHYGPEPY
NEWR2_gasA                MGNASDTSAVFASTISKERDILMGSLYSVFGVLSLVGNCILLLVAYHKRSTLKPAEFFIINLSISDLGMTLSLFPLAIPSAFKHRWLFGELTCQLYAMCGVLFGLCSLTNLTALSFVCCLKVCFPNHGNRFSSSHARLLVVAVWGYASVFAVGPLARWGRYSPEPY
NEWR2_oryL                MGNVSDTSSLFASSISREHDILMGSLYSVFGLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHRWLFGEITCQLYAMCGVLFGLSSLTNLTALSLVCCLKVCFPNHGNKFSFSHARLLVAGVWCYASVFAVGPLARWGRYSAEPY
NEWR2_calM                                              GILSLVGNSVLLFVAYRKRQILKPAEYFVANLAVSDISMTVTLLPLAISSNFSHRWLFVSKPCMYYGFCSMLFGICSLTNLTVLSTVCCMKVCFPAYMSVVMIV-MFLLAWSPYSIVCLWASFGNPKLIPPAMAII
NEUR_homSa   MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPF
NEUR_canFa   MALNHTARPQDERLPHYLREGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAICDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPF
NEUR_musMu   MALNHTALPQDERLPHYLRDEDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWFGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPF
NEUR_loxAf   MTLNHTAPPQDDRLPQYLQDGDPFTSKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSCRRKKKLRPAEIMTINLAVCDLGISVVGKPFVIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPF
NEUR_monDo   MALNHSVSPQDDYIPHYLRDGDPFASKLSWEADLVAGFYLTIIGVLSTLGNGYVIYMSSKRKKKLRPAEIMTVNLAVCDLGISVVGKPFTIISCFSHRWVFGWVGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSYGTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPF
NEUR_ornAn    MTNYSAPQLGDYLPHYLREGDPFVSKLSWEADLVAGVYLVIIGVLSTLGNGYVIYMSSRRKKKLRPAEIMTVNLAVCDLGISVVGKPFTIVSCFCHRWVFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSYGTWLKRHHAYICLAIIWAYASFWATMPLVGLGNYAPEPF
NEUR_calMi  MTAFDNSTALYSGYWLHDSLHGDPFVSKLSWEADIISACYLIVTGLLSTLGNGYVIYLSITQKRKLKPPEILITNLAISDFGMSVGGQPFLIISCFSHRWIFGWVGCRWHGWAGFFFGCGSLITMTVVSLDRYLKICHLQYGSWLQRRHVFMSLAFIWFYAAFWATMPLVGWGNYAPEPF
NEUR_galGa   MASDCNSSSQEEYLPHYMQQEDPFASKLSREADIIAGFYLTVIGILSTLGNGYVIFMSSKRKKKLRPAEIMTVNLAVCDLGISVVGKPFSIISFFSHRWIFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLAYGTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPF
NEUR_xenTr   MAGNSSYREESGYIPHYERDSDPFASKLSREADIFAGVYLMAIGILSTLGNGYVIYMACSRKKKLRPAEIMTINLAVCDLGISVTGKPFAIVSCFSHRWVFGWNACRWYGWAGFFFGCGSLITLTVVSLDRYLKICHLRYGTWLKRRHAFIALAVIWAYATLWATLPLVGVGNYAPEPF
NEUR_danRe     MENETSISSGYIPHYLLRGDPFASKLSKEADIVAAFYILVIGILSATGNGYVMYMTFKRKTKLKPPEIMTLNLAIFDFGISVSGKPFFIVSSFSHRWLFGWQGCRYYGWAGFFFGCGSLITMTIVSFDRYLKICHLRYGTWLKRHHAFLSVVFIWAYAAFWATMPVVGWGNYAPEPF
NEUR_takRu     MENDTSIPSGYVPHYLLRGDPFASKLSKEADIVAAFYILVIGVLSATGNGYVIYQTIKRKTKLKPPEFMTLNLAVFDFGISVTGKPFFIVSSFSHRWLFGWQGCRYYGWAGFFFGCGSLITMTIVSLDRYLKICHLRYGTWFKRHHAFLCLVFTWLYAAFWATMPVVGWGNYAPEPF
NEUR_tetNi     MENETWTHSSYVPHYLLRGDPFASRLSKEADIVAALYICIIGLMSATGNGYVLYMTFKRKTKLKPPELMTLNLAIFDFGISVTGKPFFIVSSLSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGAWLKRHHAFLCLASVWAYAAFWATMPLVGWGSYAPEPF
NEUR_gasAc     MDNETRSHPSYVPHYLLRGDPFASRLSKEADIVAAFYIFIIGVMSATGNGYVLYMTFKRKTKLKPPELMTVNLAIFDFGISVTGKPFFIVSSLSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGTWLKRHHAFVCLALVWAYAAFWATMPLVGWGSYAPEPF
NEUR_oryLa     MENETWTHPSYIPHYLLRGDPFASRLSKEADIIAAFYICIIGIMSATGNGYVIYMTIKRKSKLKPPELMTVNLAVFDFGISVTGKPFFVVSSFAHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGTWLKRQHAFLCLVFVWMYAAFWATMPLVGWGNYAPEPF
NEUR_pimPr     MENTSWPHSSYVPHYLLRGDPFASRLSKEADIVAAFYILIIGIMSATGNGYVIYMTIKRKSKLKPPELMTVNLAVFDFGISVTGKPFFVVSSFSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGTWLKRQHIFLCLVFVWIYAAFWATMPLVGWGSYAPEPF
NEUR_anoCa       MEQGQNISSQDDNQQEEDPFASKLSVEADIVAGVYLLVIGILSTLGNGYVIYMSTQRKKKLKPAEIMTVNLAVCDLGISVVGKPFSIIAFFSHRWIFGWSGCRWYGWAGFFFGIGSLITMTAVSLDRYFKICHLSYGTWLKRHHVFICLGIIWSYAAFWATIPFAGFGNYAPEPF

position    1.........1.........2.........2.........2.........2.........2.........2.........2.........2.........2.........2.........3........3..........3.........3.........3.........3.....3
position    8.........9.........0.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0........1..........2.........3.........4.........5.....6
position    012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456
excMemCy    eeeeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMccccccccccccccccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMcccccccccccccccccccccccccccccccccccccccccccccc*
keyResid    .diS................................................................................................................K............................................................
NEWR2_galG  GTACCIDWQSTNVDVMSMSYTVVLFVLCFILPCGVIVTSYSLILVTVKESRKAVEQHVSGPTRINNVQTITAKLSIAVCIGFFAAWSPYAIIAMWAAFGSIDKIPPLAFAIPAVFAKSSTLYNPIIHLLLKPNFRSNIAKDFTVIQQLCVR---CCFCVKELQ--TYRSTFNTGLRTFKG
NEWR2_anoC  GTACCIDWRISNMKKTAMSYTTALFVFCYIIPCGIIITSYTLILITVKDSRKAVEQHALGPTRMSSVHTITAKLSIAVCIGFFVAWSPYAIIAMWAAFGSIDMIPPLAFAVPAVFAKSSTLYNPAMYLFLKPNFRSTIAKDLTVLHRLCLK---SCFCPRGMQNCSYRSALEAPLKSFKG
NEWR2_xenT  GTACCLDWEASYRERKALSYTISLFVFCYLIPSSLIFISYTLIFVTVKGARRAVQQHLSPQAKGSSIHSLIIKLSIAVCIGFLIAWTPYAIVAMMAAFGDPTKIPSLVFALAAAFAKSSTIYNPVVYLLLKPNFLNVVTKDLTLFQTMCAV---VCGWCR-----TPAVKTPCPHKDLKT
NEWR2_danR  GTACCINWYTPSHDALAMSYIISLFIFCYVVPCTIIILSYTFILVTVRGSQQAVQQHVSPQTKVTNAHALIVKLSVAVCIGFLTAWSPYAIVAMWAAFSANEQVPPTAFALAAIMAKSSTIYNPMVYLLFKPNFRKSLSQDTQMFRHRICLSHSKASPSPGMKDQERQSSQQCNNKDGSI
NEWR2_pimP  GTACCINWYIPSHDALAMSYIISLFIFCYVVPCTIIILSYTFILLRVRGSRQAVQKHVSPKTKETNAHTLIVKLSVAVCIGFVTAWSPYAVVAMWAAFSANEPVPPTAFALAAILAKSSTIYNPMVYLLFKPNFRKILSQDTQNIRHRMCVSHSKASPTPEIK---AQSSQQC--KDATI
NEWR2_tetN  GTACCINWQAPNHELSSLSYIVCLFLFCYVLPCAIIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHALIVKLSVAVCIGFLGAWSPYAVVAMWASFGDATWVPPDAFAIAAILAKSSTIYNPLVYLLCKPNFRECLYKDTSTLRQRIY----RGSPLSGPRDRSGGVTQR--HKDLSV
NEWR2_takR  GTACCIDWRAPNHELSSLSYIVCLFFFCYVLPCATIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHSLIVKLSVAVCIGFLGAWSPYAIVAMWAAFGDATWVPPDAFAIAAILAKSSTIYNPVVYLLCKPNFRECLYKDTSTLRQRIY----RGSPQSEPRERFGGTSQR--HKDLSI
NEWR2_gasA  GTACCIDWHAPNHELAALSYIVCLFVFCYALPCATIFLSYTFILLTVRGSRQAVQQHVSPQTKTTNTHALIVKLSVAVCIGFLGAWTPYAVVAIWAAFGDATLVPPDAFALAAMFAKSSTIYNPVVYLLCKPNFRACLYRDTTLLRQRIY----RGSPRSEPKAHFGSTSQR--NKDMSV
NEWR2_calM  APLFAKSSTFYNPCIYVISYTMTVIAVNFVVPLSVMFFCYYNV   
NEWR2_oryL  GTACCIDWHAPNHELWALSYILCLFIFCYALPCTIIFLSYAFILLTVRGSRQAVQQHVSPQTKTTNAHTLIVKLSVAVCIGFLGAWTPYAVIAMWAAFGDATQVPPTAFALAAVFAKSSTIYNPMVYLLCKPNFRECLCRDTSLLRHMIY----RGSP--QPQERFGSDSRR--NKDITA
NEUR_homSa  GTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKA-TKKKSLEGFRLHTVT-TVRKSSAVLEIHEEV    
NEUR_calJa  GTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKA-TKKKSLEDFRLHTVT-TVRKSSAVLEIHEEV    
NEUR_canFa  GTSCTLDWWLAQASLGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGRLKA-TKKKSLEDFRLNTVT-TVRKSSAVLEIHQEV    
NEUR_musMu  GTSCTLDWWLAQASGGGQVFILSILFFCLLLPTAVIVFSYAKIIAKVKSSSKEVAHFDSRIHSSHVLEVKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYRFACCQAGGLRG-TKKKSLEDFRLHTVT-TVRKSSAVLEIHQEV    
NEUR_loxAf  GTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHMLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLRA-TKKKSLEGFRLHTVT-TVKKSSAVLEVHQEV    
NEUR_monDo  GTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPVQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCQSGGQKA-AKKESLRTYRLHTVT-TVRRSSAVLEIHQEV    
NEUR_ornAn  GTSCTLDWWLAQASVAGQAFILNILFFCLLLPTAVIVFSYVKIIAKVKSSTKEVAHFDSRIQNSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPIQFSVVPTLLAKSAAMYNPIIYQVIDCRISCCRLGGPKT-GKKESLKNSRSHSMS-TIRKPSAVSGPHQEV    
NEUR_calMi  GTSCTLDWWLARVSVSGLIFVLTILFFCLLLPIIIIVFSYIKIIAKVKSSAKEVAHFDSRIQNHHSLEMNLTK                                                                                                           
NEUR_galGa  GTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGQPDSVPIQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCRSGGPKTLQKKSSLKESRMYTIS-SHRDSAALSGTQLEV    
NEUR_xenTr  GTTCTLDWWLAQASVKGQIFVLSMLFFCLLFPTMVIVFSYAKIIAKVKSSAKEVAHFDTRNQNNHTLEIKLTKVAMLICAGFLIAWFPYAVVSVWSAFGQPDSIPIELSVVPTMMAKSASMYNPIIYQVIDCKPACCKK------DKSLQNTTSRVYTIS-TFRKSTTSAR         
NEUR_danRe  GTSCTLDWWLTQASVSGQSFVMCMLFFCLIFPTVIIVFSYVMIIFKVKSSAKEVSHFDTRNKNNHSLEMKLTKVAMLICAGFLIAWIPYAVVSVMSAFGEPDSVPIPVSVVPTLLAKSSAMYNPIIYQVIDCKKKCVKSCCFQAWRKKKPSKTSRFYTISGSIKQR-PGDEASIEI    
NEUR_takRu  GTSCTLDWWLAQASVSGQSFVMCMLIFCLVLPTGVIVFSYVMIILQVKSSAQEVSHFDTQNKNKHHLEMKLTKVAMLICAGFLIAWIPYAVVSVVSAFGDPDSVPISISVVPTLLAKSSAMYNPIIYQVIDCKKNCAKLSCFQAWSKRKHYKTSRFYSISASMKKR-PANEVPTEI    
NEUR_tetNi  GTSCTLDWWLAQASVSGQSFVMAILFFCLILPTGIIVFSYVMIIFKVKSSAKEISHFDARIRNSHDLEIKLTKVAMLICAGFLIAWIPYAVVSVISAFGEPDSVPIPVSVIPTLLAKSSAMYNPIIYQVVDVKTSCTNFSCCKALKERIHFRKSRLYTISGSLRDPLPPKEAHIEM    
NEUR_gasAc  GTACTLDWWLAQASVSGQSFVMAILFFCLVLPTGIIVFSYIMIIFKVKSSAKEISHFDARIKNSHSLEIKLTKVAMLICAGFLIAWIPYAVVSVVSAFGEPDSVPIPVSVIPTLLAKSSAMYNPIIYQVADLKTSCTSSSCCKALKERVLFRKARLYTISGSLRDTLPPKEAHIEM    
NEUR_oryLa  GTSCTLDWWLAQASVSGQSFVVAILFFCLVLPAGIIVFSYVMIIFKVKSSAKEISNFDARIKNSHNLEIKLTKVAMLICAGFLIAWIPYAVVSVVSAFGEPDSVPISVSVIPTLLAKSSAMYNPIIYQVLDLKNSCMKSSCFKGLKKPRHFRKSRFYTISGSVKDNTTAKEAQIEM    
NEUR_pimPr  GTSCTLDWWLAQASVSGQSFVMSILFFCLVLPAGIIVFSYVMIICKVKSSSKEVSSFDARIKNSHTLEIKLTKVAMLICAGFLIAWIPYAVVSVVSAFGEPDSIPIPVSVIPTLLAKSSAMYNPIIYQLVDLKNSC-STCCAKVIRKRTHFRNSRFYTISGSLKDTAPAKEAHIEI    
NEUR_anoCa  GTSCTLDWWLAQGSVAGQAFILNILFFCLVLPTAVIMFCYVKIIAKVQSSTKEVAHYDTRIQNQHVLEMKLTKVAMLICAGFMFAWIPYAVVSVWSAFGRPDSVPIKVSVIPTLLAKSAAMYNPVIYQVIDCKSACCRPGNLQPLQKKNSR                             

Curated Set of 51 deuterostome neuropsins

>NEUR_homSap OPN5 1_6 43 0 1  2_6 40 2 1  3_6 57 2 1  4_6 112 2 0  5_6 81 0 2  6_6 19 1 0
0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFR 2
1 LHTVTTVRKSSAVLEIHEEV* 0

>NEUR_panTro
0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKETKKKSLEGFR 2
1 LHTVTTVRKSSAVLEIHEEV* 0

>NEUR_gorGor
0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 12 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 YASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIPSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGKPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFQ 2
1 LHTVTTVRKSSAVLEIHEEv* 0

>NEUR_ponPyg
0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGSPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFR 2
1 LHTVTTVRKSSAVLEIHEEV* 0

>NEUR_nomLeu
0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFR 2
1 LHTVTSVRKSSAVLEIHEEv* 0

>NEUR_macMul
0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADIVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFR 2
1 LHTVTTVRKSSAVLEIHEEV* 0

>NEUR_papHam
0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADIVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFR 2
1 LHTVTTVRKSSAVLEIHEEv* 0

>NEUR_calJac
0 MALNHTSLPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2
1 LHTVTTVRKSSAVLEIHEEV* 0

>NEUR_tarSyr
0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VIGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKGVAHFDSRIHTSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2
1 LHTVTTVRKSSAVLEIHEEv* 0

>NEUR_otoGar
0 MALNHTALPQDELRPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTVNLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 WTMMPLVGLEDYVPEPFTSCTLDWWLAQSLGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHGSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKTTKKKSLEDFR 2
1 LHTVTTVRKSSAVLEIHQEV* 0

>NEUR_micMur
0 MALNHTVLPQDERLPHYLRDGDPFASKLSWEADLVAGFYLIII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2  1
2 GVWLKRKHAYICLALIWAYVSFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDHR 2
1 LHTVTAVRKSSAVLEIHQEv* 0

>NEUR_tupBel
0 MALNHTALPQDESLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHVLEMKLTK 0
0  2
1 LHTVTTVRKSSAVLEIHQEV* 0

>NEUR_musMus
0 MALNHTALPQDERLPHYLRDEDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWFGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASGGGQVFILSILFFCLLLPTAVIVFSYAKIIAKVKSSSKEVAHFDSRIHSSHVLEVKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYRFACCQAGGLRGTKKKSLEDFR 2
1 LHTVTTVRKSSAVLEIHQEV* 0

>NEUR_ratNor
0 MALNHTALPQDERLPHYLRDEDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWFGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 gVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASGGGQVFILSILFFCLLLPTAVIVFSYAKIIAKVKSSSKEVAHFDSRIHSSHVLEVKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPNSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYRFACCQTGGLRATKKKSLEDFR 2
1 LHTVTAVRKSSAVLEIHPEv* 0

>NEUR_speTri
0 MALNHTALPQDEHLPHYLRDEDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAFICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFINILFFCLLLPTAVIEFSYVKIIAKVKSSSEEVAHFDSRIHSSHV 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPSLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2
1 LHTVTAVRKSSAVVEIHQEv* 0

>NEUR_dipOrd
0 MAFNHTAGTQGQGLPHYLPEEDPFTSKLSWEADIVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASLAGQVFILNILFFCLLLPTSVIVFSYVKIIAKVKSSSKEVAHFDSRIPSSHVLEMKLTK 0
0  2
1  * 0

>NEUR_cavPor
0 MALNHTAPPQNEHLPRYLQDEDPFVSKLSWEADLVAGFYLTII 1
2 GILSTVGNGYVLYMSSRRKKKLRPAEIMTINLAICDLGIS 1
2 vVGKPFTIISCFRHRWVFGWIGCRWYGWAGFFFGCGSLITMTVVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAAIWAYVSFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASAGGQIFILHILFFCLLLPTAMIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDSRFACCQNAGLKATKKKSLEDFR 2
1 LHTVTTDRKSAVLEIHQEV* 0

>NEUR_oryCun
0 MALNHTALPQDEHLPHYLREGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRRHAYICLALIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFSCCRTSGLKATKKKSLEDFR 2
1 LHTVTTVRKSSAVLEIHQEv* 0

>NEUR_ochPri
0 MALNDTALPQDEHLPHYFRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRLYGWADFFFGCGSLITMTAVSLDRYLK 1
2 GVWLKRRHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHGSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFSCCRTGGLKQTKKKSLEDFR 2
1 LHTVTTVRKSSAVLEIHQEv* 0

>NEUR_canFam
0 MALNHTARPQDERLPHYLREGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAICDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASLGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGRLKATKKKSLEDFR 2
1 LNTVTTVRKSSAVLEIhQEV* 0

>NEUR_felCat
0 MALNHTAPPQDERLPHYLREGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAE 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2
1 LHTVTTVRKSSAVLEIHQEv* 0

>NEUR_bosTau
0 MALNHTAPPPDERRPPYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTVNLAICDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GIWLKRKHAYICLAVIWAYAAFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEVKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2
1 LHTVTTVRKSSAVLEVHQEv* 0

>NEUR_turTru
0  1
2 GILSTFGNGYVLYMSSRRKKKLKPAEIMTINLAICDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIPSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAVYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2
1 LHTVTTVRKSSAVLEIHQEv* 0

>NEUR_susScr
0 MALNHTAPPPDERRPHYLREGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTVNLAICDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAVIWAYAAFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLVAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2
1 LHTVTTVRKSSAVLEIRQEV* 0

>NEUR_vicVic
0 MALNHTAPPPDERRPRHLRDGdPFASKLSWEADLVAGFYLTII 1
2 GILSTLGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWMFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2  0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2
1 LHAVTTVRKSSAVLEIHQEV* 0

>NEUR_equCab
0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAICDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHGSHVLEVKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2
1 LHTVTTVRKSSAVLEIHQEV* 0

>NEUR_myoLuc
0 MALNHTALPQDEGLPHYLQDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTVGNGYVLYMSSRRKKKLRPAEIMTVNLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHTYICLAFIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQATVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKKVAHFDSRIH 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSSAMYNPIIYQVIDYKLACCQTGGLRATKKKSLENFR 2
1 LHTVTTVRKSSAVLEIHQEv* 0

>NEUR_pteVam
0 MALNHTVLPQDEHLPHYVRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTVGNGYVLYMSSRRKKKLRPAEIMTINLAICDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHGSHMLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTSGLRATKKKSLEDFR 2
1 LHTITTVREASAVLEIHQEV* 0

>NEUR_sorAra
0 MALNHTALPQDENLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLVVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPNSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYRFACCQSGGLRATKKKSLDDFr 2
1 LHTVTTVRESSAVLEIHQEV* 0

>NEUR_eriEur
0 MSLNQTALPQDEGLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 gVWLKRKHAYLCLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASLGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKKVAHFDSRIHSSHMLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKANKKKSLKDYR 2
1 

>NEUR_loxAfr
0 MTLNHTAPPQDDRLPQYLQDGDPFTSKLSWEADLVAGFYLTII 1
2 GILSTFGNGYVLYMSCRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFVIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHMLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLRATKKKSLEGFR 2
1 LHTVTTVKKSSAVLEVHQEv* 0

>NEUR_proCap
0 MTLNHTVLPEDDRLSHYLRDGDPFTSKLSWEADLVAGFYLTVI 1
2 GILSTCGNGYVLYMSYRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFIIISSFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSY 1
2  0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSVPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCRTRGLRATKEKSLEGVR 2
1 LHTVTTVRKSSAVLEIHQEv* 0

>NEUR_echTel
0 MALNHTAPPQDNSLPHYLRDGDPFVSKLSWEADLGAGFYLIII 1
2 GILSTFGNGYVLYMSYRRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFSHRWVFGWTGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHMLEMKLTK 0
0  2
1 LHTITTVRKSSAVLEIHQEV* 0

>NEUR_dasNov
0 MALNHTALPQDDRLPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 gILSTFGNGYVLYMSSKRKKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLRATKKKSLEDFR 2
1 LHTVTTVRESSAVLEVHQEV* 0

>NEUR_choHof
0 MALNHTGLPQDDSLPHYFRDGDPFASKLSWEADLVAGFYLIII 1
2 GILSTFGNGYVLYMSSRRRKKLRPAEIMTINLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1
2 GVWLKRKHAYICLAVIWAYASFWTTMPLLGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHMLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCRTGGLRATKKKSFEGFR 2
1 LHTVTTVRKSSAVLEIHQEv* 0

>NEUR_monDom
0 MALNHSVSPQDDYIPHYLRDGDPFASKLSWEADLVAGFYLTII 1
2 GVLSTLGNGYVIYMSSKRKKKLRPAEIMTVNLAVCDLGIS 1
2 VVGKPFTIISCFSHRWVFGWVGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSY 1
2 GTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPFGTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPVQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCQSGGQKAAKKESLRTYR 2
1 LHTVTTVRRSSAVLEIHQEv* 0

>NEUR_macEug
0  1
2 GVLSTLGNGYVIYMSSKRKKKLRPAEIMTVNLAVCDLGIS 1
2 VVGKPFTIISCFCHRWVFGWVGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSy 1
2 GTWLKRHHAYICLVIIWAYATFWATMPLAGLGNYAPEPFGTSCTLDWWLAQASVTGQTFILNILFFCLLLPTAVIVFSYVKIIAKVKSSTKEVAHFDSRIQSSHVLEMKLTK 0
0  2
1 RHTVSTIRKSSSVSETYQEV* 0

>NEUR_ornAna
0 MTNYSAPQLGDYLPHYLREGDPFVSKLSWEADLVAGVYLVII 1
2 GVLSTLGNGYVIYMSSRRKKKLRPAEIMTVNLAVCDLGIS 1
2 VVGKPFTIVSCFCHRWVFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSY 1
2 GTWLKRHHAYICLAIIWAYASFWATMPLVGLGNYAPEPFGTSCTLDWWLAQASVAGQAFILNILFFCLLLPTAVIVFSYVKIIAKVKSSTKEVAHFDSRIQNSHVLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPIQFSVVPTLLAKSAAMYNPIIYQVIDCRISCCRLGGPKTGKKESLKNSR 2
1 SHSMSTIRKPSAVSGPHQEV* 0

>NEUR_galGal
0 MASDCNSSSQEEYLPHYMQQEDPFASKLSREADIIAGFYLTVI 1
2 GILSTLGNGYVIFMSSKRKKKLRPAEIMTVNLAVCDLGIS 1
2 VVGKPFSIISFFSHRWIFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLAY 1
2 GTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPFGTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVWSAFGQPDSVPIQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCRSGGPKTLQKKSSLKESR 2
1 MYTISSHRDSAALSGTQLEV* 0

>NEUR_taeGut
0 MASEYNNSSQEEYIPHYLQEEDPFASKLSREADIIAGFYLTII 1
2 GILSTLGNGYVIFMSSKRKKKLRPAEIMTVNLAVCDLGIS 1
2 VVGKPFSIISFFSHRWMFGWIGCCWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSY 1
2 GTWLKRHHAFICLAIIWAYAMFWATVPFAGVGSYAPEPFGTSCTLDWWLAQASVAGQVFVLSILFFCLLLPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTK 0
0 VAMLICAGFLLAWIPYAVVSVWSAFGRPDSVPIQFSVVPTLLAKSAAMYNPIIYQVIECRLACCRPGG 2
1 

>NEUR_anoCar
0 MEQGQNISSQDDNQQEEDPFASKLSVEADIVAGVYLLVI 1
2 GILSTLGNGYVIYMSTQRKKKLKPAEIMTVNLAVCDLGIS 1
2 VVGKPFSIIAFFSHRWIFGWSGCRWYGWAGFFFGIGSLITMTAVSLDRYFKICHLSY 1
2 GTWLKRHHVFICLGIIWSYAAFWATIPFAGFGNYAPEPFGTSCTLDWWLAQGSVAGQAFILNILFFCLVLPTAVIMFCYVKIIAKVQSSTKEVAHYDTRIQNQHVLEMKLTK 0
0 VAMLICAGFMFAWIPYAVVSVWSAFGRPDSVPIKVSVIPTLLAKSAAMYNPVIYQVIDCKSACCRPGNLQPLQKKNSR 2
1 

>NEUR_xenTro
0 MAGNSSYREESGYIPHYERDSDPFASKLSREADIFAGVYLMAI 1
2 GILSTLGNGYVIYMACSRKKKLRPAEIMTINLAVCDLGIS 1
2 VTGKPFAIVSCFSHRWVFGWNACRWYGWAGFFFGCGSLITLTVVSLDRYLKICHLRY 1
2 GTWLKRRHAFIALAVIWAYATLWATLPLVGVGNYAPEPFGTTCTLDWWLAQASVKGQIFVLSMLFFCLLFPTMVIVFSYAKIIAKVKSSAKEVAHFDTRNQNNHTLEIKLTK 0
0 VAMLICAGFLIAWFPYAVVSVWSAFGQPDSIPIELSVVPTMMAKSASMYNPIIYQVIDCKPACCKKDKSLQNTTSR 2
1 VYTISTFRKSTTSAR* 0

>NEUR_danRer
0 MENETSISSGYIPHYLLRGDPFASKLSKEADIVAAFYILVI 1
2 GILSATGNGYVMYMTFKRKTKLKPPEIMTLNLAIFDFGIS 1
2 VSGKPFFIVSSFSHRWLFGWQGCRYYGWAGFFFGCGSLITMTIVSFDRYLKICHLRY 1
2 gTWLKRHHAFLSVVFIWAYAAFWATMPVVGWGNYAPEPFGTSCTLDWWLTQASVSGQSFVMCMLFFCLIFPTVIIVFSYVMIIFKVKSSAKEVSHFDTRNKNNHSLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVMSAFGEPDSVPIPVSVVPTLLAKSSAMYNPIIYQVIDCKKKCVKSCCFQAWRKKKPSKTSR 2
1 FYTISGSIKQRPGDEASIEI* 0

>NEUR_takRub
0 MENDTSIPSGYVPHYLLRGDPFASKLSKEADIVAAFYILVI 1
2 GVLSATGNGYVIYQTIKRKTKLKPPEFMTLNLAVFDFGIS 1
2 VTGKPFFIVSSFSHRWLFGWQGCRYYGWAGFFFGCGSLITMTIVSLDRYLKICHLRY 1
2 GTWFKRHHAFLCLVFTWLYAAFWATMPVVGWGNYAPEPFGTSCTLDWWLAQASVSGQSFVMCMLIFCLVLPTGVIVFSYVMIiLQVKSSAQEVSHFDTQNKNKHHLEMKLTK 0
0 VAMLICAGFLIAWIPYAVVSVVSAFGDPDSVPISISVVPTLLAKSSAMYNPIIYQVIDCKKNCAKLSCFQAWSKRKHYKTSR 2
1 FYSISASMKKRPANEVPTEI* 0

>NEUR_tetNig
0 MENETWTHSSYVPHYLLRGDPFASRLSKEADIVAALYICII 1
2 gLMSATGNGYVLYMTFKRKTKLKPPELMTLNLAIFDFGIS 1
2 VTGKPFFIVSSLSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRY 1
2 GAWLKRHHAFLCLASVWAYAAFWATMPLVGWGSYAPEPFGTSCTLDWWLAQASVSGQSFVMAILFFCLILPTGIIVFSYVMIIFKVKSSAKEISHFDARIRNSHDLEIKLTK 0
0 VAMLICAGFLIAWIPYAVVSVISAFGEPDSVPIPVSVIPTLLAKSSAMYNPIIYQVVDVKTSCTNFSCCKALKERIHFRKSR 2
1 LYTISGSLRDPLPPKEAHIEM* 0

>NEUR_gasAcu
0 MDNETRSHPSYVPHYLLRGDPFASRLSKEADIVAAFYIFII 1
2 GVMSATGNGYVLYMTFKRKTKLKPPELMTVNLAIFDFGIS 1
2 VTGKPFFIVSSLSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRY 1
2 GTWLKRHHAFVCLALVWAYAAFWATMPLVGWGSYAPEPFGTACTLDWWLAQASVSGQSFVMAILFFCLVLPTGIIVFSYIMIIFKVKSSAKEISHFDARIKNSHSLEIKLTK 0
0 VAMLICAGFLIAWIPYAVVSVVSAFGEPDSVPIPVSVIPTLLAKSSAMYNPIIYQVADLKTSCTSSSCCKALKERVLFRKAr 2
1 LYTISGSLRDTLPPKEAHIEM* 0

>NEUR_oryLat
0 MENETWTHPSYIPHYLLRGDPFASRLSKEADIIAAFYICII 1
2 gIMSATGNGYVIYMTIKRKSKLKPPELMTVNLAVFDFGIS 1
2 VTGKPFFVVSSFAHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRY 1
2 GTWLKRQHAFLCLVFVWMYAAFWATMPLVGWGNYAPEPFGTSCTLDWWLAQASVSGQSFVVAILFFCLVLPAGIIVFSYVMIIFKVKSSAKEISNFDARIKNSHNLEIKLTK 0
0 VAMLICAGFLIAWIPYAVVSVVSAFGEPDSVPISVSVIPTLLAKSSAMYNPIIYQVLDLKNSCMKSSCFKGLKKPRHFRKSR 2
1 FYTISGSVKDNTTAKEAQIEM* 0

>NEUR_pimPro
0 MENTSWPHSSYVPHYLLRGDPFASRLSKEADIVAAFYILII 1
2 GIMSATGNGYVIYMTIKRKSKLKPPELMTVNLAVFDFGIS 1
2 VTGKPFFVVSSFSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRY 1
2 GTWLKRQHIFLCLVFVWIYAAFWATMPLVGWGSYAPEPFGTSCTLDWWLAQASVSGQSFVMSILFFCLVLPAGIIVFSYVMIICKVKSSSKEVSSFDARIKNSHTLEIKLTK 0
0 VAMLICAGFLIAWIPYAVVSVVSAFGEPDSIPIPVSVIPTLLAKSSAMYNPIIYQLVDLKNSCSTCCAKVIRKRTHFRNSr 2
1 FYTISGSLKDTAPAKEAHIEI* 0

>NEUR_calMil
0 MTAFDNSTALYSGYWLHDSLHGDPFVSKLSWEADIISACYLIVT 1
2 GLLSTLGNGYVIYLSITQKRKLKPPEILITNLAISDFGMS 1
2 VGGQPFLIISCFSHRWIFGWVGCRWHGWAGFFFGCGSLITMTVVSLDRYLKICHLQY 1
2 GSWLQRRHVFMSLAFIWFYAAFWATMPLVGWGNYAPEPFGTSCTLDWWLARVSVSGLIFVLTILFFCLLLPIIIIVFSYIKIIAKVKSSAKEVAHFDSRIQNHHSLEMNLTK 0
0  2
1  * 0

>NEUR1_braFlo from cDNA and genome chrUn  ++  176419943 176431046 11104 FE548698 
0 MATTPADRLDGLTPAGRGATTAETHADDFASKLSREADIVIGVYLILI 1
2 GTGAILGNGRVLWLSYRCRARLRPVEMFVVSLAAADVGLSLVGHPFSAASSLMGRWSFGSAGCTW 1
2 YGFVVFFLGIASIATMALMSIMRFMIVHKRY 1
2 GQYPSRRASCVLVAAAWLYGLFWACAPLA 1
2 GWSQYHPEPYGLSCSVDWGGFSRGAGGSSFIICMLLFCTAVPVVVMVTSYAAIFALYRQAQKGVVLNLQVNATFGGKRQRTER
0 IALAVCGGFLLAWLPYAVVGLWASVAGVDAVPLALASAAPLFAKSNSLWNPIIYLGMNERFR 2

>NEUR2_braFlo from traces and genome chrUn  ++  187375671 187384042 8372 nearly identical chrUn ++ 32271780 32281075 9296
0 MATTPGLPLDGLAPTGRGVTAADTLDDDFASKLSREADIVIGVYLLLI 1
2 GTGSILGNGRVLWLSYRNWAKLRPVELFVVSLAVTDVGISVFGYPFAASSSLLGRWSFGSAGCTW 1
2 YGFTGFFFGLTSIANMALMSIMRFMIVYKGY 1
2 GPYPSRRATSGLIAAAWLYGLFWACAPLA 1
2 GWSQYHVEPFGLSCTVDWGSFSRDAGGMSFIICLLVFCVAIPVTAIMASYVAISAIYRQAKKSIAGHLQDNSAMCKKRNKLE 0
0 MALAVCGGFLLAWLPYAVVGLWSAVAGVDAVPLALASAAPLFAKSSSLWNPIIYLGMNDRFR 2
 
>NEUR_strPur Strongylocentrotus purpuratus XM_001197837 CX694910 CX690664
0 MDVNAKWWTNETLRTRDQFSDDHYTSVLSYEGDIWAGVYLMFI 1
2 SLIAFIGNISVIVISLRKREKLKPIDLLTINLAIADFLICVVSYPLPMISAFRHR 0
0 WSFGKFGCVWYGFTSFLFAVGSMATLMVIALLRYAKLCRENV 1
2 DQYQSRPFVIKVIVAIWGFAFFTTAPPLFGWS 2
1 SYVPEPYHLSCTIDFADTSPSGLSYTYFTTIVVFFMPLMIIVLCYVAIARKMIHHNRRINVGHNAGRMLLEIRLLK 0
0 TACMITMAYTISWTPYAVIAMWVTYIPVNQIPDAFRILPAFCAKTSSVYNPIIYCIFNKSFRQDLSSLICCCACQCYTITINLDINSHAQQQFRRIEERR
DEVGTYKRRPLMICSNPFAWSRDFHETWRQRRIRGIHRNCRNNVRVENINVNFRRDTDMVELNAPTPAEIHRPELNTASTRSGARTKSMATHLPALEEVPSG
APQCSALLHNTPIPRSLQGTPLPYQPQPSTSDLHDEFLNPSVVSRNMCVIVVKPNIEEELSTD* 0

Curated Set of 51 vertebrate newropsins

>NEWR_galGal GenBank 3 garbage 5'UTR translated exons -B4GALT6 -NEWR2_galGal -KIAA1012
0 MDPSFANSTFQSKITEAADIVVGTCYMVF 1
2 GICSLCGNSILLYISYKKKHLLKPAEYFIINLAISDLAMTLTLYPLAVTSSLSHR 2
1 WLYGKHICLFYAFCGLFFGICSLSTLTLLSVVCCLKICFPAY 1
2 GNRFRRKHGQILIACAWTYAAIFACSPLAHWGEYGEEPYGTACCIDWQSTNVDVMSMSYTVVLFVLCFILPCGVIVTSYSLILVTVKESRKAVEQHVSGPTRINNVQTITAK 0
0 LSIAVCIGFFAAWSPYAIIAMWAAFGSIDKIPPLAFAIPAVFAKSSTLYNPIIHLLLKPNFRSNIAKDFTVIQQLCVRCCFCVKELQTYRSTFNTGLRTFKGKNESSCNALPIMEG
CSYFPSEKGSHTFECFKSYPNCFQERLSTMGCHLQDCESLENDLQVEVTQGSRNSMKVVEQEEKSTELDNLEITLEAVPVSCTFTDL* 0

>NEWR_anoCar Anolis carolinensis
0 MESYFANTTFHSKITEAADVIVGVFYIVF 1
2 GICSFCGNSILLYVSYKKKNLLKPAEYFMINLAISDLGMTLTLYPLAVTSSLAHR 2
1 WLFGQQVCLFYAFCGVFFGVCSLTTLTLLSIVCCLKICFPVY 1
1 GNRFRPGHGWILIACAWVYAAIFAFSPLAHWGEYGAEPYGTACCIDWRISNMKKTAMSYTTALFVFCYIIPCGIIITSYTLILITVKDSRKAVEQHALGPTRMSSVHTITAK 0
0 LSIAVCIGFFVAWSPYAIIAMWAAFGSIDMIPPLAFAVPAVFAKSSTLYNPAMYLFLKPNFRSTIAKDLTVLHRLCLKSCFCPRGMQNCSYRSALEAPLKSFKGRNESSSNSVQIVGGCS
YFPCEKCHDPFECFKNYPKCCQGRLNVMDHTPRESISVENNMQSKTKHASEKYIKVVIRGEKNTDIDNLEITLEHIPTDIKFANL* 0

>NEWR_xenTro abundant transcripts
0 MGNKSDASAFYSSISETDDIVLGVLYSVF 1
2 GLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHR 2
1 WLFDKVTCNYYAFCGMLFGLCSLTNLTVLSSVCCLKVCYPAY 1
2 GNKFSTAHSRILLLGIWAYAGLFATAPLADWGKYGPEPYGTACCLDWEASYRERKALSYTISLFVFCYLIPSSLIFISYTLIFVTVKGARRAVQQHLSPQAKGSSIHSLIIK 0
0 LSIAVCIGFLIAWTPYAIVAMMAAFGDPTKIPSLVFALAAAFAKSSTIYNPVVYLLLKPNFLNVVTKDLTLFQTMCAVVCGWCRTPAVKTPCPHKD
LKTTSKPPSSFKKSQGVCRNCVDTFECFRNYPRCCSVGNVDAAQPMAASLVRIPPANGAPQQTVQLVVSSSRTRSGVETVEVSTEAPMSDFIKDFI* 0

>NEWR_danRer Danio rerio acquired new intron
0 MGNVSKTALFMSTISRQHDILMGSLYSVF 1
2 FVLSLLGNGMLLFVAYRKRSSLKPAEFFVVNLSVSDLGMTLSLFPLAIPSALAHR 2
1 WLFGEITCLCYAVCGVLFGLCSLTNLTALSSVCCLKVCFPNY 1
2 GNKFSSSHACVMVIGVWCYASVFAVGPLVHWGSFGPEPYGTACCINW 2
1 YTPSHDALAMSYIISLFIFCYVVPCTIIILSYTFILVTVRGSQQAVQQHVSPQTKVTNAHALIVK 0
0 LSVAVCIGFLTAWSPYAIVAMWAAFSANEQVPPTAFALAAIMAKSSTIYNPMVYLLFKPNFRKSLSQDTQMFRHRICLSHSKASPSPGMKDQERQS
SQQCNNKDGSISTPFSSGQAESYGACHVYAEAGPHYQQISRQITARVLEGSVQSEIPVKQLTEKMQNDLL* 0

>NEWR_tetNig Tetraodon nigroviridis gene mix
0 MGNASDTSDAFNSKISKEHDFLIGSIYSVF 1
2 CVLSLMGNCILLLVAHHKRSTLKPAEFFIVNLSISDLGMTLTLFPLAIPSSFSHR 2
1 WLFGEIACQLYATCGVLFGLCSLTNLTVLSSVCCLKVCLPNL 1
2 GSKFSSSHARLLVAGVWGYASVFAVGPLVQWGHYGPEPYGTACCINWQAPNHELSSLSYIVCLFLFCYVLPCAIIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHALIVK 0
0 LSVAVCIGFLGAWSPYAVVAMWASFGDATWVPPDAFAIAAILAKSSTIYNPLVYLLCKPNFRECLYKDTSTLRQRIYRGSPLSGPRDRSGGVTQRHKDLSVSTR
LSNGQQDSYGTCLHCAEDAELGHVTGSRRTACILTGSTFTEVTLSQLSATPADLL* 0

>NEWR_takRub Fugu rubripes
0 MGNASEASDIFLSKISKEHDILIGSIYSVF 1
2 GLLSLAGNCILLLVAYHKRSMLKPAEFFIINLSISDLGMTLTLFPLAIPSSFSHR 2
1 WLFGEITCQLYAMCGVLFGLCSLTNLTALSLVCCLKVCFPNH 1
2 GSRFSSSHARLLVVGVWCYASVFAVGPLVQWGHYGPEPYGTACCIDWRAPNHELSSLSYIVCLFFFCYVLPCATIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHSLIVK 0
0 LSVAVCIGFLGAWSPYAIVAMWAAFGDATWVPPDAFAIAAILAKSSTIYNPVVYLLCKPNFRECLYKDTSTLRQRIYRGSPQSEPRERFGGTSQRHKDLSISTR
LSNGQQDSYGTCLHCADDAERGHVTTSQRTACILTGSTFTEVTVGQLSAAPADLL*

>NEWR_gasAcu Gasterosteus aculeatus
0 MGNASDTSAVFASTISKERDILMGSLYSVF 1
2 GVLSLVGNCILLLVAYHKRSTLKPAEFFIINLSISDLGMTLSLFPLAIPSAFKHR 2
1 WLFGELTCQLYAMCGVLFGLCSLTNLTALSFVCCLKVCFPNH 1
2 GNRFSSSHARLLVVAVWGYASVFAVGPLARWGRYSPEPYGTACCIDWHAPNHELAALSYIVCLFVFCYALPCATIFLSYTFILLTVRGSRQAVQQHVSPQTKTTNTHALIVK 0
0 LSVAVCIGFLGAWTPYAVVAIWAAFGDATLVPPDAFALAAMFAKSSTIYNPVVYLLCKPNFRACLYRDTTLLRQRIYRGSPRSEPKAHFGSTSQRNKDMSVSVRSSNGQQDSYGACTENA
APCHVMTPQRTACILTESTNREVTVSRLADKPQADFL*

>NEWR_oryLat Oryzias latipes 
0 MGNVSDTSSLFASSISREHDILMGSLYSVF 1
2 GLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHR 2
1 WLFGEITCQLYAMCGVLFGLSSLTNLTALSLVCCLKVCFPNH 1
2 GNKFSFSHARLLVAGVWCYASVFAVGPLARWGRYSAEPYGTACCIDWHAPNHELWALSYILCLFIFCYALPCTIIFLSYAFILLTVRGSRQAVQQHVSPQTKTTNAHTLIVK 0
0 LSVAVCIGFLGAWTPYAVIAMWAAFGDATQVPPTAFALAAVFAKSSTIYNPMVYLLCKPNFRECLCRDTSLLRHMIYRGSPQPQERFGSDSRRNKDITASTRFSNGQQESYGACLNCTEN
TGLCQLASPQNTACILTGSTYAEVTVQQLVDKQQPDFL* 0

>NEWR_pimPro Pimephales promelas
0 MGNVSETALFVSTISRQHDILMGSLYSVF 1
2 CVLSLLGNGMLLFVAYRKRSSLKPAEFFVINLSVSDLGMTLSLFPLAIPSALAHR 2
1 WLFGEVVCLCYAVCGVLFGLCSLTNLTALSSVCCLKVCCPNY 1
2 GNKFSSNHACVMVIGVWCYASVFAVGPLIRWGSFAPEPYGTACCINWYIPSHDALAMSYIISLFIFCYVVPCTIIILSYTFILLRVRGSRQAVQKHVSPKTKETNAHTLIVK 0
0 LSVAVCIGFVTAWSPYAVVAMWAAFSANEPVPPTAFALAAILAKSSTIYNPMVYLLFKPNFRKILSQDTQNIRHRMCVSHSKASPTPEIK-AQSSQQCKDATISTPFSSGQAESYGTCHIYAEAEPHFQQISPQRTVRILEGIIQSEISVRHMTDRMQNDLL* 0

>NEWR_calMil frag
0 1
2 GILSLVGNSVLLFVAYRKRQILKPAEYFVANLAVSDISMTVTLLPLAISSNFSHR 2
1 WLFVSKpCMYYGFCSMLFGICSLTNLTVLSTVCCMKVCFPAY 1
2 0
0 MSVVMIVMFLLAWSPYSIVCLWASFGNPKLIPPAMAIIAPLFAKSSTFYNPCIYVISYTMTVIAVNFVVPLSVMFFCYYNV