Opsin evolution: RPE65: Difference between revisions

From genomewiki
Jump to navigationJump to search
Line 122: Line 122:


<pre>
<pre>
>RPE65r_hg8
>hg18_1_
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 138: Line 138:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_panTro
>panTro2
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 154: Line 154:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_gorGor
>gorGor1
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 170: Line 170:
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_ponAbe
>ponAbe2
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 186: Line 186:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_rheMac
>rheMac2
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 202: Line 202:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_calJac
>calJac1
MSIQ
MSIQ
vEHPAGGYKKLFETVEELSSPLTAHVT
vEHPAGGYKKLFETVEELSSPLTAHVT
Line 218: Line 218:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_tarSyr
>tarSyr1
MSVQ
MSVQ
VEHPAGGYKKLFETVEELSSPLPAHIT
VEHPAGGYKKLFETVEELSSPLPAHIT
Line 234: Line 234:
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_micMur
>micMur1
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLMAHVT
VEHPAGGYKKLFETVEELSSPLMAHVT
Line 244: Line 244:
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRRKYLNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKg
VWLHIADKKRRKYLNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKg
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLSNTTATAILCSDETIWLEPEVLFSGPRQ
ADTGKNLVTLSNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
Line 249: Line 250:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFRKS*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFRKS*


>RPE65r_otoGar
>otoGar1
MSMQ
MSMQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 265: Line 266:
GVILSVVVSPGPGQKPAYLLVLNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVILSVVVSPGPGQKPAYLLVLNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_tupBel
>tupBel1
MSIQ
MSIQ
VEHPAAGYKKLFETVEELSSPLTAHVT
VEHPAAGYKKLFETVEELSSPLTAHVT
Line 281: Line 282:
GVVLSVVVSPGTGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVVSPGTGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_mm9
>mm9_1_1
MSIQ
MSIQ
IEHPAGGYKKLFETVEELSSPLTAHVT
IEHPAGGYKKLFETVEELSSPLTAHVT
Line 297: Line 298:
GVVLSVVVSPGAGQKPAYLLVLNAKDLSEIARAEVETNIPVTFHGLFKRS*
GVVLSVVVSPGAGQKPAYLLVLNAKDLSEIARAEVETNIPVTFHGLFKRS*


>RPE65r_rn4
>rn4_1_1
MSIQ
MSIQ
IEHPAGGYKKLFETVEELSTPLTAHVT
IEHPAGGYKKLFETVEELSTPLTAHVT
Line 313: Line 314:
GVVLSVVVSPGAGQKPAYLLVLNAKDLSEIARAEVETNIPVTFHGLFKKP*
GVVLSVVVSPGAGQKPAYLLVLNAKDLSEIARAEVETNIPVTFHGLFKKP*


>RPE65r_dipOrd
>dipOrd1
MSIQ
MSIQ
IEHPAGGYKKLFETVEELSTPLTAHVT
IEHPAGGYKKLFETVEELSTPLTAHVT
Line 329: Line 330:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEIARAEVEINIPVTFHGLFKKP*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEIARAEVEINIPVTFHGLFKKP*


>RPE65r_cavPor3
>cavPor3
MSIQ
MSIQ
VEHPASGYRKLFETVEELSSPLTAHVT
VEHPASGYRKLFETVEELSSPLTAHVT
Line 345: Line 346:
gVVLSVVVSPGPGQKPASLLILNAKDLSEVARAEVEVNIPVTFHGLFKKA*
gVVLSVVVSPGPGQKPASLLILNAKDLSEVARAEVEVNIPVTFHGLFKKA*


>RPE65r_speTri
>speTri1
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 361: Line 362:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEIARAEVEINIPVTFHGLFKKP*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEIARAEVEINIPVTFHGLFKKP*


>RPE65r_oryCun
>oryCun1
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 377: Line 378:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_ochPri
>ochPri2
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 393: Line 394:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_vicPac
>vicPac1
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 409: Line 410:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_turTru
>turTru1
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 425: Line 426:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_bosTau4
>bosTau4
MSSQ
MSSQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 441: Line 442:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_equCab
>equCab2
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 457: Line 458:
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_felCat3
>felCat3
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 473: Line 474:
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_canFam
>canFam2
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 489: Line 490:
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_myoLuc
>myoLuc1
MSIQ
MSIQ
VEHPAGGYKKLYETVEELSSPVTAHVT
VEHPAGGYKKLYETVEELSSPVTAHVT
Line 505: Line 506:
GVVLSVVVSPGTGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVVSPGTGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_pteVam
>pteVam1
MSIQ
MSIQ
VEHPASGYKKLFETVEELSSPLTAHVT
VEHPASGYKKLFETVEELSSPLTAHVT
Line 515: Line 516:
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHVADKKRRKYLNNKYRTSPFNLFHHINTFEDNEFLIVDLCCWKg
VWLHVADKKRRKYLNNKYRTSPFNLFHHINTFEDNEFLIVDLCCWKg
FEFVYNYLYLANLRENWEEVKKNASKAPQPEVRRYVLPLNIDK
ADTGKNLVSLPNTTATAILCSDETIWLEPEVLFSGPRQ
ADTGKNLVSLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPYSYPSEPIFVSHPDALEEDD
LCKLNVKTKETWVWQEPYSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEVNIPVTFHGLFKKS*
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEVNIPVTFHGLFKKS-


>RPE65r_eriEur
>eriEur1
MSAQ
MSAQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 536: Line 538:
GVVLSVVISPGTGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVISPGTGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_sorAra
>sorAra1
MSAQ
MSAQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 552: Line 554:
GVVLSVVISPGTGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSVVISPGTGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_loxAfr
>loxAfr2
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 568: Line 570:
gVVLSIVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
gVVLSIVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_proCap
>proCap1
MSIQ
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 584: Line 586:
GVVLSIVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSIVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_echTel
>echTel1
MSIQ
MSIQ
MEHPAGGYKKLFETVEELSSPLTAHVT
MEHPAGGYKKLFETVEELSSPLTAHVT
Line 600: Line 602:
GVVLSIVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
GVVLSIVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_dasNov
>dasNov2
MSIQ
MSIQ
IEHPAGGYKKLFETVEELSSPLTAHVT
IEHPAGGYKKLFETVEELSSPLTAHVT
Line 610: Line 612:
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYLDCFESNETMG
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYLDCFESNETMG
VWLHIADKKRRMYLNNKYRTSSFNLFHHINTYEDNGFVIVDLCCWKg
VWLHIADKKRRMYLNNKYRTSSFNLFHHINTYEDNGFVIVDLCCWKg
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNINK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFAGPRQ
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFAGPRQ
AFEFPQINYQKYSGKPYTYAYGLGLNHFVPDR
AFEFPQINYQKYSGKPYTYAYGLGLNHFVPDR
Line 615: Line 618:
gVVLSVVVSPGAGQKPAYLLVLNAKDLSEVARAEVEINIPVTFHGLFKKS*
gVVLSVVVSPGAGQKPAYLLVLNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_choHof
>choHof1
MSIQ
MSIQ
IEHPAGGYKKLFETVEELSSPLTAHVT
IEHPAGGYKKLFETVEELSSPLTAHVT
Line 625: Line 628:
FGLTPNYIVFVETPVKINLFKFLSVWSLWGANYMDCFESNETMG
FGLTPNYIVFVETPVKINLFKFLSVWSLWGANYMDCFESNETMG
VWLHIADKKRRMYLNNKYRTSSFNLFHHINTYEDNGFVIVDLCCWKg
VWLHIADKKRRMYLNNKYRTSSFNLFHHINTYEDNGFVIVDLCCWKg
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNINK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFAGPRQ
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFAGPRQ
AFEFPQINYQKYSGKPYTYAYGLGLNHFVPDR
AFEFPQINYQKYSGKPYTYAYGLGLNHFVPDR
Line 630: Line 634:
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*


>RPE65r_monDom4
>monDom4
MSHQ
MSHQ
VEHPAGGYKKLFETVEELSSPLPAHVR
VEHPAGGYKKLFETVEELSSPLPAHVR
Line 646: Line 650:
GVVLSVVVSPGAGQKPACLLVLNAKDMTEVARAEVETNIPVTFHGLFRRS*
GVVLSVVVSPGAGQKPACLLVLNAKDMTEVARAEVETNIPVTFHGLFRRS*


>RPE65r_ornAna
>ornAna1
MSNQ
MSNQ
VEHPAGGYKKLFETVEELSSPLTAHVT
VEHPAGGYKKLFETVEELSSPLTAHVT
Line 662: Line 666:
GVVLSVVVNPGPGQKPAYLLILNAKDMSEVARAEVEINIPVTFHGLFRKS*
GVVLSVVVNPGPGQKPAYLLILNAKDMSEVARAEVEINIPVTFHGLFRKS*


>RPE65r_galGal3
>galGal3
MSSQ
MSSQ
VEHPAGGYKKLFETAEELSSPVTAHVT
VEHPAGGYKKLFETAEELSSPVTAHVT
Line 678: Line 682:
GVVLSIVISPGSGPKPAYLLILNAKDMSEVARAEVEVNIPVTFHGLFKRA*
GVVLSIVISPGSGPKPAYLLILNAKDMSEVARAEVEVNIPVTFHGLFKRA*


>RPE65r_taeGut
>taeGut1
MSSQ
MSSQ
VEHPAGGYKKLFETAEELSSPVTAHVT
VEHPAGGYKKLFETAEELSSPVTAHVT
Line 694: Line 698:
GVVLSIVISPGSGPKPAYLLILNAKDMSEVARAEVEVNIPVTFHGLFKRA*
GVVLSIVISPGSGPKPAYLLILNAKDMSEVARAEVEVNIPVTFHGLFKRA*


>RPE65r_anoCar
>anoCar1
MSSQ
MSSQ
VEHPAGGYKKLFETVEELATPITTHVT
VEHPAGGYKKLFETVEELATPITTHVT
Line 710: Line 714:
GVVLSIIVSPGNGPKPAYLLILSAKDMSEVARAEVDINIPVTFHGFFKKA*
GVVLSIIVSPGNGPKPAYLLILSAKDMSEVARAEVDINIPVTFHGFFKKA*


>RPE65r_xenTro
>xenTro2
MTNH
MTNH
VEHPAGGYKKLFETAEELATPMATHVT
VEHPAGGYKKLFETAEELATPMATHVT
Line 726: Line 730:
GIVLSAVISPAVGHKPSYLLILDAKDMSEIARAEVDTIIPVTFHGMFKRA*
GIVLSAVISPAVGHKPSYLLILDAKDMSEIARAEVDTIIPVTFHGMFKRA*


>RPE65r_tetNig
>tetNig1
----
----
VEHPAGGYKKIFETVEELDEPVAAKIS
VEHPAGGYKKIFETVEELDEPVAAKIS
Line 742: Line 746:
GVLLSVVVKPGA--RPGFLLVLDAKTLSELGRAEVSVNIPVTLHGMFKLT*
GVLLSVVVKPGA--RPGFLLVLDAKTLSELGRAEVSVNIPVTLHGMFKLT*


>RPE65r_fr
>fr2_1_1
----
----
VEHPAAGYKKIFETVEELNEPIAATIS
VEHPAAGYKKIFETVEELNEPIAATIS
Line 758: Line 762:
GVLLSVVVKPGA--RPGSLLVLDAKTLTEVGRAEVSVNIPVTLHGMFKLT*
GVLLSVVVKPGA--RPGSLLVLDAKTLTEVGRAEVSVNIPVTLHGMFKLT*


>RPE65r_gasAcu
>gasAcu1
----
----
VEHPAAGYKKIFETVEELNEPIPARVI
VEHPAAGYKKIFETVEELNEPIPARVI
Line 774: Line 778:
GVLLSIVVKPGA--RPSFLLVLDAVKLTELARAEVNTIIPVTLHGTYKlt*
GVLLSIVVKPGA--RPSFLLVLDAVKLTELARAEVNTIIPVTLHGTYKlt*


>RPE65r_oryLat
>oryLat2
----
----
VEHPAAGYKKIFETVEELDEPIPAQIS
VEHPAAGYKKIFETVEELDEPIPAQIS
Line 790: Line 794:
GVLLSIVVKPGT--RPGFLLVLDAMKLTEVARAEVNAIIPVTLHGLYKMT*
GVLLSIVVKPGT--RPGFLLVLDAMKLTEVARAEVNAIIPVTLHGLYKMT*


>RPE65r_danRer5
>danRer5
----
----
LEHPAGGYKKVFESCEELAEPIPAHVS
LEHPAGGYKKVFESCEELAEPIPAHVS
Line 805: Line 809:
FCKLNVKSKETWIWQEPDAYPSEPLFVQSPDAEDEDD
FCKLNVKSKETWIWQEPDAYPSEPLFVQSPDAEDEDD
GVLLSIVVKPGA--RPAFLLILKATDLTEIARAEVDVLIPLTLHGIYTLt*
GVLLSIVVKPGA--RPAFLLILKATDLTEIARAEVDVLIPLTLHGIYTLt*
</pre>
</pre>


[[Category:Comparative Genomics]]
[[Category:Comparative Genomics]]

Revision as of 14:09, 7 January 2009

Introduction to the RPE65 gene family

This gene family is critical to the metabolism of provitamin A dietary carotenoids and thus to retinoic acid signaling and the recycling of all-trans retinal needed to replenish vertebrate ciliary opsins after exposure to light. RPE65, as the illegimate gene name suggests, is a protein of 65 kdaltons found in the retinal pigment epithelium adjacent to photoreceptor layers.

In vertebrates including human, RPE65 has two full-length paralogs on different chromosomes, BCMO1 (ß-carotene-15,15'-oxygenase cleaving symmetricaly to yield two retinals) and BCO2 (ß-carotene-9',10'-oxygenase cleaving asymmetricaly to yield a retinoic acid precursor and also cleaving lycopene).

The nomenclature of these three genes violates the spirit and letter of international agreement for naming human gene families, yet the responsible committee has approved the names! While an improvement over monstrosities such as B-DIOX-II and BCDO2 beta-carotene dioxygenase (for a mono-oxygenase!), it should be noted that RPE65 is 61 kd and expressed in a great many other tissues (such as skin) beyond retinal pigment epithelium (where BCMO1 also occurs). A more appropriate nomenclature would be BCO1S, BCO2A, and BCO3R, where the letters denote symmetric, asymmetric, and retinal.

This gene family is readily tracked back to bacteria where a 3D structure has been determined that suffices to model the entire family in all species. Four invariant histidines that hold the catalytic ferrous iron lie on the axis of a seven-bladed beta-propeller fold. The Fe2+ is accessible to carotenoids via a long nonpolar tunnel capable of promoting cis-trans double bond conversions. RPE65 shares all these features even though it is merely an isomerase.

Since this complex structure consists of a single coherent domain fold, alternative coding splices can be dismissed out of hand as transcriptional noise not leading to functional enzyme. The domain structure implies all homologs from all species must be full length or around 550 amino acids, which is validated below over tens of billions of years of evolutionary branch length. That conclusion must be qualified to the extent that N- and C-terminal extensions could be trimmed post-translationally and small indels not affecting the propellers might be tolerated.

REP65 structure.jpg

RPE65 is a well-known disease gene for retinitis pigmentosa and type II Leber congenital amaurosis. The other two are not currently associated with human disease but have various consequences in knockout mice consistent with enzymatic expectations.

Retinol dehydrogenase RDH5 is associated with fundus albipunctatus; LRAT lecithin retinol acyltransferase with severe, early-onset retinal dystrophy.

The confused history of RPE65

RPE65 was the subject of recent studies with serious experimental and interpretive error, such as attributing its enzymatic activity to LRAT (ecithin:retinol acyl transferase), assigning palmitolyation to various cysteine residues that did not occur, and attributing its (non-passaging) membrane association to these lipids when in fact that is intrinsic, and misunderstanding enzyme mechanism.

While this never made sense in view of full-length alignment of RPE65 with its two well-studied paralogous carotenoid oxygenases, the most interesting aspect of the debacle is the light it sheds on the multi-year delay cycle in updates at primary hub databases such as RefSeq, SwissProt, and OMIM. None of these have been corrected 18 months later. It's not clear when they will ever be revisited. Consequently hundreds of derived databases such as UCSC perpetuate serious misunderstandings about this important but straighforward disease gene and the visual pigment cycle in vertebrates.

The current view of retinal cycling in vertebrate eyes envisions the primary photosensitive event as isomerization of opsin-bound 11-cis retinal to all-trans retinol. After importation to the adjacent retinal pigment epithelia, that is esterified by LRAT palmitolyation and subsequently isomerized by RPE65 to 11-cis retinol, which must then be oxidized to 11-cis retinal by RDH5 before it can be re-exported back to the retina and re-form its Schiff base in a newly recharged opsin molecule. This system seems quite bizarre in comparison to self-recharging insect melanopsins.

RPE65 cycle.jpg

Reference gene collection: non-teleosts

Here the primary focus is obtaining reliable full length genes for the three members of this gene family from early diverging species (rather than standard teleost sequences more closely related to human), the idea being to possibly correlate gene expansions of the RPE65 family with the origin of imaging vision in deuterostomes. However partial sequences are also of valuable for purposes of establishing paralog numbers and presence.

Here it must be immediately noted that many Gnomon predicted sequences at GenBank are so deeply flawed that their uncritical use would hopelessly taint any comparative genomics effort. Without transcripts and given the rates of divergence, it is quite difficult to recover full length genes by blast into incomplete and sometimes garbled assemblies. Of the early deuterostome invertebrates, only Ciona has an adequate (tileable) set of transcripts vis-a-vis this gene family. Yet tunicates have lost one family member altogether, BCO2.

This gene family is quite unusual in that new members are quite difficult to classify by blast clustering relative to a reference collection. The match qualities tend to be quite similar, possibly attributable to gap issues and regions of uncertain and unpersuasive alignment. Here it is best to classify by individual exons because each paralog is intronated distinctively.

Because intronation patterns are strongly conserved from human back to pre-bilaterans (eg cnidarian and placamorpha), the fact that the three paralogs are intronated quite differently implies gene duplication and divergence occured very early, prior to the main era of intron establishment in early pre-metazoan eukaryotes. This is consistent with full length counterparts with conserved 3D structure already existing in bacteria.

This in turn implies that loss of one or more family members has occured in many lineages. For example, arthropods (notably many insect species) contain but a single gene copy (denoted NinaB), lophotrochozoan genomes have none (despite ciliary opsins), and tunicates have two. Gene expansions have also occured in species such as Nematostella yet no expansion occured in deuterostomes despite supposed 1R and 2R whole genome expansions.

>RPE65_homSap length=547 14 exons
0 MSIQ 21 VEHPAGGYKKLFETVEELSSPLTAHVT 1
2 GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR 2
1 FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR 2
1 FFSYFRGVEVTDNALVNVYPVGEDYYACTETNFITKINPETLETIKQ 0
0 VDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQA 1
2 DKEDPISKSEIVVQFPCSDRFKPSYVHS 21 FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG 0
0 VWLHIADKKRKKYLNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKG 21 FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK 0
0 ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ 1
2 AFEFPQINYQKYCGKPYTYAYGLGLNHFVPDR 0
0 LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD 1
2 GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS* 0

>BCMO1_homSap length=547 11 exons 51,666 bp
0 MDIIFGRNRKEQLEPVRAKVT 1
2 GKIPAWLQGTLLRNGPGMHTVGESRYNHWFDGLALLHSFTIRD 1
2 GEVYYRSKYLRSDTYNTNIEANRIVVSEFGTMAYPDPCKNIFSK 2
1 AFSYLSHTIPDFTDNCLINIMKCGEDFYATSETNYIRKINPQTLETLEK 0
0 VDYRKYVAVNLATSHPHYDEAGNVLNMGTSIVEKGKTKYVIFKIPATVP 1
2 EGKKQGKSPWKHTEVFCSIPSRSLLSPSYYHSFGVTENYVIFLEQPFRLDILKMATAYIRRMSWASCLAFHREEK 0
0 TYIHIIDQRTRQPVQTKFYTDAMVVFHHVNAYEEDGCIVFDVIAYEDNSLYQLFYLANLNQDFKENSRLTSVPTLRRFAVPLHVDK 0
0 NAEVGTNLIKVASTTATALKEEDGQVYCQPEFLYE 1
2 GLELPRVNYAHNGKQYRYVFATGVQWSPIPTK 0
0 IIKYDILTKSSLKWREDDCWPAEPLFVPAPGAKDEDD 1
2 GVILSAIVSTDPQKLPFLLILDAKSFTELARASVDVDMHMDLHGLFITDMDWDTKKQAASEEQRDRASDCHGAPLT* 0

>BCO2_homSap length=579 12 exons alt leader peptide not shown
0 MGNTPQKKAVFGQCRGLPCVAPLLTTVEEAPRGISARVWGHFPKWLNGSLLRIGPGKFEFGKDK 2
1 YNHWFDGMALLHQFRMAKGTVTYRSKFLQSDTYKANSAKNRIVISEFGTLALPDPCKNVFERFMSRFELPGKAA 1
2 AMTDNTNVNYVRYKGDYYLCTETNFMNKVDIETLEKTEK 0
0 VDWSKFIAVNGATAHPHYDLDGTAYNMGNSFGPY 1
2 GFSYKVIRVPPEKVDLGETIHGVQVICSIASTEKGKPSYYHSF 1
2 GMTRNYIIFIEQPLKMNLWKIATSKIRGKAFSDGISWEPQCNTRFHVVEKRTGQ 0
0 LLPGRYYSKPFVTFHQINAFEDQGCVIIDLCCQDNGRTLEVYQLQNLRKAGEGLDQ 0
0 VHNSAAKSFPRRFVLPLNVSLNAPEGDNLSPLSYTSASAVKQADGT 0
0 IWCSHENLHQEDLEKEGGIEFPQIYYDRFSGKKYHFFYGCGFRHLVGDSLIKVDVVNKTLK 0
0 VWREDGFYPSEPVFVPAPGTNEEDGGVILSVVITPNQ 0
0 NESNFILVLDAKNFEELGRAEVPVQMPYGFHGTFIPI* 0

>RPE65_cioInt 40% but about equally similar, slightly different intronation
0 MFAIQRRPFFAVFRNFNKMSAPSKTKSYVKLLQKAEERANAECVVT 1
2 GCIPEWLNGDVLRNGPAEFDIGPDTFKHWFDGHALLHK 2
1 FSMFEGKVTYSSKFLRSGTYKTNHENSRIIIGEFGTASRPDPCKNMFSR 2
1 FFTNFVEIAPRSDNANVSVAQLGEAYYAITDGPTAYGFDPETLETKNLITDCGPANMTVTAAHPHY 1
2 DRNGDYLNLGTTFGRTPHYHVIKVPAAKMTSPDPMNELEVFMKFPSTTSNASYHHS 2
1 FGLSENWIIFHEQPFSFSTPKLLIGLKLWNPILSSFYEDKQTIS 0
0 FHIINKTTGEKIATKYEARGMFCFHHINAYETKENDGKRFIVVDMCGSDRSLVWLL 2
1 GLDTLLDEEAHDKVVSNLDEKYLTRPRRIVIPLDISSDTPN 1
2 DTNLVTIPGCKATAMLNKSGVVSLTYELLVPDDFPNTELGIELPRINYDGYNGREYK 2
1 FIYAISSEYILPSHLVKINVETKEIKYWKEK 0
0 DKYTSEPIFVPRPGSQDEDDGVVLSTVISPTDDKTFLLILDGQSFKEIARAE 0
0 IETKMSYPLHGLFSK* 0

>BCMO1_cioInt 525 aa tiled transcripts agree with genomic
0 MDFPVSAFPHLTALATTKNIEYAEAVQGKVQ 1
2 GEVPSWLNGSWYRNGPGVVHFREESVKHWFDGMALARK 2
1 FCIEDGKVSYMSRLVDGESLQKNTAAGRVVVAEFGTTTHSEGFLGR 2
1 VKSALTMPEFTDNCLINFMNLGDHLFAITESNFIRQIDPVTLDTKDK 0
0 VDLAKHLPINIMSSHPLVDGEGNVYTFSSSIFNMGRTKYNLLKFPAAAP 1
2 GTPLETILSQSESICSIDSSWRVSPSYHHSFAMSEKYAVFVEMPLKIDIPKMAVAHLRHMCYSDCIEVLEDTK 0
0 TRIYLVNKETGKQHPITFLCDPLIVYHHVNAYDDGDHVVLDLSCYKKNSFYDKFTMSNLEKTPQEFSKLFDSDEQAVKAMRIVLPLANDS 0
0 KTTGNLVSVANTSCTAEFQGNNIFCTSEMLSVGTECAVINNKYIGKKYKYFYSPGGLKLPPGEM 0
0 LTKIDVETKQRVQTWQEKGCWASQPVFVAKPGATQEDE 1
2 GILMSSVVNENGNPFLLMLDAKSFTEVARIHFDANIPPDVHGVFVPKA* 0

Regularized reference gene collection: teleost RPE65

It is easy to compile large sets of bony vertebrate exons for these three genes using the 44-species genomic alignments at UCSC. To do this, look up the human gene by name at GeneSorter (or blat in a reference sequence) and click on "protein fasta" on the gene details page. Among the various output configurations provided, select all species checking the options as below. Note these alignments, being exon-aware, are intrinsically homological unlike blast alignments in gappy regions which lack this local constraint.

44way.jpg

For some species, no data is available (shown as dashes). For others, some exons or parts of exons are misssing. This can be due to incompleteness of the respective genome or technical difficulties at exon edges due to split codons. To facilitate uniform comparison of paralogs, the output can be 'regularized' by filling in missing data using the nearest species in the taxonomic sense, for example using chicken data if finch is missing an exon. Most human genes can be completely regularized back to teleost fish. In some cases, notably frog, it may be necessary to stub in an orthologous region from salamander cDNA.

This introduces artefacts that have the effect of understating comparative genomics variability at the locus but that error is offset by the benefits of having 'complete' copies of the three genes in the same set of 44 species. As long as regularization is limited (topologically) to species diverging off the same node (eg lizard could regularize finch and chicken but not frog or platypus), the impact on gene history at the ancestral nodes leading to human will be very minimal. The regularized set works quite well in a blast classificatory tool.

The sequences for RPE65, BCMO1, and BCO2 have been completely regularized in this manner below. This allows comparison of variation by various techniques such as percent identity at nodes (ie averaging human matches to all species coming off that node if ancestral sequences are not computed) and residue-by-residue conservation by Multalin and similar tools.

Regularization can be done by hand in a spreadsheet using fill-down and similar commands. Begin by creating a template whose first column consists of the 44 species. Color rows by subclade as a visual assist to filling in using tree topology as constraint. Now copy the exons, replace all Z by *, replace double carriage returns by a placeholder, replace remaining carriage returns by tabs, replace placeholder with single carriage return, place into spreadsheet, and reverse rows and columns. Now each column is a separate exon and guided regularization can begin.

>hg18_1_
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNVYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRKKYLNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYCGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>panTro2
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNVYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRKKYLNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFS-PRQ
AFEFPQINYQKYCGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>gorGor1
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNVYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRKKYLNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYCGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>ponAbe2
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNVYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRKKYLNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYCGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>rheMac2
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNVYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRKKYLNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLITLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYCGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>calJac1
MSIQ
vEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNVYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRKKYFNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYRKYCGKPYTYAYGLGLNHFVPDR
LCKLNVKTKEIWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>tarSyr1
MSVQ
VEHPAGGYKKLFETVEELSSPLPAHIT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEITDNALVNIYPVGEDYYACTETNYITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRRKYLNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYRKYCGKPYTYAYGLGLNHFVPDR
LCKLNVKTKEIWVWQEPDSYPSEPIFVSHPDALEEDD
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>micMur1
MSIQ
VEHPAGGYKKLFETVEELSSPLMAHVT
GRIPLWLSGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
dKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRRKYLNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKg
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLSNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFRKS*

>otoGar1
MSMQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLSGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCRNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKVNPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNESMG
VWLHIADKKRRKYINNKYRTSPFNLFHHINTYEDSGFLIVDLCCWKG
FEFVYNYLYLANLRGNWDEVKKSARKAPQPEVRRYVLPLSMDK
ADTGKNLVTLPHTTATAVLCSDDTIWLEPEVLFSGPRQ
AFEFPQINYQKYGGRPYTYAYGLGLNHFVPDR
LCKLNVRTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVILSVVVSPGPGQKPAYLLVLNAKDLSEVARAEVEINIPVTFHGLFKKS*

>tupBel1
MSIQ
VEHPAAGYKKLFETVEELSSPLTAHVT
GRIPLWLSGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
dKDDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRRKYLNNKYRTSSFNLFHHINTYEDNEFLVVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTFAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGTGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>mm9_1_1
MSIQ
IEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFKGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYISVNGATAHPHIESDGTVYNIGNCFGKNFTVAYNIIKIPPLKA
DKEDPINKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNESMG
VWLHVADKKRRKYFNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKRNAMKAPQPEVRRYVLPLTIDK
VDTGRNLVTLPHTTATATLRSDETIWLEPEVLFSGPRQ
AFEFPQINYQKFGGKPYTYAYGLGLNHFVPDK
LCKMNVKTKEIWMWQEPDSYPSEPIFVSQPDALEEDD
GVVLSVVVSPGAGQKPAYLLVLNAKDLSEIARAEVETNIPVTFHGLFKRS*

>rn4_1_1
MSIQ
IEHPAGGYKKLFETVEELSTPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYYRK
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFKGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFTVAYNIIKIPPLKA
DKEDPINKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNESMG
VWLHVADKKRRKYFNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKRNAMKAPQPEVRRYVLPLTIDK
ADTGRNLVTLPHTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKCGGKPYTYAYGLGLNHFVPDK
LCKLNVKTKEIWMWQEPDSYPSEPIFVSQPDALEEDD
GVVLSVVVSPGAGQKPAYLLVLNAKDLSEIARAEVETNIPVTFHGLFKKP*

>dipOrd1
MSIQ
IEHPAGGYKKLFETVEELSTPLTAHVT
gRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRK
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFKGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCKYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEVVVQFPCSDRFKPSYVHS
FGLTSNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWIHIAHKRQRKYLNNKYRTSSFNLFHHINTYEDDGFLIVDLCCWKG
FEFVYNYLYLSNLRENWEEVKRNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPNSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEIARAEVEINIPVTFHGLFKKP*

>cavPor3
MSIQ
VEHPASGYRKLFETVEELSSPLTAHVT
GRVPLWLTGSLLRCGPGLFEVGAEPFYHLFDGQALLHKFDFKEGHVTYYRR
FIRTDAYVRAMTEKRIVITEFGTYAYPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITRINPETLETIKQ
VDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIIKIPPLQA
DKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGTNYMDCFESNETMG
VWLHIADKKRKKYLNNKYRTSPFNLFHHINTYEDREFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNASKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPNSYPSEPIFVSHPDALEEDD
gVVLSVVVSPGPGQKPASLLILNAKDLSEVARAEVEVNIPVTFHGLFKKA*

>speTri1
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPVWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNVYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGTNYMDCFESNETMG
VWLHIADKKRKKYLNNKYRTSPFNLFHHINTYEDREFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPNSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEIARAEVEINIPVTFHGLFKKP*

>oryCun1
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
gRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRRKYLNNKYRTSPFNLFHHINTYEENDFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNFDK
ADTGKNLVTLPNTTDTAILCSDETIWLEPEVLFSGPRQ
aFELPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>ochPri2
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRK
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRRKYLNNKYRTSPFNLFHHINTYEDNDFLVVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTDTAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>vicPac1
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRKKYLNNKYRTSPFNLFHHINTYEDNEFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>turTru1
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRKKYINNKYRTSPFNLFHHINTYEDSEFLVVDLCCWKG
FEFVYNYLYLANLRENWEEVKRNARKAPQPEVRRYVLPLNTDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>bosTau4
MSSQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKVNPETLETIKQ
VDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRKKYINNKYRTSPFNLFHHINTYEDHEFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>equCab2
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKVNPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRRKYLNNKYRTSSFNLFHHINTYEENEFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>felCat3
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
gRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
dKEDPISKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRRKYLNNKYRTSSFNLFHHINTYEENEFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>canFam2
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNVYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLLKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRKKYLNNKYRTSSFNLFHHINTYEDNEFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATATLRSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>myoLuc1
MSIQ
VEHPAGGYKKLYETVEELSSPVTAHVT
GRIPFWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHkFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEITDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPISKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHVADKKRRKYLNNKYRTSPFNLFHHINTYEDNEFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNASKAPQPEVRRYVLPLNIDK
ADTGKNLVTLSNTTATAILCSDETIWLEPEVLFAGPRQ
AFEFPQINYQKYGGKPYTYTYGLGLNHFVPDR
LCKLNVKTKETWVWQEPDSYPSEPIFVSHPEALEEDD
GVVLSVVVSPGTGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>pteVam1
MSIQ
VEHPASGYKKLFETVEELSSPLTAHVT
GRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYYRR
FIRTDAYVRAMTEKRIVITEFGTYAFPDPCKNIFSR
FFSYFRGVEITDNALVNVYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
dKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHVADKKRRKYLNNKYRTSPFNLFHHINTFEDNEFLIVDLCCWKg
FEFVYNYLYLANLRENWEEVKKNASKAPQPEVRRYVLPLNIDK
ADTGKNLVSLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEPYSYPSEPIFVSHPDALEEDD
GVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEVNIPVTFHGLFKKS-

>eriEur1
MSAQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRVPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKVNPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSVAYNIVKIPPLQA
DKEDPISKSEVVVQFPCSDRFKPSYVHS
FGLTPNFIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHVADKKRRKYLNNKYRTSPFNLFHHINTFEDNEFLIVDLCCWKg
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLTIDK
ADTGKNLVTLSNTTATAILCSDETVWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKEIWIWQEPESYPSEPIFVSHPDALEEDD
GVVLSVVISPGTGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>sorAra1
MSAQ
VEHPAGGYKKLFETVEELSSPLTAHVT
gRIPVWITGSLLRCGSGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIIKIPPLQA
DKEDPISKSEIVVQFPCSDRFKPSYVHS
FGLTPNFIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHVADKKRRKYLNNKYRTSPFNLFHHINTFEDNEFLIVDLCCWKg
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ANTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYRTYGGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEADSYPSEPIFVAHPDALEEDD
GVVLSVVISPGTGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>loxAfr2
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
gRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTENRIVITEFGTCAFPDPCKNIFSR
FFSYFKGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKQDPISKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWG-DYMDCFESNENMG
VWLHIADKKRRKYLNNKYRTSSFNLFHHINTYEDNGFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFAGPRQ
AFEFPQINYQKYSGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEADSYPSEPIFVSHPEAMDEDD
gVVLSIVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>proCap1
MSIQ
VEHPAGGYKKLFETVEELSSPLTAHVT
gRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSR
FFSYFKGVEITDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
dKQDPISKSEIVVQFPCSDRFKPSYVHs
FGLTPNYIVFVETPVKINLFKFLSSWSLWG-DYMDCFESNENMG
VWLHIADKKRRKYLNNKYRTSSFNLFHHINTYEDNGFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFAGPRQ
AFEFPQINYQKYSGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEADSYPSEPIFVSHPEAMDEDD
GVVLSIVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>echTel1
MSIQ
MEHPAGGYKKLFETVEELSSPLTAHVT
GRIPLWLSGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FIRTDAYVRAMTEKRIVLTEFGTCAFPDPCKNIFSR
FFSYFRGVEITDNALVNIYPVGEDYYACTETNLITKVNPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKQDPISKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHIADKKRRKYLNNKYRTSSFNLFHHINTYEDNGFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNINK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFAGPRQ
AFEFPQINYQKYSGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEADSYPSEPIFVSHPEAMDEDD
GVVLSIVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>dasNov2
MSIQ
IEHPAGGYKKLFETVEELSSPLTAHVT
gRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRK
FIRTDAYVRAMTEKRIVLTEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPINKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSSWSLWGANYLDCFESNETMG
VWLHIADKKRRMYLNNKYRTSSFNLFHHINTYEDNGFVIVDLCCWKg
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNINK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFAGPRQ
AFEFPQINYQKYSGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEADSYPSEPIFVSHPEAMDEDD
gVVLSVVVSPGAGQKPAYLLVLNAKDLSEVARAEVEINIPVTFHGLFKKS*

>choHof1
MSIQ
IEHPAGGYKKLFETVEELSSPLTAHVT
GRMPLWLTGSLLRCGPGLFEvGSEPFYHLFDGQALLHKFDFKEGHVTYHRK
FIRTDAYVRAMTEKRIVLTEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYVSVNGATAHPHIESDGTVYNIGNCFGKNFSIAYNIVKIPPLQA
DKEDPINKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLFKFLSVWSLWGANYMDCFESNETMG
VWLHIADKKRRMYLNNKYRTSSFNLFHHINTYEDNGFVIVDLCCWKg
FEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNINK
ADTGKNLVTLPNTTATAILCSDETIWLEPEVLFAGPRQ
AFEFPQINYQKYSGKPYTYAYGLGLNHFVPDR
LCKLNVKTKETWVWQEADSYPSEPIFVSHPEAMDEDD
gVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS*

>monDom4
MSHQ
VEHPAGGYKKLFETVEELSSPLPAHVR
GRIPPWLAGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKDGRVTYHRR
FIRTDAYVRAMTEKRIVLTEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYISINGMTAHPHIENDGTVYNIGNCFGKNFAIAYNIVKIPPLQA
dKEDPMNKSEVVVQFPCSDRLKPSYVHS
FGLTPNYIVFVETPVKINLLKFLSSWTLWGANYMDCFESNETMG
VWLHVADKKRGKYLNNKYRTSSFNLFHHINTYEEDNFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKRNAKKAPQPEVRRYVLPLAINK
ADTGKNLVSLPNTTAAAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYAFAYGLGLNHFVPDR
LCKLNVVTKETWLWHEPDSYPSEPIFVSHPDAQEEDD
GVVLSVVVSPGAGQKPACLLVLNAKDMTEVARAEVETNIPVTFHGLFRRS*

>ornAna1
MSNQ
VEHPAGGYKKLFETVEELSSPLTAHVT
GRIPVWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRK
FIRTDAYVRAMTEKRIVLTEFGTCAFPDPCKNIFSR
FFSYFRGVEVTDNALVNIYPVGEDYYACTETNFITKINPETLETIKQ
VDLCNYISINGMTAHPHIENDGTVYNIGNCFGKNFAIAYNIVKIPPLQA
DKEDPINKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPIKINLFKFLSSWSLWGANYMDCFESNETMG
VWLHVADKNQGRCLNHKYRTSSFNLFHHINTYEDNGFLIVDLCCWKG
FEFVYNYLYLANLRENWEEVKRNAKKAPQPEVRRYVLPLNITK
ADTGKNLVSLPYTTATAILCSDETIWLEPEVLFSGPRQ
AFEFPQINYQKYGGKPYTYTYGLGLNHFVPDR
LCKLNVKTKDTWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSVVVNPGPGQKPAYLLILNAKDMSEVARAEVEINIPVTFHGLFRKS*

>galGal3
MSSQ
VEHPAGGYKKLFETAEELSSPVTAHVT
GRIPTWLRGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
VGRTDAYVRAMTEKRIVITEFGTYAYPDPCKNIFSR
FFSYFKGVEVTDNALVNVYPVGEDYYACTETNFITKINPDTLETIKQ
VDLCKYVSVNGATAHPHVENDGTVYNIGNCFGKNFSLAYNIIRIPPLQA
DKEDPMNKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLLKFLSSWSLWGANYMDCFESNETMG
VWLHVAEKKKGRLLNIKYRTSAFNLFHHINTFEDNGFLIVDLCTWKG
FEFVYNYLYLANLRANWDEVKKQAEKAPQPEARRYVLPLRIDK
ADTGKNLVTLPYTTATATLRSDETVWLEPEVIFSGPRH
AFEFPQINYKKYGGKPYTYTYGLGLNHFVPDR
lCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDD
GVVLSIVISPGSGPKPAYLLILNAKDMSEVARAEVEVNIPVTFHGLFKRA*

>taeGut1
MSSQ
VEHPAGGYKKLFETAEELSSPVTAHVT
GRIPTWLRGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRR
FVRTDAYVRAMTEKRIVITEFGTYAYPDPCKNIFSR
FFTYFKGVEVTDNALVNVYPVGEDYYACTETNFITRINPDTLETIKQ
VDLSKYVSVNGATAHPHIENDGTVYNIGNCFGKNFSLAYNIIRIPPLQA
DKEDPINKSEVVVQFPCSDRFKPSYVHS
FGLTPNYIVFVETPVKINLLKFLSSWSLWGANYMDCFESNETMG
VWLHVAEKKKGRLLNLKFRTSAFNLFHHINTYEDNGFLIVDLCTWKG
FEFVYNYLYLANLRANWDEVKRQAEKAPQPEARRYVLPLSIDK
ADTGKNLVTLPYTTATATLRSDETIWLEPEVIFSGPRH
AFEFPQINYTKYGGKPYTYTYGLGLNHFVPDR
lCKLNVKTKETWVWQEPDAYPSEPIFVSHPDALEEDD
GVVLSIVISPGSGPKPAYLLILNAKDMSEVARAEVEVNIPVTFHGLFKRA*

>anoCar1
MSSQ
VEHPAGGYKKLFETVEELATPITTHVT
GRIPVWLRGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFEIKEGHVTYYRR
FIRTDCYVRAITEKRIVITEFGTYAYPDPCKNIFSR
FFTYFQGVEVTDNALVNVYPVGEDFYACTETNFLTKINPDNLETLKK
VDISKIVSVNGVTAHPHIENDGTVYNIGNCFGKNFSIAYNIVRIPPLQA
DRKDPMTKCEVVVQFPCSDRFKPSYVHS
FGLTPNYLVFVETPVKINLLKFLSSWSLWGANYMDCFESNESMG
VWMHVADKKKGKYLNIKYRTSPFNLFHHINTYEENGFLIVDLCTWKG
YEFIYNYLYLANLRDNWEEVKKHAQKAPQPEVRRYVLPLNIEK
ADTGKNLITLPNTTATATLNSDETIWLEPEVIFSGPRQ
AFEFPQINYTKYSGKPYTFAYGLGLNHFVPDR
lCKINIKTRETWVWQEPDAYPSEPIFVSHPDALEEDD
GVVLSIIVSPGNGPKPAYLLILSAKDMSEVARAEVDINIPVTFHGFFKKA*

>xenTro2
MTNH
VEHPAGGYKKLFETAEELATPMATHVT
GRVPPWLSGSLLRCGPGLFEVGSEQFYHLFDGQALLHKFEIREGHVSYHRR
FVRTDAYVRAMTEKRIVITEFGTFAYPDPCRNIFSR
FFSYFKGLEVTDNALVNVYPVGEDYYACTETNYITKVNPETLETVKK
VDLCNYISINGVTAHPHIESDGTVYNIGNCFGKNFAIAYNVVKTPPLQA
DKEDPITKSTVVVQFPCSDRFKPSYVHS
FGMTPNYLVFVEQPVKINLLKFLSAWSIWGANYMDCFESHETMG
VWMHVAEKHTGEYLNIKYRTSAFNIFHHINTYEDNGFLILDVCCWKG
FEFIYNYLYLANLRENWEEVKKHAEKAPQPEARRYVLPLDINK
NDVGKNLVSLNYTTATATLHSDGTIWLEPEVLFSGPRQ
AFEFPQINYKKYGGKDYSYAYGLGLNHFIPDR
lTKLNVKTKETWVWQEPNAYPSEPIFVQAPDAIEEDD
GIVLSAVISPAVGHKPSYLLILDAKDMSEIARAEVDTIIPVTFHGMFKRA*

>tetNig1
----
VEHPAGGYKKIFETVEELDEPVAAKIS
GTLPAWLSGSLLRMGPGLFEVGDEPFRHLFDGQALIHKFDLKEGHVTYHRK
FIRTDAYVRAMTENRVVITELGTAAYPDPCKNIFSR
FFTYFKGVEVTDNCMVNIYTIGEDFYAVTETNFITKVDPDSLETLKK
VDLSKYLSVNGVTAHPHKDADGTVYNIGNCFGKNMSLAYNIVKIPPAQK
DSLEPFEKARVVVQLPSSERLKPSYIHS
FGMSDNYFVFVEPPVKINLIKFLSAWSIRGATYMDCFESNETLG
TWFHLATKDPADYSSYKFRTSAFNIFHHINTYEDQGFMVVDLCTWKG
HEFVYNYLYVANLRQEWEEVKKAAMKAPQPEVRRYVLPLDISS
EDLGKNLVSLSYTTATAVLRRDGTIWLEPEVLFSGPRL
AFEFPQINYSEYSGKMYHFTYGLGLNHFIPDR
LMKLNVQTKETQEWREEECYPSEPLFVPTPGATDEDD
GVLLSVVVKPGA--RPGFLLVLDAKTLSELGRAEVSVNIPVTLHGMFKLT*

>fr2_1_1
----
VEHPAAGYKKIFETVEELNEPIAATIS
GVIPTWLSGSLLRMGPGLFEVGDQPLHHLFDGQALIHKFDLKDGRVTYYRK
FIRTDAYVRAMTENRVVITEFGTAAYPDPCKNIFSR
FFTYFKGIEVTDNCLVNIYTIGEDFYAVTETNFITKVDPDSLETLKK
VDLSKYVSVNGITAHPHKDPDGTVYNIGNCFGKNMSLAYNIVKIPPAPK
DSLEPFEKARVVVQLPSSERLKPSYIHS
FGMTENYFVFVEPPVKINLLKFLSAWSVRGATYMDCFESNETLG
TWFHLATKDPADYSSYKFRTSAFNVFHHINTYKDQDFIVVDLCTWKG
FEFVYNYLWLANLRANWDEVKKAAMMAPQPEVRRYVIPLDVHK
EDMGKNLVSLSYTTATAVLRSDGTIWLEPEVLFSGPRH
AFEFPQINYSEYSGKMYRFAFGLGLNHFIPDR
lMKLNVQTKEMREWREEECYPSEPLFVATPGATDEDD
GVLLSVVVKPGA--RPGSLLVLDAKTLTEVGRAEVSVNIPVTLHGMFKLT*

>gasAcu1
----
VEHPAAGYKKIFETVEELNEPIPARVI
GVLPSWLGGSLLRMGPGLFEVGGEPFNHLFDGQALIHKFDLKNGQVTYYRK
FIRTDTYVRAMTENRVVITEFGTAAYPDPCKNIFSR
FFTYFKGIEVTDNCLVNICPIGEDFYAVTETNYITKVDPDSLETLKK
vDLCKYVSVNGVTAHPHTDADGTVYNIGNCFGKNFSLAYNIVKIPPPQE
DKSDPLERAQVVLQLPSSERLKPSYLHS
FGMTDNHFVFVEQPVKINVLKYLSAWSVRGTTYMDCFESNDSMG
TWFHLATKDPADYSSYKFRTSAFNIFHHINAYEDQGFIVVDLCTWKG
HDFVYDYLYLANLKGEWEEVKKAAMRAPQPEVRRYVLPLDIHR
EEQGRNLVSLSYTTATAALRSDGTIWLEPEVLFSGPRQ
AFEFPQINYSLCRGKKYSFAYGLGLNHFIPDR
VVKLNVQTKETWVWQEDECYPSEPLFVPTPAATEEDD
GVLLSIVVKPGA--RPSFLLVLDAVKLTELARAEVNTIIPVTLHGTYKlt*

>oryLat2
----
VEHPAAGYKKIFETVEELDEPIPAQIS
GVIPSWLGGSLLRMGPGLFEVGDEPFHHLFDGQALMHKFDLRNGQVTYNRK
FIKTDSYVRAMTENRVVITEFGTIAYPDPCKNIFSR
FFSYFKGIEVTDNCLVNIYPIGEDFYAVTETNYITKVDPDSLETLEK
VDVCKYLSVNGLTAHPHIEADGTVYNIGNCFGKDMSLAYNIVKIPPAQK
DKSDPIEKSQIVVQLPSSERLKPSYVHS
FGMTNNYFVFVEQPVKINLLKFLSAWSIRGTTYMDCFESCESMG
TWFHLTTKNPAAYSSHKFRTSAFNLFHHINTYEDQGFIVVDLCTWKG
HEFVYNYLYLANLREEWEEVKKAASRAPQPEVRRYVLPLDIYK
EEQGKNLVSLSYTTATAVLCSDGTIWLEPEVLFSGPRQ
AFEFPQINYSQCCGKKYFFAYGLGLNHFIPDR
VVKLNVQTKETWVWQEEDCYPSEPIFVATPGATQEDD
GVLLSIVVKPGT--RPGFLLVLDAMKLTEVARAEVNAIIPVTLHGLYKMT*

>danRer5
----
LEHPAGGYKKVFESCEELAEPIPAHVS
GEIPAWLSGSLLRMGPGLFEVGDEPFYHLFDGQALLHKFDLKDGRVTYHRR
FIRTDAYVRAMTEKRVVITEFGTTAYPDPCKNIFSR
FFTYFQGIEVTDNCLVNIYPIGEDFYACTETNFITKVDPDTLETVKK
VDLCNYLSVNGLTAHPHIEADGTVYNIGNCFGKNMSLAYNIVKIPPLQE
DKSDQFEKSKILVQFPSSERFKPSYVHS
FGITENHFVFVETPVKINLLKFLTSWSIRGSNYMDCFESNDKMG
TWFHLAAKNPGKYIDHKFRTSAFNIFHHINCFEDQGFIVVDLCTWKG
HEFVYNYLYLANLRQNWEEVKKAALRAPQPEVRRYVLPLDIHR
EEQGKNLVSLPYTTATAVMRSDGTVWLEPEVLFSGPRQ
AFEFPQINYSKFNGKDYTFAYGLGLNHFVPDR
FCKLNVKSKETWIWQEPDAYPSEPLFVQSPDAEDEDD
GVLLSIVVKPGA--RPAFLLILKATDLTEIARAEVDVLIPLTLHGIYTLt*