Selenoprotein evolution: introduction

From genomewiki
Jump to navigationJump to search

Introduction to selenoprotein evolution

(more selenoprotein classes shortly)

SEPP2: lost in placentals

SEPP2 is a newly discovered paralog of SEPP1 with a UxxC motif (plus a later conserved cysteine and distal CxxC motif) and high-scoring putative SECIS element in 3' UTR that is quite conserved in vertebrates but only through marsupials. No corresponding gene -- or even decayed debris -- can be found in syntentic position in any placental mammal, including Atlantogenata. Also, cysteine has displaced the selenocysteine in Xenopus, a species much depleted in selenoproteins. SEPP2 does not appear at GenBank non-redundant other than mis-annotated SEPP1 chicken mRNAs. SEPP2 transcripts are available only in skate, zebrafish, and echidna (a short 454 read) .

The two paralogs have 4 coding exons with identical intron phases so clearly represent a segmental gene duplication. The gene duplication can be tracked back to before chondricthyes divergence but only one copy can be located in earlier diverging deuterostomes (sea urchin and acorn worm). SEPP2 retains the selenocysteine TGA only in its first exon and lacks the cluster of them in exon 4 of SEPP1. Exon 4 in SEPP2 is diverging very rapidlly, about neutrally, and is difficult to recover from blast searches without transcripts.

It can be surmised the first seleoncysteine in both SEPP1 and SEPP2 has a different functional role from terminal selenocysteines in SEPP1, possibly a conserved redox role distinct from selenostorage function of exon 4 in SEPP1. SEPP2 is very likely functional in all the species in which it occurs because of its conservation. There is no indication it is "on its way out" in marsupials -- the SECIS would be obliterated very quickly if the gene were non-functional.

Selenoprotein SELU: 3 paralogs, variable timing losses

SELU: This family consists of three deeply diverged (distinct exon patterns) paralogs. The encoding gene has 5 average exons with anomalously short introns like many selenoproteins. In the SELU1 group, selenocysteine occurs in a UxxC motif already in the earliest deuterostome but drops out in mammals after monotremes, being replaced by CxxC in marsupials and placentals. Amphibia separately lost selenocysteine.

The second paralog SELU2 has selenocysteine in bilaterans only to the node of sea urchin, suggesting it was lost early in the deuterostome ancestor. It is the closer paralog of SelU1, 36% vs 27% percent identity. No vestigal SECIS element persists in living species that encode cysteine. (The decayed SECIS elements still identifiable in 3' UTR of cysteine-containing GPX6 genes in rodents and human GPX5 represent much more recent loss of selenocysteine.)

The third paralog SELU3 has cysteine in all species for which a sequence is available. It might be called virtual selenoprotein supposing orthologs in early diverging eukaryotes could be located that contained selenocysteine. This would suggest a scenario in which selenocysteine was present in an ancestral gene prior to gene duplications followed by conversion to cysteine in different phylogenetic patterns within each gene subfamily.

This family exhibits the "selenocysteine rachet": if selenocysteine happens to be replaced by ordinary cysteine (despite catalytic inferiority) in some stem lineage, the unselected 3' UTR SECIS element then deteriorates over a few million years from accrued mutations, for the same reason (lack of purifying selection) the crayfish in the cave loses its imaging opsins. Consequently the whole following clade will contain cysteine -- a reversion to TGA at the cystein codon might occur but it would simultaneously require a multi-step reversion or de novo evolution of a SECIS element, ie all SECIS elements are ancient and selenocysteines cannot wink back on paraphyletically. (However the overall selenoproteome can still increase over time because of gene duplications elsewhere.)

A phylogenetic overview of the occurence of selenocysteine in SELU1 in 38 vertebrates:

                                        .........................*.....
C  Homo sapiens                  genome EPRTFKAKELWEKNGAVIMAVRRPGcFLCRE
C  Pan troglodytes         AACZ02115591 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Pongo abelii            ABGA01228099 EPRTLKAKELWEKNGAVIMAVRRPGCFLCRE
C  Macaca mulatta          AANU01282766 EPRTLKAKELWEKNGAVIMAVRRPGCFLCRE
C  Microcebus murinus      ABDC01489848 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Otolemur garnettii      AAQR01538573 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Tupaia belangeri        AAPY01309022 EPRTFKAKELWGERGAVIMAVRRPGCFLCRE
C  Mus musculus            AAHY01113156 EPRTFKAKELWEKNGAVIMAVRRPGCFLCR.
C  Rattus norvegicus       AAHX01086750 EPRTFKAKELWEKNGAVIMAVRRPGCFLCR.
C  Spermophilus tridec     AAQQ01288000 EPRTFKAKELWEKSGAVIMAVRRPGCFLCRE
C  Cavia porcellus         AAKN02044618 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Oryctolagus cuniculus   AAGW01591660 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Canis familiaris        AAEX02011808 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Bos taurus              AAFC03065652 ...TFKAKALWEKNGAVIMAVRRPGCFLCRE
C  Equus caballus          AAWR02000382 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Myotis lucifugus        AAPE01631988 EPRTFKAKELWEEKGAVIMAVRRPGCFLCRE
C  Sorex araneus           AALT01607337 zPKTFKAKELWSKSGAVIMAVRRPGCFLCRE
C  Boreoeuthere ancestralis   ancestral EPRTFKAKELWEKNGAVIMAVRRPGcFLCRE
C  Echinops telfairi       AAIY01623759 ...TFQSKGALGKNGAVIMAVRRPGCFLCRE
C  Dasypus novemcinctus    AAGV01392885 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Monodelphis domestica   AAFR03024314 SPKTFKARELWEHRGAVIMAVRRPGCFLCRE
C  Trichosurus vulpecula     transcript SPKTFKARELWEHRGAVIMAVRRPGCFLCRE
C  Macropus eugenii              genome ..KTFKARELWEHRGAVIMAVRRPGCFLCRE
U  Ornithorhynchus anatin  AAPN01249400 EPRTFKARELWQRNGAVIMAVRRPGUFLCRE
U  Tachyglossus aculeatus        genome EPRTFKARELWQRNGAVIMAVRRPGUFLCRE
U  Anolis carolinensis     AAW.01013574 ..RTFKAEELWKKNGAVIMAVRRPGUFLCRE
U  Gallus gallus           AADN02035315 EPRTFKASELWKKNGAVIMAVRRPGUFLCRE
U  Taeniopygia guttata           genome EKRTFKAGELWKQNGAVIMAVRRPGUFLCRE
C  Xenopus tropicalis            genome EPKSFKAKDLWEKNGAVVMAVRRPGCFLCRE
C  Xenopus laevis            transcript EPRLFKAKDLWERDGAVIMAVRRPGCFLCRE
U  Danio rerio             CAAK04015812 DDRVFKARELWESSGAVIMAVRRPGUFMCRE
U  Tetraodon nigroviridis  CAAE01014976 ETKTFKAKTLWEKCGAVVMAVRRPGUFLCRE
U  Fugu rubripes           CAAB01000016 ETKTFKAKSLWENSGAVVMAVRRPGUFLCRE
U  Gasterosteus aculeatus  AANH01005113 ...VIKGRSLWDKNGAVVMAVRRPGUFLCRE
U  Oryzias latipes         BAAE01190338 DTKIIKAKSLWDKNGAVVMAVRRPGUFLCRE
U  Fundulus heteroclitus     transcript .....KAKSLWEKNGAVVMAVRRPGUFLCRE
U  Oncorhynchus mykiss         CR369769 .....KAKALWEKTGAVVMAVRRPGUFLCRE
U  Callorhinchus milii     AAVX01258517 ENRTFRASELWAGRGAVIMAVRRPGUFLCRE

C  Gasterosteus  aculeatus AANH01005113 ......AKTLWDKTGAVVMVVRRPGCLLCRE (anomalous gene duplication with cysteine)

Selenoprotein SEPW1: odd paralog SEPV gained in placentals

Selenoprotein SEPW1 is one of the shortest known mammalian proteins at 87 aa. With its CxxU motif, it is likely limited to simple redox reactions. Curiously, despite the small size protein still has 5 coding exons. One of these is of relatively recent origin because chondrichtyes and telost fish have the second and third exons fused (which, given the tree and extreme rarity of intron gain/loss must be the ancestral condition.

Like many genes on the densely packed human chromosome 19, SEPW1 seems to have given rise to a segmental duplication, here during the placental stem (ie, is absent marsupial and earlier diverging vertebrates). The second copy, called SELV, retains the same intron placements and phases but with greatly expanded and compositionally anomalous exon 1 which is highly prone to large indels (from replication slippage).

This long exon retains some conservation at its distal end to the last 5 residues of SEPW1 (unsurprising because the cysteine of the CxxU motif is at the end). There is observable conservation also at the amino terminus but otherwise the exon is evolving unconstrained. Possibly the initial methionine of SEPW1 was lost at the time of duplication but the terminal residues and splice donor were retained. Transcription far upstream lead to a remote methionine serving as replacement.

This is a very odd gene but transcripts in a half dozen species and moderate conservation (75% within placental mammals) in exons 2-5 over hundreds of millions of years witin placentals proves that it is evolving under selective constraint and so is functional. Distal exons however retain only 50% conservation to those of SEPW1, far too little to support ongoing gene conversion.

DIO1, DIO2, and DIO3: a curious history

The iodothyronine deiodinases of selenoproteins have been thoroughly studied because of their role in thyroid hormone metabolism. The three paralogs seen in mammals diverged long ago -- the percent identity today is about 45% and exon breaks and phases are completely unrelated. One scenario is that DIO2 and DIO3 in mammals arose as an initial retroprocessed gene that then duplicated again, with DIO2 subsequently acquiring an intron and DIO3 remaining a single coding exon. The critical era for exon acquisition and loss is poorly represented in the databases.

DIO2 and DIO3 are found on the same chromosome arm of chr 14 in human but separated by 22 Mbp, rather far for a tandom duplication. This structure is conserved in the chicken genome with only 8.7 Mbp intervening. Again, it is hard to sort out coincidence from correlation. Note the SECIS element might well accompany both retroprocessing and tandem duplications.

No 3D structures are available for this family of proteins (which have a single N-terminal transmembrane segment) but deiodinases have thioredoxin-like fold (which includes glutathione peroxidase selenoproteins) according to SuperFamily. This domain might serve to define trimming for purposes of better protein trees as well as serve as a partial template for folding (noting too a short insert with strong similarities to the active site of iduronidase, a GH-A-fold of glycoside hydrolase).

DIO1 catalyzes the outer ring deiodination of T4 to the biologically active hormone T3, as well as the degradation of T3 and sulfated iodothyronines. DIO2 is also involved in outer ring deiodination. DIO3 converts T(4) to T(3) and T(3) to 3, 3'-diiodothyronine (T2) by inner-ring deiodination. This gene is imprinted and has an anti-sense transcript. These genes are of great interest in the evolution of the thyroid (resp. endostyle) and its hormones.

The putative redox motif varies significantly across the three paralogs, with the CxU motif of DIO1 and DIO3 becoming AxU in DIO2 (meaning it cannot form a local seleno-disulfide). All three proteins contain a distal conserved cysteine; DIO2 has an additional conserved cysteine still further downstream.

DIO1 GSCTuPSF...PQCPV
DIO2 GSATuPPF...PQCRV...RVCIV...KRuKK
DIO3 GSCTuPPF...PGCAL

Remarkably, a second selenocysteine may have arisen in near-terminal position post-lungfish divergence (ie after teleost fish divergence). No experiment specifically addresses this TGA codon but it is conserved here over billions of branch length years and could be supported by the same SECIS element that supports the first selenocysteine. It may have arise from read-through of a TGA stop codon; no clear counterpart is seen in this position in DIO1 or DIO2. Note ancestral CTU in the proximal selenocysteine had already morphed into ATU by lamprey and chondrichthyes. Alanine of course has a chemically inert sidechain; possibly the second selenocysteine now contributes the second half of a redox pair. Additional species might further buttress the second selenocysteine but the possibility always remains that the TGA is the actual stop codon and is conserved for other unknown reasons.

DIO2_homSa    SKRuKKT
DOI2_macMu    .......
DOI2_canFa    ....N--
DOI2_otoGa    ....N--
DOI2_tupBe    ....N--
DIO2_ratNo    ....ILD
DIO2_susSc    .....LD
DOI2_equCa    .E..N.L
DOI2_echTe    ....T--
DOI2_sorAr    ....NSL
DOI2_bosTa    .....--
DOI2_dasNo    ....N--
DOI2_monDo    ....NPG
DIO2_ornAn    ....NPD
DIO2_galGa    ....NPL
DIO2_anoCa    ....NPL
DIO2_neoFo    R...VP.
DIO2_xenTr    G...T
DIO2_gasAc    G..FSR.
DIO2_takRu    GPHLI
DIO2_funHe    GR.
DIO2_oryLap   ...K...
DIO2_tetNi    G..
DIO2_danRe    G.u
DIO2_calMi    GQ.WG

The collection of DIO1 below from 49 species illustrates the importance of rare genomic events in defining the topology of the mammalian species tree. Here we see a 5 residue insert in the first exon restricted to Pegasoferea (ie excluding artiodactyls) with no hint of homoplasy. Additionally a longer deletion is restricted to Afrotheres.

DIO1_homSa   MGLPQPGLWLKRLWVLLEVAVHVVVGKVLLILFPDRVKRNILAMGEKTGMTRNPHFSHDNWIPTFFSTQYFWFVLKVRWQRLEDTTELGGLAPNCPVVRLSGQRCNIWEFMQ
DIO1_panTr   ..................................................................................................H.............
DIO1_macMu   .....S...V.K................................................................................................D...
DIO1_calJa   ....G..................A......T.......K......D........N...........................................H.........D...
DIO1_otoGa   ....R..........F.......A...M..........SQ.....QQ.V.AK.............LQHPV.LVCPEGPL.....M..R.........SAS...K....D...
DIO1_musMu   .....LW......VIF.Q..LE.A.....MT...G...QS.....Q....A...R.AP...V.....I................RA.F.......T..C....K....D.I.
DIOI_ratNo   ...S.LW......VIF.Q..LE.AT....MT...E...Q......Q........R.AP...V.....I................RA.Y.......T.......K..V.D.I.
DIO1_oryCu   ....R...........VQ...E.A.....MT...E...Q......Q...IAQ..N.AQ.S........................A..P.......S.......Q.S..D..R
DIOI_cavPo   ...TW...........VQ...E.AM....MT...E.I.KS..............Q..................I.........E.A.......D.S..C...EKRT..D..H
DIO1_canFa   ....R.V...R......Q...Q.A....F.K...A...QH.V..NGN-----K....Y...A..LY.M.........Q......R..P....................D...
DIO1_ursAr   ....R.V...R......Q..M..A..........E...QQV...NK.-----.....Y...L...Y.M................R..P....................D... 
DIO1_felCa   ...S.L....R.....FQ..LQ.A....F.....S...QH.V..NR.-----.....Y...A..LY.V................R..P.................S..D..K
DIO1_equCa   ....RA...........Q..LQ.A......T.......QH.V..NQ.-----.....Y...V..LY...........H........KR......S.............D...
DIO1_myoLu   ..............I..Q..L..TL...Q.K...R...QH....NR.-----.....Y...A..L...P....I..........K..E.S...............H..D...
DIO1_pteVa   .E..W..R.........Q..L..A....Q.T...R...Q..V..NR.-----.....F...L..L...............Q.....KE..........C.........D...
DIO1_sunMu   ....GL..L...FG..VR..LK.A......T.W.SAIRPHL...S.....AK..R.TYED.A...............N..Q...R.KQ.DI..DS...H.....ARL.D...
DIO1_bosTa   ....S...........FQ..L..AI.....T...R...Q...................E..............I..........M..Q..............E..S..D...
DIO1_turTr   ....L...........FQ.GL..AM.....T...R...Q.....S.....AK.....YE........A.....I..........M..Q..R.................D...
DIO1_susSc   .E..L...........FQ..L..AM....MT...G...QD....SQ....AK......E........A................K..E..........S......H..D...
DIO1_vicVi   ...SL...........FQ.VL..AL.....T...G...QD....SQR...AQ.....YE..............I.............Q.....D....C..-D.VH..D...
DIO1_eriEu   ....S...........FQ..L..AI.....T...R...Q...................E..............I..........M..Q..............E..S..D...
DIO1_dasNo   ...S..........I.FQ..L..A...T..T...G...Q....KSQ.SHKAE....PY...G....N......L..IG......K..Q..........H.........D...
DIO1_choHo   ...SW............Q..L..AM..I..T...G...Q.....SRRANN.KD.Q.PY...G....N.................K..Q..........H..R......D...
DIO1_loxAf   ..............IF.K..L..AM.........G...K....Q--------....AY.M.GS.L..IP....I...Y......K..E..P..D....C........SD...
DIO1_proCa   ......V..........R..L..AM.....A...G...K....Q--------....AY.M.CS.L..VP........Y......K..E..........H.....R...D...
DIO1_monDo   .LRLWLW.........Q.VG..LM..LMKM.S...M.QH..G..Q.SSIFQ..N.KYE..G....TLP..L...R........QALQ..P..D....S.R..PRRL.D..HA
DIO1_triVu   . AG.L..VR.F.A..Q..F......L.KT...NMM.KH..SL.QRSSISQ.TQ.AYE..G.....I...F............QALQ......P...T.K.ESRH..D..H
DIO1_anoCa   . FKA.RLVLKT.L..Q.CLSTA...LFM....ATA..Y..KQS.RSS.G...N.VYE..G.....F..LL.....K.K....KALQ.CP...T...DFD.KIHH.LD...

Selenoprotein SELH: rapid evolution

SELH is another small selenoprotein with a conserved CxxU redox motif split by a phase 21 intron. The introns in mammalian SELH are exceedingly short (eg, 93 and 162 bp in human with gene coding span very short at 619 bp) but the level of transcription is not remarkable so provides no explanation for the lack of retroposons. Some species like orangutan have processed pseudogenes, implying transcription in germ-line tissues. Zebrafish has a diverged duplicate gene with tryptophan at the seleocysteine site in addition to an intronless transcribed gene with TGA selenocysteine -- it appears that this has displaced the normal three-intron gene (seen in other fish). That can work in selenoproteins since retropositioning begins at the 3' end of transcripts, meaning the SECIS likely accompanies the coding region.

Selh.png

Protein conservation is below average -- SELH percent identities (to human) drop to the low 80's within Laurasiatheres, to 72% with marsupial, and 57% with chicken. Further, rodents exhibit significant residue loss upstream of the CxxU motif, very unusual in such a short protein but indicative of an inessential structural region. Indeed observed conservation is primarily centered in the middle of the protein. No phylogenetic conversion of selenocysteine to cysteine is observed within vertbrates

Selenoprotein SELM: retained in ER

Another selenoprotein of unknown function, SELM with CxxU motif, surfaces during Blastp searches using selenoprotein SEP15 (which oddly has a CxU motif) as query. A 3' UTR SECIS element cannot be located with SECISearch 2.19 yet it seemingly must be located in the comparative genomic peaks of conservation lying between SELM and the neighboring gene.

SELM SECIS.png

Since the protein is quite short with 5 introns, complete SELM sequences are best recovered from cDNAs and genomic alignments in the UCSC 28way. This is an ancient protein recoverable from vertebrates, amphioxus, sea urchin, shrimp, mite, moths, and hydra, and plants. Moths (silkworm and hawkmoth) -- but not their outgroups -- encode cysteine in a CxxC motif in place of CxxU. No loss of selenocysteine is seen in 22 species of phylogenetically dispersed vertebrates.

SELM may begin with a signal peptide. There are no glycosylation sites nor additional conserved cysteines beyond the CxxU motif. SuperFamily finds no similarity to proteins of known 3D structure. The terminal residues appear to be a phylogenetically conserved KDAL-class endoplasmic retention signal

In whole-mount zebrafish embryos, SELM is expressed within the notochord and anterior somites, axial fin fold, dorsal spinal chord neurons, then in lateral line neuromasts. SelM is located in the ER/Golgi as is its distant homolog Sep15 (found associated with UDP-glucose glycoprotein glucosyltransferase, an ER-resident protein involved in quality control of protein folding).

Selenoprotein MSRB123: methionine sulfoxidases

MSRB1 is a short odd protein, rich in cysteines, with two CxxC motifs, a near-amino terminal cysteine and a more distal selenocysteine in a motif with serine. MSRB is now known to be a stereospecific methionine-R-sulfoxide reductase repairing oxidative damage to methionine in native proteins. The two pairs of cysteines bind a zinc atom. The all-beta structure has been determined in a bacterial homolog with a internal structural duplication so weak it needs an xray determination to be revealed. Humans have 9 such domains in 9 proteins according to SuperFamily. These could potentially encode other selenoproteins at least in some species.

The fold exists as a small family of paralogs, three in mammals, with specialization to cell compartment -- cytosol, mitochondria and endoplasmic reticulum (via KAEL* signal), respectively.). All contain catalytic zinc. This multiplicity of MSRB contrasts with a single non-homologous methionine-S-sulfoxide reductase (not a selenoprotein in any species).

MSRB1 has selenocysteine in its active site in conserved motif UxxS, whereas MSRB2 and MSRB3 contain Cys in the well-conserved motif CINS. The cysteine in the Ux53xC motif of prokaryotes is replaced by serine or threonine in all eukaryotic MSRBs as U/CxxS/T. While serine and threonine have polar hydroxyl groups reminiscent of a cysteine, whether they form a covalent bond with the selenocysteine or otherwise contribute to catalysis remains unresolved. Thus this protein has a defunct CxxU motif that was unusual among selenoproteins in having 53 intervening residues.

The exon structures, relative to reliably locatable CxxC anchors, show MSRB2 and MSRB3 more closely related (having a distinctive break within the second CxxC motif), whereas MSRB1 introns are placed altogether differently. This suggests two rounds of gene duplication with intronation of MSRB2/3 prior to the second round. A tree based on the alignable conserved core of these proteins indicates the same result.

The phylogenetic distribution of orthologs of MSRB1 is orderly, with selenocysteine back through fish, with no counterpart currently locatable in amphioxus or tunicates (which otherwise have 3 MRSB genes). Further, there is no sporadic appearance of selenocysein in MRSB2/3 -- cysteine is always found, even prior to bilatera, suggesting though selenocysteine is ancestral (viz prokaryotes) but became cysteine prior to the second duplication giving rise to MRSB2/3.

Care must be taken at greater depth because of possible confusion among paralog members (and lineage-specific expansions and contractions). Synteny is lost so intron phasing and siting must be used along with blast clustering. The selenoprotein rachet predicts as more species are sequenced, some clades may exhibit lost selenocysteine in MRSB1 but none will acquire it in MRSB2 or MRSB3 lineages.

This raises the question that if selenocysteine is not necessary, why is it retained in MSRB1? Possibly that has some connection to localization in the reduced cytosol whereas more oxidizing mitochondrial and ER compartments utilize cysteines. In this view, selenocysteine will not be lost in any vertebrate clade no matter how many species are sequenced for MSRB1; whereas in MSRB23 cysteine was an adaptive change rather than mere rift, allowing wider intracellular distribution. Evidently methionine sulfoxide forms in all three compartments.

32 vertebrate MSRB1 aligned in exon 3:
VSCGKCGNGLGHEFLNDGPKPGQSRFuIFSSSLKFVPK	homSap Homo sapiens (human)
VSCGKCGNGLGHEFLNDGPKPGQSRFuIFSSSLKFVPK	panTro Pan troglodytes (chimp)
VSCGKCGNGLGHEFLNDGPKPGQSRFuIFSSSLKFVPK	ponPyg Pongo pygmaeus (orang_sumatran)
VSCGKCGNGLGHEFLNDGPKPGQSRFuIFSSSLKFVPK	macMul Macaca mulatta (rhesus)
VSCGRCGNGLGHEFLNDGPKPGQSRFuIFSSSLKFIPK	otoGar Otolemur garnettii (bushbaby)
VSCGRCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK	micMur Microcebus murinus (mouse_lemur)
VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFVPK	musMus Mus musculus (mouse)
VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK	ratNor Rattus norvegicus (rat)
VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK	cavPor Cavia porcellus (guinea_pig)
VSCGKCGHGLGHEFLNDGPKPGQSRFuIFSSSLKFIPK	oryCun Oryctolagus cuniculus (rabbit)
VFCGKCGHRFGHEFLNDGLKPGQSRFuIFSNTLKFVPK	ochPri Ochotona princeps (pika)
VSCGKCGNGLGHEFLNDGPKPGKSRFuIFSSSLKFIPK	canFam Canis familiaris (dog)
VSCGRCGNGLGHEFLNDGPKPGQSRFuIFSSSLKFIPK	felCat Felis catus (cat)
VSCGRCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK	bosTau Bos taurus (cow)
VSCGRCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK	susScr Sus scrofa (pig)
VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFVPK	equCab Equus caballus (horse)
VSCGRCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPR	eriEur Erinaceus europaeus (hedgehog)
VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK	loxAfr Loxodonta africana (elephant)
VSCGKYGHGLGHEFLNDGPNWGQSRFuIFSSSLKFIPK	echTel Echinops telfairi (tenrec)
VSCGKCGNGLGHEFLNDGPKKGQSRFuIFSNTLKFVPK	triVul Trichosurus vulpecula
VSCGKCGNGLGHEFLNDGPRRGQSRFuIFSSSLKFIPK	ornAna Ornithorhynchus anatinus (platypus)
VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK	galGal Gallus gallus (chicken)
VLCGKCGNGLGHEFINDGPKKGQSRFuIFSSSLKFVPK	anoCar Anolis carolinensis (lizard)
VSCGKCGNGLGHEFINDGPKKGQSRFuIFSSSLKFIPK	xenTro Xenopus tropicalis (frog)
VRCGKCGNGLGHEFVNDGPKHGLSRFuIFSSSLKFIPK	danRer Danio rerio (zebrafish)
VRCGKCGNGLGHEFLNDGPSRGLSRFuIFSSSLKFIPK	tetNig Tetraodon nigroviridis (pufferfish)
VRCGKCGNGLGHEFVNDGPSRGLSRFuIFSSSLRFIPK	takRub Fugu rubripes (fugu)
VRCGKCGNGLGHEFVNDGPAKGVSRFuIFSSSLKFIPK	gasAcu Gasterosteus  aculeatus (stickleback)
VRCGKCGNGLGHEFVNDGPSKGLSRFuIFSSSLKFIPK	oryLap Oryzias latipes (medaka)
IRCGKCNNGLGHEFLNDGPKHGLSRFuIFSSSLKFV..	ictFur Ictalurus furcatus (fish)
VRCGKCGNGLGHEFVGDGPKKGLSRFuIFSSSLKFV..	oncMyk Oncorhynchus mykiss (trout)
VRCGKCGNGLGHEFVGDGPKKGLSRFuIFSSSLKFV..	salSal Salmo salar (salmon)
.SCGKCGNGLGHEFLNDGLKAGQSRYuIFSNSLKFVPK	calMil Callorhinchus milii (elephantfish)

SEPHS1 and SEPHS2: selenophosphate synthetases

Selenophosphate synthetases are biosynthetic enzymes that capture selenium in a chemical form suitable for selenoprotein biosynthesis. One of the two mammalian selenophosphate synthetase paralogs, SEPHS2, itself contains a selenocysteine in a conserved UGCK motif several dozen residues in from the amino terminus. This motif contains threonine as TGCK in SEPHS1 which apparently functions in a selenium salvage system recycling catabolic selenocysteine (tRNA is charged with precursor serine), whereas the SEPHS2 enzyme functions in selenite assimilation (selenophosphate from selenide and ATP). Unsurprisingly only SEPHS2 has a selenocysteine insertion sequence (SECIS) element in 3' UTR.

This gene family has a curious evolutionary history: every additional level of phylogenetic resolution adds another twist -- even 29 genomes from cnidarian to placental do not provide adequate genomic saturation. The original single-copy gene (itself an ancient internal tandem duplicatation fold with active site on dimeric cleft) procedes uneventfully from ur-bacteria to hagfish/lamprey divergence.

However, prior to teleost fish divergence, block duplication of the 8 exon gene took place. No flanking synteny persists in extant species. The copy that would remain closest to the ancestral gene then lost its selenocysteine to nucleophile threonie, its 3' UTR SECIS insertion element decayed, and the old function strayed to salvaging selenide from free selenocysteine (arising from selenoprotein catabolism or diet). The copy we call SEPHS2 diverged rapidly ( rather unconstrained amino terminus in both length and sequence) but with conservation about the selenocysteine active site motif and beyond.

Between monothere and marsupial divergence, a fully processed retrogene (no introns) carrying the 3' UTR SECIS insertion element became functionally established, remarkably displacing the previous 8-exon gene (now an undetectable decayed pseudogene or deletion), to the extent that every species of placental mammal retains only the processed retrogene and threonine copies. Opossum, but not platypus nor elephant or tenrec, retains all 3 gene copies.

Here the processed retrogene mechanism (begin at 3' end of UTR but not extend necesarily to initial methionine or 5' regulatory regions) brings along the SECIS selenocysteine insertion loop making a new selenoprotein feasible and while explaining non-homologous start peptides. Humans have an assortment of fragmentary and processed pseudogenes for which SEPHS1 is usually the parent, most notably a mis-spliced processed pseudo omitting exon 7 (build 35, chr7:63756925-63757879).

Reference set of 341 vertebrate selenoproteins

SEPP2: 12 vertebrate sequences

>SEPP2_monDom Monodelphis domestica (opossum) chr2:9,451,774-9,455,977 1 selenocys tga stop taa 
0 MPPPGLSLAVLLGLLGASLAFENRTRFCQPAPPWQVGGGRAPMEEALGNVTVVALLKASuHFCLKQAAS 2
1 MGSLQERLARMGAPDVRFIIVNEKSPQSRALHGELELHAPPGVPVYGQPELGPDIWSILGGAKDDFLIYDR 2
1 CGRLTFHIRLPFSFLHFPYVESAIRFTHRQDSCGNCSFYPAQ 0
0 VNSTDERKGESKHSPGLEGEGQEPLGEKPDSRGLTLGSSAPTAHAHDPVHGGMEKPSPSLPSPALEPSLHEGEAPG* 0

>SEPP2_macEug Macropus eugenii (wallaby) trace archive tga stop taa
0 MGLPGLAAALLLGLVGATLASENGTRICQPAPKWQVDRGASPMEEALGQVTVVALLKASuHFCLKQAAS 2
1 IGNLQERLARVGATGVRFIIINEKSPLSQALYGELELHAPSGVLVYNQQGPGPDIWSILGGAKDDFLIYDR 2
1 CGRLTFHLPLPLSFLYFPYVESAIRFTHQQDHCGNCSFYPAQ 0
0 VNNTDKGKAESLTQSPRLEGEGQESLAEGPSTHGPTLWPSTPHPAHARGPVHSPGSKRLSPSLPPPSLEWPPHEEANK* 0

>SEPP2_ornAna platypus Ultra131:348583-348720 Zswim5 synteny last exon uncertain
0 MAGSGLLGPALTLATLLAAAGALPDLENGTRICQPAPRWTVNGVAPMEGTEGQVIVVALLKASuHFCLKQAAR 2
1 LAGLRERLAGHGAGNVSFLIVNQRDPTAQLLHTELERHAPPGVPVYAQDGPDPDVWSVLGGDKDDFFVYDR 2
1 CGRLTFHIQLPFSFLHFPYVEAAVRFTHRRDFCGNCSYYFPQVGT 0
0 VNDTTTQESELEKSPGAPGEEPEGSPVREPDRPQSQDPTGPFSGVLLQGKENKIIPWKTPLQAAPRKPSHPPGAHD* 0

>SEPP1_tacAcu Tachyglossus aculeatus (echidna) EUEMSW408ERBZB length=227 run=R_2007_08_22_12_11_10_ 59% 
1         VPVYAQDGPDPDVWSILGGGKDDFLVYDR 2
1 CGRLTFHIRLPFSFLHFPYVEAAVLFSHRHDFCGNCS

>SEPP2_galGal Gallus gallus (chicken) taa stop  synteny Zswim5 HPDL MUTYH
0 MGSLLLALASCLGLAVASEGATNGSRLCHEAPAWRINGSSPMEGAAGQVTVVALLKASuHFCLLQARS 2
1 LGALRERLGQQGVSDVRYVIVNEQAPLSRAMFGELQRHAPPGVPVLQQQPHEPDVWQLLGGDKDDFLVYDR 2
1 CGRLAFHIQLPYSFLHLPYVESAIRFTHRKDFCGNCSLYPNSTQE 0
0 ANSTMEVPATLTPLPKQEEKESETPAHHQPNHLHPHHRAVGNGTAPEPSGDHRPAHAHHHHGAHGKLHPKGQTPEGRDP* 0

>SEPP2_taeGut Taeniopygia guttata (finch)
0 MGLLVLALATWLGLGLASASEETANSSRICQEAPAWTINGSSPMEGAAGQVTVVALLKASuQFCLRQAHS 2
1 LGGLRERLARQGMADVRYMIVNEKAPLSRAMLPELQRHAPPGVPVFQPEREDPDVWQVLGGDKDDFLVYDR 2
1 CGRLAFHIQLPFSFLHFPYVEAAIRSSHIKDFCGNCSLYPNTTQE 0
0 ANSTMEGPATPSSLPEHEGMESETPVHQHKPLHPHRHHEVNSERDTNPSEDHKPATHAHHHHGDHGQLHHKGRKQKEGDEH* 0

>SEPP2_anoCar Anolis carolinensis (lizard) scaffold_4:5,993,109-5,994,846
0 MDYSLATRILLLGLVVISATQAEVTENKTRICQPAPLWKINGTAPMAGMEGQVTVVALLKASuPFCLKQAAK 2
1 IGGLQKKLSNEGVANVSFLIVNEKTPLSRAMYWELKRNAPQGIPVYQQQILEPDVWQILDGDKDDFLIYDR 2
1 CSRLTFHIQLPYSFLHLPYVEAAVHYTHRKDYCGNCSRYYSE 0
0  * 0

>SEPP2_xenTro Xenopus tropicalis (frog) NM_001006907 misannotated, no selenocysteine
0 MHNLALTVSILMGLLGQVSSSEQTNSSICKPPPKWSIEGEVPMAEALGKVTVVALLQASCGFCLVQAAR 2
1 MGPLRYKLSLQGMTDIKYMIVNDQSLHSANMFPELKRWAPEGIPVYQQTPGQDDVWELLDGNKDDFLIYDR 2
1 CGRLTFHVRLPLSFLHFPYVEAAILSTYNESFCGNCSFTSNSTLIPM 0
0 NGTTVSPSGDDSSSPLQNKDEPVNKEPSPTLEKHNDQRKLDSELRLHDHSQHHPINSHKRQENQNNHPRNLIKNGKQN* 0

>SEPP2_tetNig Tetraodon nigroviridis (pufferfish) from cDNA, genome misassembled
0 MSSPWLLWLQVALTGLLWASQGQSATSRICKAAPRWEVGGQAPMEALVGRVVVVALLKASuQFCHIQASK 2
1 IGPLREKLSRRNVTEVSFVIVNEQEPVSRALYWQLRRRAPPGVPVYQQAPLQDDVWEALDGDKDDFLIYDR 2
1 CGQLTFHIGLPYSSLRYTYVEAAVIATHQGNICNCS 0
0 ANFTSLSISNSSGSGGMPSQTNQTVTAETDGPHTTHHHPHPHRHHHHHQHLSPEQPTPTAMPGQATPTPA*

>SEPP2_gasAcu Gasterosteus aculeatus (stickleback0 chrIII:3,915,059-3,917,020
0 MTPPGSASRPPHWTIKERAPMQELLGNVVVVALLKASuQFCLTQASK 2
1 IGGLRDKLNRSNLTDVSFMIVNEREPHSRAMYWELKRRAPPGVPVYQQAALQSDVWEALDGDKDDFLVYDR 2
1 CGLLTFHIVLPYSFLHNVYVEAAIRATYLKNICNCT 0
0 VDSVVSSLNNSVMNNETDFNVSQTNATPRIQPDTNDPEGAGTPPPPT 

>SEPP2_leuEri Leucoraja erinacea (skate) cDNA
0 MGLQRSLLIILVGAAITLAAANNDTRICQVAPHWEIGNQSPMEQLSGQLVVVALLKASuQFCLTQAAKLGILRDKLSLQGLKNIHYIIVNEKT
LESRAMFWKLKLKTPKNITVYQQSAFQPDVWRILRGNKDDFLIYDRCGKLTFHITSPYSYLNFRYVEAAIMATYNTDYCGNCMGSSTTLEATS

>SEPP2_calMil elephant fish Callorhinchus milii (elephantfish)
1 LGSLQDKLARQGLKDVHYMIVNEKAPESRAMLWELKRHVPNNVSVYQQSPIQPDVWHSLQGGKDDFLIYDR 2
1 CGRLTFHVVLPYSSLQYPYIEAAIRATHKRDICGECTIT 0

SELU1: 13 vertebrate sequences

>SELU1_homSap Homo sapiens (human) processed pseudogenes chr8 and chr12
0 MSFLQDPSFFTMGMWSIGAGALGAAALALLLANTDVFLSKPQKAALEYLEDIDLKTLEK 1
2 EPRTFKAKELWEKNGAVIMAVRRPGcFLCREE 0
0 AADLSSLKSMLDQLGVPLYAVVKEHIRTEVKDFQPYFKGEIFLDEK 0
0 KKFYGPQRRKMMFMGFIRLGVWYNFFRAWNGGFSGNLEGEGFILGGVFVVGSGKQ 0
0 GILLEHRENEFGDKVNLLSVLEAAKMIKPQTLASEKK* 0

>SELU2_homSap Homo sapiens (human) 7 exons chr1p36.32 36% id NM_152371 
0 MSTVDLARVGACILKHAVTGE 0
0 AVELRSLWREHACVVAGLRRFGCVVCRWIAQDLSSLAGLLDQHGVRLVGVGPEALGLQEFLDGDYFAG 1
2 ELYLDESKQLYKELGFKR 2 
1 YNSLSILPAALGKPVRDVAAK 0
0 AKAVGIQGNLSGDLLQSGGLLVVSK 1
2 GGDKVLLHFVQKSPGDYVPKEHILQVLGISAEVCASDPPQ 0
0 CDREV* 0

>SELU3_homSap Homo sapiens (human) 6 exons chr9q22.32 25% id processed pseudogene chrX
0 MAAPAPVTRQVSGAAALVPAPSGPDSGQPLAAAVAELPVLDARGQRVPFGALFRERRAVVVFVR 0
0 HFLCYICKEYVEDLAKIPRSFLQ 0
0 EANVTLIVIGQSSYHHIE 0
0 PFCKLTGYSHEIYVDPEREIYKRLGMKRGEEIASS 1
2 GQSPHIKSNLLSGSLQSLWRAVTGPLFDFQGDPAQQGGTLILGP 1
2 GNNIHFIHRDRNRLDHKPINSVLQLVGVQHVNFTNRPSVIHV* 0

>SELU1_borAnc Boreoeuthere ancestralis (northern beast) 5 exons no selenocyseine
0 MSFLQDPSFFTMGMWSIGAGALGAAALALLLANTDVFLSKPQKAALEYLEDIDLKTLEK 1
2 EPRTFKAKELWEKNGAVIMAVRRPGcFLCREV 0
0 AADLSSLKPKLDELGVPLYAVVKEHIRTEVKDFQPYFKGEIFLDEK 0
0 KKFYGPQRRKMMFMGFVRLGVWYNFFRAWNGGFSGNLEGEGFILGGVFVVGPGKQ 0
0 GILLEHREKEFGDKVNPVSVLEAARKIKPQTSASEKK* 0

>SELU1_triVul Trichosurus vulpecula (brushtail opossum) EC360881
0 MSFLDLSFFSMGMWSLGAGALGAAVLSLILANTNLFLTKSVTATLEFLEEIELKTLDN 1
2 ESPKTFKARELWEHRGAVIMAVRRPGCFLCREE 0
0 AAELSALKPQLDQLGIPLYAVVKEKIGSEVENFQPYFKGKIFLDER 0
0 KKFYGPQKRKMMFMGFVRLGVWQNFFRARSKGFSGNLEGEGFILGGVYVIGPGKQ 0
0 GILLEHREKEFGDKVDPASVLEAA   * 0

>SELU1_macEug Macropus eugenii (tammar wallaby) EX196548 full
0 MSFLDLSFLSMGMWSLGAGALGAAVLSLILANTDVFLTKSVTATLEFLEDIELKTLDN 1
2 KTFKARELWEHRGAVIMAVRRPGCFLCREE 0
0 AADLSALKPQLDQLGIPLYAVVKEKIGSEVEDFQPYFKGKIFLDER 0
0 KKFYGPQKRKMMFMGFVRLGVWQNFFRARSKGFSGNLEGEGFILGGVYVIGPRKQ 0
0 GILLDHREKELGDKVNPASVLEACKKIKLHA* 0

>SELU1_monDom Monodelphis domestica (opossum) tgt-cys
0 MSFLDLNFFSMSMWSLGAGALGAAALSLILANTDLFLTKSVDATLEFLEEIQLKTLDN 1
2 ESPKTFKARELWEHRGAVIMAVRRPGCFLCREV 0
0 AADLSALKPQLDLLGVPLYAVVKEKIGSEVENFQPYFKGKIFLDER 0
0 KKFYGPQKRKMMFMGFVRLGVWQNFFRARSKGFSGNLEGEGFVLGGVYVIGPGKQ 0
0 GILLEHREKEFGDKVNPASVLEAAKKIKPHTSTSEGK* 0

>SELU1_ornAna Ornithorhynchus anatinus taa early stop full
0 MPLPPDLGLFNLGMWSVGVGALGAAAVGLLLANTDLLLTKPEKATLEYLEDTELKTLGK 1
2 EPRTFKARELWQRNGAVIMAVRRPGuFLCREE 0
0 AAELSSLKPQLDRLGVPLYAVVKEKIGTEVEDFQPYFKGEIFLDER 0
0 KKFYGPHKRKMLFLGFIRLGVWQNFLRARNRGFSGNLEGEGLILGGVYVLGAGKQ 0
0 GILLEHREREFGDKVSPASVLEAAQRIKPQPL* 0

>SELU1_tacAcu Tachyglossus aculeatus (echidna) 454:EUEMSW405C31QQ (74%) tSASEKK terminus? frag
0 1
2 EPRTFKARELWQRNGAVIMAVRRPGuFLCREE 0
0 AAELSSLKPQLDQLGVPLYAVVKENIGTEVEDFQPYFKGEIFLDER 0
0 KRFYGPHKRKMLFLGLIRLGVWQNFIRARNKGFPPVTWEGEG     0
0 GVLLEHREREFGDKVSPASVLEAAQKIKPQ* 0

>SELU1_gga Gallus gallus (chicken)
0 MSFLPDFGIFTMGMWSVGLGAVGAAITGIVLANTDLFLSKPEKATLEFLEAIELKTLGS 1
2 EPRTFKASELWKKNGAVIMAVRRPGuFLCREE 0
0 ASELSSLKPQLSKLGVPLYAVVKEKIGTEVEDFQHYFQGEIFLDEK 0
0 RSFYGPRKRKMMLSGFFRXGVWQNFFRAWKNGYSGNLEGEGFTLGGVYVIGAGRQ 0
0 GVLLEHREKEFGDKVSLPSVLEAAEKIKPQAS* 0 

>SELU1_tgu Taeniopygia guttata (finch)
0 msflpdfgiFTMGMWSVGLGAIGAAVTGIVLANTDLFLSKPEKATLEFLEEIELKTLGS 1
2 EKRTFKAGELWKQNGAVIMAVRRPGuFLCREE 0
0 ASELSSLKPQLSKLGVPLYAVVKENIGTEVEDFQHYFKGEIFLDEK 0
0 KGFYGPRRRKMMLSGFFRLGVWQNFVRAWRSGYSGNLEGEGFTLGGVYVIGAGRQ 0
0 GVLLEHREKEFGDKVSLPSVLEAAEKIKPQAS* 0 

>SELU1_anoCar Anolis carolinensis (lizard)
0 MWTIGLGAIGAAVTGIILANTDLFLSKAEQASLDFLEAIDLKTLGE 1
2 NQRTFKAEELWKKNGAVIMAVRRPGuFLCREV 0
0 AAELSSLKPQLDKLGVPLYAVVKENLGTEVMDFQPYFKGEIFLDEK 0.
0 KQFYGPQKRKMLFMGFIRCSVWRNFFRAWKSGYTGNIDGEGFVLGGVFVVGPGKQ 0
0 GVLLEHREKEFGDKVSLDAVLEAVKNIQPQPSEKDK* 0

>SelU1_fugRer Fugu rubripes (fugu)
0 MGLLAKLLAAVGGFVTAVMNSVTDAFLTPPLRATLEHLEETDLKTLSG 1
2 ALVIRLIPTRTETKTFKAKSLWENSGAVVMAVRRPGuFLCRE 0
0 EAAELSSLKPRLDQLGVPLYAVVKEDVGTEIQNFRPYFQGEIFLDEK 0
0 RRFYGPRERKMGLLGFLRVGVWMNGLRAFRSGFMGNVLGEGFVLGGVFVIGREQQ 0
0 GILLEHREREFGDKVNIEDVIQAVDRIAQELMPVTQN* 0

>SELU1_gasAcu Gasterosteus aculeatus (stickleback) chrVI.790.1 length=214
MGMWSLGLGAVGAALAGIFLANTDLCLPKAASASLENLEDADLRS
KGRSLWDKNGAVVMAVRRPGuFLCREV
ASGLSSLKPQLEELGVPLVAVVKEDVGTEIRDFRPHFAGDIFIDEK
SFYGPLQRKMGGLGFIRLGVWQNFMRAWRSGYQGNMNGEGFILGGVFVFGAGNQ
GILLEHREKEFGDKVQIADVLEAVKKIVPAK*

>SELU1_calMil Callorhinchus milii (elephantfish) frag
2 ENRTFRASELWAGRGAVIMAVRRPGuFLCRE 0
0 AAALSSLRPSLAQLGVPL
0 GHLLEHREKEFGDAVNLTAVMEAAGKISPRQSAE* 0	
	
>SELU1_squAca Squalus acanthias (spiny dogfish) also selenocysteine
0 MVVVVEDFHMGLWTLGLGALGAAITGVILANTDLLLPKAETASLAYLSGAELRTLDR 1
2 EERTLKAGDLWSRSGAVIMVVRRPGuFLCREE 0
0 AAEISSLRPQLDELGVPLYGVIKENINNELKNFQPFFKGEIFLDVE 0
0 MRFYGPKPRTMGLMGFMRLGVWKNFVRAWQKGFSGNTDGEGFILgGVFVIGAGQQ 0
0 GVLLEHREKEFGDVVNISSVLEARRKIETQRTEP* 0 

SELV: 19 placental mammal sequences

>SELV_homSap Homo sapiens (human) chr 19 uc002oly.1  66 pro + 32 thr +21 ser first exon 
0 MNNQARTPAPSSARTSTSVRASTPTRTPTPLRTPTPVRTRTPIRTLTPVLTPSPAGTSPLVLTPAPAQIPTLVPTPALARIPRLVPPPAPAWIPTPVPT
PVPVRNPTPVPTPARTLTPPVRVPAPAPAQLLAGIRAALPVLDSYLAPALPLDPPPEPAPELPLLPEEDPEPAPSLKLIPSVSSEAGPAPGPLPTRTPL
AANSPGPTLDFTFRADPSAIGLADPPIPSPVPSPILGTIPSAISLQNCTETFPSSSENFALDKRVLIRVTYC 2
1 GLuSYSLR 0
0 YILLKKSLEQQFPNHLLF 0
0 EEDRAAQATGEFEVFVNGRLVHSKK 0
0 RGDGFVNESRLQKIVSVIDEEIKKR* 0

>SELV_panTro Pan troglodytes (chimp) essentially identical to humna
MNNQARTPAPSSARTSTSVRASTPTRTPTPLRTPTPVRTRTPIRTLTPVLTPSPAGTPPLVLTPAPAQIPTLVPTPALARIPRLVPPPAPAWIPTPVPTPV
PVRNPTPVPTPARTLTPPVRVPAPAPAQLLAGIRAALPVLDSYLAPALPLDPPPEPAPELPLLPEEDPEPAPSLKLIPSVSSEAGPAPGPLPTRTPLAANS
PGPTLDFTFRADPSATGLADPPIPSPVPSPILGTIPSAISLQNSTETFPSSSENFALDKRVLIRVTYC 2
1 GLuSYSLR 0
0 YILLKKSLEQQFPNHLLF 0
0 EEDRAAQATGEFEVFVNGRLVHSKK 0
0 RGDGFVNESRLQKIVSVIDEEIKKR* 0

>SELV_ponPyg Pongo abelii (orang)
0 MNNQARTPAPSSARTSTSVRASTPTRTPTPLRTPTPVRTRTPIRTLTPVLTPSPAGTPPLVLTPAPAQIPTLVPTPALARIPRLVPPPAPAWIPTPVPTPVPVRNPTPVPTPARTLTTPV
RVPAPAPAPPPAQVLAGIRAALPILDSYLAPALRLHPPPEPAPELPLSPEEDPEPAPSLKLIPSVSSEAGPAPGPLPARTPQAANSPGPTLDFTFRADLSATGLADPPIPSPVPSPILGT
TSSAISLQNSTENFASSSENFALDKRVLIRVTYc 2
1 GLuSYSLR 0
0 YILLKKSLEQQFPNHLLF 0
0 EEDRAAQATGEFEVFVNGRLVHSKK 0
0 RGDGFVNESTLQKIVSVIDEEIKKR* 0

>SELV_macMul Macaca mulatta (rhesus) distal seq from M. nemestrina transcript
0 MNNQARTPAPSSARTSTSVRASTPTRTPTPLRTPTPVRTRTPIRTQTPVLTPSPARLPALVQPPAPAHIPTQVPTPALARIPGLVPPPARAWIPTPVPTPARVRNPTPVPTPARTLTPPV
RVPAPAPDPAQVLAGIRTALPALDSYPAPALSLDSPPEPAQEPPLSPEEDPEPAPSLKLIPSVSSEAGPALGPLPAHTPLAAKSSGPTLDFTFRADPSATGLADPHIPSPVPAPILGTIP
SALSLQNFTETLVSTSENFALDKRVLIRVTYC 2
1 GLuSYSLR 0
0 YILLKNSLEQQFPSHLLF 0
0 EEDRAAQATGEFEVFVNGRLVHSKK 0

>SELV_calJac Callithrix jacchus (marmoset)
0 MNNQARTPAPSSARTSTSVRASTPTRTPTPVRTPTAVRIRTPIRTLTPSLAGTPALVPTPTPARISRLVPTPAPARTPTPIPTLVRTLTPVPLPAPARIPAPAPAPAPAPAPAPSPALVP
AGIRATLPVLDSYPALALPWDPPPEPVPEPLVSVSSEEDPEPAPSLKLVPSVSGETGPAPGPLPACTPLATNPPEPTLDFTADSSATELAVPTIPGSVPAPILGTIPLAASLLNSTESFL
SASENFALDKRVPIRVTYc 2
1 GLuSYSLR 0
0 YILLKKSLEQRFPDCLLF 0
0 EEDRAAQATGDFEVFVDGRLVHSKK 0
0 RGDGFVDEASLQKIVSVIDEEIKKR* 0

>SELV_otoGar Otolemur garnettii (bushbaby)
0 MNSQARASVHPSRTSTAVRASIPARVHPRARTTPVQPRTLITQDRIPAPVRVPP
1 GLuSYGLQ 0
0 YILLRQNLEHHFPNRLLF 0
0 EEGRAAQATGEFEVFVDGKLVHSKK 0
0 NGDGFVDEIRLQKIVNIIDEEIKKRQ 0

>SELV_musMus Mus musculus (mouse) AV279316 58 Pro, 25 Thr, 25 Ser syntenic chr 7 46% hsa!
MNNKARVPAPSSVRANTPARTPAPIRTATPVRAPNPAHNSTPVRTSIRVRAPAQVPNPVPIRFPTPAPVPAPTLTPAPTPAPVRHAAPVRTPAPVRAPNLGRV
FPKISPGRFFPSLASPTAQPLSSRAASALLKDPTLAQNQKPSIHSLAEAIQGPLPVLTPSSSKTQGSIPDTASPIDSLASTAMASSTLGPIPGPNPTLEFLAS
PLKETPGLGKLSTISPAPSFGSTKEIPSTSEDVPTPNRILIRVMYC 2
1 GLuSYGLR 0
0 YIILKRTLEHQFPNLLEF 0
0 EEERATQVTGEFEVFVDGKLIHSKK 0
0 KGDGFVDESGLKKLVGAIDEEIKKR* 0

>SELV_ratNor Rattus norvegicus (rat) chr 1 synteny 83% mmu 
MNNKARNPAPSSVRANTPSRTPTPVRTATPVRASTPAHNRSPVRTSIRVRTPANPVPIRFPTPAPAPAPTPTPAPTPAPVPAAAPVRTPAPVRASIQGRSFPTIS
PVRFLRNLALPAAQPLSSGGAGSLSKDLTLAQKQKPSIHSLAEAIQGPFPVLTPSASSETHGSIPDPAPPTDSLASTAMASSTLDPIPGPKPTLEFLASPLKETP
DLGKLSTISPAQNFVSTKEVPSTSEDVPTANRILIRVMYC 2
1 GLuSYGLR 0
0 YILLKKTLEHQFPNLLEF 0
0 EEERATQVTGEFEVFVDGKLIHSKK 0
0 KGDGFVDETSLKKLVGAIDEEIKKR* 0

>SELV_cavPor Cavia porcellus (guinea_pig)
1 GLuSYGFQ 0
0 YTLLKMSLKQQFPNLLRF 0
0 EDERATQVTGEFEVFVEGKLVHSKK 0
0 QGDGFVDDNRMQTIVNAINEEIKRR* 0

>SELV_oryCun Oryctolagus cuniculus (rabbit)
0 MNNQARTPAPAQARTSSVVRASAPTRVSTQIRTPATGWTPTPVQASTPVRTQTPVRTPTLVQASTPVRTPTLGQASTPVRTPTPVQALTPVRTPTPVSGPDSGRTPTPVGTPVGTL
1 GLuSYGLR 0
0 YILLKKNLEELFPDCLLF 0
0 EEERATQASGEFEVFVNGKLVHSKK 0
0 KGDGFVDEVKLRKIVTAINEEIK... 0

>SELV_canFam Canis familiaris (dog) CO599097
0 MNNQARAPPRTSARVLAWVRASTPVRTSIPVRTPTPARIPTSSRAPTSVQTPAPARTPNPVQIPTPVQTSTSARIPNPARTLTPVQTPASAWTPNPVQMLTPARTPTPVPTP
VPTPIPARTPTPARTPTPVEAAAAAPASESFGSSALPLEPPPEPASEPTTSPHQDLSPTPSVKPLPSVTNGFGSTQEPLPDLTPPATDFLGPTLGSTSRADSSATKLTDSSESVR
VPIPGTPSATALATSTNTFAPVGESCSVKIAVRVIYC
1 GLuSYGLR 0
0 YILLKKSLEQQFPNCLLF 0
0 EEERAAQATGEFEVFVDGKLVHSKK 0
0 KGDGFVDEARLQKIMNVIEEEVRKR* 0

>SELV_felCat Felis catus (cat)
0 MNNQARAPNPSPARVLALVRASTPVRSSIPVRIPTPARIPARTRNPQTPVRAPNPVQIPNPVQISARASNPARAPRPVQIPARTPNPPQTLTPGRAPNPVPTPVQTPTLVQTP
TPVQTPTPVWTETPVQTPTAVGTPTRVQTLTPVIPRVRTPTPIRPLWPDPSPNLIPSGPDHPAPESSLRNSAPSFWINSPDSLPAPILETPSTAFASTFENLPEDSKILIRVIYC
1 GLuSYSLR 0
0 YILLRKSLELQFPNCLRF 0
0 EEERSAQATGEFEVFVDGKLVHSKK 0
0 KGDGFVDEARLQKIVSTIDEEIRKR* 0

>SELV_bosTau Bos taurus (cow) DT829759
0 MNNQPRTPAPTPARASTPVRGSTLHRTSIQVRTPTPGPDSGPTRITTLLRTPALIRTLTPIRTPTPVRTPTPVPPSTPVRSPIPVRTPTPVPPSTPVRTPTPVPLSTPVRPPTPVRPPTPVRPPTPVRPPTP
IGTPTLIRSPTPVQIPIPEPIPTPIPSRVLIPPLESFPDSALPSGPPLELEPTLTVSPAKNLEPSPAKNLEPSPRVKQVSSAANGFPPIQEPLPALTPLATDLRSPSLGSPLRTDTSTTNLIASSSGHVPGTPILGAIQAILPVPATALASISGNLKEENKIMIRVVYC
1 GLuSYSLR 0
0 YILLRKSLEQQFPNSLIF 0
0 EEEISAQATGEFEVFVDGKLIHSKK 0
0 NGDGFVDEVKLQTIVNLLNEEFKKR* 0

>SELV_susScr Sus scrofa (pig) ti|851198642 CX061656
0 MNNPARAPAPSPVRPSASLRPLASVRASTPARGSTLARTSLLVRTPRTPNLVPASGPGPIRTPTPVRTPTPVRTPTPVRTLTPVRTPTPVRTPTPVRTPTPVRTPTPVRTPTPVQVPTPVRVPTTVGIPTP
TLSQVLVPALESLPNPALPSLDPPLEPDPELTLSPDEDPAPTPRAKHLPLVANGFVPVQEPLPALSPLATNLLESTPGAGTDSSTTKLTDSTSGHVPGTPILATIPLAVALPVSTNALASTSENIQVEQQILIRVVYC 2
1 GL*SYGLR 0
0 YILLKKSLEQQFPNCLVF 0
0 EEDISAQATGAFEVFVDGKLVHSKK 0
0 KGDGFVDETKLQKIVSHINEEIKTKVAGSC* 0

>SELV_equCab Equus caballus (horse)
1 GLuSYGLR 0
0 YILLRKSLDQQFPNRLVF 0
0 EEDVGAQASGEFEVFVEGKLVHSKK 0
0 KGDGFVDEARLRKIVSAINEEIKRR* 0

>SELV_myoLuc Myotis lucifugus (microbat)
0 MNHQARASHPFPARTPASVRASIPVRPSNQTPAPSSAPTRTSIPVPASTAVRTPTLVRTPIPFQAVSPVRTSTRFSASVPVQFPTPVSASTPVQTPTRVPAPVRTSTRFPASV
PVQFPTPVPASTPVRTSTPVPASTPVQTPTRVPAPAQVRTSIPVPAPAPARTSTPVPAPAPVRTSTTVPAPVPFRTPTPVPAPAPVRTSTPVPAPAPVRTSTPVPAPGPNPDSSPCPGPG
1 GLuSYNLR  0
0 YILLKKSLEQQFPNCLYF 0
0 EEDISEQATGEFEVFVDGKLVHSKK 0
0 KGDGFVDEVKMQKIVNIIDEEIRKN* 0

>SELV_pteVam Pteropus vampyrus (macrobat) 
0 MNNKARASAPSLPGTSALVRASAPVRPSTPVRAPTLIRTPTPVRTPNPVPASTPARVPTPVRTLTPVRTPTPIRTPTPVRTQIPFRAANPVRAANPVRTPTPLRTPTPVRTSIPVRTL
APVSTPTPIPPPTPPAVPAVVPAAVPGLEFFPSPPPEPVPEPALSPDKDPAPTQSVKHVPSVANEFGLTQEPLSALAPLATDLLGPTRASIRRADPEATKLTGSTAEPSPASILELISSAHIGIASEYLQVDTGSVIRVILL 
1 GLuSYNFR 0
0 YILLRKNLEQQFPHGLLF 0
0 EEEVSAEGTGEFEVLVDGKLIHSKK 0
0 RGDGFVDEARLQKIVNVINE IRKR* 0

>SELV_dasNov Dasypus novemcinctus (armadillo) ti|593909023 
1 GLuSYALR  0
0 YILLKKSLEMQFPNRLIF 0
0 EEARSSQAVGEFEVFVDGALVHSKK 0
0 RGDGFVDDSKMEKIVSSIEEAVKKT* 0

>SELV_loxAfr Loxodonta africana (elephant)
0  2
1 GLuSYGLR 0
0 YILLRKSLEQQFPNHLIF 0
0 EEDISSQATGEFEVFVDGKLVHSKK 0
0 KGDGFVDDTRLQKIVNSINETIKKE* 0

>SELV_proCap Procavia capensis (hyrax)
0  2
1 GLuSYSLR 0
0 YILLRKNLEQQFPNHLLF 0
0 EVDISSQATGEFEVFVDGKLVHSKK 0
0 KGDGFVDDAQLQKIVNSINETIQRR* 0

SEPW1: 26 vertebrate sequences

>SEPW1_homSap  Homo sapiens (human) Selenoprotein W  chr19 87 aa uc002phn.1 has retroprocessed pseudogene
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 CGEGTPQATGFFEVMVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_panTro Pan troglodytes (chimp)
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 RGEGTPQATGFFEVMVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_ponPyg Pongo pygmaeus (orang_sumatran) CR926472 
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 CGEGTPQATGFFEVMVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_macMul Macaca mulatta (rhesus)
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 CGEGTPQATGFFEVMVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_macFas Macaca fascicularis (cynomolgus_monkey) AB169486 
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 CGEGTPQATGFFEVMVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_papAnu Papio anubis (baboon) EY285690 
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 CGEGTPQATGFFEVMVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_calJac Callithrix jacchus (marmoset)
0 MALTVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 SGEGTPQATGFFEVTVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_micMur Microcebus murinus (mouse_lemur)
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGCLDI 0
0 CGEGTPQATGFFEVMVAGKLVHSKK 0
0 GDGYVDTESKFLKLV

>SEPW1_musMus Mus musculus (mouse) 
0 MALAVRVVYC 2
1 GAuGYKPK 0
0 YLQLKEKLEHEFPGCLDI 0
0 CGEGTPQVTGFFEVTVAGKLVHSKK 0
0 RGDGYVDTESKFRKLVTAIKAALAQCQ* 0

>SEPW1_ratNor Rattus norvegicus (rat) BC087625 
0 MALAVRVVYC 2
1 GAuGYKPK 0
0 YLQLKEKLEHEFPGCLDI 0
0 CGEGTPQVTGFFEVTVAGKLVHSKK 0
0 RGDGYVDTESKFRKLVTAIKAALAQCQ* 0

>SEPW1_cavPor Cavia porcellus (guinea_pig)
0 MALAVRVVYC 2
1 GAuGYKPK 0
0 YLQLKEKLEDEFPGCLDI 0
0 CGEGTPQTTGFFEVTVAGKLVHSKK 0
0 GGDGFVDTEGKFRKLVAAIKAALAQG* 0

>SEPW1_oryCun Oryctolagus cuniculus (rabbit)
0 MALAVRVVYC 2
1 GAuGYKPK 0
0 YLQLKKKLEDEFPGCLDI 0
0 CGEGTPQVTGFFEVTVAGKLVHSKK 0
0 RGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_ochPri Ochotona princeps (pika)
0 MALSVRVVYW 2
1 GAuGYKPK 0
0 YLQLKKRLEDEFPGCLDI 0
0 GEGTPQVTGFFEVMVAGKLVHSKK 0
0 SGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_canFam Canis familiaris (dog)
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGCLDI 0
0 RGEGTPQATGFFEVTVAGKLVHSKK 0
0 RGDGYVDTESKFLRLVAAIKTALAQG* 0

>SEPW1_felCat Felis catus (cat)
0 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGCLDI 0
0 RGEGTPQATGFFEVMVGGKLVHSKK 0
0 RGDGYVDTESKFLKLVAAIKAALAQG* 0 

>SEPW1_ bosTau Bos taurus (cow) 
0 MAVVVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPSRLDI 0
0 RGEGTPQVTGFFEVFVAGKLVHSKK 0
0 GGDGYVDTESKFLKLVAAIKAALAQA* 0

>SEPW1_oviAri Ovis aries (sheep)
0 MAVVVRVVYC 2
1 GAuGYKPK 0
0 YLQLKKKLEDEFPSRLDI 0
0 CGEGTPQVTGFFEVFVAGKLVHSKK 0
0 GGDGYVDTESKFLKLVAAIKAALAQA* 0 

>SEPW1_susScr Sus scrofa (pig) AF380118 
0 MGVAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 CGEGTPQVTGFFEVLVAGKLVHSKK 0
0 GGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_eriEur Erinaceus europaeus (hedgehog) 
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGCLDI 0
0 RGEGTPQGTGFFEVLVAGKLVHSKK 0
0 KGDGYVDTETKFLKLVTAIKAALAQG* 0

>SEPW1_sorAra Sorex araneus (shrew)
0 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGCVDV 0
0 CGEGTPQVTGFFEVMVAGKLVHSKK 0
0 RGDGYVDSESKYVRLVTAIKTALAQA* 0

>SEPW1_Choloepus hoffmanni (sloth)
0 MALAVRVVYW 2
1 GAuGYKPK 0
0 YVQLKKKLEDEFPGCLDI 0
0 SGEGTPQTTGFFEVMVAGKLVHSKK 0
0 QKGDGFVDTESKFLRLVAAIKAALAQG* 0

>SEPW1_monDom Monodelphis domestica (opossum) diverged
0 MAIQVRVVYW 2
1 GAuGYKPK 0
0 YLLLKKKLEDEYPGLLRH 0
0 NGEGTPEVTGFFEVTVAGKLVHSKK 0
0 AGHGFVDTADKYLQIVAEIKAALA* 0

>SEPW1_ornAna Ornithorhynchus anatinus (platypus) 
0 MASLEAFPRGVVPVHVVYC 2
1 GAuGYKPK 0
0 FLQLKKKLENEFPGQVEI 0
0 SGEGTPQVTGWFEVTVAGKLVHSKK 0
0 EGDGFVDSESKFAKIRMAIKAALVPGY* 0

>SELW_galGal Gallus gallus (chicken) tga confirmed
0 MPLRVTVLYC 2
1 GAuGYKPK 0
0 YERLRAELEKRFPGALEM 0
0 RGQGTQEVTGWFEVTVGSRLVHSKK 0
0 NGDGFVDTNAKLQRIVAAIQAALP* 0

>SELW_anoCar Anolis carolinensis (lizard) 
1 GAuGYSPK 0
0 YQQLKRGLEKEFPGKLEI 0
0 TGEGTPQVTGWFEVTVAGKLVHSKK 0
0 NGDGFVDNDTKLHKILMAIKAALA* 0

>SELW_xenTro Xenopus tropicalis (frog) tga confirmed 
0 MPDTMVKVNVVYC 2
1 GAuGYLSK 0
0 FRRLKKELEQRFPGKLSI 0
0 DGEGTERMTGWFEVSINGKLVHSKK 0
0 NGDGYVDNDAKLQKIILAIEAALKQ* 0

>SELW_danRer Danio rerio (zebrafish) tga confirmed
0 MTVKVHVVYC 2
1 GGuGYRPK 0
0 FIKLKTLLEDEFPNELEI 0
0 TGEGTPSTTGWLEVEVNGKLVHSKK 0
0 NGDGFVDSDSKMQKIVTAIEQAMGK* 0

>SEPW1_takRub
0 MGVTIRVEYC 2
1 GGuGYGPR 0
0 YEELARVVRAEFPDADVSGFVGRM 1
2 GSFEIQINEQLIFSKLETGGFPYEDD 0

>SEPW2_calMil Callorhinchus milii (elephantfish)
1 GAuGYEPRYQKLAIVIKDEFPDADVSGKVGRT 1
2 GSFEIEINGQLIFSKLETGGFPYEND 0
0 ISEAVQKANNGEELQKIENSRPPCVIL* 0
0 VMHAIQCVSDGKPVEKITKSRPPCVIM* 0

DIO123: 112 vertebrate deiodinases

>DIO1_homSap Homo sapiens (human) iodothyronine deiodinase type I 4 exons chr1 uc001cwb.1
0 MGLPQPGLWLKRLWVLLEVAVHVVVGKVLLILFPDRVKRNILAMGEKTGMTRNPHFSHDNWIPTFFSTQYFWFVLKVRWQRLEDTTELGGLAPNCPVVRLSGQRCNIWEFMQ 1
2 GNRPLVLNFGSCTuPSFMFKFDQFKRLIEDFSSIADFLVIYIEEAHAS 1
2 DGWAFKNNMDIRNHQNLQDRLQAAHLLLARSPQCPVVVDTMQNQSSQLYAALPERLYIIQEGRILYK 0
0 GKSGPWNYNPEEVRAVLEKLHS* 0

>DIO1_panTro Pan troglodytes (chimp)
0 MGLPQPGLWLKRLWVLLEVAVHVVVGKVLLILFPDRVKRNILAMGEKTGMTRNPHFSHDNWIPTFFSTQYFWFVLKVRWQRLEDTTELGGLAPNCPVVHLSGQRCNIWEFMQ 1
2 GNRPLVLNFGSCTuPSFMFKFDQFKRLIEDFSSIADFLVVYIEEAHAS 1
2 DGWAFKNNMDIRNHQNLQDRLQAAHLLLARSPQCPVVVDTMQNQSSQLYAALPERLYVIQEGRILYK 0
0 GKSGPWNYNPEEVRAVLEKLHS* 0

>DIO1_ponPyg Pongo pygmaeus (orang_sumatran)
0 MGLPQPGLWLKRLWVLLEVAVHVVVGKVLLILFPDRVKRNILAMGEKTGMTRNPHFSHDNWIPTFFSTQYFWFVLKVRWQRLEDTTETGGLAPNCPVVRLSGQRCNIWEFMQ 1

>DIO1_macMul Macaca mulatta (rhesus)
0 MGLPQSGLWVKKLWVLLEVAVHVVVGKVLLILFPDRVKRNILAMGEKTGMTRNPHFSHDNWIPTFFSTQYFWFVLKVRWQRLEDTTELGGLAPNCPVVRLSGQRCNIWDFMQ 1
2 GNRPLVLNFGSCTuPSFMFKFDQFKRLIEDFSSIADFLIIYIEEAHAS 1
2 DGWAFKNNMDIRNHRNLQDRLQAAHLLLARSPQCPVVVDTMQNQSSQLYAALPERLYVIQEGRILYK 0
0 GKSGPWNYNPEEVRAVLEKLYS* 0

>DIO1_macFas Macaca fascicularis (crab-eating macaque)
0 MGLPQSGLWVKKLWVLLEVAVHVVVGKVLLILFPDRVKRNILAMGEKTGMTRNPHFSHDNWIPTFFSTQYFWFVLKVRWQRLEDTTELGGLAPNCPVVRLSGQRCNIWDFMQ 1

>DIO1_macNem Macaca nemestrina (pigtailed macaque)
0 MVLPQSGLWVKKLWVLLEVAVHVVVGKVLLILFPDRVKRNILAMGEKTGMTRNPHFSHDNWIPTFFSTQYFWFVLKVRWQRLEDTTELGGLAPNCPVVRLSGPSCNIWDFMQ 1

>DIO1_calJac Callithrix jacchus (marmoset)
0 MGLPGPGLWLKRLWVLLEVAVHVAVGKVLLTLFPDRVKKNILAMGDKTGMTRNPNFSHDNWIPTFFSTQYFWFVLKVRWQRLEDTTELGGLAPNCPVVHLSGQRCNIWDFMQ 1
2 GNRPLVLNFGSCTuPSFMFKFDQFKRLTEDFSSVADFLIIYIEEAHAS 1
2 DGWAFKNNVDIRNHQNLQDRLQAAHLLLARSPQCPVVVDTMQNQSSELYAALPERLYIIQEGRILYK 0
0 GKSGPWNYNPEEVRDVLEKLHS* 0

>DIO1_otoGar Otolemur garnettii (bushbaby)
0 MGLPRPGLWLKRLWVFLEVAVHVAVGKMLLILFPDRVKSQILAMGQQTVMAKNPHFSHDNWIPTFLQHPVFLVCPEGPLQRLEDMTERGGLAPNCPVSASSGQKCNIWDFMQ 1
2 GNRPLVLNFGSCTuPSFLFKFDQFKRLVEDFSSVADFLIIYIEEAHAS 1
2 DGWAFKNNVDIRTHRNLQDRLRAAHLLLARSPQCPVVVDTMENQSSQLYAALPERLYVLQEGRILYK 0
0 GKSGPWNYQPEEVRAVLEKFDN* 0

>DIO1_micMur Microcebus murinus (mouse_lemur)
0 MGLPCPGLWLKRLWVLLQVAVHVAVGKVLLTLFPERVMQHILSIGQKTGMARNPHFTPDNWVPTFFSTQYFWFVLKVRWQQLEDVTERGGLAPNCPVVRLSGQRCNIWDFMQ 1

>DIO1_tupBel Tupaia belangeri (tree_shrew)
0       RAGLWLKRLWVFLQLTVQVAVGKVLLTLFPERVKQHILALGQKTGIARNPNFAHDNWIPTFFSTQYFWFVLKVYWQRLEDTTEPGGLAPNGPVVHLSGQRCDIWDFMQ 1
2 GDRPLVLNFGSCTuPSFLFKFDQFKRLIEDFSSIADFLIIYIEEAHAS 1
2 DGWAFKNNVNIRNHQNLQDRLQAAHLLLDRSPQCPVVVDTMQNQSSQLYAALPERLYVLQEGRILYK 0
0 GKPGPWNYDPEEVRAVLEKLRS* 0

>DIO1_musMus Mus musculus (mouse)
0 MGLPQLWLWLKRLVIFLQVALEVAVGKVLMTLFPGRVKQSILAMGQKTGMARNPRFAPDNWVPTFFSIQYFWFVLKVRWQRLEDRAEFGGLAPNCTVVCLSGQKCNIWDFIQ 1
2 GSRPLVLNFGSCTuPSFLLKFDQFKRLVDDFASTADFLIIYIEEAHAT 1
2 DGWAFKNNVDIRQHRSLQERVRAARMLLARSPQCPVVVDTMQNQSSQLYAALPERLYVIQEGRICYK 0
0 GKAGPWNYNPEEVRAVLEKLCTPPRHVPQL* 0

>DIOI_ratNor Rattus norvegicus (rat) 
0 MGLSQLWLWLKRLVIFLQVALEVATGKVLMTLFPERVKQNILAMGQKTGMTRNPRFAPDNWVPTFFSIQYFWFVLKVRWQRLEDRAEYGGLAPNCTVVRLSGQKCNVWDFIQ 1
2 GSRPLVLNFGSCTuPSFLLKFDQFKRLVDDFASTADFLIIYIEEAHAT 1
2 DGWAFKNNVDIRQHRSLQDRLRAAHLLLARSPQCPVVVDTMQNQSSQLYAALPERLYVIQEGRICYK 1
0 GKPGPWNYNPEEVRAVLEKLCIPPGHMPQF* 0

>DIOI_speTri Spermophilus tridecemlineatus (squirrel)
0 MGLLRPGLWLKKLWILLLVIVEVAMGKVLMTLFPERATQNILAMGQKTGMTRNPQFSPDNWVPTFFSIQYFWFVLKVRWQRLEDKAELGGLAPNCPVIRLSGEKCNIWDFIQ 1
2 GNRPLVLNFGSCTuPSFLFKFDQFKRLIEDFSPIADFLIIYIEEAHAS 1
2 1
0 * 0

>DIOI_dipOrd Dipodomys ordii (kangaroo_rat)
0 MELSRLGLWLKRLWIFLQVVVEVAMGKMLMILFPERVKKHILAMGQKTGMTRNPRFSPDNWVPTFFSTQYFWFVLKVRWQRLEDKAMYGGLAPNCSVISLSGQRCSIWDFMQ 1
2 GNRPLVLNFGSCTuTSFLFKFDQFKRLVEDFDSTADFLIIYIEEAHAS 1
2 DGWAFKNNVNISHHRNLQDRLQAAQLLLDQKPQCPVVVDTMENQSSQLYAALPERLYVLQEGRILYK 1
0 GKPGPWNYNPEEVRAVLEKLCT* 0

>DIOI_ochPri Ochotona princeps (pika)
0 MAWPQVRLWLRRLWVLVQVAVEVAVGKVLMTLFPERVKQSILAMGQKTGLAQNPLFTHDNWIPTFFSIQYFWFILKVRWQRLEDATQPGGLAPNCPVVCLSGQECHIWDFMQ 1
2 MTGNRPLVLNFGSCTuPSFLSKFNQFKRLIEDFSSIADFLIIYIEEAHAS 1
2 DGWAFKNNVDIRNHRNLQERLQAAHLLLPRSPQCPVVVDTMQNQSSQHYAALPERLYVLQQGRILYK 1
0 GQPGPWNYDPEEVRAVLLELHS* 0

>DIOI_cavPor Cavia porcellus (guinea_pig) 
0 MGLTWPGLWLKRLWVLVQVAVEVAMGKVLMTLFPERIKKSILAMGEKTGMTRNPQFSHDNWIPTFFSTQYFWFILKVRWQRLEETAELGGLAPDCSVVCLSGEKRTIWDFMH 1
2 GNRPLVLNFGSCTuPSFMFKFDQFKRLIEDFSSIADFLVIYIEEAHAS 1
2 DGWAFKNNVDIRQHQNLQDRMRAAHLLLAKSPQCPVVVDTMQNESSQLYAALPERLYVQEGRILYK 1
2 GKSGPWNYNPEEVRGVLEKLHT* 0

>DIO1_oryCun Oryctolagus cuniculus (rabbit)
0 MGLPRPGLWLKRLWVLVQVAVEVAVGKVLMTLFPERVKQNILAMGQKTGIAQNPNFAQDSWIPTFFSTQYFWFVLKVRWQRLEDATEPGGLAPNCSVVRLSGQQCSIWDFMR 1
2 GNRPLVLNFGSCTuPSFLSKFDQFKRLIQDFSSIADFLIIYIEEAHAS 1
2 DGWAFKNNVDIKNHRNLQDRLRAASLLLARSPQCPVVVDTMQNQSSQLYAALPERLYVLRQGRILYK 0
0 GESGPWNYNPEEVRAVLEELHS* 0

>DIO1_canFam Canis familiaris (dog)
0 MGLPRPVLWLRRLWVLLQVAVQVAVGKVFLKLFPARVKQHIVAMNGNKPHFSYDNWAPTLYSMQYFWFVLKVQWQRLEDRTEPGGLAPNCPVVRLSGQRCNIWDFMQ 1
2 GNRPLVLNFGSCTuPSFLFKFDQFKRLIEDFCSTADFLIIYIEEAHAS 1
2 DGWAFKNNVNIRTHQTLQDRLQAARLLLDRAPPCPVVVDTMRNQSSQFYAALPERLFVLQEGRILYK 0
0 GKPGPWNYHPEEVRAVLEKLHS* 0

>DIO1_ursArc Ursus arctos AM748338
0                   VAMHVAVGKVLLILFPERVKQQVLAMNKKNPHFSYDNWLPTFYSMQYFWFVLKVRWQRLEDRTEPGGLAPNCPVVRLS

>DIO1_melUrs Melursus ursinus AM748341
                     AMHVAVGKVLLILFPERVKQQVLAMNQKNPHFSYDNWLPTFYSMQYFWFVLKVRWQRLEDRTEPGGLAPNCPVVRLSG

>DIO1_ursAme Ursus americanus AM748339
                     AMHVAVGKVLLILFPERVKQQVLAMNQKNPHFSYDNWLPTFYSMQYFWFVLKVRWQRLEDRTEPGGLAPNCPVVRLSG

>DIO1_ursMar Ursus maritimus AM748337
                     AMHVAVGKVLLILFPERVKQQVLAMNQKNPHFSYDNWLPTFYSMQYFWFVLKVRWQRLEDRTEPGGLAPNCPVVRLSG

>DIO1_ailMel Ailuropoda melanoleuca AM748344
                     AMHVAVGKVFLILFPERVKQQVLAMNQKNPHFSYDNWLPTFYGMQYFWFVLKVRWQRLEDRTEPGGLAPNCPVVQLSG

>DIO1_treOrn Tremarctos ornatus AM748343
                     AMHVAVGKVLLILFPERVKQQVLAMNQKNPHFSYDNWLPTFYSMQYFWFVLKVRWQRLEDRTEPGGLAPNCPVVQLSG

>DIO1_helMal Helarctos malayanus AM748340
                            GKVLLILFPERVKQQVLAMNQKNPHFSYDNWLPTFYSMQYFWFVLKVRWQRLEDRTEPGGLAPNCP

>DIO1_ursthi Ursus thibetanus AM748342
                      AMHVAVGKVLLILFPERVKQQVLAMNQKNPHFSYDNWLPTFYSMQYFWFVLKVRWQRLEDRTEPGGLAP

>DIO1_felCat Felis catus (cat)
0 MGLSQLGLWLRRLWVLFQVALQVAVGKVFLILFPSRVKQHIVAMNRKNPHFSYDNWAPTLYSVQYFWFVLKVRWQRLEDRTEPGGLAPNCPVVRLSGQRCSIWDFMK 1
2 GNRPLVLNFGSCTuPSFLFKFDQFKRLIEDFCSIADFLIIYIEEAHAS 1
2 DGWAFKNNVNIRNHRNLQDRLQAACLLLDRSPRCPVVVDTMKNQSSRLYAALPERLYVLQAGRILYK 0
0 GKPGPWNYHPEEVRAVLEKLHS* 0

>DIO1_equCab Equus caballus (horse)
0 MGLPRAGLWLKRLWVLLQVALQVAVGKVLLTLFPDRVKQHIVAMNQKNPHFSYDNWVPTLYSTQYFWFVLKVHWQRLEDTTKRGGLAPNSPVVRLSGQRCNIWDFMQ 1
2 GNRPLVLNFGSCTuPSFLFKFDQFKRLIEDFSSIADFLIIYIEEAHAS 1
2 DGWAFKNNVVIRNHRNLQDRLRAARLLLDRSPPCPVVVDSMENRSSQLYAALPDRLYVLQARRILYK 0
0 GKPGPWNYHPEEVRAVLEKLHS* 0

>DIO1_myoLuc Myotis lucifugus (microbat)
0 MGLPQPGLWLKRLWILLQVALHVTLGKVQLKLFPRRVKQHILAMNRKNPHFSYDNWAPTLFSTPYFWFILKVRWQRLEDKTEEGSLAPNCPVVRLSGQRCHIWDFMQ 1
2 GNRPLVLNFGSCTuPSFLFKFDQFKRLIEDFSAIADFLVIYIEEAHAS 1
2 DGWAFKNNVVIKNHRNLQDRLQAAHLLLDRSPRCPVVVDTMKNQSSQLYAALPDRLYVLQEGRILYK 0
0 GKPGPWNYHPEEVRAVLEKLRS* 0

>DIO1_pteVam Pteropus vampyrus (macrobat)
0 MELPWPGRWLKRLWVLLQVALHVAVGKVQLTLFPRRVKQNIVAMNRKNPHFSFDNWLPTLFSTQYFWFVLKVRWQQLEDTTKEGGLAPNCPVVCLSGQRCNIWDFMQ 1
2 GKRPLVLNFGSCTuPSFLLKFDQFKKLIEDFSSIADFLVIYIEEAHAS 1
2              ERLQAARMLLDRSPPVPVVVDTMKNQSSHLYAALHERLYVLQEGRILYK 0
0 GKPGPWNYHPEEVHAVLEKLHS* 0

>DIO1_bosTau Bos taurus (cow)
0 MGLPSPGLWLKRLWVLFQVALHVAIGKVLLTLFPRRVKQNILAMGEKTGMTRNPHFSHENWIPTFFSTQYFWFILKVRWQRLEDMTEQGGLAPNCPVVRLSGERCSIWDFMQ 1
2 GNRPLVLNFGSCTuPSFIFKFDQFKRLIEDFGSVADFLIIYIEEAHAS 1
2 DGWAFKNNVDIKNHRNLQDRLRAAHLLLDRSPPCPVVVDTMTNQSSSCYAALPERLYVLQEGRVLYK 0
0 GKPGPWNYHPEEVRAVLEKLHS* 0

>DIO1_turTru Tursiops truncatus (dolphin)
0 MGLPLPGLWLKRLWVLFQVGLHVAMGKVLLTLFPRRVKQNILAMSEKTGMAKNPHFSYENWIPTFFSAQYFWFILKVRWQRLEDMTEQGGRAPNCPVVRLSGQRCNIWDFMQG 1
2 GNRPLVLNFGSCTuPSFIFKFDQFKRLIEDFSSIADFLIIYIEEAHAS 1
2 DGWAFKNNVDIKNHQHLQDRLRAARLLLDRSPQCPVVVDTMKNQSSQLYAALPERLYVLQDGRILYK 0
0 GKPGPWNYRPEEVRAVLEKLHS* 0

>DIO1_susScr Sus scrofa (pig)
0 MELPLPGLWLKRLWVLFQVALHVAMGKVLMTLFPGRVKQDILAMSQKTGMAKNPHFSHENWIPTFFSAQYFWFVLKVRWQRLEDKTEEGGLAPNCPVVSLSGQRCHIWDFMQ 1
2 GNRPLVLNFGSCTuPSFIFKFDQFKRLIEDFSSIADFLIIYIEEAHAS 1
2 DGWAFKNNVDIKNHQNLQDRLRAAHLLLDRSPQCPVVVDTMKNQSSRLYAALPERLYVLQAGRILYK 0
0 GKPGPWNYHPEEVRAVLEKLHS* 0

>DIO1_vicVic Vicugna vicugna (vicugna)
0 MGLSLPGLWLKRLWVLFQVVLHVALGKVLLTLFPGRVKQDILAMSQRTGMAQNPHFSYENWIPTFFSTQYFWFILKVRWQRLEDTTEQGGLAPDCPVVCLSDRVHIWDFMQ 1
2                     FDQFKRLIEDFNSIADFLIIYIEEAHAS 1
2 DGWAFKNNVDIKNHRNLQDRLRAAHLLLDRSPQCPVVVDTMKNQSSQLYAALPERLYVLQKGRILk 0
0 GKPGPWNYHPEEVRAVLEKLHS* 0

>DIO1_eriEur Erinaceus europaeus (hedgehog)
0 MGLPSPGLWLKRLWVLFQVALHVAIGKVLLTLFPRRVKQNILAMGEKTGMTRNPHFSHENWIPTFFSTQYFWFILKVRWQRLEDMTEQGGLAPNCPVVRLSGERCSIWDFMQ 1
2 1
2 DRWAFKNNVDIRTHRNLQDRMRAALLLLDRDPQCPVVVDTMENQSSQLYAALPERLYVLQEGRILYK 0
0 GKPGPWDYQPQEVRAVLEKLRGKCGQTLPKL* 0

>DIO1_sunMur Suncus murinus (shrew)
0 MGLPGLGLLLKRFGVLVRVALKVAVGKVLLTLWPSAIRPHLLAMSEKTGMAKNPRFTYEDWAPTFFSTQYFWFVLKVNWQQLEDRTKQGDIAPDSPVVHLSGQRARLWDFMQ 1
2 GNRPLVLNFGSCSuPSFLFKFDQFKRLVEDFSSVADFLTVYIEEAHAS 1
2 DGWAFKNNVDIRRHRDLQERLQAARLLLDRNPGCPVVVDTMENRSSQLYAALPERLYVLQEGRILYK 0
0 GGPGPWNYHPEEVHAVLEQLCRSSAQSPRL* 0

>DIO1_loxAfr Loxodonta africana (elephant)
0 MGLPQPGLWLKRLWIFLKVALHVAMGKVLLILFPGRVKKNILAQNPHFAYDMWGSTLFSIPYFWFILKVYWQRLEDKTEEGGPAPDCPVVCLSGQRCNISDFMQ 1
2 GIRPLVLNFGSCTuPSFLSKLDQFKRLVEDFSSMADFLIIYIEEAHAT 1
2 DGWAFKNNVAIRNHRNLQDRLQAAHLLLDRSPQCPVVVDTMQNVSSQLYAALPERLYVLQEGRILYK 0
0 GKPGPWNYHPEEVRAVLEKLNS* 0

>DIO1_proCap Procavia capensis (hyrax)
0 MGLPQPVLWLKRLWVLLRVALHVAMGKVLLALFPGRVKKNILAQNPHFAYDMWCSTLFSVPYFWFVLKVYWQRLEDKTEEGGLAPNCPVVHLSGQRRNIWDFMQ 1
2 GIRPLVLNFGSCTuPSFLSKLDQFKRLVEDFSSMADFLIIYIEEAHAT 1
2 DGWAFKNNVAIRTHRNLQDRLQAARLLLDRSPQCPVVVDTMQNVSSQFYAALPERLYVLQEGRILYK 0
0 GKPGPWNYHPEEVRAFLEKFGS* 0

>DIO1_echTel Echinops telfairi (tenrec)
0 1
2 GTRPLVLNFGSCT*PSFLLKFDQFKRLMDDFHATADFLI 1
2 DGWAFKNNVTIRTHQNLEDRLRAARLLLDRGPQCPVVVDTMENESGRLYAALPERLYVLQEGRILYK 0
0 GKPGPWNYRPEEVRAVLEKLDS* 0

>DIO1_choHof Choloepus hoffmanni (sloth)
0 MGLSWPGLWLKRLWVLLQVALHVAMGKILLTLFPGRVKQNILAMSRRANNTKDPQFPYDNWGPTFFNTQYFWFVLKVRWQRLEDKTEQGGLAPNCPVVHLSRQRCNIWDFMQG 1
2 GNRPLVLNFGSCTuPSFLFKLDQFKRLVDDFSSIADFLIIYIEEAHAS 1
2                    RRRAARLLLDRSPQCPVVVDTMQNQSSQLYAALPERLYMLQEGRILYK 0
0* 0

>DIO1_dasNov Dasypus novemcinctus (armadillo)
0 MGLSQPGLWLKRLWILFQVALHVAVGKTLLTLFPGRVKQNILAKSQKSHKAENPHFPYDNWGPTFFNTQYFWFLLKIGWQRLEDKTEQGGLAPNCPVVHLSGQRCNIWDFMQ 1
2 GNRPLVLNFGSCTuPSFLFKFDQFKRLVEDFSSIADFLIIYIEEAHAS 1
2 DGWAFKNNVDIRNHRNLQDRQRAACLLLDRSPQCPVVVDTMQNQSSQLYAALPDRLYVLQEGRILYK 0
0 GKPGPWDYQPEEVRAVLETLNG* 0

>DIO1_monDom Monodelphis domestica (opossum)
0 MAELLRLWLWLWLRLQRLWVLLQVVGHVLMGKLMKMLSPDRMKQHILGMGQKSSIFQNPNFKYENWGPTFFTLPYFLFVLRVRWQRLEDQALQGGPAPDCPVVSLRGQPRRLWDFMH 1
2 ANRPLVLSFGSCTuPSFIFKFDQFHRIMEDFSSVADFLIIYIEEAHAT 1
2 DGWAFANNIDIKQHRTLQDRMEAARLLMDRKPLCPIVVDTMDNLATRKYAALPERFYILLEGHILYK 0
0 GGPGPWSYNPEEVRAVLETLSR* 0

>DIO1_triVul Trichosurus vulpecula (possum)
0 MAGPLLWVRRFWALLQVAFHVVVGKLLKTLFPNMMKKHILSLGQRSSISQNTQFAYENWGPTFFSIQYFFFVLKVRWQRLEDQALQGGLAPNPPVVTLKGESRHIWDFMH 1
2 GNRPLVLNFGSCTuPSFILKFDQFRKLIEDFSSIADFLIIYIEEAHAA 1
2 DGWAFENNIDIKQHRTLQDRRETAQLLMDRKPLCPIVLDTMDNLTSKKYAALPERFYVLLEGRILFK 0
0 GDPGPWNYHPEEVQAVLQ* 0

>DIO1_ornAna Ornithorhynchus anatinus (platypus)
0 1
2 GSRPLVLTFGSCTuPSFLFKFDQFNQLVQDFNSIADFLIIYIEEAHPT 1
2 DGWAFANNVDIPSHRSLQERQEAARRLLARGPRCPVVVDTMDNASSRQYAALPERLYLLREGKVVYK 0
0 GGPGPWNYNPGEVRAVLEKLS 0*

>DIO1_galGal Gallus gallus (chicken)
0 MLSIRVLLHKLLILLQVTLSVVVGKTMMILFPDTTKRYILKLGEKSRMNQNPKFSYENWGPTFFSFQYLLFVLKVKWRRLEDEAHEGRPAPNTPVVALNGEMQHLFSFMR 1
2 DNRPLILNFGSCTuPSFMLKFDEFNKLVKDFSSIADFLIIYIEEAHAV 1
2 DGWAFRNNVVIKNHRSLEDRKTAAQFLQQKNPLCPVVLDTMENLSSSKYAALPERLYILQAGNVIYK 0
0 GGVGPWNYHPQEIRAVLEKLK* 0

>DIO1_anoCar Anolis carolinensis (lizard)
0 MFKAGRLVLKTWLLLQVCLSTAVGKLFMILFPATAKRYILKQSERSSMGRNPNFVYENWGPTFFSFQYLLFVLKVKWKRLEDKALQGCPAPNTPVVDFDGKIHHILDFMQ 1
2 DNRPLVLAFGSCTuPSFMFRFGEFKKLIEDFSFAADFLVIYIEEAHAS 1
2 DGWAFKNNIVIKSHRTLHDRMQAAEILLKQYPLCSVVMDTMENLSSSTYAALPERLYVLQGGNIVYK 0
0 GGVGPWNYNPQEVREVLEKL* 0

>DIO1_xenTro Xenopus laevis (frog) tga confirmed
0 MLRYIQKALILFFLFLYVVVGKVLMFLFPQTMASVLKSRFEISGVHDPKFQYEDWGPTFFTYKFLRSVLEIMWMRLEDEAFVGHSAPNTPVVDLSGELHHIWDYLQ 1
2 GTRPLVLSFGSCTuPPFLFRLGEFNKLVNEFNSIADFLIIYIDEAHAA 1
2 DEWALKNNLHIKKHRSLQDRLAAAKRLMEESPSCPVVLDTMSNLCSAKYAALPERLYILQEGKIIYK 0
0 GKMGPWGYKPEEVCSVLEKKK* 0

>DIO1_danRer Danio rerio (zebrafish)
0 MGSAVGFALRKLFVYISAVLMVCAAILQMSMLKLLSFISPGRMRKIHMKMGERSTMTQNPKFRYEDWGPAFFSLAFIKTLFFVNWCSLGLEAFEGHAAPDSALITLDRQKTSVHRFLK 1
2 GNRPLVLSFGSCTuPPFLYKLDEFKQLVKDFSNVADFLIVYLAEAHAT 1
2 DAWAFKNNVDISVHKNLEERLAAARTLLKEDPPCPVVVDEMNNITASKYGALPERLYVIQSGKVIY 0
0 QASDLGGQA* 0

>DIO1_takRub Takifugu rubripes (fugu) genome glitches
0 MLLQKLAMYLSTAGLFCFMITLNVVLWILNIVAPALAKKIALKMGEKATMTQDPLFKYEDWGLTFASTALVKTASRHMWLSLGQEAFAGLEAPDSPVVTMERKRSSIGEFMK 1
2 TNRPLVLNFGSCTuPPFMFKLEEFKQLVRDFSDVADFLVVYIAEAHST 1
2 DGWAFKNNFDINQHRNLEDRLSAAQILVQKDPLCPVVVDDMNNSCAIKYGALPERLYVLQAGKVLYK 0
0 GAVGPWGYDPREVRSYLEKMK* 0

>DIO1_calMil Callorhinchus milii (elephantfish) weak match but introns ok
0 MWRRVVVYMQTVLLLVCICVRVAVGRVMLTLFPATTRRLELRNGLKTTMTLNPRFRFEDWGPSMFSLSSLRAVTTSIIANSGDRAFPGQPAPDTTLIDLDNTAHTIRSFIR 1
2 1
2 0
0 SGLGPWGYKPEEVRGVLHTLE* 0


>DIO2_homSap Homo sapiens (human)iodothyronine deiodinase type II 2 exons chr14 uc001xuu.1
0 MGILSVDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGEWRRMLTSEGLRCVWKSFLLDAYKQ 0
0 VKLGEDAPNSSVVHVSSAEGGDNSGNGTQEKIAEGATCHLLDFASPERPLVVNFGSATuPPFTSQLPAFRKLVEEFSSVADFLLVYIDEAHPSDG
WAIPGDSSLSFEVKKHQNQEDRCAAAQQLLERFSLPPQCRVVADRMDNNANIAYGVAFERVCIVQRQKIAYLGGKGPFSYNLQEVRHWLEKNFSKRuKKTRLAG* 0

>DOI2_macMul Macaca mulatta (rhesus) 
0 MGILSVDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGEWRRMLTSEGLRCVWKSFLLDAYKQ 0
0 VKLGEDAPNSSVVHVSSAEGGDNSGHGTQEKIAEGAACHLLDFASPERPLVVNFGSATuPPFTSQLPAFRKLVEEFSSVADFLLVYIDEAHPSDG
WAIPGDSSLSFEVKKHQNQEDRCAAAQQLLERFSLPPQCRVVADRMDNNANIAYGVAFERVCIVQRQKIAYLGGKGPFSYNLQEVRHWLEKNFSKRuKKTRLAG* 0

>DOI2_otoGar Otolemur garnettii (bushbaby) 
0 MGILSVDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGEWRRMLTSEGLRCVWKSFLLDAYKQ 0
0 VKLGEEAPNSSVVHVSNPEKGDSSGNGAPENAAAECHLLDFASPERPLVVNFGSATuPPFTSQLPAFRKLVEEFSAVADFLLVYIDEAHPSDG
WAVPGDSSLSFEVKKHRNQEDRCAAAHQLLERFSLPPQCRVVADRMDNNANVAYGVAFERVCIVQRQKIAYLGGKGPFFYNLQEVRRWLEKNFSKRuNRLAG* 0

>DOI2_tupBel Tupaia belangeri (tree_shrew) 
0 MGILSVDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGEWRRMLTSEGLRCVWKSFLLDAYKQ 0
0 VKLGEDAPNSSVVHVSNLEGGHNGGNGTQEKTADGAECHLLDFANSERPLVVNFGSATuPPFTSQLPAFSKLLEESSVADFLVVYIDEAHPSDG
WAVPGDSSLSFEVKKHRNQEDRCAAAHQLLERFSLPPQCRVVADRMDNNANVAYGVAFERVCIVQRQKIAYLGGKGPFYYNLQEVRRWLEKNFSKRuNSLAG* 0

>DIO2_ratNor Rattus norvegicus (rat)
0 MGLLSVDLLITLQILPVFFSNCLFLALYDSVILLKHVALLLSRSKSTRGEWRRMLTSEGLRCVWNSFLLDAYKQ 0
0 VKLGEDAPNSSVVHVSNPEAGNNCASEKTADGAECHLLDFACAERPLVVNFGSATuPPFTRQLPAFRQLVEEFSSVADFLLVYIDEAHPSDG
WAVPGDSSMSFEVKKHRNQEDRCAAAHQLLERFSLPPQCQVVADRMDNNANVAYGVAFERVCIVQRRKIAYLGGKGPFSYNLQEVRSWLEKNFSKRuILD* 0

>DOI2_canFam Canis familiaris (dog)
0 MGILSVDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGEWRRMLTSEGMRCIWKSFLLDAYKQ 0
0 VKLGEDAPNSSVVHVSNSEGGDNSRNGAQVKIVDGAECHLLDFASPERPLVVNFGSATuPPFTSQLPAFSKLVEEFSSVADFLLVYIDEAHPSDGWAVPGD
SSLSFEVKKHRNQEDRCAAAHQLLERFSLPPQCRVVADRMDNNANVAYGVAFERVCIVQRQKIAYLGGKGPFYYNLQEVRRWLEKNFSKRuNRLAG* 0

>DOI2_bosTau Bos taurus (cow)
0 MGILSVDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGQWRRMLTSEGMRCIWKSFLLDAYKQ 0
0 VKLGEDAPNSSVVHVSSPEGGDTSGNGAQEKTVDGTECHLLDFASPERPLVVNFGSATuPPFTNQLPAFSKLVEEFSSVADFLLVYIDEAHPSDGWAVPGD
SSLFFEVKKHRNQEDRCAAAHQLLERFSLPPQCRVVADRMDNNANVAYGVAFERVCIVQRQKIAYLGGKGPFFYNLQEVRRWLEKNFSKRuKRLAG* 0

>DIO2_susScr Sus scrofa (pig) tga tag
0 MGILSVDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGEWRRMLTSEGMRCIWKSFLLDAYKQ 0
0 VKLGEDAPNSSVVHVSNPEGSNNHGHGTQEKTVDGAECHLLDFANPERPLVVNFGSATUPPFTSQLPAFSKLVEEFSSVADFLLVYIDEAHPSDG
WAVPGDSSLSFEVKKHQNQEDRCAAAHQLLERFSLPPQCRVVADRMDNNANVAYGVAFERVCIVQRQKIAYLGGKGPFYYNLQEVRRWLEKNFSKRuKLD* 0

>DOI2_equCab Equus caballus (horse)
0 MGILSVDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGEWRRMLTSEGMRCIWKSFLLDAYKQ 0
0 VKLGEDAPNSSVVHVSNPDRGDNRGNGAQGKTVDGTECHLLDFASSERPLVVNFGSATuPPFISQLPAFSKLVEEFSSVADFLLVYIDEAHPSDG
WAVPGDSSLSFEVKKHQNQEDRCAAAHQLLERFSLPPQCRVVADRMDNNANVAYGVAFERVCIVQRQKIAYLGGKGPFYYNLQEVRRWLEKNFSERuNKLG* 0

>DOI2_sorAra Sorex araneus (shrew)
0 MGNLSVHLLITLQILPVFFSNCLFLALYDSVILLKHVVLMLSRSKSTRGEWRRMLTSEGMRCIWKSFLLDAYKQ 0
0 VKLGEDAPNSSVVHVSSSEDGSRNGAHEKTVDGAECHLLDFASPERPLVVNFGSATuPPFTSQLPAFSKLVEEFSSVADFLLVYIDEAHPSDG
WAVPRDSSLSFEVKKHRSQEDRCAAAHQLLERFSLPPQCRVVADRMDNNANVAYGVAFERVCIVQRQKIAYLGGKGPFCYNLQEVRDWLEKNFSKRuNSL* 0

>DOI2_echTel Echinops telfairi (tenrec) 
0 MGILSVDLLITLQILPVFFSNCLFLALYDSVLLLKHVMLLLSRSKSTCGEWRRLLTFEGLRCVWKSFLLDAYKQ 0
0 VKLGEDAPNSGVVHVSNFEGGGNRGPGAQEKTADGAQCHLLDFARAERPLVVNFGSATuPPFTSQLPAFSQLVEEFSSVADFLLVYIDEAHPSDG
WAVPGDSSSSFEVKKHRNQEDRCAAAHRLLERFSLPPQCLVVADRMDNNANVAYGVAFERVCIVQQKIAYLGGKGPFCYNLQEVRCWLEKNFSKRuTRVAGQ* 0

>DOI2_dasNov Dasypus novemcinctus (armadillo) 
0 MGILSVDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGEWRRMLTSEGLRCVWKSFLLDAYKQ 0
0 VKLGEDAPNSGVVHVSNPEGGDNSGNSAQEKTEDGAECHLLDFASPERPLVINFGSATuPPFTSQLPAFSKLIEEFSSVADFLLVYIDEAHPSDG
WAVPGGSSLSFEVKKHRNQEDRCAAVHKLLDRFSLPPQCHVVADRMDNNANIAYGVAFERVCIVQRQKIVYLGGKGPFCYNLQEVQHWLEKNFSKRuNRLAG* 0

>DOI2_monDom Monodelphis domestica (opossum) tga tag
0 MGLLSIDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGEWRRMLTSEGLRCVWKSFLLDAYKQ 0
0 VKLGGDAPNSSVIHITSAEVGSGQQNNSRWKRFDGAECHLLDFANPERPLVVNFGSATuPPFTSQLPAFSKLVEEFSTVADFLLVYIDEAHPSDG
WAVPGNSSVSFEVKKHRNQEDRCAAAHQLLERFSLPPQCQVVADCMDNNANIAYGVSFERVCIVQRQKIAYLGGKGPFFYNLQEVRLWLEKNFSKRuNPG* 0

>DIO2_ornAna Ornithorhynchus anatinus (platypus) tga tga then tag
0 MGLLSVDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGEWRRMLTSEGLRCVWNSFLMDAYKQ 0
0 VKLGGDAPNSSVVHVANANGETSGGNSPKWKNFSGRYGAECHLLDFASSERPLVVNFGSATuPPFISQLPAFSKLVEEFSAVADFLLVYIDEAHPSDG
WAVPGEFSLPFEVRKHQNQEDRCAAAHQLLERFSLPPQCQVVADCMDNNANVAYGVSFERVCIVQRQKIAYLGGKGPFFYNLQEVRLWLEQNFSKRuNPD* 0

>DIO2_galGal Gallus gallus (chicken) 2 exons 
0 MGLLSADLLITLQILPVFFSNCLFLALYDSVILLKHMVLFLSRSKSARGEWRRMLTSEGLRCVWNSFLLDAYKQ 0
0 VKLGGEAPNSSVIHIAKGNDGSNSSWKSVGGKCGTKCHLLDFANSERPLVVNFGSATuPPFTSQLSAFSKLVEEFSGVADFLLVYIDEAHPSDG
WAAPGISPSSFEVKKHRNQEDRCAAAHQLLERFSLPPQCQVVADCMDNNANVAYGVSFERVCIVQRQKIAYLGGKGPFFYNLQEVRLWLEQNFSKRuNPLSTEDLSTDVSL* 0

>DIO2_anoCar Anolis carolinensis (lizard)
0 MGLLSVDLLITLQILPVFFSNCLFLALYDSVILLKHMVLFLSRSKSARGEWRRMLTSEGLRCVWNSFLLDAYKQ 0
0 VKLGGEAPNSSVIHIAKGNDGSNSSWKSVGGKCGTKCHLLDFANSERPLVVNFGSATuPPFTSQLSAFSKLVEEFSGVADFLLVYIDEAHPSDG
WAAPGISPSSFEVKKHRNQEDRCAAAHQLLERFSLPPQCQVVADCMDNNANVAYGVSFERVCIVQRQKIAYLGGKGPFFYNLQEVRLWLEQNFSKRuNPLSTEDLSTDVSL* 0

>DIO2_xenTro Xenopus tropicalis (frog) tga tag
0 MGLLSVDLLITLQILPGFFSNCLFLALYDSVVLVKHVLLQLSRSKSSQSQWRRMLTPEGLRCVWNSFLLDAYKQ 0
0 VKLGQDAPNSNVIQVSNKICKSVQRKLVGKCHLLDFASSERPLVVNFGSATuPPFISQLPAFSKLVEEFSSVADFVLVYIDEAHPSDG
WAAPGTTSYEVKKHRNQEERCAAASKLLEHFSIPPQCQVVADCMDNNANVAYGVSFERVCIVQKQKIVYLGGKGPFFYNIQEIRRWLELSFGKRuT* 0

>DIO2_neoFor Neoceratodus forsteri (lungfish) exons unknown
0 MGLLSVDLLITLQILPWFFSNCLFLALYDSVVLLKHVILLLSCSKSSRGEWRRMLTSEGLRTVWNSFLLDAYKQ 0
0 VKLGGDAPNSKVVRVTSGCCRRRSFSGKGESECHLLDFASSNRPLVVNFGSATUPPFISQLPTFRKLVEEFSDVADFLLVYIDEAHPADG
WAAPGVATKSFEVKKHRSQEERCVAAHKLLEHFSLPPQCQVVADCMDNNTNVAYGVSFERVCIVQRQKIAYLGGKGPFFYNLKEVRHWLEQTYRKRuVPTCELIM* 0

>DIO2_gasAcu Gasterosteus  aculeatus (stickleback) cDNA taa stop
0 MGMASGGLRVTLQILPGFFSNCLFLALYDSVVLLKAVVSLLSCSRAAGRGARRRMLTSAGLRSVWRSFLLDAYKQ 0
0 VKVGLEAPNSKVVKVPGSPQRSSNLSNLISLPPGAMIANEVECHLLDFESSDRPLVVNFGSATuPPFISHLPAFRQLVEDFSDVANFLLVYIDEAHPSDG
WVAPALGSCSFDVPKHQNLEERLGAARKLIEHFSLPPQCQLVADCMDNNANVAYGVSNERVCIVKQRKIAYLGGQGPFFNLKDVQHWLEQNFGKRFSRTSAEKDMSHISKKGIIHQ* 0

>DIO2_oryLap Oryzias latipes (medaka) tga stop
0 MGSAGAELLVTLQILPGFFSNCLFMALYDSVVLLKRVVSLLSCSRSVSCGEWRRMLTSAGLRAIWNSFLLDAYKQ 0
0 VKVPESPRWSSSIKSMTSVPRGARAQTGDECRLLDFESSDRPLVVNFGSATuPPFISHLPAFRQLVEDFSDVANFLLVYIDEAHPSDG
WVAPQMGPCSFSFRKHQNLEERMGAARQLTEHFSLPPQCQLVADCMDNNANVAYGVSNERVCIVHQRKIAYLGGKGPFFYNLKEVRQWLEQLRQTVGPNTEE* 0

>DIO2_danRer Danio rerio (zebrafish) cDNA tga taa 
0 MGLLSVDLLVTLQILPGFFSNCLFFVLYDSIVLVKRVVSLLSCSGSTGEWQRMLTTAGVRSIWNSFLLDAYKQ 0
0 VKLGEAAPNSKVVKVTGINRCWSISGKTHNQCHLLDFESPDRPLVVNFGSATuPPFISQLPVFRRMVEEFSDVADFLLVYIDEAHPSDG
WVGPPMENFSFEVRKHRNLEERMFAARTLLEHFSLPPQCQLVADCMDNNANIAYGVSYERVCIVQKNKIAYLGGKGPFFYNLKDVRRWLEKCYGKu* 0

>DIO2_tetNig Tetraodon nigroviridis (pufferfish) tag 
0 MGMASEDLLITLQILPGFFSNCLFLALYDSVVLVRRVVSRLSCSRSAGAKEWRPMLTSAGLRSIWNSFLLDAYKQ 0
0 VKVPNGPRWSSISNMLPGANLRNGIECHLLDFESSNRPLVVNFGSATuPPFISHLPAFRQLVEDFSDVADFLLVYIDEAHPSDG
WEAPPMGPCSFNVRKHQNLEERLGAARKLIEHFSLPPQCQLVADCMDNNANVAYGVSNERVCIVQQRKIAYLGGKGPFFYNLKEVRQWLEHSYGKR* 0

>DIO2_takRub Takifugu rubripes (fugu) taa
0 MGMASEDLLITLQILPGFFSNCLFLALYDSVVLVRRVVSRLSCSRSAGPKEWRPMLTSAGLRSIWNSFLLDAYKQ 0
0 VKLGCEAPNSKLVKVPDGARWSSINNITNMLPGASLCNGIECHLLDFESSNRPLVVNFGSATuPPFISHLPAFRQLVEDFSDVADFLLVYIDEAHPSD
GWKAPPMGPISFNVRKHQNLEERIGAAQKLIEHFSLPPQCQLVADCMDNNANVAYGVSNERVCIVQQRKITYLGGKGPFFYNLKEVRQWLEQSYGPHLI* 0

>DIO2_funHet Fundulus heteroclitis tag tag
0 MGSASEDLLVTLQILPGFFSNCLFLALYDSVVLVKRVVALLSRSRSAGCGEWRRMLTSEGLRSIWNSFLLDAHKQ 0
0 VKLGCEAPNSKVVKVPDGPRWSSTVVPCGSRIQTGGECRLLDFESSDRPLVVNFGSATuPPFISHLPAFRQLVEDFSDVADFLLVYIDEAHPSDG
WVAPQMGACSFSFRKHQNLEERIGAARKLIEHFSLPPQCQLVADCMDNNANVAYGVANERVCIVHQRKIAYLGGKGPFFYSLKDVRQWLELSYGRR* 0

>DIO2_calMil Callorhinchus milii (elephantfish)no intron tga stop many extra aa subsequent
0 MGLLSMDLIMKLQILPGFSNCLFLAAYDSFVLLRQAVSLLSCSGLGPDPQHRMLTAEGMQVVWQSFLLDALKQ 0
0 VKVGLEAPNSAVARLDGGAPCRLLDFASRDRPLVVNFGSATuPPFVSRLPAFRQMVERYAEVADFLLVYVDEAHPSDGWALRSRFQLRRHRSQEER
CSAAGLLAREFGLPAACGVVADLMDNNANRAYGVAFERLCVVQSQKIAYLGGKGPFFYNLNGVREWLERHSGQRWG* 0

>DIO2_petMar Petromyzon marinus (lamprey)
0 MTAELNVSVFVALRILPGFFTNCLFLGLRDVLALLARSTRALFARHVPASCCPCPPEACRRVLTRAGMRAVWRSFLLDARRE 0
0 AQRAGDPAPNPRVALLADARSSSPAAAAAIPCRLQELSREGRPLVINMGSASuPPFVGRLPEFRRVVDDFSHAADFLLVYVEEAHPSDGWAVPGALQVSRARSLENNNSMA

>DIO2_petMar Petromyzon marinus (lamprey) frag
0 SRPVVDSLLILPGVFSNCLFLALYDAVSFLRRALQASLTHSAKGDAQHPRMLAGQGMLSVWRSYVLDAHKK
VRLGGEAPNSSV RPSPPQPPPPQLRQAPPCRLLDFARAHRPLVVNFGSASuPPFVEQLGEFCDLVRDFADVADFLVVYIEEAHPSDAWPAPGGLEVPRHLALGDRCVAASQLRGLMPPLGRCPVVADAMDNNANIDYGVSYERLYVIQDG
RIRYLGGKGPFFYRVREVKSFLESVKASR* 0

>DIO3_homSap Homo sapiens (human) 1 exon chr14
0 MLRSLLLHSLRLCAQTASCLVLFPRFLGTAFMLWLLDFLCIRKHFLGRRRRGQPEPEVELNSEGEEVPPDDPPICVSDDNRLCTLASLKAVWHGQKLDFFKQAHEGGPAPNSEVVLPDGFQSQHILDYAQGNRPLVLNFGS
CTuPPFMARMSAFQRLVTKYQRDVDFLIIYIEEAHPSDGWVTTDSPYIIPQHRSLEDRVSAARVLQQGAPGCALVLDTMANSSSSAYGAYFERLYVIQSGTIMYQGGRGPDGYQVSELRTWLERYDEQLHGARPRRV* 0

>DIO3_macMul Macaca mulatta (rhesus)
0 MLRSLLLHSLRLCAQTASCLVLFPRFLGTAFMLWLLDFLCIRKHFLGRRRRGQPEPEVELNSEGEEVPPDDPPICVSDDNRLCTLASLKAVWHGQKLDFFKQAHEGGPAPNSEVVLPDGFQSQRILDYAQGNRPLVLNFGS
CTuPPFMARMSAFQRLVTKYQRDVDFLIIYIEEAHPSDGWVTTDSPYIIPQHRSLEDRVSAARVLQQGAPGCALVLDTMANSSSSAYGAYFERLYVIQSGTIMYQGGRGPDGYQVSELRTWLERYDEQLHGARPRRV* 0

>DIO3_musMus Mus musculus (mouse)
0 MLRSLLLHSLRLCAQTASCLVLFPRFLGTAFMLWLLDFLCIRKHFLRRRHPDHPEPEVELNSEGEEMPPDDPPICVSDDNRLCTLASLKAVWHGQKLDFFKQAHEGGPAPNSEVVRPDGFQSQRILDYAQGTRPLVLNFGS
CTuPPFMARMSAFQRLVTKYQRDVDFLIIYIEEAHPSDGWVTTDSPYVIPQHRSLEDRVSAARVLQQGAPGCALVLDTMANSSSSAYGAYFERLYVIQSGTIMYQGGRGPDGYQVSELRTWLERYDEQLHGTRPHRF* 0

>DIO3_ratNor Rattus norvegicus (rat)
0 MLRSLLLHSLRLCAQTASCLVLFPRFLGTAFMLWLLDFLCIRKHFLRRRHPDHPEPEVELNSEGEEMPPDDPPICVSDDNRLCTLASLKAVWHGQKLDFFKQAHEGGPAPNSEVVRPDGFQSQRILDYAQGTRPLVLNFGS
CTuPPFMARMSAFQRLVTKYQRDVDFLIIYIEEAHPSDGWVTTDSPYVIPQHRSLEDRVSAARVLQQGAPGCALVLDTMANSSSSAYGAYFERLYVIQSGTIMYQGGRGPDGYQVSELRTWLERYDEQLHGTRPRRL* 0

>DIO3_cavPor Cavia porcellus (guinea_pig)
0 MLRSLLLHSLRLCAQIASCLVLFPRFLGTAFMLWLLDFLCIRKHFLRRRGLGQPEPEAELNSEGEEVPPDDPPICVSDDNRLCTLASLKAVWHGQKLDFFKQAHEGGPAPNSEVVLPDGFQSQRILDYEHGNRPLVLNFGS
CTuPPFMARMSAFQRLVTKYQRDVDFLIIYIEEAHPSDGWVTTDSPYIIPQHRSLEDRVSAARVLQQGAPGCALVLDTMANSSSSAYGAYFERLYVIQSGTIMYQGGRGPDGYQVSELRTWLERYDEQLHGPRHGRF* 0

>DIO3_canFam Canis familiaris (dog)
0 MLRSLLLHSLRLCAQTASCLVLFPRFLGTAFMLWLLDFLCIRKHFLGRRRRGQPEPAVELDSDGEELPPDDPPVCVSDDNRLCTLASLKAVWHGQKLDFFKQAHEGGPAPNSEVVLPNGFQNQHILDYARGNRPLVLNFGS
CTuPPFMARMSAFHRLVTKYQRDVDFLIIYIEEAHPSDGWVTTDSPYSIPQHRSLEDRVSAAQVLQQGAPSCSLVLDTMANSSSSAYGAYFERLYVIQNGTVMYQGGRGPDGYQVSELRTWLERYDEQLHGAQPRRV* 0

>DIO3_felCat Felis catus (cat)
0 MLRSLLLHSLRLCAQTASCLVLFPRFLGTAFMLWLLDFLCIRKHFLGRRRRGQPEPAAELDSDGEEVPPDDPPVCVSDDNRLCTLASLKAVWHGQKuDIFKQAHEGGPAPNSEVVLPNGFQNQHILDYARGNRQLVLNFGR
CTuPPFMARMSAFQRLVTKYQRDVDFLIIYIEEAHPSDGWVTTDSPYSIPQHRSLEDRVSAAEVLQQGAPSCSLVLDTMANSSSSAYGAYFERLYVVQNGTIMYQGGRGPDGYQVSELRTWLERYDAQLRGGQPRRV* 0

>DIO3_bosTau Bos taurus (cow)
0 MLRSLLLHSLRLCSQTASCLVLFPRFLGTAFMLWLLDFLCIRKHLLGRRRRGQPEIEVELNSDGEEVPPDDPPVCVSDDNRLCTLASLRAVWHGQKLDFFKQAHEGGPAPNSEVVLPDGFQNQHILDYARGNRPLVLNFGS
CTuPPFMARMSAFQRLVTKYQRDVDFLIIYIEEAHPSDGWVTTDSPYSIPQHRSLEDRVSAARVLQQGAPECALVLDTMTNSSSSAYGAYFERLYIIQSGTIMYQGGRGPDGYQVSEVRTWLERYDEQLHGPQPRRV* 0

>DIO3_susScr Sus scrofa (pig)
0 MLHSLLLHSLRLCAQTASCLVLFPRFLGTACMLWLLDFLCIRKHLLGRRRRGEPETEVELNSDGDEVPPDDPPICVSDDNRLCTLASLRAVWHGQKLDFFKQAHEGGPAPNSEVVLPDGFQNQHILDYARGNRPLVLNFGS
CTuPPFMARMSAFQRLVTKYQRDVDFLIIYIEEAHPSDGWVTTDSPYSIPQHRSLEDRVSAARVLQQGAPECSLVLDTMANSSSSAYGAYFERLYVIQSGTIMYQGGRGPDGYQVSELRTWLERYDQQLHGPQPRRV* 0

>DIO3_loxAfr Loxodonta africana (elephant)
0 MLRCMLLHSLRLCAQTASCIVLFPRFLGTAFMLWLLDFLCIRKHFLGRRRRGQPETEVELNSDGEEVPPDDPPISVSDDNRLCTLASLKAVWHGQKLDFFKQAHEGGPAPNSEVVLPDGFQSQRILDYVRGNRPLVLNFGS
CTuPPFMARMSAFQRLVTKYRRDVDFLIIYIEEAHPSDGWVTTDSPYSIPQHRSLEDRVSAARVLQQGAPGCALVLDTMANSSSSAYGAYFERLYVIQNGTIVYQGGRGPDGYQVAKLRTWLERYDEQLHSAQTRRV* 0

>DIO3_echTel Echinops telfairi (tenrec)
0 MLRSPVLHSLVLCAQDAYSIVLFPCLLGTAFMQRQDFLCIRKHFLGLRRRGQPEPEVELNSDGEEVPPDDPPICVSDDNRLCTLASLKAVWHGQKLDFFKQAHEGGPAPNSEVVLPDGFQSQHILDYARGSRPLVLNFGS
CTuPPFMARMSAFQRLVSKYRGEVDFLIIYIEEAHPSDGWVTTDSPYSIPQHRSLEDRVSAARVLQQGAPDCALVLDTMTNSSSSAYGAYFERLYVIQNGTVMFQGGRGPDGYQVAELRTWLQQYSEQQHRAQPRRV* 0

>DIO3_monDom Monodelphis domestica (opossum) tga taa
0 MRSRREGDTEEPRKAGKGGGGGMGDEEDGAAVIQSPGLPPDDPPVWVSDSNRLCTLESLKAVWHGQKMDFFKTARKGSLAPNPEVIQLDGLKRLRILDFARGSRPLVLNFGS
CTuPPFMARLRAFQRLASTFLDIADFLLVYIEEAHPSDGWVSSDAAYDIPKHQCLQDRLRAARLLQEGVPGCLLAVDTMDNASSAAYGAYFERLYIIQDARVMYQGGRGPEGYKISELRHWLDQYKDRLQSPEASVVLPV* 0

>DIO3_ornAna Ornithorhynchus anatinus (platypus)
DCLLVYIEEAHPSDGWVSSDAPYDIPKHRCLQDRLRAARLMQRGAPGCRLAVDTMDNAASAAYGAYFERLYVVQDARVVYQGGRGPEGYKISELRLWLEQYRLRLRGPGPATAAAAAVLDV* 0
 
>DIO3_galGal Gallus gallus (chicken)
0 MLHSLGAHTLQLLTQAAACILLFPRFLLTAVMLWLLDFLCIRKKMLTMPTAEEAAGAGEGPPPDDPPVCVSDSNRMFTLESLKAVWHGQKLDFFKSAHVGSPAPNPEVIQLDGQKRLRILDFARGKRPLILNFGS
CTuPPFMARLRSFRRLAADFVDIADFLLVYIEEAHPSDGWVSSDAAYSIPKHQCLQDRLRAAQLMREGAPDCPLAVDTMDNASSAAYGAYFERLYVIQEEKVMYQGGRGPEGYKISELRTWLDQYKTRLQSPGAVVIQV* 0

>DIO3_xenTro Xenopus laevis (frog)
0 MLHCAGPHTGKLVKQVAACCLLLPRFLLTGLMLWLLDFQCIRRRVLLTAREESTAEHEDPPLCVSDSNRMCTVESLRAVWHGQKLDYFKSAHLGCSAPNTEVVMLEGRRLCKILDFSQGKRPLVVNFGS
CTuPPFMARLQAYRRLAAQHVGIADFLLVYIEEAHPSDGWLSTDASYQIPQHQCLQDRLAAAQLMLQGAPGCRVVVDTMDNSSNAAYGAYFERLYIVLEGKVVYQGGRGPEGYKISELRMWLEQYQQGLMGTKGSGQVVIQV* 0

>DIO3_ranCat Rana catesbiana (bullfrog)
0 MLPAPHTCCRLLQQLLACCLLLPRFLLTVLLLWLLDFPCVRRRVIRGAKEEDPGAPEREDPPLCVSDTNRMCTLESLKAVWYGQKLDFFKSAHLGGGAPNTEVVTLEGQRLCRILDFSKGHRPLVLNFGS
CTuPPFMARLQAYQRLAAQRLDFADFLLVYIEEAHPCDGWLSTDAAYQIPTHQCLQDRLRAAQLMLQGAPGCRVVADTMTNASNAAYGAYFERLYVILDGKVVYQGGRGPEGYKIGELRNWLDQYQTRATGNGALVIQV* 0

>DIO3_neoFor Neoceratodus forsteri (lungfish) 1 exon
0 MYQSSGVHTMNEVLKQAFACFILLPRFLVTALMLWLLDFLCVRRRVLLHMSRRQEASDLPDEPELCVSDSNRMFTLKSLRAVWHDQKLDFFKAAHIGLVAPNTEVIKLEGQRKAKILEFGGGKRPLILNFGS
CTuPPFMARLKAFRGVATQYKDVADFLLIYIEEAHPSDGWVSTDAPYQIPKHQCLEDRLKAAQLMNLEIPGCLVVVDTMDNASNAAYGAFFERLYIVQQERVVYQGGRGPEGYKISELKNWLDQYKSQLQNSSAVVIQV* 0

>DIO3a_danRer Danio rerio (zebrafish)
MEMLQGSAGVQSALKNAAVCVLLLPRFLLAALMLCLLDFLCIRRKLLLKMQEGAFSSPDDPPLRVSDSNKMFTLESLRAVWYGQKLDFFKSARLGGAAPNTEVFPLDGDARAAERILDYARGRRPLILNFGS
CS*PPFMTRLSAFQRVARQYADIADSLLVYIEEAHPSDGWVSSDAPVQIPRHRCLEDRLRAAQMLHREAAGSAVVVDSMQNSCNAAYGAYFERLYIVKDATVVYQGGRGPEGYRIAELRDWLERYRSGLQESAVLHV* 0

>DIO3bb_danRer Danio rerio (zebrafish)
0 MNTGRALKNALVCLLILPRFLVAAFMLWCLDFLCVRKRVLVHLQERAEEEEEDAEEEEEPLCISDSSRMFSWESLKAVFHGHKLDYMKSARLGHAAPDSEVFPLAEPRRGRVLEFARGHRPLVLSFGS
CSuPPFMRRLKAFRRLVLRYADVADALLIYIEEAHPSDGWRSSDAPHQIRRHRSLEERLSAARLMEREAPGCAVVADGMENAANSAYGAYFDRLYIVQDGRVVYQGGRGPEGFRISELRHWLDRYRERLRDAVIPV* 0

>DIO3bbb_danRer Danio rerio (zebrafish) bizarre ... matches opossum yet in both genomes
VHTLGFLTQVAACCFLGPRFLATAVTLWLLDFLCIRKKVLMRHRREGDAEDPDDPPIWVSDSNRFCTIESLKAVWHGQKMDFFKTAHKGSPAPNPEVIQLDGLKRLRILDFARGSRPLVLNFGS
CTuPPFTARLRAFQHLASTFLDIADFLLVYIEEAHPSDGWVTSDAAYDIPKHQCLQDRLRAARLLQEGAPGCLLAVDTMDNASSAAYGAYSERLYIIQEARVMYQGGRGPEGYKISELRHWLDQYKDRLQSPPASVVLPV* 0

>DIO3b_danRer Danio rerio (zebrafish)
0   GALKNALVCLLILTRFLVAAFMLWCLDFLCVRKRVLVHLQERAYAEQEEEPLCISDSSRMFSWESLKAVFHGHKLDYMKSARLGHAAPDSEVFPLAEPRRGRVLEFARGHRPLVLSFGS
CSuPPFMRRLKAFRRLVLRYADVADALLIYIEEAHPSDGWRSSDAPHQIRRHRSLEERLSAARLMEREAPGCAVVADGMENAANSAYGAYFDRLYIVQDGRVVYQ* 0

>DIO3a_takRub Takifugu rubripes (fugu) 1 exon
0 MLDSGGVQMATALKHAALCLMLLPRFLLTAVMLWLLDFLCIRRKVLLQMGQRQKSPDDPPVCVSDSNRMFTLESLGAVWYGQKLDFFKSAHLGSAAPNTEVMLVQERRQVRILDCMKGKRPLILNFGS
CSuPPFMTRLAAFQRVVRQYADIADFLVVYIEEAHPSDGWVSTDAPYQiPKHRCLEDRLRAAQLMLAEVPESNVVVDNMDNSSNAAYGAYFERLYIVRDERVVYQGGRGPEGYRISELRSWLEQYRNDVARSQTAVLHV* 0

>DIO3a_tetNig Tetraodon nigroviridis (pufferfish)
0 MLLPRFLLTAVALWLLDFLCIRRKVLLQMGQRQKSADDPPVCVSDSNKMFTLESLRAVWYGQKLDFLKSAHLGSAAPNTEVMLVQERRQVRILDCMKGKRPLVLNFGS
CS-PPFMTRLAAFQRIVRQYADIADFLVVYIEEAHPSDGWVSTDAPHQIPKHRCLEDRLRAAQLMLAEVPESNVVVDNMDNSSNAAYGAYFERLYILRDERVVYQGGRGPEGYRISELRTWLEQYRNDVAESQTAVLHV * 0

>DIO3a_gasAcu Gasterosteus  aculeatus (stickleback)
0 MHDSGGGVRTARALKHAALCLMLLPRFLLAAVMLWLLDFLCIRKVLLKMGERQDSPDDPPLCVSDSNRMFTLESLRAVWYGQKLDCLKAAHLGRAAPNTEVMLVQQRRRVPILDCTKGKRPLILNFGS
CS-PPFMTRLAAFQRVVSQYADIADFLLVYIEEAHPSDGWVSSDAPYQIPKHRCLEDRLRAARLMLAEVPGSDVVVDNMDNSSSAAYGAYFERLYVVRDERVVYQGGRGPEGYRISELRAWLDEYRSELARSQTAVLHV* 0

>DIO3a_oryLap Oryzias latipes (medaka)
0 MDDSSGVQMARALKQAALCLMLLPRFLLAAVMLWLLDFLCIRRKLLLKMGERQDSPDDPPLCVSDSNKMFTLESLRAVWHGQKLDILKTAHLGQTAPNTEVVLVQERRQVRILDCMKGNRPLILNFGS
CSuPPFMMRLAAFQRVVSQYADIADFLVVYIEEAHPSDGWVSSDAPHQIPKHRCLEDRLRAAALMLTEVPGSKVVVDNMDNSSNAAYGAYFERLYVVRDETVVYQGGRGPEEYRISELKTWLQQYRKELLHSQNAVLHV* 0

>DIO3a_pimPro Pimephales promelas (fish)
0 MEMLHGSAGVQTAKALKNAAVCLMLLPRFLLAALTLWLLDFLCIRRKLLMKMRESDIASPDDPPMCVSDSNKMFTLESLRAVWYGQKLDFFKTARLGGAAPNTEVVPLDSTRLRGTRRILDYARGRRPLIVNFGS
CS-PPFMTRLSAFQRVARQYADIADSLLVYIEEAHPSDGWVSSDAPYQIPRHRCLEDRLRAAELMNQKVPECAIVVDTMENSSNSAYGAYFERLYILKDEKVVYQGGRGPEGYRISELRDWLERYRNELEAS* 0

>DIO3a_ictPun Ictalurus punctatus (fish)
0 MPDVQDFVKALKNALVSLMLLPRFLLTAVLLWFLDFVCIRRKVLVKMREREGSSPDDPSVTVSDSNRMFTVESLRAVWYSQKLDFCKTAHLGLTAPNSEVVPLGERKRARILDYARGSRPLILNFGS
CS*PPFMTRLAAFRRVADQYADIADSLLIYIEEAHPSDGWVSTDAPYQIPRHRCIKDRLRAARLMTATVPGSIVVVDTMDNSSNAAYGAYFERLYVVKDEKVVYQGGRGPEGYRIFEL NWLEPYRQKARGSRPIVGHV* 0

>DIO3a_spaAur Sparus aurata (fish)
0 MHDSGGVQMARALKHAALCLMLLPRFLLAAVMLWLLDFLCIRKKVLLKMGERQDGPDDPPVCVSDSNKMFTLESLRAVWYGQKLDFLKSAHLGRTAPNTEVMLVQERRQVRILDCMKGKRPLILNFGS
CSuPPFMTRLAAFQRVVSQYADIADFLVVYIEEAHPSDGWVSSDAPYQIPKHRCLEDRLRAAQLMLAEVPGSNVVVDNMDNSSNAAYGAYFERLYIVRDERVVYQGGRGPEGYRISELRNWLEQYRNGLVNSQTAVLHV* 0

>DIO3a_parOli Paralichthys olivaceus mRNA
MVHDSGGVQMARALKHAALCLMLLPRFLLAAVMLWLLDFLCIRKKVLLKMGERQDGPDDPPVCVSDSNKMFTLESLRAVWYGQRLDFFKSAHLGRAAPNTEVVLVQEGRQVRILDCMKGKRPLILNFGS
CSuPPFMTRLAAFQRVVSQYADIADFLVVYIEEAHPSDGWVSSDAPFQIPKHRCLEDRLRAAQLMLSEVPGGNVVVDNMDNSSNAAYGAYFERLYIVRDERVVYQGGRGPEGYQISGLRDWLEQYRSDLVNSKTPVLHV* 0

>DIO3b_takRub Takifugu rubripes (fugu) 1 exon N's
0 MNTIKSIKNALVFFVLLPRFLVAAVMFWLLDFLCIRKRVFFRMKEQEDDAVDPPLCISDSNRLFTVESLKAVWHGHKLDFLKAAHLGQGAPNTEVVQLEDQRRSRILDYAKDKRPLILNFGS
CTuPPFMARLKAFRGSWSKTQTSRLCSCVIEEAHPSDGWVSTDAPYQIPKHRCLADRLGAAQLMRLEVPGCLIVVDSMENSSNVAYGAYFDRLYILQEGKIVYQGGRGPEGYRITELRDWLVEYRESLKSSNDLVIHL* 0
                                   
>DIO3b_tetNig Tetraodon nigroviridis (pufferfish)
0 MNTMKSIKNALVFFVLLPRFLMAAVMFWLLDFLCIRKRVFFRMKEQEDDAVDPPLCISDSNRLFTVESLKAVWHGHKLDFLKAAHLGQGAPNTEVVQLEDQRRSRILDYAKDKRPLILNFGS
CT-PPFMARLKAFQGVVEQNADIADTVVVYIEEAHPSDGWVSTDAPYQIPKHRCLEDRLSAAQLMRLEVPGCLVVVDSMENSSNAAYGAYFDRLYILQEGKIVYQGGRGPEGYRITELRDWLEKYRKSLETSNNLVIHL* 0

>DIO3b_gasAcu Gasterosteus  aculeatus (stickleback)
0 MNTIKAIKNAIVCLALLPRFLMAAFMFWLLDFLCIRRRVFFRMEQQEGDAIDPPLCMSDSNRLFSLESLKAVWHGHKLDFLKAAHLGHGAPNTEVVHLGDQRRGHILDYAKEKRPLVLNFGS
CTuPPFMARLKAFQGVVQQNADIADSVVVYIEEAHPSDGWMSTDAPYQIPKHRCLEDRLNAAQLMHLEVPGCPLVVDSMENSSNAAYGAYFDRLYILQEGTIVYQGGRGPEGYRVTELRDWLDRYRQTLGKSSQLVMSV** 0

>DIO3b_oryLap Oryzias latipes (medaka)
0  MNTMKALKNAIVCLLLLPRFLLAAVVFWLLDFSCIRKRVFLRMKEQGGDATDPPLCISDSNRLFSVESLKAVWHGHKLDFLKAAHLGRGAPNTEVVHLEDQRCSRILDYAKDKRPLILNFGS
CTuPPFMARLKAFQGVVQQNADIADSLVVYIEEAHPSDGWMSTDAPYQIPKHRRLEDRLNAAQLMHLEVPGCLVVVDSMENSSNAAYGAFFDRLYILQEGKVVYQGGRGPEGYRISELRDWLDQYRKGMEKSNNLVINM* 0

>DIO3_squAca Squalus acanthias (spiny dogfish)
0 MNLLKEAVAVLILLPRFLVTALMLWLLDILCIRKRLLSKSREQTEGNADDPPLCVSDTNRMFTLESLKAIWHGQKLDFFKSAHVGSPAPNPEVVQLQEQRKVRLLDYSRGARPLILNFGS
CTuPPFMARLKAFQRVAIQYADIADFLLVYIEEAHPSDGWVSTDAPYDIPRHRCLEDRLKAARLMHKENPSCLVVADTM
 
>DIO3_calMil Callorhinchus milii (elephantfish)
0 MKLLKEAMAVLVLLPRFIVTALTLWLLDILCIRKRLLCKLRARYEE  AAELLGPGAGPPAPD HRMLTAEGMQVVWQSFLLDALKQVKVGLEAPNSAVARLDGGAPCRLLDFASRDRPLVVNFGS
ATuPPFVSRLPAFRQMVERYAEVADFLLVYVDEAHPSDGWALRSRFQLRRHRSQEERCSAAGLLAREFGLPAACGVVADLMDNNANRAYGVAFERLCVVQSQKIAYLGGKGPFFYNLNGVREWLERHSGQRWG* 0

>DIO3_braFlo Branchiostoma floridae (amphioxus) taa 1 exon closer match to DIO3
0 MGCGHHSSLSPHSVERPNAGMLCEAVRPLKALAAPILFVVLCFAFFLLKTASIVLTFVAPAKKEDLLAKLLGREEQEQTSSFQYDIDEFLTWNSLKGLVYSQMVGIMKRARHGKPAPDPTLVLLDSVTEKTLLSFAVADRPLFVNFGS
YTuPPFVPDLAVFDEIVAEFGERVDFLLVYIEEAHPTDGWSFRAGPDITSHQSIGQRCTAARAMLANRTVLYNVAVDSMTNSANYQYGAFPDRVYIIRHGQVVYEGGKGPFKYSLQEARTWLQENIGQQKN* 0

SELH: 25 vertebrate sequences

>SELH_homSap Homo sapiens (human) NP_734467 Selenoprotein H exons chr11 80% identity musMus
0 MAPRGRKRKAEAAVVAVAEKREKLANGGEGMEEATVVIEHC 2
1 TSuRVYGRNAAALSQALRLEAPELPVKVNPTKPRRGSFEVTLLRPDGS 1
2 SAELWTGIKKGPPRKLKFPEPQEVVEELKKYLS* 0

>SELH_ponPyg Pongo pygmaeus (orang_sumatran)
0 MAPRGRKCKAEATVVAVAEKrEKLTNGGEGMEEATIVIEHC 2
1 TSuRVYGRNAAALSQVLCLEAPELPVKVNPTKPRRGSFEVTLLRPDGS 1
2 SVELWTGIKKGPPCKLKFPEPQEVVEKLKKYLS* 0

>SELH_macMul Macaca mulatta (rhesus) 
0 MAPRGRKRKAEAAMVAAAEKQEKLANSGEGMEETTVVIEHC 2
1 TSuRVYGRNAAALSQALRLEAPELPVKVNPSKPRRGSFEVTLLRPDGS 1
2 SAELWTGIKKGPPRKLKFPEPQEVVEELKKYLS* 0

>SELH_micMur Microcebus murinus (mouse_lemur)
0 MAPRGRKRKAEASVVATAEKREKLENGGEAVEEATVVIEHC 2
1 TSuRVYGRNAAALSQALRLEAPELPVKVNPAKPRRGSFEVTLQRPDGS 1
2 SAELWTGIKKGPPRKLKFPEPQVVVKELKKYL.* 0

>SELH_tupBel Tupaia belangeri (tree_shrew)
0 MAPRGRKRKAEAAVVATAEKQEKLQNGGEGVKEASIVIEHC 2
1 TSuRVYGRNAAALSQALRLEAPELPVKVNSAKPRRGSFEVTLLRPDGS 1
2 SVELWTGIKKGPPRKLKFPEPQEVVEELKKYLS* 0

>SELH_musMus Mus musculus (mouse)
0 MAPHGRKRKAGAAPMETVDKREKLAEGATVVIEHC 2
1 TSuRVYGRHAAALSQALQLEAPELPVQVNPSKPRRGSFEVTLLRSDNS 1
2 RVELWTGIKKGPPRKLKFPEPQEVVEELKKYLS* 0

>SELH_ratNor Rattus norvegicus (rat) TGA verified 77%
0 MAPLGRKRKAGAAPIESADKREKLAEGAAVVIEHC 2
1 TSuRVYRRHAAALSQALQLEAPEISVQVNRSKPRRGSFEVTLLRPDNS 1
2 RVELWTGIKKGPPRKLKFSEPQEMVEELKKYLS* 0

>SELH_speTri Spermophilus tridecemlineatus (squirrel)
0 MAPRVRKRKAEAAAVSTSEKREKLENGKEQVEEAVVIEHc 2
1 TSuRVYGRNAAALSQALRLEAPELPVKVNPSKPRRGSFEVTLLRRDGTS 1

>SELH_oryCun Oryctolagus cuniculus (rabbit)
0 MAPGKRKRKAEAAPVASAEKREKLANGGQGVEEIVIEHc 2
1 tSuRVYGRNAAALSQALRLQAPELPVTVNPSKPRRGSFEVTLLRPDGS 1
2 gAELWTGIKKGPPRKLKFPEPQQVVEELKKYLS* 0

>SELH_ochPri Ochotona princeps (pika)
0 MAPNRRKRKAEAVADAAAEKREKQAKQANGVGGGEEIVIEHc 2
1 TSuRVYGRNAAALSQALRLEAPELPVKVNPAKPRRGSFEVTLQRPDGS 1
2 SAELWTGIKKGPPRKLKFPEPQQVVEELKKYLS* 0

>SELH_canFam Canis familiaris (dog)
0 MASRGRKRKAEAAGVAAAEKRDKPASGRKAVEEATVVIEHC 2
1 TSuRVYGRNAAALSQALRLETPELPVEVNPAKPRRGSFEVTLLRPDGS 1
2 SVELWTGIKKGPPRKLKFPEPQEVVKALKQHLS* 0

>SELH_bosTau Bos taurus (cow) TGA verified
0 MASRGRKRKAEAALAAAAEKREKPAGGQEGGVEGPSVVIEHC 2
1 TSuRVYGRNAAALSQALRLQAPELTVKVNPARPRRGSFEVTLLRADGS 1
2 SAELWTGLKKGPPRKLKFPEPHVVLEELKKYLS* 0

>SELH_oviAri Ovis aries (sheep)
0 MASRGRKRKAEAALAAAAEKREKPAGSREGEVAGPSVVIEHC 2
1 TSuRVYGRNAAALSQALRLQAPELAVKVNPSRPRRGSFEVTLLRADGS 1
2 AELWTGLKKGPPRKLKFPEPHVVLEELKKYLS* 0

>SELH_susScr Sus scrofa (pig)
0 MASRGRKRKAETALGAAAEKQETPASGRKGMEEPSVVIEHC 2
1 TSuRVYGRNAAALSQALRVEAPELPVRVNPTKPRRGSFEVTLMRPDGS 1
2 SAELWTGIKKGPPRKLKFPEPQEVVEELKKYLS* 0

>SELH_equCab Equus caballus (horse)
0 MASRGRKRKAEAVVAVAEKREKLTSGGKGVEEVTVVIEHc 2
1 TSuRVYGRNAAALSQALRLEAPELPVKVNPAKPRRGSFEVTLLRPDGS 1
2 sAELWTGIKKGPPRKLKFPEPQEVVEELKKYLS* 0

>SELH_choHof Choloepus hoffmanni (sloth)
0 MASRGRKRKAEAVVLAAAEKQEKVASGKEGEKEAVVVIEHC 2
1 TSuRVYGRNAAALSQALRLEAPEIPVKVNFAKPRRGSFEVTLLRPDGS 1
2 TELWTGIKKGPPRKLKFLQPLKVVEELKKYLS* 0

>SELH_loxAfr Loxodonta africana (elephant)
0 MASRGRKRKAEAAVAAAAAEKREKPVGGQAAAEEVVVIIEHC 2
1 TSuRVYGRNAAALSQALRLEAPELSVKVNPSKPRRGSFEVTLQRPDGS 1
2 GAELWTGIKKGPPRKLKFPEPQEVVEELKKYL* 0

>SELH_monDom Monodelphis domestica (opossum)
0 MAPRGRKRKADVAAAALTEKPEKLAQGGEEGAGEARVVIEHC 2
1 QSuRVYARHAEAVGQALRLARPGLPVLLNPAKPRRSSFEVTLLRPDGS 1
2 RVELWSGIKKGPPRKLKFPEPAQVVEELKARLV* 0

>SELH_ornAna Ornithorhynchus anatinus (platypus) E5DI3CH10F50JR run=R_2008_02_11_18_03_02_ 
0 DAGGAEVGEGLHVVIEHC 2
1 RSuGVYGRRAEALSRALSLAAPDLPVLLNPTKPRRNSFEVTLLRPDGT 1
2 RTELWSGIKKGRP

>SELH_galGal Gallus gallus (chicken) tga confirmed
0 MAPRGRKRAARRPAEPEARADPPEKRPRDEAEGSPGDAGGPRVVIEHC 2
1 RSuRVYGRNAAALSEALRGAVASLAVEINPRQPRRNSFEVSLVKEDGS 1
2 TVQLWSGIGKGPPRKLKFPEPAAVVEALRSSLA* 0

>SELH_danRer Danio rerio (zebrafish) AI877878 BM037625 etc CKSU tga verified exons fused
0 MATRGKSARKRKADSDEKEKLDDAKKEKLEDKDEETGLRVVIEHC 2
1 KSuRVYGRNADVVREALADSHPELKVMINPHKPRRNSFEITLMDGERADVLWSGIKKGPPRKLKFPEPAEVVTALKQALEKE* 0

>SELH_takRub Takifugu rubripes (fugu) tga confirmed, no Ests 3 exons
0 MTPRVLMTGRRGTKRKAEEDEKPKEEKKEKQREDDQGGPRVAIEHC 2
1 KSuRVYGRNAEAVKSALLAAHPGLTVVLNPEKPRRNSFEITLLDEGK 1
2 ETSLWTGIKKGPPRKMKFPQPDVVVTALQEALKTE* 0

>SELH_ictPun Ictalurus punctatus (fish) CB940790 tga verified 
0 MATRAKAGRGAKRKADVIAAAEPVAKQDKGNKGEREDDEGQRVIIEHC 2
1 KSuRVYGRNADAVREALLSAHPELHVVLNPEKPRRNSFEVTLIEGK 1
2 KELVLWTGLKKGPPRKLKFPEPAEVVTALEEALKSK* 0

>SELH_salSal Salmo salar CA043802 tga verified 
0 MASRNKAGRVLKRKASVKEESVEEKRGKGEDDQPEIVTEGRRVVIEHC 2
1 KSuRVYGRNAEGVRVALLAACPDLTVVLNPQKPRSKSFEVILVEGE 1
2 KEVCLWSGIKKGPPRKLKFPEPEVVVSALEKALKTE* 0

>SELH_oryLat Oryzias latipes BJ026077 tga verified 
0 MASKAGRRGTKRKVEAKKEEDKTSTEEKKARGENAHEEAGLKVLIEHC 2
1 KSuRVYGRNAEEVKSALLAARPELTVVCNPEKPRRNSFEITLLDGAK 1
2 ETSLWTGIKKGPPRKLKFPQPDDVVAAFKDALKTE* 0

>SELH_oncMyk Oncorhynchus mykiss Rainbow trout BX312781
0 MASLTKAGRVLKRKVETEESSVEGKRGKGEDDHPEIVTEGQRVVIEHC 2
1 KSuRVYGRNAEGVRVALLAACPDLTVVLNPQKPRSKSFEVILFEGE 1
2 KEVCLWSGIKKGPPRKLKFPEPEVVVSALEKALKTE* 0

SELM: 22 vertebrate sequences

>SELM_homSap Homo sapiens (human) NM_080430 Selenoprotein M (uc003ajq.1) chr22
0 MSLLLPPLALLLLLAALVAPATAATAYRPDWNRLSGLTRARVE 0
0 TCGGUQLNRLKE 0
0 VKAFVTQDIPFY 2
1 HNLVMKHLPGADPELVLLGRRYEELE 0
0 RIPLSEMTREEINALVQELGFYRKAAPDAQVPPEYVWAPAKPPEETSDHADL* 0

>SELM_musMus Mus musculus (mouse)
0 MSILLSPPSLLLLLAALVAPATSTTNYRPDWNRLRGLARGRVE 0
0 TCGGUQLNRLKE 0
0 VKAFVTEDIQLY 2
1 HNLVMKHLPGADPELVLLSRNYQELE 0
0 RIPLSQMTRDEINALVQELGFYRKSAPEAQVPPEYLWAPAKPPEEASEHDDL* 0

>SELM_ratNor Rattus norvegicus (rat)
0 MNILLSPPPLLLLLAALVAPATSITTYRPDWNRLRGLARGRVE 0
0 TCGGUQLNRLKE 0
0 VKAFVTQDIQLY 2
1 HNLVMKHLPGADPELVLLSRNYQELE 0
0 RIPLSQMTRDEINALVQELGFYRKSAPEAKVPPEYLWAPAKPPEDASDRADL* 0

>SELM_canFam Canis familiaris (dog)
0 MRLPLPPPPLLLLLAALAAAVTTFRPDWNRLHGLARARVE 0
0 TCGGUQLNRLKE 0
0 VKAFVTQDIPLY 2
1 HNLVMKHLPGADPELVLLGHHYEELE 0
0 RIPLSEMTREEINELVQELGFYRKAAPDEAVPPEYLRAPARPAEGAPDRADL* 0

>SELM_bosTau Bos taurus (cow)
0 MHLPLPPPPLLLLLAAVAAATTTFRPDWNRLQGLARARVE 0
0 TCGGUQLNRLKE 0
0 VKAFVTQDIPLY 2
1 HNLVMKHLPGADPELVLLGHRFEELE 0
0 RIPLSDMTREEINALVQELGFYRKASPDEPVPPEYLRAPARPAGDAPDHADL* 0

>SELM_susScr Sus scrofa (pig)
0 MHLPPLSLPLLLLLAALAAATTTFRPDWNRLQGLARARVE 0
0 TCGGUQLNRLKE 0
0 VKAFVTQDIPLY 2
1 HNLVMKHLPGADPELVLLGHRFEELE 0
0 RIPLSDMTREEINALVQELGFYRKAAPDDPVPPEYMRAPARPAEGAPDRADL* 0

>SELM_ornAna Ornithorhynchus anatinus (platypus)
0 MLVCPERRSWVLIPPLSLLLLLPGLLAAFQPDWSRLQGLARGKVE 0
0 TCGGuQLNRLKE 0
0 VKAFVTEDIPLY 2
1 HNLVMKHLPGADPELVLLNFRYEELE 0
0 RIPLSHMTRAEINQLVQDLGFYRKAERDAPVPPEFQQAPAKTSDLREKVQPQETPKSEEQNHPDL* 0

>SELM_galGal Gallus gallus (chicken)
0 MRRAALAALLLLLAAAAGIERRPPRGLARGKVE 0
0 TCGGURLSRLPE 0
0 VKAFVSQDIPLY 2
1 HNLEMKHLPGADPELVLLSFRYEELE 0
0 RIPLSDMTREEINQLVQELGFYRKETPEAPVPEEFQFAPAKPLPTLTPRRAPAADGKTLSEQDKKDHPDL* 0

>SELM_botIns Bothrops insularis (snake) PVPDAFQMA
..PLLWLPLLLLGLLSAVAPLRAVQLDRSRLQWLARGKVE 0
0 SCGGURLNRLPE 0
0 VKAFLNEDLPLY 2
1 HNMDLKYLAGADPELILLNIKFEELQ 0
0 RIPLSDMSREEINQLMQELGFYRKDTPDSLFRCFPNGAC* 0 

>SELM_xenTro Xenopus laevis (frog)
0 MWLPLPLLLGLLQLQPILSYQIDWNKLERINRGKVE 0
0 SCGGUQLNRLKE 0
0 VKGFVTEDLPLY 2
1 HNLEMKHIPGADPELVLITSRYEELE 0
0 RIPLSDMKRDEINQLLKDLGFYRKSSPDAPVPAEFKMAPARASGDTKEDL* 0

>SELM_danRer Danio rerio (zebrafish)
0 MWPLIFTALLPSVILTYEVNIEKLSGLARARVE 0
0 TCGGUQLNRMRE 0
0 VKAFVTQDIPLY 2
1 HNLVMKHIPGADPELVLLNHYYEELD 0
0 RIPLSEMTRAEINKLLAELGFYKKDHPEDQVPEEFRFSPAKDSPFEGRQSSTAAPETTEPSDSQHTDL* 0

>SELM_ictPun Ictalurus punctatus (fish)
0 MFSTWPLLWAAFLPCISLAYEVDWKKLDGLARAKVE 0
0 SCGGUQLNRLRE 0
0 VKAFVTQDIPFY 2
1 HNLVMKHIPGADPELVLLNHYYEELD 0
0 RIPLSHMTRTDINGLLEELGFYKKARAEDDVPEEFRFSPAKDSPFKEHHTRRAPANSDLAQEPQPENESPHKDL* 0

>SELM_oncMyk Oncorhynchus mykiss (trout)
0 MWFFFFVSLLNCVSAYDVDLKKLDGLAKAKVE 0
0 SQSCGGUQLNRLRE 0
0 VKAFVTQDIPLY 2
1 HNLVMKHIPGADPELVLLNHYYEELD 0
0 RIALSDMTRSEINELLEKLGFYKKAQAEDQVPEEFRFSPAKDSPFKATPADNASSDSDAEAKHSDL* 0

MSRB1: 17 vertebrate sequences


>MSRB1_homSap Homo sapiens (human) SEPX1 (uc002cng.1) SELX SELR human chr16
0 MSFCSFFGGEVFQNHFEP 1
2 GVYVCAKCGYELFSSRSKYAHSSPWPAFTETIHADSVAKRPEHNRSEALK 0
0 VSCGKCGNGLGHEFLNDGPKPGQSRFuIFSSSLKFVPK 1
2 GKETSASQGH* 0

>MSRB1_macMul Macaca mulatta (rhesus)
0 MSFCSFFGGEVFQNHFEP 1
2 GVYACAKCGYELFSSRSKYAHSSPWPAFTETIHADSVAKRPEHNRPGALK 0
0 VSCGKCGNGLGHEFLNDGPKPGQSRFuIFSSSLKFVPK 1
2 GKETSTSQGH* 0

>MSRB1_musMus Mus musculus (mouse) exon break just at stop codon NP_038787
0 MSFCSFFGGEVFQNHFEP 1
2 GVYVCAKCSYELFSSHSKYAHSSPWPAFTETIHPDSVTKCPEKNRPEALK 0
0 VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFVPK 1
2 GKEAAASQGH* 0

>MSRB1_ratNor Rattus norvegicus (rat)
0 MSFCSFFGGEVFQNHFEP 1
2 GVYVCAKCGYELFSSRSKYAHSSPWPAFTETIHEDSVAKCPEKNRPEALK 0
0 VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK 1
2 GKEAPASQGD* 0

>MSRB1_cavPor Cavia porcellus (guinea_pig)
0 MSFCSFFGGEVFQNHFES 1
2 GIYVCAKCGYELFSSRSKYAHSSPWPAFTDTIHADSVAKCPEHNRPGALK 0
0 VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK 1
2 DKETSASQGH* 0

>MSRB1_canFam Canis familiaris (dog) frag
0 MSFCSFFGGEVFQNHFEP 1
2 GVYVCAKCGYELFSSRSKYAHSSPWPAFTETIHADSVAKRPERNSPEALK 0
0 VSCGKCGNGLGHEFLNDGPKPGKSRFuIFSSSLKFVPK 1
2 GKGTSGSQEA* 0

>MSRB1_bosTau Bos taurus (cow)
0 MSFCSFFGGEIFQNHFEP 1
2 GIYVCAKCGYELFSSRSKYAHSSPWPAFTETIHADSVAKRPEHNRPGAIK 0
0 VSCGRCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK 1
2 AEETSASQGQ* 0

>MSRB1_equCab Equus caballus (horse)
0 MSFCSFFGGEIFQNHFEP 1
2 GIYVCAKCGYELFSSRSKYAHSSPWPAFTETIHADSVAKRPEHNRPEALK 0
0 VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFVPK 1
2 GKESSASQGQ* 0

>MSRB1_ loxAfr Loxodonta africana (elephant) 
0 MSFCSFFRSEVFQNHFEP 1
2 GVYVCAKCGYELFSSRSKYAHSSPWPAFTETIHADSVGKHPEHNRPEALK 0
0 VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK 1
2 GKETSASQGK* 0

>MSRB1_monDom Monodelphis domestica (opossum) frag
2 GVYVCAKCGYELFSSRSKYHHSSPWPAFTETIHADSVSKRPESGRSEALK 0
0 VSCGKCGNGLGHEFINDGPKKGQSRFuIFSSSLKFVPKG 1

>MSRB1_ornAna Ornithorhynchus anatinus (platypus)
0 1
2 GTYVCARCGYELFSSRSKYEHSSPWPAFTETIHPDSVAKREEPGRPNAFK 0
0 VSCGKCGNGLGHEFLNDGPRRGQSRFuIFSSLKFIPK 1
2 GKDSQAAQDK* 0

>MSRB1_galGal Gallus gallus (chicken)
0 MSFCSFFGGEVFKDHFEP 1
2 GVYVCARCGYELFSSRAKYEHSSPWPAFTETIHEDSVAKRKERPGALK 0
0 VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK 1
0 GKSPQEN* 0

>MSRB1_anoCar Anolis carolinensis (lizard)
0 MSFCAFSGGEIYQGHFEA 1
2 GMYVCSKCGFELFSSKSKYAHSSPWPAFTETIHDDSITKYLERPNAFK 0
0 VLCGKCGNGLGHEFINDGPKKGQSRFZIFSSSLKFVPK 1

>MSRB1_xenTro Xenopus tropicalis (frog)
0 MSFCSFFGGEVYKDHFKS 1
2 GIYVCSECNYELFSSRSKYQHSSPWPAFTETVHKDSISKYLERPNAYK 0
0 VSCGKCGNGLGHEFINDGPKKGQSRFuIFSSSLKFIPK 0
0 DKVDGEVQRE* 0

>MSRB1_danRer Danio rerio (zebrafish) tga confirmed
0 MSFCSFSGGEIYKDHFES 1
2 GMYVCAQCGYELFSSRSKYEHSSPWPAFTETIHEDSVSKQEERWGAYK 0
0 VRCGKCGNGLGHEFVNDGPKHGLSRFuIFSSSLKFI
0 PKVKNEQQ* 0

>MSRB1_ictFur Ictalurus furcatus (fish)
0 MAFCSFKGGEIFKDHYEP 1
2 GIYVCVKCGYELFSSTSKYKLSSPWPAFTTTIHEDSVSKQEERPGALK 0
0 IRCGKCNNGLGHEFLNDGPKHGLSRFuIFSSSLKFV 0
0 PKDKGGQ* 0

>MSRB1_oncMyk Oncorhynchus mykiss (trout)
0 MSFCSFFGGEVFKDHFKT 1
2 GLYMCAQCGHQLFSSRSKYEHSSPWPAFTETILQDSVSKHEERPGAFK 0
0 VRCGKCGNGLGHEFVGDGPKKGLSRFuIFSSSLKFV 0
0 PKDKVDGQ* 0

>MSRB1_salSal Salmo salar (salmon)
0 MSFCSFFGGEVFKDHFKT 1
2 GLYVCAQCGHQLFSSRSKYEHSSPWPAFTETVLQDSVSKHEERPGAFK 0
0 VRCGKCGNGLGHEFVGDGPKKGLSRFuIFSSSLKFV 0
0 PKDKVDGQ* 0
 


>MSRB2_homSap Homo sapiens (human) CBS-1 PILB 5 exons chr10
0 MARLLWLLRGLTLGTAPRRAVRGQAGGGGPGTGPGLGEA 1
2 GSLATCELPLAKSEWQKKLTPEQFYVTREKGTEP 0
0 PFSGIYLNNKEAGMYHCVCCDSPLFS 2
1 SEKKYCSGTGWPSFSEAHGTSGSDESHTGILRRLDTSLGSARTEVVCKQ 0
0 CEAHLGHVFPDGPGPNGQRFCINSVALKFKPRKH* 0

>MSRB2_musMus Mus musculus (mouse)
0 MARLLRALRGLPLLQAPGRLARGCAGS 1
2 GSKDTGSLTKSKRSLSEADWQKKLTPEQFYVTREKGTEA 0
0 PFSGMYLNNKETGMYHCVCCDSPLFSSEKKYCSGTGWPSF 2
1 SEAYGSKGSDESHTGILRRLDTSLGCPRMEVVCKQ 0
0 CEAHLGHVFPDGPKPTGQRFCINSVALKFKPSKP* 0

>MSRB2_bosTau Bos taurus (cow)
0 MARLLRALRGLTLREAPGWAVRGRADCGGFRAGA 1
2 GSPQAAHPDTFPFHRGSKSEWQKKLTPEQFHVTREKGTEP 0
0 PFSGIYLNNKEPGMYHCVCCDSPLFS 2
1 SEKKFCSGTGWPSFSEAHGTSGSDESNTGILRRADTSLGPARTEVVCKQ 0
0 CEAHLGHVFPDGPGPAGQRFCINSVALRFKPRKH* 0

>MSRB2_canFam Canis familiaris (dog)
0 MGDSSTTLRFLIRKRHSPYDFVVKIDKSKPKEERSLTKHEL 0
0 PLTKSEWQKKLTPEQFYVTREKGTEP 0
0 PFSGVYLNNKESGMYHCVCCDSPLFS 2
1 SEKKYCSGTGWPAFSEAHGTSGSDERDTGILRRVDTSLGLTRTEVVCKQ 0
0 CEAHLGHVFPDGPGPSGQRFCINSVALKFKPRKH* 0

>MSRB2_xenTro Xenopus laevis (frog)
0 MSRLLTGFRLLLRVREISFPSVTAQRLQAWSRVRMAQTGADL 1
2 GSLTRYDDSAVSTDWQKKLTPEQYYVTREKGTEL 0
0 PFSGIYLNNTEKGMYHCVCCSAPLFS 2
1 SEKKYNSGTGWPSFSEAYGAQGADESNTNVLRRLDNSLGSTGTEVICKE 0
0 CDAHLGHVFEDGPPPYGQRFCINSVALTFAPSSM* 0

 
>MSRB3_homSap Homo sapiens (human) ER retention signal KAEL* AY358229
0 MSPRRTLPRPLSLCLSLCLCLCLAAALGSAQS 1
2 GSCRDKKNCKVVFSQQELRKRLTPLQYHVTQEKGTES 2
1 AFEGEYTHHKDPGIYKCVVCGTPLFK 2
1 SETKFDSGS 1
2 GWPSFHDVINSEAITFTDDFSYGMHRVETSCSQ 0
0 CGAHLGHIFDDGPRPTGKRYCINSAALSFTPADSSGTAEGGSGVASPAQADKAEL* 0

>MSRB3_musMus Mus musculus (mouse)
0 MSAFNLLHLVTKSQPVAPRACGLPSGSCRDKK 1
2 NCKVVFSQQELRKRLTPLQYHVTQEKGTES 2
1 AFEGEYTHHKDPGIYKCVVCGTPLFK 1
1 SETKFDSGS 1
2 GWPAFHDVISSEAIEFTDDFSYGMHRVETSCSQ 0
0 CGAHLGHIFDDGPRPTGKRYCINSASLSFTPADSSEAEGSGIKESGSPAAADRAEL*

>MSRB3_canFam Canis familiaris (dog)
0 MSAFNLLHLVTKSQPVALRACGLPSGSCRDKKNCK 1
2 VVFSQQELRKRLTPLQYHVTQEKGTESAFEGEYTHHKDPGIYKCVVCGTPLFRSESKFDSGS 1
2 GWPSFHDVISSDAITFTDDFSYGMHRVETSCSQ 0
0 CGAHLGHIFDDGPRPTGKRYCINSASLAFTPAGGGTQGSSGPGGPAAGGRAEL*

SELO: 5 vertebrate sequences

The terminal exon containing the selenocysteine is evolving very erratically in length, suggesting that is unimportant.

VRRVLKLLETPYHCEAGAATDAEATEADGADGRQR                   SYSSKPPLWAAELCVTuSS*  SELO_homSap Homo sapiens (human)
VRRVLKLLETPYHCESGAATDAEATEANGADGRQR                   SYSSKPPLWAAELCVTuSS*  SELO_panTro Pan troglodytes (chimp)
VRRVLKLLETPYHCEAGAATDAEATEADGADGRQR                   SYSSKPPLWAAELCVTuSS*  SELO_ponPyg Pongo pygmaeus (orang_sumatran)
VRRVLKLLETPYHCEAGAATDAEATEADGADGRQR                   SYSSKPPLWAAELCVTuSS*  SELO_macMul Macaca mulatta (rhesus)
VQRVLKLLETPYDNGGGAAAEPKDGSRAASRRP                     SYSSKPPLWAAELCVTuSS*  SELO_otoGar Otolemur garnettii (bushbaby)
VRRVLKLLESPYHSEEEATGPEAVARSTEEQS                      SYSNRPPLWAAELCVTuSS*  SELO_musMus Mus musculus (mouse)
VRRVLKLLESPYHSEEEATGPEAVARTTDEQS                      SYSSRPPLWAAELCVTuSS*  SELO_ratNor Rattus norvegicus (rat)
VRRVLKLLESPYQHEGEHAEALEVAGPEGAATGASRRP                SYSSKPPLWAAELCVTuSS*  SELO_cavPor Cavia porcellus (guinea_pig)
VRRVLELLETPYHRAEEAARVPEATEPEGASGADSGGH                SYSSKPPLWAAELCVTuSS*  SELO_canFam Canis familiaris (dog)
VRRVLKLLETPYGGEAEAEAAEPAEASEAAEETGGAAGRRR             SYSSKPPLWAAELCVTuSS*  SELO_bosTau Bos taurus (cow)
VRRVLKLLETPYHREGEAAEPAEPEAAEGRL                       SYSSKPPLWAAELCVTuSS*  SELO_susScr Sus scrofa (pig)
VRRVLKLLEAPYHRAEEAAEVSEAAEPEGAGGASGRRR                SYSSKPPLWAAELCVTuSS*  SELO_equCab Equus caballus (horse)
VRRVLKLLEAPYHREGEAAEVLEAAEPEGAGSTAERRR                SYSSKPPLWAAELCVTuSS*  SELO_myoLuc Myotis lucifugus (microbat))
VRRVLKLLETPYPQEREAPEALEAAEPQGAGRQD                    SYSSRPPLWAAELCVTuSS*  SELO_eriEur Erinaceus europaeus (hedgehog)
VRRVLKLLEAPYSREEPTEALELGEAAGAVGLQR                    SYSSRPPPWAAELCVTuSS*  SELO_loxAfr Loxodonta africana (elephant)
VQRVLRLLEKPYGEPWEDDADGLLAAAAAADSGEAESRR               SYGRKPPLWAAELCVTuSS*  SELO_monDom Monodelphis domestica (opossum)
VRKVAKLLEHPYREEEGPDDGEGTADRSVRGL                      AYGGKPPHWAAQLCVTuSS*  SELO_ornAna Ornithorhynchus anatinus (platypus)
VRNVLKLLENPFQETEDSTEMETKEEEATATAAACAQATRSRL           SYCSKPPLWASELCVTuSS*  SELO_galGal Gallus gallus (chicken)
VKRVLQMLENPYQEGESCQSIADKSPEEDVAVAASSVSTNPSRL          PYNSKPPLWATELCVTuSS*  SELO_xenTro Xenopus tropicalis (frog)
VQRVLKVLEKPFSVQEGLEQPGWMGRGGAAIPGERDETEEEGSNSSGAGARGLVPYDSKPPVWANEICVTuSS*  SELO_danRer Danio rerio (zebrafish)
VQEERLRVMEGTNPEFPAWVGGSGEAANQGERDEGEEQQQAMASSSTPRNP  VSYDSKPPAWAGEICVTuSS*  SELO_takRub Takifugu rubripes (fugu)
VHLLQKTLRHPFHKQREAEEA                                 GYSSRPPLWARELRVSCSS*  SELO_calMil Callorhinchus milii (elephantfish)
>SELO_homSap Homo sapiens (human) tga-sel taa-stop
0 MAVYRAALGASLAAARLLPLGRCSPSPAPRSTLSGAAMEPAPRWLAGLRFDNRALRALPVEAPPPGPEGAPSAPRPVPGACFTRVQPTPLR
QPRLVALSEPALALLGLGAPPAREAEAEAALFFSGNALLPGAEPAAHCYCGHQFGQFAGQLGDGAAMYLGEVCTATGERWELQLKGAGPTPFSR 2
1 QADGRKVLRSSIREFLCSEAMFHLGVPTTRAGACVTSESTVVRDVFYDGNPKYEQCTVVLRVASTFIR 2
1 FGSFEIFKSADEHTGRAGPSVGRNDIRVQLLDYVISSFYPEIQAAHASDSVQRNAAFFRE 0
0 VTRRTARMVAEWQCVGFCHGVLNTDNMSILGLTIDYGPFGFLDR 2
1 YDPDHVCNASDNTGRYAYSKQPEVCRWNLRKLAEALQPELPLELGEAILAEEFDAEFQRHYLQKMRRKLGLVQVELEEDGALVSKLLETMHLT 1
2 GADFTNTFYLLSSFPVELESPGLAEFLARLMEQCASLEELRLAFRPQMDPR 2
1 QLSMMLMLAQSNPQLFALMGTRAGIARELERVEQQSRLEQLSAAELQSRNQGHWADWLQAYR 2
1 ARLDKDLEGAGDAAAWQAEHVRVMHANNPKYVLRNYIAQNAIEAAERGDFSE 0
0 VRRVLKLLETPYHCEAGAATDAEATEADGADGRQRSYSSKPPLWAAELCVTuSS* 0

>SELO_musMus Mus musculus (mouse) 
0 MAVYRAALGASLAAARLLPLGRCSPSPAPRSTLSGAAMEPAPRWLAGLRFDNRALRALPVEAPPPGPEGAPSAPRPVPGACFTRVQPTPLR
QPRLVALSEPALALLGLGAPPAREAEAEAALFFSGNALLPGAEPAAHCYCGHQFGQFAGQLGDGAAMYLGEVCTATGERWELQLKGAGPTPFSR 2
1 QADGRKVLRSSIREFLCSEAMFHLGVPTTRAGACVTSESTVVRDVFYDGNPKYEQCTVVLRVASTFIR 2
1 FGSFEIFKSADEHTGRAGPSVGRNDIRVQLLDYVISSFYPEIQAAHASDSVQRNAAFFRE 0
0 VTRRTARMVAEWQCVGFCHGVLNTDNMSILGLTIDYGPFGFLDR 2
1 YDPDHVCNASDNTGRYAYSKQPEVCRWNLRKLAEALQPELPLELGEAILAEEFDAEFQRHYLQKMRRKLGLVQVELEEDGALVSKLLETMHLT 1
2 GADFTNTFYLLSSFPVELESPGLAEFLARLMEQCASLEELRLAFRPQMDPR 2
1 QLSMMLMLAQSNPQLFALMGTRAGIARELERVEQQSRLEQLSAAELQSRNQGHWADWLQAYR 2
1 ARLDKDLEGAGDAAAWQAEHVRVMHANNPKYVLRNYIAQNAIEAAERGDFSE 0
0 VRRVLKLLETPYHCEAGAATDAEATEADGADGRQRSYSSKPPLWAAELCVTuSS* 0

>SELO_ratNor Rattus norvegicus (rat)
0 MASFRAAFGASLAVARTRPQCVGLELQSSAPWSAWAAAMEPTPRWLARLRFDNRALRALPVETPPPGPEDSLSTPRPVPGACFSRARPAPLR
QPRLVALSEPALALLGLEVSEEAEVEAEAALFFSGNALLPGTEPAAHCYCGHQFGQFAGQLGDGAAMYLGEVCTAAGERWELQLKGAGPTAFSR 2
1 QADGRKVLRSSIREFLCSEAMFHLGIPTTRAGACVTSESTVMRDVFYDGNPKYEKCTVVLRIAPTFIR 2
1 FGSFEIFKPPDELTGRAGPSVGRNDIRVQMLDYVISSFYPEIQAAHTCDTDNIQRNAAFFRE 0
0 VTRRTARMVAEWQCVGFCHGVLNTDNMSIVGLTIDYGPFGFLDR 2
1 YDPDHVCNASDNAGRYTYSKQPQVCRWNLQKLAEALEPELPLVLAEAILKEEFDTEFQRHYLQKMRKKLGLVRVEKEDETLVAKLLETMHQT 1
2 GADFTNTFCVLSSFPAEPSDTAEFLTQLTSQCASLEELKLAFRPQMDPR 2
1 QLSMMLMLAQSNPQLFALIGTQANVTKELERVEHQSRLEQLSPSELQSKNRDHWETWLQEYR 2
1 ERLDKEKEGVGDIAAWQAERVRIMHANNPKYVLRNYIAQKAIEAAENGDFSE 0
0 VRRVLKLLESPYHSEEEATGPEAVARTTDEQSSYSSRPPLWAAELCVTuSS* 0

>SELO_galGal Gallus gallus (chicken)
0 MQRSGGVLRRGRADTERGETGGGWLSALRFDNLAMRSLPVDPFEDCAPRAVPGACFARVRPTPLRNPRLVAMSAPALALLGLEAGGPEAERE
AEAALYFSGNRLLPGSEPAAHCYCGHQFGSFAGQLGDGAAIYLGEVRGPRGARWELQLKGAGITPFSR 2
1 QADGRKVLRSSIREFLCSEAMFHLGIPTTRAGTCVTSDSEVVRDIFYDGNPKKERCTVVLRIASTFIR 2
1 FGSFEIFKPPDEYTGRKGPSVNRNDIRIQMLDYVIGTFYPEIQEAHADNSIQRNAAFFKE 0
0 ITKRTARLVAEWQCVGFCHGVLNTDNMSIVGLTIDYGPFGFMDR 2
1 YDPEHICNGSDNTGRYAYNRQPEICKWNLGKLAEALVPELPLEISELILEEEYDAEFEKHYLQKMRKKLGLIQLELEEDSKLVSELLETMHLT 1
2 GGDFTNIFYLLSSFSVDTDPSRLEDFLEKLISQCASVEELRVAFKPQMDPR 2
1 QLSMMLMLAQSNPQLFALIGTKANINKELERIEQFSKLQQLTAADLLSRNKRHWTEWLEKYR 2
1 VRLHKEVESISDVDAWNTERVKVMNSNNPRYILRNYIAQNAIEAAENGDFSE 0
0 VRNVLKLLENPFQETEDSTEMETKEEEATATAAACAQATRSRLSYCSKPPLWASELCVTuSS* 0

>SELO_calMil Callorhinchus milii (elephantfish) frag tgt-cys not sel tag-stop? tga-stop? taa-stop
0                 LNFDNLALRSLPVDSSGERSCRRVPGACFSLAGATPVDNPRLVASSR    0
0 QSDGRKVLRSSIREFLCSEAMFHLGIPTTRAGTCVTSDTKVLRDVFYDGNSKHENCTIILRIAPTFLRF 2
1 FGSFEILKPEDELTGRQGPSSNRNDIRIQMLDYVIGTFYPEAQQAHPENQVQRNAAFFRE 0
0 VTQRTARLVAEWQCVGFCHGVLNTDNMSIMGLTIDYGPFGFMDR 2 
1 FDPSYICNASDNRGRYAYNQQPEICKWNLGKLAEVLVPELPLKDSQSIIDEEYDTEFQRHYLQKMRKKLGLLQCEQEDDDKLVSELLDIMYRT 1
2 GADFTNTFYLLSSFPVELESPGLAEFLARLMEQCASLEELRLAFRPQMDPR 2
1 QLSILLMLSQSNPQLFEVIGSKEGIAKELDLIERSSKLQQATAEDIHSNNAKVWTEWLQKYR 2
1 SRLATEAEGVDDVDEQNAERVKVMNLNNPKFILRNYIAQNAIEAAEKGDFSE 0
0 VHLLQKTLRHPFHKQREAEEAGYSSRPPLWARELRVSCSS-R-P* 0

SEPN1: 2 vertebrate sequences

>SEPN1_homSap Homo sapiens (human)
0 MGRARPGQRGPPSPGPAAQPPAPPRRRARSLALLGALLAAAAAAAVRVCARHAEAQAAARQ 0
0 ELALKTLGTDGLFLFSSLDTDGDMYISPEEFKPIAEKLT 1
2 GSTPAASCEEEELPPDPSEETLTIEARFQPLLPETMTKSKDGFLG 0
0 VSRLALSGLRNWTAAASPSAVFATRHFQPFLPPPGQELGEPWWIIPSELSMFTGYLSNNRFYPPPPKGKE 0
0 VIIHRLLSMFHPRPFVKTRFAPQGAVACLTAISDFYYTVMFR 2
1 IHAEFQLSEPPDFPFWFSPAQFTGHIILSKDATHVRDFRLFVPNHR 2
1 SLNVDMEWLYGASESSNMEVDIGYIPQ 0
0 MELEATGPSVPSVILDEDGSMIDSHLPSGEPLQFVFEEIKWQQELSWEEAARRLEVAMYPFKK 0
0 VSYLPFTEAFDRAKAENKLVHSILLWGALDDQSCu 1
2 GSGRTLRETVLESSPILTLLNESFISTWSLVKELEELQ 0
0 NNQENSSHQKLAGLHLEKYSFPVEMMICLPNGTV 0
0 VHHINANYFLDITSVKPEEIESNLFSFSSTFEDPSTATYMQFLKEGLRRGLPLLQP* 0

>SEPN1_petMar Petromyzon marinus (lamprey) tga-sel taa-stop frag
0 0
0 EMALRTLGNDGLFLFTSLDTNMDMQISPEEFRPIVDKII 1
2 GPPPSEYEGTQEADPQGEGLTMLARFEPLLMETMSKSRDGFLG 0
0 VGQSCLAGLRGWKKAEAPSQHFGANQFKVFLPPKSDLELGEAWWLVPNDLNLFTGYLPNSRYYPPPPVAKE 0
0 IIIFKLLSMFHPRPFVKSRFAPQGSVACIRAQSDMYYDIVFR 2
1 VHAEFQLNEPPAFPFWFTPAQFTGHVTIARDSSHVRAFHMFVPNNR 2
1 SLNVDMEWLFGSMDQGNMEVDIGFMPK 0
0 MELVAEGPSVPALIYDENGNAINTSDPDVEPIQFVFENIEWRSEISFQEAYRQMEVAMYPFKK 0
0 IQYHPFTEAFEKAKAEDKLVHSILLWGALDDQSCu 1
2 KVLTQSLKFDFNITSPVLALLSENFISSWSLVKDLEDL 0
0 KQEEQAEHAKWATLHLAKYTFPVEMMIALPNGTV 0
0 VHCINANDFLDATAVKAEDLTPDLPAEFLDPTSTTYLKFLKEGLQKAQTYLQA* 0

SEPHS1: 13 chordate sequences

>SEPHS1_homSap Homo sapiens (human) PHYH- SEPHS1- DUF1172- PRFF18+ FRMD4A-
0 MSTRESFNPESYELDKSFRLTRFTELKG TGCK VPQDVLQKLLESLQENHFQEDEQFLGAVMPRL 1
2 GIGMDTCVIPLRHGGLSLVQTTDYIYPIVDDPYMM 0
0 GRIACANVLSDLYAMGVTECDNMLMLLGVSNKMTDR 0
0 ERDKVMPLIIQGFKDAAEEAGTSVTGGQTVLNPWIVLGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVAVHQWLDI 0
0 PEKWNKIKLVVTQEDVELAYQEAMMNMARLNRT 1
2 AAGLMHTFNAHAATDITGFGILGHAQNLAKQQRNEVSFVIHNLPVLAKMAAVSKACGNMFGLMHGTCPETS 1
2 GGLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGNRTARIIDKPRIIEVAPQVATQNVNPTPGATS* 0

>SEPHS1_musMus Mus musculus (mouse)
0 MSTRESFNPETYELDKSFRLTRFTELKGTGCKVPQDVLQKLLESLQENHFQEDEQFLGAVMPRL 1
2 GIGMDTCVIPLRHGGLSLVQTTDYIYPIVDDPYMM 0
0 GRIACANVLSDLYAMGVTECDNMLMLLGVSNKMTDR 0
0 ERDKVIPLIIQGFKDAAEEAGTSVTGGQTVLNPWIVLGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVAVHQWLDI 0
0 PEKWNKIKLVVTQEDVELAYQEAMMNMARLNRT 1
2 AAGLMHTFNAHAATDITGFGILGHAQNLAKQQRNEVSFVIHNLPVLAKMAAVSKACGNMFGLMHGTCPETS 1
2 GGLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGNRTARIIDKPRIIEVAPQVATQNVNPTPGATS* 0

>SEPHS1_loxAfr Loxodonta africana (elephant) frag
0 MSVRESFNPESYELDKSFRLTKFTELKGTGCKVPQDVLQKLLESLQENHFQEDEQFLGAVMPRLGM 1
2 GIGMDTCVIPLRHGGLSLVQTTDYIYPIVDDPYMM 0
0 GRIACANVLSDLYAMGVTECDNMLMLLGISNKMTDR 0
0 ERDKVMPLIIQGFKDAAEEAGTSVTGGQTVLNPWIVLGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVAVHQWLDI 0
0 PEKWNKIKLVVTQEDVELAYQEAMMNMARLNRT 1
2  1
2 GGLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGNRTARIIDKPRIIEVAPQVATQNVNPTP 1

>SEPHS1_monDom Monodelphis domestica (opossum)
0 MSVRESFNPESYELDKSFRLTRFTELKGTGCKVPQDVLQKLLESLQENHFQEDEQFLGAVMPRL 1
2 GIGMDTCVIPLRHGGLSLVQTTDYIYPIVDDPYMM 0
0 GRIACANVLSDLYAMGVTECDNMLMLLGISNKMTDR 0
0 ERDKVVPLIIQGFKDAAEEAGTSVTGGQTVLNPWIVLGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVAVHQWLDI 0
0 PEKWNKIKLVVTQEDVELAYQEAMMNMARLNRT 1
2 AAGLMHTFNAHAATDITGFGILGHAQNLAKQQRNEVSFVIHNLPVLAKMAAVSKACGNMFGLMHGTCPETS 1
2 GGLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGNRTARIIDKPRIIEVAPQVATQNVNPTPEMAATALYTLDNNVLTLEQRQFYEDNGFLVIGATS* 0

>SEPHS1_ornAna Ornithorhynchus anatinus (platypus) virtual selenocys
0 MSVRESFNPESYELDKSFRLTRFTELKGTGCKVPQDVLQKLLEALQENHFQEDEQFLGAVMPRL 1
2 GIGMDTCVIPLRHGGLSLVQTTDYIYPIVDDPYMM 0
0 GRIACANVLSDLYAMGVTECDNMLMLLGISNKMTDR 0
0 ERDKVMPLIIQGFKDAAEEAGTSVTGGQTVLNPWIVLGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVAVHQWLDI 0
0 PEKWNKIKLVVTQEDVELAYQEAMMNMARLNRT 1
2 AAGLMHTFNAHAATDITGFGILGHAQNLAKQQRNEVSFVIHNLPVLAKMAAVSKACGNMFGLMHGTCPETS 1
2 GGLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGNRTARIIDKPRIIEVAPQVATQNVNPTPGATS* 0

>SEPHS1_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01BJUY7 454
0                 AEEAGTSVTGGQTVLNPWIVLGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQ             0
2                             AKQQRSEVSFVIHNLPIIAKMAAITKACGNRFGLLQGTSSETS 1
2 GGLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGNQTARIIDKPRIIEVTPRGAAAPPPDNNSSA

>SEPHS1_galGal Gallus gallus (chicken)
0 MSvREtFNPESYELDKSFRLTRFTELKG TGCK VPQDVLQKLLESLQENHFQEDEQFLGAVMPRL 1
2 GIGMDTCVIPLRHGGLSLVQTTDYIYPIVDDPYMM 0
0 GRIACANVLSDLYAMGVTECDNMLMLLGVSNKMTDR 0
0 ERDKVMPLIIQGFKDAAEEAGTSVTGGQTVLNPWIVLGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVAVHQWLDI 0
0 PEKWNKIKLVVTQEDVELAYQEAMMNMARLNRT 1
2 AAGLMHTFNAHAATDITGFGILGHAQNLAKQQRNEVSFVIHNLPVLAKMAAVSKACGNMFGLMHGTCPETS 1
2 GGLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGNRTARIIDKPRIIEVAPQVATQNVNPTPGATS* 0

>SEPHS1_xenTro Xenopus tropicalis (frog) frag
0 MSVRESFNPESYELDKSFRLTRFAELKG TGCK VPQDVLQKLLESLQENHFQEDEQFLGAVMPRL
2 GIGMDTCVIPLRHGGLSLVQTTDYIYPIVDDPYMM 0
0 GRIACANVLSDLYAMGVTECDNMLMLLGVSNKLTDR 0
0 ERDKVMPLIIQGFKDAAEEAGTSVTGGQTVLNPWVVLGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVAVHQWLDI 0
0 PEKWNKIKLVVTQEDVELAYQEAMMNMARLNRT 1
2 AAGLMHTFNAHAATDITGFGILGHAQNLAKQQRNEVSFVIHNLPVLAKMAAVSKACGNMFGLMHGSCPETS 1
2 GGLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGNRTARIIDKPRIIEVAPQVASQNVNPTPGATS* 0

>SEPHS1_danRer Danio rerio (zebrafish) first occurence of threonine form
0 MSVRESFNPESYELDKNFRLTRFTELKG TGCK VPQDVLQKLLESLQENHYQEDEQFLGAVMPRL
2 GIGMDTCVIPLRHGGLSLVQTTDYIYPIVDDPYMM 0
0 GRIACANVLSDLYAMGVTECDNMLMLLGVSNKLTEK 0
0 ERGKVMPLVIQGFKDASEEAGTSVTGGQTVINPWIVLGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVAVHQWLDI 0
0 PEKWNKIKLVVTQEDVELAYQEAMLNMARLNRT 1
2 AAGLMHTFNAHAATDITGFGILGHAQNLARQQRTEVSFVIHNLPVLAKMAAVSKACGNMFGLMHGTCPETS
2 GGLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGNRTARIIDKPRIIEVAPQVATQNVNTTPGATS* 0

>SEPHS1_calMil Callorhinchus milii (elephantfish) frag
0 MSVRETFNPENYELDKNFRLTRFAELKGTGCKVPQDVLHKLLEALQENHYQEDEQFLGAVMPRL 1
2 GIGMDSCVIPLRHGGLSLVQTTDFFYPLVDDPYMM 0
0 GRIACANVLSDLYAMGVTECDNMLMLLGVSNKMADK 0
0 ERDKVMPLIIQGFKDAADEAGTAVTGGQTVLNPWIILGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVAVHQWLDI 0
0 PEKWNKIKLVVTQEDVELAYQEAMFNMARLNRT 1
2 1
2 GGLLICLPREQAARFCAEIKSPKYGEGYQAWIIGIVEKGNRTARIIDKPRIIEVAPTVATQNVNPTPGATS*

>SEPHS1_petMar Petromyzon marinus (lamprey) cDNA LyEST8076 FD729689 tga-sel tgc-cys tag-stop not so virtual selenocys full
0 MSVHRYFDPEDHDLDKSFRLTKFSELKG*GCKVPQETLLKLLEGLEQDGPYQDEHQQFMGAVMPRL 1
2 GIGMDACVIPLRHGGLSLVQTTDFFYPLVDDPYMM 1
2 GKIACANVLSDLYAMGVTECDNMLMLLAISQKLSEK 0
0 ERDKVIPLMIRGFKDAAEEAGTTVTGGQTVVNPWIVIGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVNAHQWLDM 0
0 PEKWNKIKLVVTQEDVELAYQEAMFNMARLNRTAAGLMHTFNAH 2
1 AATDITGFGILGHAQNLAKQQRNEVSFVIHNLPVIAKMAAISKACGNLFGLLQGTSAETSG 1
2 GLLICLPREQAARFCAEIaKYGEGHQAWIIGIVEKGGRTARIIDKPRIIEVAPRGASPTGVASPDTPTGPPLT* 0

>SEPHS1_eptBur Eptatretus burgeri (hagfish) cdna BJ649814 BJ654390 tga-sel taa-stop full
0 MSVHRYFDPEDHELDKSFRLTKFSDLKGuGCKVPQESLLKLLEGLEQDSPFQDEHQQFMGAVMPRL 1
2 GIGMDSCVIPLRHGGLSLVQTTDFFYPLVDDPYMM 0
0 GKIACANVLSDLYAMGVTDCDNMLMLLGISQKLSEK 0
0 ERDKVVPLMVRGFKDAAEEAGTTVTGGQTVMNPWIIIGGVASTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVAVHQWLDI 0
0 PEKWNKIKLVVTQEDVELAYQEAMMNMARLNRTAAGLMHTFNAH 1
2 AATDITGFGILGHAQNLARQQRNEVAFVIHNLPRNEVAFVIHNLPVVAKMAAISKACGNLFGLLQGTSAETSG 1
2 GLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGARTARIIDKPRIIEVAPRGAPPPTVTTPTTPTAPLS* 0

>SEPHS1_bfl Branchiostoma floridae tga confirmed 73% hsa BW738890 BW748096 frag
0 MAAPEQVQQAVRPGRVFDPVAHGLDKSFRLTRFADLK uCCK VPQEVLLNLLEGLQNQTIPEEQGQFLPPHFAPI 1
2 GIGMDSCVVPLRQGGLSLVQTTDFFYPLVDDPYMQ 0
0 GKIACANVLSDLYAMGVKTCDNMLMLLGVSNKMSDK 0
0 ERDTVVPLIIRGFKDLAEEAGTMVQGGQTVLNPWVTIGGVATTVCPPNEFIE 2
1 PGNAVVGDVLVLTKPLGTQVAVNAHQWLDE 0
0 PERWNRIRLVVSEEDVERAYQESMFLMSKLNRT 1
2 AAKLMHKYNAHGATDITGFGLLGHAKNLVQHQKNEVSFTIHNLPVIAKMAAVAKACGNMFQLLQGYSAETS 1
2 * 0

SEPHS2: 8 vertebrate sequences

>SEPHS2_hsa human 1 exon but intronated like SEPHS1 for comparison 
MAEASATGACGEAMAAAEGSSGPAGLTLGRSFSNYRPFEPQALGLSPSWRLTGFSGMKGuGCKVPQEALLKLLAGLTRPDVRPPLGRGLVGGQEEASQEAGLPAGAGPSPTFPAL
GIGMDSCVIPLRHGGLSLVQTTDFFYPLVEDPYMM
GRIACANVLSDLYAMGITECDNMLMLLSVSQSMSEEE
EKVTPLMVKGFRDAAEEGGTAVTGGQTVVNPWIIIGGVATVVCQPNEFIM
PDSAVVGDVLVLTKPLGTQVAVNAHQWLDN
PERWNKVKMVVSREEVELAYQEAMFNMATLNRT
AAGLMHTFNAHAATDITGFGILGHSQNLAKQQRNEVSFVIHNLPIIAKMAAVSKASGRFGLLQGTSAETS
GGLLICLPREQAARFCSEIKSSKYGEGHQAWIVGIVEKGNRTARIIDKPRVIEVLPRGATAAVLAPDSSNASSEPSS*

>SEPHS2_mmu mouse 1 exon 
MAEAAAAGASGETMAALVAAEGSLGPAGWSAGRSFSNYRPFEPQTLGFSPSWRLTSFSGMKGuGCKVPQETLLKLLEGLTRPALQPPLTSGLVGGQEETVQEGGLSTRPGPGSAFPSL
SIGMDSCVIPLRHGGLSLVQTTDFFYPLVEDPYMM
GRIACANVLSDLYAMGITECDNMLMLLSVSQSMSEKER
EKVTPLMIKGFRDAAEEGGTAVTGGQTVVNPWIIIGGVATVVCQQNEFIM
PDSAVVGDVLVLTKPLGTQVAANAHQWLDN
PEKWNKIKMVVSREEVELAYQEAMFNMATLNRT
AAGLMHTFNAHAATDITGFGILGHSQNLAKQQKNEVSFVIHNLPIIAKMAAISKASGRFGLLQGTSAETS
GGLLICLPREQAARFCSEIKSSKYGEGHQAWIVGIVEKGNRTARIIDKPRVIEVLPRGASAAAAAAPDNSNAASEPSS*

>SEPHS2_laf Loxodonta africanus 1 exon
MAEAAATGASGEAMAVAEGSSGPAGFSLGRGFSSYRPFEPQALGLNPSWRLTGFSGMKGuGCKVPQETLLKLLAGLTRPDMRPPLGRTLVEGHEEAIQEAGLPAGSGPSPTLPSL
GIGLDSCVIPLRHGGLSLIQTTDFFYPLVEDPYMM
GRIACANVLSDLYAMGITECDNMLMLLSVSQNMSEE
EREKVTPLMIKGFRDAAEEGGTAVTGGQTVINPWIIIGGVATVVCQPNEFIM
PDSAVVGDVLVLTKPLGTQVAVNAHQWLDN
PERWNKIKMVVSREEVELAYQEAMFNMATLNRT
AAGLMHTFNAHAATDITGFGILGHSQNLARQQRNEVSFVIHNLPIIAKMAAISKASGRFGLLQGTSAETS
GGLLICLPREQAARFCSEIKSSKYGEGHQAWIVGIVEKGNRTARIIDKPRVIEVLPRGTTATTLVPDNFNASSEPTL*

>SEPHS2_mdo opossum first 1 exon
MAAAAAATVGNGACTPATGGSLAGSYRPFEPQALGLSPNWRLTSFSDMKG uGCK VPQETLLTLLAGLTRPEERPPRAPGLNLGFGDVVAEEAAGPAVLEPGPDAGPFLPSAAGPSLPSPTL
GIGMDSCVIPLRHGGLSLVQTTDFFYPLVEDPYMM
GRIACANVLSDLYAMGITECDNMLMLLSVSQKMNEE
EREKVMPLMIKGFRDAAEEGGTSVTGGQTVVNPWIIIGGVATVVCQPNEFIM
PDSAVPGDVLVLTKPLGTQVAVNAHQWLDN
PEKWNKIKLVVTREDVELAYQEAMFSMAMLNRT
AAGLMHTFNAHAATDVTGFGILGHAQNLAKQQRSQVSFVIHNLPIIAKMAAISKASGRFGLLQGTSAETS
GGLLICLPREHAARFCAEIKSPKYGEGHQAWIIGIVEKGNQTARIIDKPGIIEVTPRGATVPLHPNNSHASLEPGS*

>SEPHS2_ornAna Ornithorhynchus anatinus (platypus) tga confirmed
0                 FDPELLGLSPSWRLTGFSELKGuGCKVPQETLLKLLAGLTHPDPRPGAAPPDPQDPPPDPQDRTPHHAGPAPPAL 1
2 GIGMDSCVIPLRHGGLSLVQTTDFFYPLVEDPYMM 0
0 GRIACANVLSDLYALGITECDNMLMLLSVSQKMNEE 0
0 EREKIMPLMIKGFRDAAEEGGTSVTGGQTVVNPWIIIGGVATVVCQPNEFIM 2
0 PDNAVPGDVLVLTKPLGTQVAVNAHQWLDN 0
0 PEKWNKIKLVVSREDVELAYQEAMFSMAMLNRT 1
2 AAGLMHTFNAHAATDITGFGILGHAQNLAKQQRSEVSFVIHNLPIIAKMAAITKACGNRFGLLQGTSSETS 1
2 GGLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGNQTARIIDKPRIIEVTPRGAAAPPPDNNSSASPETGS* 0

>SEPHS2_tacAcu Tachyglossus aculeatus (echidna) EUPZL4S02I1W5B 454
0           KLVVSREDVELAYQEAMFSMAMLNRT 1
2 AAGLMHTFNAHAATDI

>SEPHS2_galGal Gallus gallus (chicken) frag
0 MAAPPGPSVPSPTPGPSPGSGCALGPSTSYRPFDPTSLGLDPNWRLTSYSELRGuGCKVPEETLLRLLEGLGGPRSGAAAAAAAAGGGDGEGAEPGGAPTL 1
2 GIGTDCCVIPLRHGGLSLVQTTDFFYPLVDDPYMM 0
0 GRIACANVLSDLYAMGITECDNMLMLLSVSQKMTDE 0
0 ERDKVMPLIVRGFRDAAEAAGTSVTGGQTVLNPWVIVGGAASVVCQRQEFIV 2
1 PDSAVPGDVLVLTKPLGTQVAVTAHQWLDN 0
0 PERWNKIKLVVTREEVEAAYREAMFSMATLNRT 1
2 AAGLMHTFNAHAATDITGFGILGMHKNLAKQQRNEVSLLFTSLPVLAKNGCCQMAAVSKGLWQYCLGLYSLELVPETS 1
2 GGLLICLPREQAARFCAE

>SEPHS2_xenTro Xenopus tropicalis (frog) 8 exons in genome
0 maAPGASHLSAACPFFPGYRPFDPVSVGLEASFRLTSFSDLKGuGCKVPRETLLRLLTGLTEEEAAVSGGAIRDPDVPCAGQDGGQNRL 1
2 GIGLDSCVIPLRHRGLSLVQTTDFFYPSVEDPYMQ 0
0 GRIACANVLSDLYAMGITECDNMLMLLSVSQKMTEE 0
0 EREKVTPLMIKGFRDAAEEGGTSVTGGQTVVNPWIIIGGVATVVCQANEFIM 2
1 PQNAVAGDVLVLTKPLGTQVAVNAHQWLDN 0
0 PERWNKIRLVVSREDVELAYQEAHVNMATLNRT 1
2 AAALMHSFNAHAATDVTGFGILGHAQNLAQEQQNEVSFVIHNLPIIAKAAITKACGNRFgLLQGTSPETS 1
2 GGLLICLPREQAARFCAEIKSPKHGEGHQAWIIGIVEKGNRSARIIEKPRiiEVTPRGAaTSDNTA* 0

Reference sets of vertebrate SECIS elements

SEPP2: 5 SECIS sequences

>SEPP2_SECIS_monDom Monodelphis domestica (opossum) chr2:9455773-9456962 COVE: 30 55 bp after stop codon
ACGTGCCGCTGCCCCTCCCTCCCTCCAAGAATGACGCCCACAGTGAAACCCAGAGAACTGGTCCCTGTGGGCTGATGCCCCAGAGGGGAGGAGAGGC

>SEPP2_SECIS_macEug Macropus eugenii (wallaby) SECIS COVE score: 26.38 trace ti|975557399 contains terminal exon
CCTCATTCCTCTTCTCCATGGCATCAGCCCACCATGGCCGGTTCCCCGGGTTTCACTGTGGGCATCATTAATTGAGGGAGGGAAGGGCAATGGGCAAG

>SEPP2_ornAna Ornithorhynchus anatinus (platypus)Ultra131:321101-348720 COVE: 34.28 no match to chicken
TCCCTCGACTCCCGCTCCCGCCTCGCACTCATGACGTCCACGGTGTCAACCGGCCCGCCGGGCACCGTGGACTGACGCCGGTCGAGGCGGAGGGGT

>SEPP2_ornAna Ornithorhynchus anatinus (platypus) Ultra131:321101-348720 COVE: 22.82 no match to chicken
GGAATCAGGAACCCAGTAACATGAGGTCATCTTCGGAAGCCTGTGCCTAGAGGACCAAGATAATGGAAAAAGTGACGGACAAGGGTGTGTAGCTGG

>SEPP2_tetNig Tetraodon nigroviridis (pufferfish) COVE: 17
GCTGGACCCAGGCTGCTGGTGGTCCCGTTGATGACGTCTGCGCTGGTAAACCTGCCTGCAGGAGCCTGTGGACCGACGTGTGTGGACCCACCGGCAG

SEPW1: 13 SECIS sequences

>selW_SECIS_homSap Homo sapiens (human)
agggaccttgacccagcccctctcagcagacgcttcatgataggaaggactgaaaagtcttgtggacacctggtctttccctgatgttctcgtggctgctgttgggggcagagattgacgcccccggtctttgcct

>selW_SECIS_panTro Pan troglodytes (chimp) 
AGGGACCTTGACCCAGCCCCTCTCAGCAGACGCTTCATGATAGGAAGGACTGAAAAGTCTTGTGGACACCTGGTCTTTCCCTGATGTTCTtGTGGCTGCTGTTGGGGGCAGAGATTGACGCCCCCGGTCTTTGCCT

>selW_SECIS_ponPyg Pongo pygmaeus (orang_sumatran)
AGTCCAGGGACCTTGACCCAGCCCCTCTCAGCAGACGCTTCATGATAGGAAGGACTGAAAAATCTTGTGGACACCTGGTCTTTCCCTGATGTTCTCGTGGCTGCTGTTGGGGGCAGAGATTGACGCCCCTGGTCTTTGCCT 

>selW_SECIS_macMul Macaca mulatta (rhesus)
AGcGACCTTGACCCAGCCCCTCTCAGCAGACGCTTCATGATAGGAAGGACTGAAAAGTCTTGTGGACgCCTGGTCTTTCCCTGATGTTCTCGTGGCTGCTGTTGGGGGCAGAGATTGACGCCgCtGGTCTTTGCCT

>selW_SECIS_musMus Mus musculus (mouse)
ACTGAAATGTCTTAGACTTGGCCCAGCCCCTCGTGGCAGACGCTTCATGATGGGAAGAACTGAAATGTCTCGTGGACGCCTGGTCTTTCCCTGATGTCCCTGCGACTGCCACGTAGGGGCAGAGACTGATGCCCCTGTGGGTGCCT

>selW_SECIS_ ratNor Rattus norvegicus (rat)
CCTGGCCGGCCTTTCTTGGCAGCCGCTTCATGACAGGAAGGACTGAAATGTCTCAAAGACCTGTGGTCTTTCTTCGATGTTCCTGCGGCCACCAAGTCAGGCCAGAGATGGATTCTGTGTGTGGGTGCCT

>selW_SECIS_oryCun Oryctolagus cuniculus (rabbit)
AGTAACCTTGACCCAGCCCCTTTCATGCCTCAGCCTCGTCTCCATAGGCTAAGACTGGAGAAATGAGTCCCCTGAAGAACTGAAACTGGGGGTAGAGGGTTGGTGTTTTAAGATGTGGATGAGCTGGTCTTTAC

>selW_SECIS_canFam Canis familiaris (dog)
CCAGtGACCTTGgCCCAGCCCCTCgtgGCAGACGCTTCATGATgGGAAGaACTGAAAtGTCTcGTGGACgCCTGGTCTTTCCCTGATGTccctgcgactgccacgtaGGGGCAGAGAcTGAtGCCCCtGGTCTTTGCCT

>selW_SECIS_susScr Sus scrofa (pig)
AGTAACCTTGACCCAGCCCCTTTCATGCCTCAGCCTCGTCTCCATAGGCTAAGACTGGAGAAGTCTTGTGGACGCCTGGTCTTTCCCTGATGTTCTCGTGGCTGCTGTTGGGGGCAGAGATGGATGAGCTGGTCTTTAC

>selW_SECIS_borAnc Boreoeuthere ancestralis (ancestral)
AGTCCAGCAACCTTGGCCCAGCCCCTCTCAGCAGATGCTTCATGACAGGAAGGACTGAAATGTCTTGTGGACGCCTGGTCTTTCCCTGATGTTCTTGTGGCTGCTGGTTGGGGCAGAGATTGACACCCCTGGTCTTTGCCT

>selW_SECIS_dasNov Dasypus novemcinctus (armadillo)
CCAGCAACCTCAGCCCAGCTGCCCTTGGCAGACGCTTCATGAGGGGAAGGACCTAAATGCGTCGTGGATGCCTGGTCTTTCCCTGATGCTCCTTCACCTGCCAGATGGGGCAGAGGTCATTGCCCCTGGTCTTGGCCT

>selW_SECIS_loxAfr Loxodonta africana (elephant)
GGGACCTTGGCCCAGCCCCTTTCAGCAGACACTTCATGACAGGAGGACTGAAATGTCTCCCAGACGCCTGGCTCTTTCCCTGAATCTGTCGGCTGCAGGACAGGGCAGCGGTTGACTCTCTCGTTTTTTGCAT

>selW_SECIS_echTel Echinops telfairi (tenrec)
AGGCCAGAGACCTTGGCCCAGTCCCTCCATGACAGGCAGAACTGAAATGTCCTCTGGACAAGTGGTCTTTTCCAGAAACCCCAGGGCTGCTGGGCCGGAGCCGAGGCTGACAACCCTGGTCTTTGCCT

DIO1: 29 SECIS sequences

>DIO1_SECIS_homSap Homo sapiens (human) COVE score: 29
ttttaactctgtgtctttacatatttgtttatgatggccacagcctaaagtacacacggctgtgacttgattcaaaagaaaatgttataag

>DIO1_SECIS_ponPyg Pongo pygmaeus (orang_sumatran) COVE score: 29
ttttaactctgtgtctttacatatttgtttatgatggccacagcctaaagtacacacggctgtgacttgattcaaaagaaaatgttataag

>DIO1_SECIS_macMul Macaca mulatta (rhesus) COVE score: 29
tttcaactctgtgtctttacatatttgtttatgatggccacagcctaaagtacacacggctgtgacttgattcaaaagaaaatgttataag

>DIO1_SECIS_cavPor Cavia porcellus (guinea_pig) COVE score: 24
tgttaactctgcttcttttcatatttgttcatgacggtcacagtctaaagtacacacagctgtgacctgatttgaaagaaaatgttttaag

>DIO1_SECIS_canFam Canis familiaris (dog) COVE score: -
ttttaactctgcttcttttcatgtttgtctatgacggccacagcctaaagcacacacagctgtgacttgatttgaaagaaaatgttttaag

>DIO1_SECIS_felCat Felis catus (cat) COVE score:  -
ttttaactctgcttcttttcacgtttgtctatgacggccacagtctaaagtgcacacagctgtgacttgacttgaacgaaaatgttttaag

>DIO1_SECIS_bosTau Bos taurus (cow) COVE score: 28
ttttaactctgcctcttttcatatttgttcatgacggccacagcctaaagtacacacggctgtgacttgatttgaaagaaaatgttttaag

>DIO1_SECIS_sorAra Sorex araneus (shrew) COVE score: 24
cggaaactcagcttctcttcatatttgtttatgacagccccagctgaaagtacacacagctgtggcttgattggaaagaaaatgttttaag

>DIO1_SECIS_eriEur Erinaceus europaeus (hedgehog) COVE score: 26
tttaactctgctttcttctcatatttgcttatgatggtcacagcttaaagtatacacagctgtgacttgattggaaagaaaatattttaag

>DIO1_SECIS_borAnc Boreoeuthere ancestralis (ancestral) COVE score: 29
ttttaactctgcttcttttcatatttgttcatgatggccacagcctaaagtacacacggctgtgacttgatttgaaagaaaatgttttaag

>DIO1_SECIS_dasNov Dasypus novemcinctus (armadillo) COVE score: -
ttttaactctgcttcttttcatatttgtttatgatggccacagtttaaagtacatacagctgtgacttgatatgaaaaagaaatattttaag

>DIO1_SECIS_loxAfr Loxodonta africana (elephant) COVE score: - 
ttttaactctgcttcttttttcatgtatttatgatgggccacagcctaaagtgcacaacagctgtgacttgatttgaaaaacatctttaag

>DIO1_SECIS_monDom Monodelphis domestica (opossum) COVE score: -
tttccatcctgcttctacaaatatttatttatgacaatcacagcctaaagctcagggcagctgggattcgacgggagaaaaagtttgtaag

>DIO1_SECIS_ornAna Ornithorhynchus anatinus (platypus) COVE score: 24
ccccggatccggttccgtgaatattggtttatgagggtcacagtgtaaagcgcatgcagctgtgacttgatctgagaaaatatttctgcggc
 
 
>DIO2_SECIS_homSap Homo sapiens (human) 5,630 bp utr COVE score: 30
cagagatgtgcagagttgaccagtgtgcggatgataactactgacgaaagagtcatcgactcagttagtggttggatgtagtcacattagtttgcctctc

>DIO2_SECIS_macMul Macaca mulatta (rhesus) COVE score: 30
cagagatgtgcagagttgaccagtgtgcggatgataactactgacgaaagagtcatcgactcagttagtggttggatgtagtcacattagtttgcctctc

>DIO2_SECIS_musMus Mus musculus (mouse) COVE score: 29
cggagatgttcagagctcactggtgtgcgaatgataactactgacgaaagagctgtctgctcagtctgtggttggatgtagtcacacgagtctgcctttctgca

>DIO2_SECIS_ratNor Rattus norvegicus (rat) COVE score: 28
ccgagatgttcggagctcactggtgtgcgaatgataactactgacgaaagagtcatctgctcagtctgtggttggatgtagtcacacgagtctgcctctccatc

>DIO2_SECIS_canFam Canis familiaris (dog) COVE score: 27
ctgggatgtgcagaggtgaccagtgtgcgaatgataactactgatgaaagagtcactgactcagttagtggttggatacagtcacattagttttcctct 

>DIO2_SECIS_borAnc Boreoeuthere ancestralis (ancestral) COVE score: 30
ctgggatgtgcagaggtgaccagtgtgcaaatgataactactgatgaaagagtcattgactcagttagtggttggatgtagtcacattagtttgcctctc

>DIO2_SECIS_dasNov Dasypus novemcinctus (armadillo) iodothyronine deiodinase type II
ctgggaagttcagaggctaccagtgtgccaatgataactactgacgaaagaggcatcgactcagttagtggttggatgtagccacattagtttgcctctc
 
 
>DIO3_SECIS_homSap Homo sapiens (human) COVE score: 31
ttgggtgcacaggagccccactgctgatgacgaactatctctaactggtcttgaccacgagctagttctgaattgcaggggcctcaaagcagca

>DIO3_SECIS_macMul Macaca mulatta (rhesus) COVE score: 30 
ttgggtgcacaggagccccactgctgatgacgaactgtctctaactggtcttgaccacgagctagttctgaattgcaggggcctcaaaacagca

>DIO3_SECIS_musMus Mus musculus (mouse) COVE score: 26  
ttgggtgcgctggagccctggctgctgatgacgaaccgcctctaactgggcttgaccacgggtcggctctgaattgcagagaggctcgaaacagc

>DIO3_SECIS_ratNor Rattus norvegicus (rat) COVE score: 26 
ttgggtgcgctggagccctggctgctgatgacgaaccgcctctaactgggcttgaccacgggtcggctctgaattgcagagaggctcgaaacagc

>DIO3_SECIS_canFam Canis familiaris (dog) COVE score: 26
ttgggtgctggcgagccccactgctgatgacgagccgcctctaactggtcttgaccacgagctggttctgagttgcaggggggcttgcagcggc

>DIO3_SECIS_bosTau Bos taurus (cow) COVE score: 30
ttgggtgctcacgagccccactgctgatgaagagctgtctctaactggcctcgaccacgagctggttctgatttgcaggaggctcgcagcagc

>DIO3_SECIS_borAnc Boreoeuthere ancestralis (ancestral) COVE score: 27
ttgggtgctcaggagccccactgctgatgacgaactgtctctaactggtcttgaccacgagctggttctgaattgcagggggctcgcagcagca

>DIO3_SECIS_loxAfr Loxodonta africana (elephant) COVE score: 22
ttcggtgcgctagagccccactgctgatgacgaactgtctctaactggtcttgaccacgagctgattccgaattgcagggaactcgcagcagc

>DIO3_SECIS_echTel Echinops telfairi (tenrec) COVE score: -
ttcggtgctctgcagccccactgctgatgacgaactgcctctcactggtcttgaccacgagctgcttctgaaatgcaggggactcgcagccgca

Basal selenoproteins: platypus (21), echidna (16), elephantshark (9), lamprey (11) and hagfish (7)

Outgroups are exceedingly important in determining ancestral characters along the evolutionary stem leading to placental mammals. The most important species are hagfish, lamprey, coelocanth/lungfish, and platypus/echidna. However only a fraction of the likely selenoprotein repertoire is available because of incomplete data.

These sequencs are current as of 6 May 2008. They derive from cdna at GenBank (both hagfish and lamprey selenoproteins were surprisingly well represented), at NCBI WGS contigs for cartilaginous fish and at the UCSC lamprey genome browser. Hagfish has no genome project underway so its transcripts have been intronated by homology. Platypus has both a genome assembly and a 454 transcrip project; echidna has only the latter. One interesting finding -- in addition to startling differences in degree of conservation of different selenoprotein -- is that SEPHS1 from both hagfish and lamprey have (ancestral) selenocysteine unlike mammals which have cysteine.

Summary of availablity by species:

>DIO1_ornAna Ornithorhynchus anatinus (platypus)
>DIO2_ornAna Ornithorhynchus anatinus (platypus) tga tga then tag
>DIO3_ornAna Ornithorhynchus anatinus (platypus) 
>GPX4_ornAna Ornithorhynchus anatinus (platypus) uc002lrg.1 EG341043
>GPX7_ornAna Ornithorhynchus anatinus (platypus) uc001cue.1 LOC100090031 MVAATVAAAWLLLWAAACAQQEQDFYDFKAVNIRGKLVSLEKY?
>MSRB1_ornAna Ornithorhynchus anatinus (platypus)
>MSRB1_ornAna Ornithorhynchus anatinus (platypus)
>MSRB3_ornAna Ornithorhynchus anatinus (platypus) (uc001ssm.1
>SELH_ornAna Ornithorhynchus anatinus (platypus) E5DI3CH10F50JR run=R_2008_02_11_18_03_02_ 
>SELI_ornAna Ornithorhynchus anatinus (platypus) 53% LOC100086265
>SELK_ornAna Ornithorhynchus anatinus (platypus) EUEMSW407EBWK3 length=246 run=R_2007_08_22_12_11_10_ 64% 
>SELM_ornAna Ornithorhynchus anatinus (platypus)
>SELO_ornAna Ornithorhynchus anatinus (platypus) uc003bjx.1
>SELS__ornAna Ornithorhynchus anatinus (platypus) MERQEESLSARPALETEGLRFL?
>SELT_ornAna Ornithorhynchus anatinus (platypus) EG340137
>SELU1_oan data Ornithorhynchus anatinus (platypus) taa early stop full
>SEP15_ornAna Ornithorhynchus anatinus (platypus) EG339469
>SEPHS1_ornAna Ornithorhynchus anatinus (platypus) virtual selenocys
>SEPHS2_ornAna Ornithorhynchus anatinus (platypus) tga confirmed
>SEPP1_ornAna Ornithorhynchus anatinus (platypus)
>SEPP2_ornAna Ornithorhynchus anatinus (platypus)  Ultra131:348583-348720 Zswim5 synteny last exon uncertain
>SEPW1_ornAna Ornithorhynchus anatinus (platypus) 

>DIO2_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01A7SZJ length=249 run=R_2007_08_23_18_06_11_
>GPX1_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01BQVNH length=235 xy=0599_0299 region=1 run=R_2007_08_23_18_06_11_
>GPX4_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01DM9S9 length=130 run=R_2007_08_23_18_06_11_ 
>GPX5_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01DBXSK length=158 run=R_2007_08_23_18_06_11_ 
>GPX7_tacAcu EUPZL4S02GXFL4
>MSRB1_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01EKBYH length=230 run=R_2007_08_23_18_06_11_ 
>SELH_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01B810Z length=224 run=R_2007_08_23_18_06_11_ 
>SELK_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01DCGU7 length=251 xy=1255_0209 region=1 run=R_2007_08_23_18_06_11_
>SELM__tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01AWAS8 length=259 run=R_2007_08_23_18_06_11_ 
>SELO_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01DJRM3 length=241 run=R_2007_08_23_18_06_11_ 
>SELS_tacAcu Tachyglossus aculeatus (echidna) EUEMSW404CHB0V run=R_2007_08_22_12_11_10_ Length = 272 72% 
>SELT_tacAcu Tachyglossus aculeatus (echidna)  EUGXWLM01DXGYB length=235 run=R_2007_08_23_18_06_11_
>SELT_tacAcu Tachyglossus aculeatus (echidna) EUEMSW402BA71P run=R_2007_08_22_12_11_10_ 100% length=231
>SELU1_tacAcu Tachyglossus aculeatus (echidna) EUEMSW405C31QQ run=R_2007_08_22_12_11_10_ Length = 282 (74%) tSASEKK
>SEP15_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01BKAWJ EUEMSW404CBHS5 length=262 run=R_2007_08_23_18_06_11_
>SEPHS1_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01BJUY7 length=168 run=R_2007_08_23_18_06_11_
>SEPHS2_tacAcu Tachyglossus aculeatus (echidna) EUPZL4S02I1W5B
>SEPP1_tacAcu Tachyglossus aculeatus (echidna) EUEMSW408ERBZB length=227 run=R_2007_08_22_12_11_10_ 59% 

>DIO1_calMil Callorhinchus milii (elephantfish) weak match introns ok
>DIO2_calMil Callorhinchus milii (elephantfish)no  
>DIO3_calMil Callorhinchus milii (elephantfish)
>GPX2_calMil Callorhinchus milii (elephantfish) tga-cys tag-stop full
>SELO_calMil Callorhinchus milii (elephantfish) frag tgt-cys not sel tag-stop? tga-stop? taa-stop
>SELU1_calMil Callorhinchus milii (elephantfish) frag
>SEPHS1_calMil Callorhinchus milii (elephantfish) frag
>SEPP2_calMil elephant fish Callorhinchus milii (elephantfish)
>SEPW2_calMil Callorhinchus milii (elephantfish)

>DIO2a_petMar Petromyzon marinus (lamprey) frag
>DIO2b_petMar Petromyzon marinus (lamprey) frag
>GPX2_petMar Petromyzon marinus (lamprey) cDNA LyEST1379 tga-sel tag-stop full
>MSRB3_petMar Petromyzon marinus (lamprey) cDNA FD728382 tga-sel taa-stop full
>SELO_petMar Petromyzon marinus (lamprey) frag
>SELS_petMar Petromyzon marinus (lamprey) cDNA EB082976 39% identity homSap tga-sel taa-stop full
>SELT_petMar Petromyzon marinus (lamprey) cDNA EE737658 tga-sel tag-stop full
>SELU3_petMar Petromyzon marinus (lamprey) cDNA EE741479 ti|1430987375 tga-sel taa-stop frag
>SELW_petMar Petromyzon marinus (lamprey) cDNA FD700531 tga-sel tag-stop full
>SEP15_petMar Petromyzon marinus (lamprey) cDNA EG333213 tga-sel taa-stop full
>SEPHS1_petMar Petromyzon marinus (lamprey) cDNA LyEST8076 FD729689 tga-sel tgc-cys tag-stop not so virtual selenocys full
>SEPN1_petMar Petromyzon marinus (lamprey) tga-sel taa-stop frag

>GPX1_eptBur Eptatretus burgeri (hagfish) BJ648558 tga-sel tag-stop frag
>GPX4_eptBur Eptatretus burgeri (hagfish) BJ646422 tga-sel frag
>SELT_eptBur Eptatretus burgeri (hagfish) BJ650136 tga-sel tag-stop
>SELT_eptBur Eptatretus burgeri (hagfish) cdna BJ650136 tga-sel tag-stop full
>SELW_eptBur Eptatretus burgeri (hagfish) cdna BJ662449 tgn-sel tag-stop
>SEP15_eptBur Eptatretus burgeri (hagfish) BJ647169 tga-sel taa-stop
>SEPHS1_eptBur Eptatretus burgeri (hagfish) cdna BJ649814 BJ654390 tga-sel taa-stop full

GPX sites in lamprey browser needing evaluation:
GPX	Contig4536:10087-19433
GPX	Contig4536:13614-13712
GPX	Contig4536:17923-18132
GPX	Contig33644:113-3872
GPX	Contig41170:1909-3454
GPX	Contig3803:603-6660
GPX	Contig3036:20426-22172
>SEPHS1_ornAna Ornithorhynchus anatinus (platypus) virtual selenocys
0 MSVRESFNPESYELDKSFRLTRFTELKGTGCKVPQDVLQKLLEALQENHFQEDEQFLGAVMPRL 1
2 GIGMDTCVIPLRHGGLSLVQTTDYIYPIVDDPYMM 0
0 GRIACANVLSDLYAMGVTECDNMLMLLGISNKMTDR 0
0 ERDKVMPLIIQGFKDAAEEAGTSVTGGQTVLNPWIVLGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVAVHQWLDI 0
0 PEKWNKIKLVVTQEDVELAYQEAMMNMARLNRT 1
2 AAGLMHTFNAHAATDITGFGILGHAQNLAKQQRNEVSFVIHNLPVLAKMAAVSKACGNMFGLMHGTCPETS 1
2 GGLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGNRTARIIDKPRIIEVAPQVATQNVNPTPGATS* 0

>SEPHS2_ornAna Ornithorhynchus anatinus (platypus) tga confirmed
0                 FDPELLGLSPSWRLTGFSELKGuGCKVPQETLLKLLAGLTHPDPRPGAAPPDPQDPPPDPQDRTPHHAGPAPPAL 1
2 GIGMDSCVIPLRHGGLSLVQTTDFFYPLVEDPYMM 0
0 GRIACANVLSDLYALGITECDNMLMLLSVSQKMNEE 0
0 EREKIMPLMIKGFRDAAEEGGTSVTGGQTVVNPWIIIGGVATVVCQPNEFIM 2
0 PDNAVPGDVLVLTKPLGTQVAVNAHQWLDN 0
0 PEKWNKIKLVVSREDVELAYQEAMFSMAMLNRT 1
2 AAGLMHTFNAHAATDITGFGILGHAQNLAKQQRSEVSFVIHNLPIIAKMAAITKACGNRFGLLQGTSSETS 1
2 GGLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGNQTARIIDKPRIIEVTPRGAAAPPPDNNSSASPETGS* 0

>SELU1_oan data Ornithorhynchus anatinus (platypus) taa early stop full
0 MPLPPDLGLFNLGMWSVGVGALGAAAVGLLLANTDLLLTKPEKATLEYLEDTELKTLGK 1
2 EPRTFKARELWQRNGAVIMAVRRPGuFLCREE 0
0 AAELSSLKPQLDRLGVPLYAVVKEKIGTEVEDFQPYFKGEIFLDER 0
0 KKFYGPHKRKMLFLGFIRLGVWQNFLRARNRGFSGNLEGEGLILGGVYVLGAGKQ 0
0 GILLEHREREFGDKVSPASVLEAAQRIKPQPL* 0

>SEPP1_ornAna Ornithorhynchus anatinus (platypus)
0 MWQGLGLALALCLLPGGGAESQSASSHCKEAPRWQIRDQDPMLNSLGTVTVVALLQAS*YLCILQASR 2
1 LEDLRVKLENEGYSNISYIIVNHQGMPSQLNHKTLKEKVSEHIPVYQQDEKQTDVWSTLKGNKDDFLIYDR 2
1 CGRLVYHLSLPYTFLSFSYVEDSIKTTYCEQNCGNCSYT 0
0 MPEAEEFCTNTSSAAKEKATEAPLPHNDRPHHHHHHHHHHGHKPHPSGTEQAPADPDGPLRSPAPQGLHKRLRPAGQPRQGQGGSREAAEGRGEELPSPRKKA*RKGNASCQNQLL*DWHKRSGPAPSS*
C*HCRHLLFGSKATATAL*QCRDA LPALCS*QGRQSGEDVIES*Q*RSPLPA*PPAAQLPSPSPTDPNAA*K*ENTAGM*K*PTR* 0 
 
>SEPP2_ornAna Ornithorhynchus anatinus (platypus)  Ultra131:348583-348720 Zswim5 synteny last exon uncertain
0 MAGSGLLGPALTLATLLAAAGALPDLENGTRICQPAPRWTVNGVAPMEGTEGQVIVVALLKASuHFCLKQAAR 2
1 LAGLRERLAGHGAGNVSFLIVNQRDPTAQLLHTELERHAPPGVPVYAQDGPDPDVWSVLGGDKDDFFVYDR 2
1 CGRLTFHIQLPFSFLHFPYVEAAVRFTHRRDFCGNCSYYFPQVGT 0
0 VNDTTTQESELEKSPGAPGEEPEGSPVREPDRPQSQDPTGPFSGVLLQGKENKIIPWKTPLQAAPRKPSHPPGAHD* 0

>SEPW1_ornAna Ornithorhynchus anatinus (platypus) 
0 MASLEAFPRGVVPVHVVYC 2
1 GAuGYKPK 0
0 FLQLKKKLENEFPGQVEI 0
0 SGEGTPQVTGWFEVTVAGKLVHSKK 0
0 EGDGFVDSESKFAKIRMAIKAALVPGY* 0

>SELH_ornAna Ornithorhynchus anatinus (platypus) E5DI3CH10F50JR run=R_2008_02_11_18_03_02_ 
0 DAGGAEVGEGLHVVIEHC 2
1 RSuGVYGRRAEALSRALSLAAPDLPVLLNPTKPRRNSFEVTLLRPDGT 1
2 RTELWSGIKKGRP

>SELM_ornAna Ornithorhynchus anatinus (platypus)
0 MLVCPERRSWVLIPPLSLLLLLPGLLAAFQPDWSRLQGLARGKVE 0
0 TCGGuQLNRLKE 0
0 VKAFVTEDIPLY 2
1 HNLVMKHLPGADPELVLLNFRYEELE 0
0 RIPLSHMTRAEINQLVQDLGFYRKAERDAPVPPEFQQAPAKTSDLREKVQPQETPKSEEQNHPDL* 0

>MSRB1_ornAna Ornithorhynchus anatinus (platypus)
0 1
2 GTYVCARCGYELFSSRSKYEHSSPWPAFTETIHPDSVAKREEPGRPNAFK 0
0 VSCGKCGNGLGHEFLNDGPRRGQSRFuIFSSLKFIPK 1
2 GKDSQAAQDK* 0


>MSRB1_ornAna Ornithorhynchus anatinus (platypus)
               DWKKKLTPEQYCVTRGKGTEP 0
0 PFSGIHLNNTERGMYHCLCCDMPLFS 2
1 SEKKFCSGTGWPSFLEAHGTQGTDESQTGILRRPDHSLGLARTEVVCKR 0
0 CEAHLGHVFDDGPPPTGQRFCINSVALTFKPS* 0

>MSRB3_ornAna Ornithorhynchus anatinus (platypus) (uc001ssm.1
0 MSAFNLLHLVTKSQPGSLQACGLPSGSRRDKRSCKVVFSQEELRKRLTPLQYHVTQEKGTESAFEGEFTHHKAPGTYKCVVCGTPLFKSETKFDS
GSGWPSFYDVIGSDAITLTDDFSYGMHRVETSCSQCGAHLGHIFEDGPRPTGKRYCINSASLSFAAVEENIDKENGSGDSPIQPGKTEL*

>DIO1_ornAna Ornithorhynchus anatinus (platypus)
0 1
2 GSRPLVLTFGSCTuPSFLFKFDQFNQLVQDFNSIADFLIIYIEEAHPT 1
2 DGWAFANNVDIPSHRSLQERQEAARRLLARGPRCPVVVDTMDNASSRQYAALPERLYLLREGKVVYK 0
0 GGPGPWNYNPGEVRAVLEKLS 0*

>DIO2_ornAna Ornithorhynchus anatinus (platypus) tga tga then tag
0 MGLLSVDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGEWRRMLTSEGLRCVWNSFLMDAYKQ 0
0 VKLGGDAPNSSVVHVANANGETSGGNSPKWKNFSGRYGAECHLLDFASSERPLVVNFGSATuPPFISQLPAFSKLVEEFSAVADFLLVYIDEAHPSDGWAV
PGEFSLPFEVRKHQNQEDRCAAAHQLLERFSLPPQCQVVADCMDNNANVAYGVSFERVCIVQRQKIAYLGGKGPFFYNLQEVRLWLEQNFSKRuNPD* 0

>DIO3_ornAna Ornithorhynchus anatinus (platypus) 
DCLLVYIEEAHPSDGWVSSDAPYDIPKHRCLQDRLRAARLMQRGAPGCRLAVDTMDNAASAAYGAYFERLYVVQDARVVYQGGRGPEGYKISELRLWLEQYRLRLRGPGPATAAAAAVLDV* 0
 
>SELK_ornAna Ornithorhynchus anatinus (platypus) EUEMSW407EBWK3 length=246 run=R_2007_08_22_12_11_10_ 64% 
0 MVYISNGHVLNGQNRSLWSLSFIKDFFWGILDFIIMFFKSMIHPNVKRGCRNSSSDSKYDDGRGPPGYPRRGMGRINHSNGPSPPPMAGGGuGR*

>SEP15_ornAna Ornithorhynchus anatinus (platypus) EG339469
0 MAPWLRLFLAATLQAISAYGTELSSEACRDLGFSSNLLCSSCDLLGQFSLTQLDPVCRGCCQEEAQFESRKLYAGAILEVCGuKLGRFPQVQVCSWFRPCAKAAGrEWEHC*

>GPX4_ornAna Ornithorhynchus anatinus (platypus) uc002lrg.1 EG341043
0 MSLFRLARLAKPLLLGGAVAVPALRRTMCASPDDWRCANSIYEFSAEDIDGKLVSLEKYRGKVCIITNVASKuGKTEVNYTQLVDLHAQYAGQGLRILGFP
CNQFGRQEPGTNSEIKEFAAGYNVKFDMFSKVCVNGDEAHPLWKWLKDQPKGKGTLGK* 0

>GPX7_ornAna Ornithorhynchus anatinus (platypus) uc001cue.1 LOC100090031 MVAATVAAAWLLLWAAACAQQEQDFYDFKAVNIRGKLVSLEKY?
KGRVSLVVNVASECGFTDQHYRALQQLQKDLGPSHFNVLAFPCNQFGQQEPDGNKEIESFVRKTYGVSFPMFSKIAVSGAGANAAFKFLTESSGEEPTWNFWKYLVSPDGKVVNSWDSTVSVEEVRPQITALVRKLILLKREDL* 0

>SELT_ornAna Ornithorhynchus anatinus (platypus) EG340137
0 MRLLLLVVVAAATGAAGGRSEASADLGGLPSKRLKMQYATGPLLKFQICVSuGYRRVFEEYMRVISQRYPDIRIEGENYLPQPIYRHIASFLSVFKLVLIGLIIVGKDPFAF
FGMQAPSIWQWGQENKVYACMMVFFLSNMIENQCMSTGAFEITLNDVPVWSKLESGHLPSMQQLVQILDNEMKLNVHMDSIPHHRS* 0

>SELI_ornAna Ornithorhynchus anatinus (platypus) 53% LOC100086265
MAGYEYVSAEQLAGFDKYK
YSALDTNPLSLYVMHPFWNTIVKV
IFPTWLAPNLITFSGFLLLVFNFLLMAYFDPDFYAS 
APGQKHVPDWVWIVVGILNFTAYTL
DGVDGKQARRTNSSTPLGELFDHGLDSWACVYFVVTVYSIFGRGRTGVSVFVLYLLLWVVLFSFILSHWEKYNTGILFLPWGYDMSQV
TISIVYIVTAVVGVEAWYEPFLFNFLYRDLFTTMII
GCAVTVTLPMSLYNF
AYRNKTLKYSSVYETMLPFVSPCLLFTLSTTWIFLSPSNILETHPRLFYFMVGTLFANIT
CQLIVCQMSNTRCQPLNWLLMPLALVILVVHSGLAPHSETFLLYSLTALVTVAHIHYGIRV
VNQLSKHFKILPFSLRKPSSDuLGLEEEKIGL* 0

>SELO_ornAna Ornithorhynchus anatinus (platypus) uc003bjx.1
SRHADGRKVLRSSIREFLCSEAMFHLGIPTTRAGTCVTSESAVIRDVHYDGNPKYEKCAVVLRIASTFLRFGSFEIFKPRDEHTGRQGPSVGRNDIRIRMLDYVIGTFYPEIEEANADDTVRRNAAFFREVPGR

>SELS__ornAna Ornithorhynchus anatinus (platypus) MERQEESLSARPALETEGLRFL?
LFIVGSVLSAYGWYILFGCAVLYLIFQKLSGSLRVMRRRYSDTTGAAIDPEVVVKRQEALAASRLRMQEELNAQAEKYREKQKQLEEAKRRQKIEIWESMQEGKSYKGNSRLQPQQETDPGPSTSSVIPKPKPARKPLRGGYNPLSGEGGGTCSWRPGRRGPSSGGuG*
 

>DIO2_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01A7SZJ length=249 run=R_2007_08_23_18_06_11_
GAECHLLDFASSERPLVINFGSATuPPQPAAGLQQAGEEFSTVADFLLVYI

>GPX1_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01BQVNH length=235 xy=0599_0299 region=1 run=R_2007_08_23_18_06_11_
NGEKAHPLFAFLRESLPTPSDDPTSLMNDPKFIIWSPVCRNDISWNFEKFLVGPDGVPLRYSRRFETINIKEDIAMLLDQ
  
>GPX4_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01DM9S9 length=130 run=R_2007_08_23_18_06_11_ 
LFRLARFAKPLLLGGAVAVPALRRTMCASPDDWRCANSIYDFSAEDIDGNSVSLEKYRGKVCIITNVASKuGKTEVNYTQLVDLHAQYVEQGLRILGFPCNQFGKQEPGTNSEIKEFAAGYNVKFDMFSKVCVNGDEAHPLWKWLKDQPKGKGTLGNAIKWNFTKFLIDREGQVVKRYGPMDEPRVIEKDLPCY

>GPX5_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01DBXSK length=158 run=R_2007_08_23_18_06_11_ 
LLGDPTRLFWSPMKTHDIRWNFEKFLVGPDGVPVMRWYHRATVSTVK

>GPX7tacAcu EUPZL4S02GXFL4
QLQKDLGPSHFNVLAFPCNQFGQQEPDSNKEIESFVRKTYGVSFPMFSKIAVSG 163

>MSRB1_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01EKBYH length=230 run=R_2007_08_23_18_06_11_ 
FCSFFGGEVFQNHFETGIYVCARCGYELFSSRSKFEHSSPWPAFTETIHPDSVAKREEPGRPGAFKVSCGKCGNGL

>SELH_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01B810Z length=224 run=R_2007_08_23_18_06_11_ 
RALSLAAPHLPILLNPRQPRRNSFEVTLFGPDGTRTELWSGIKKGPPRRLKFPEPEMLADLLRSSLA

>SELK_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01DCGU7 length=251 xy=1255_0209 region=1 run=R_2007_08_23_18_06_11_
VMVYISNGHVLNGQNRSPWSLSYIKDFFWGILDFIIMFFKSMIHPNVKRGCRNSSSDSKYDDGRGPPGYPRRGMGRINHSNGPNPPPMAGG

>SELM__tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01AWAS8 length=259 run=R_2007_08_23_18_06_11_ 
LNFRYEELERIPLSHMTRAEINQLVQDLGFYRKADRDAPVPPEFQQAPAK 

>SELO_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01DJRM3 length=241 run=R_2007_08_23_18_06_11_ 
YGGHQFGTWAGQLGDGRAHLIGIYTNRHGEQWELQLKGSGRTPYSRNGDGRAVFRSSVREFLGSEAMHYLRIPTSRA 

>SELS_tacAcu Tachyglossus aculeatus (echidna) EUEMSW404CHB0V run=R_2007_08_22_12_11_10_ Length = 272 72% 
0 1
2 LRVMRQRYSDTTGAAI 1
2 DPEVVVKRQEALAASRLRMQEELNAQAEKYREKQKQ 0
0 LEEAKRRQKIEIWESMQEGKSYKGNSRLRPK 0
0 QETDPGPSTSSVIPKPKPARKPLRGG 1
2 SYNPLSGEGGGTCSWRPGRRGPSSGGuG* 0

>SELT_tacAcu Tachyglossus aculeatus (echidna) EUEMSW402BA71P run=R_2007_08_22_12_11_10_ 100% length=231
VFFLSNMIENQCMSTGAFEITLNDVPVWSKLESGHLPSMQQLVQILDNEMKLNVHMDSIPHHRS

>SELT_tacAcu Tachyglossus aculeatus (echidna)  EUGXWLM01DXGYB length=235 run=R_2007_08_23_18_06_11_
YRHIASFLSVFKLVLIGLIIVGKDPFAFFGMQAPSIWQWGQENKVYACMMVFLLSNMIENQCMSTVAFEITLNDVPVWSKLESGHLPSMQQLVQILDNEMKLLNVHMDSIPHHRS
  
>SELU1_tacAcu Tachyglossus aculeatus (echidna) EUEMSW405C31QQ run=R_2007_08_22_12_11_10_ Length = 282 (74%) tSASEKK
0 1
2 EPRTFKARELWQRNGAVIMAVRRPGuFLCREE 0
0 AAELSSLKPQLDQLGVPLYAVVKENIGTEVEDFQPYFKGEIFLDER 0
0 KRFYGPHKRKMLFLGLIRLGVWQNFIRARNKGFPPVTWEGEG         0
0 GVLLEHREREFGDKVSPASVLEAAQKIKPQ*SAS 0

>SEPHS1_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01BJUY7 length=168 run=R_2007_08_23_18_06_11_
AEEAGTSVTGGQTVLNPWIVLGGVATTVCQPNEFIMPDNAVPGDVLVLTKPLGTQ
AKQQRSEVSFVIHNLPIIAKMAAITKACGNRFGLLQGTSSETSGGLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGNQTARIIDKPRIIEVTPRGAAAPPPDNNSSA

>SEPHS2tacAcu EUPZL4S02I1W5B
QRIRRGKLVVSREDVELAYQEAMFSMAMLNRTAAGLMHTFNAHAATDI

>SEP15_tacAcu Tachyglossus aculeatus (echidna) EUGXWLM01BKAWJ EUEMSW404CBHS5 length=262 run=R_2007_08_23_18_06_11_
LGFSSNLLCSSCDLLGQFSLTQLDPSCRGCCQEEAQFESRKLYAGAILEVCGuKLGRFPQVQAFVRSDKPKLFRGLQIKYVHGSDPVLKLLDESGNIAEELSILKWNTDSVEEFLSEKLERI 

>SEPP1_tacAcu Tachyglossus aculeatus (echidna) EUEMSW408ERBZB length=227 run=R_2007_08_22_12_11_10_ 59% 
GELERHAPPGVPVYAQDGPDPDVWSILGGGKDDFLVYDRCGRLTFHIRLPFSFLHFPYVEAAVLFSHRHDFCGNCSYY
>SELO_calMil Callorhinchus milii (elephantfish) frag tgt-cys not sel tag-stop? tga-stop? taa-stop
0                 LNFDNLALRSLPVDSSGERSCRRVPGACFSLAGATPVDNPRLVASSR    0
0 QSDGRKVLRSSIREFLCSEAMFHLGIPTTRAGTCVTSDTKVLRDVFYDGNSKHENCTIILRIAPTFLRF 2
1 FGSFEILKPEDELTGRQGPSSNRNDIRIQMLDYVIGTFYPEAQQAHPENQVQRNAAFFRE 0
0 VTQRTARLVAEWQCVGFCHGVLNTDNMSIMGLTIDYGPFGFMDR 2 
1 FDPSYICNASDNRGRYAYNQQPEICKWNLGKLAEVLVPELPLKDSQSIIDEEYDTEFQRHYLQKMRKKLGLLQCEQEDDDKLVSELLDIMYRT 1
2 GADFTNTFYLLSSFPVELESPGLAEFLARLMEQCASLEELRLAFRPQMDPR 2
1 QLSILLMLSQSNPQLFEVIGSKEGIAKELDLIERSSKLQQATAEDIHSNNAKVWTEWLQKYR 2
1 SRLATEAEGVDDVDEQNAERVKVMNLNNPKFILRNYIAQNAIEAAEKGDFSE 0
0 VHLLQKTLRHPFHKQREAEEAGYSSRPPLWARELRVSCSS-R-P* 0

>GPX2_calMil Callorhinchus milii (elephantfish) tga-cys tag-stop full
0 MANSRKFYGFSTKLLNGQTLNFSKFKGKVVLIENVASLuGTTARDYTQMNELQSRYSREGFAVLGFPCNQFGFQ 0
0 ENCKNDEILKTLKYVRPGGGYEPNFTMFQKSIVNGDGTHPLFAYLKEKLPYPDDDPVSFMKDPQSINWSPVCRSDISWNFEKFLIGPDGEPFKRYSKKFETIQIEPDIQRLLKVAK* 0

>SEPHS1_calMil Callorhinchus milii (elephantfish) frag
0 MSVRETFNPENYELDKNFRLTRFAELKGTGCKVPQDVLHKLLEALQENHYQEDEQFLGAVMPRL 1
2 GIGMDSCVIPLRHGGLSLVQTTDFFYPLVDDPYMM 0
0 GRIACANVLSDLYAMGVTECDNMLMLLGVSNKMADK 0
0 ERDKVMPLIIQGFKDAADEAGTAVTGGQTVLNPWIILGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVAVHQWLDI 0
0 PEKWNKIKLVVTQEDVELAYQEAMFNMARLNRT 1
2 1
2 GGLLICLPREQAARFCAEIKSPKYGEGYQAWIIGIVEKGNRTARIIDKPRIIEVAPTVATQNVNPTPGATS*

>SEPN1_petMar Petromyzon marinus (lamprey) tga-sel taa-stop frag
0 0
0 EMALRTLGNDGLFLFTSLDTNMDMQISPEEFRPIVDKII 1
2 GPPPSEYEGTQEADPQGEGLTMLARFEPLLMETMSKSRDGFLG 0
0 VGQSCLAGLRGWKKAEAPSQHFGANQFKVFLPPKSDLELGEAWWLVPNDLNLFTGYLPNSRYYPPPPVAKE 0
0 IIIFKLLSMFHPRPFVKSRFAPQGSVACIRAQSDMYYDIVFR 2
1 VHAEFQLNEPPAFPFWFTPAQFTGHVTIARDSSHVRAFHMFVPNNR 2
1 SLNVDMEWLFGSMDQGNMEVDIGFMPK 0
0 MELVAEGPSVPALIYDENGNAINTSDPDVEPIQFVFENIEWRSEISFQEAYRQMEVAMYPFKK 0
0 IQYHPFTEAFEKAKAEDKLVHSILLWGALDDQSCu 1
2 KVLTQSLKFDFNITSPVLALLSENFISSWSLVKDLEDL 0
0 KQEEQAEHAKWATLHLAKYTFPVEMMIALPNGTV 0
0 VHCINANDFLDATAVKAEDLTPDLPAEFLDPTSTTYLKFLKEGLQKAQTYLQA* 0

>SELO_petMar Petromyzon marinus (lamprey) frag
0        GASMCSLYVLPLGRN 1
2 APRQVPGAVFSRVRPSPVERPRVVAISVPALRLLGLRDPEAEAARPEAAEFLSGNRVPPGAQPAAHCYCGHQFGSFAGQLGDGAAMYLGEVQPGPGQRWEVQLKGAGPTPYS 2
1 HSDGRKVLRSSLREFLCSEAMHHLGVPTTRAGSCVTSHSTVLRDVHYDGNARPEQCSVVLRIAPSFLR 0
1 FGSFEIFKSTDKDTGRTGPSAGREDIKVTMLDYVIDTFYPELLEGHGDGASHKYTAFFRE 0
0 VVRRTAHLVAEWQCVGFCHGVLNTDNMSILGLTIDYGPFGFMDR 2
1 GADFTNSFRLLSRLTLEPGSVEELATLLCQQCATVEEMKRAYKPRIDPR 2

>MSRB3_petMar Petromyzon marinus (lamprey) cDNA FD728382 tga-sel taa-stop full
0 MSWAARLLGCAAVRAAGRAFGSVHARRHIALSTTLV 1
2 ASCKDTRTCRVNFAEEELRKKLSPLEFHVTQEKGTEP 2
1 AFTGKFAEHKGKGTYGCVVCQAPLFA 2
1 SQAKYDSGS 1
2 GWPSFYNIIRANAVSHSKDRSHGMHRTEVTCGQ 0
0 CGAHLGHVFDDGPPPTGKRFCINSASLNFNPSAEEEGGAEAVEEAPVKPELuGIGGIV* 0

>SELU3_petMar Petromyzon marinus (lamprey) cDNA EE741479 ti|1430987375 tga-sel taa-stop frag
             VRPDPYIYKALELKMGEDVDKIY 1
2 KSPHVHCGLWWGVARGLWRAMWSESFDFQGDPKQQGGALVLGP 1
2 GGRVLFSHRDEAVLDHTPINRLLAVAGIPPVDFTHKHMVKHVuPRPLTPTG* 0

>SELS_petMar Petromyzon marinus (lamprey) cDNA EB082976 39% identity homSap tga-sel taa-stop full
0 MDPRNEEPAVIRGVSDAVSELLARFGWPLLLFCALLYFTVRRAAPWLRWGRSSSSSTSISNPDLLMNMHSAMERSRIRMQQELDARAAEHQARVKQLEEERQKQ
KMESWERGRSLRPRRDPQSQQEDNSTPSTSLPRTERQRLRDNNYSPLSGGGGPTCLWRPGRRGPASGGGuG* 0

>SELW_petMar Petromyzon marinus (lamprey) cDNA FD700531 tga-sel tag-stop full
0 MPLKIHIVYCGAuGYRSRFHRLKDELETEFPGELEITGEGTPTQTGFLEVQIVGGKLLHSKANGDGFVDSDEKLQKIFSGVEKALKK* 0

>SELT_petMar Petromyzon marinus (lamprey) cDNA EE737658 tga-sel tag-stop full
0 MRTHLFAGPVLKVEYC 2
1 VSuGYRRVFEEYSRVIAERFPDIRVEGDNYLPQPLYR 2
1 YIASFFSVFKLVLIGLVLSGKNLFPMLGVDTPGVWTWSQENK 0
0 LYACLMIFFVSNMVETQCMSTGAFEVSLN 1
2 DVPVWSKLQSGRVPSPQEILQILDNHVKLSGGSAGRMQPS* 0

>SEP15_petMar Petromyzon marinus (lamprey) cDNA EG333213 tga-sel taa-stop full
0 MAAVRPGLLLLLLKAVS 0
0 VYATELTSEACRDLGFSSNLLCSSCDLLSQFGLDQLDPGCKRCCQLEVEESALKL 0
0 YPGAVLEVCGuKLGRFPQVQ 1
2 AFVRSNKPSTFK 0
0 GLTIKYVRGSDPVLKLLDESGNVAEELSITKWNTDSVEEFLSEKLERL

>GPX2_petMar Petromyzon marinus (lamprey) cDNA LyEST1379 tga-sel tag-stop full
0 MKSFYELSAKTLGGELVSFSRYRGKVVLVENVASLuGTTTRDFTQLNELQGRYGAQGLAVLGFPCNQFGHQ 0
0 ENSQNEEILNTLKYVRPGSGYEPNFTMFAKCEVNGKDTHPVFEFLKEKLPLPSDDPISFLQDPKHIIWSPVSRSDIAWNFEKFLVGPDGQPFKRYSKKFQTINIEEDLKFLLKQVK* 0

>SEPHS1_petMar Petromyzon marinus (lamprey) cDNA LyEST8076 FD729689 tga-sel tgc-cys tag-stop not so virtual selenocys full
0 MSVHRYFDPEDHDLDKSFRLTKFSELKG*GCKVPQETLLKLLEGLEQDGPYQDEHQQFMGAVMPRL 1
2 GIGMDACVIPLRHGGLSLVQTTDFFYPLVDDPYMM 1
2 GKIACANVLSDLYAMGVTECDNMLMLLAISQKLSEK 0
0 ERDKVIPLMIRGFKDAAEEAGTTVTGGQTVVNPWIVIGGVATTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVNAHQWLDM 0
0 PEKWNKIKLVVTQEDVELAYQEAMFNMARLNRTAAGLMHTFNAH 2
1 AATDITGFGILGHAQNLAKQQRNEVSFVIHNLPVIAKMAAISKACGNLFGLLQGTSAETSG 1
2 GLLICLPREQAARFCAEIaKYGEGHQAWIIGIVEKGGRTARIIDKPRIIEVAPRGASPTGVASPDTPTGPPLT* 0

>DIO2a_petMar Petromyzon marinus (lamprey) frag
0 MTAELNVSVFVALRILPGFFTNCLFLGLRDVLALLARSTRALFARHVPASCCPCPPEACRRVLTRAGMRAVWRSFLLDARRE 0
0 AQRAGDPAPNPRVALLADARSSSPAAAAAIPCRLQELSREGRPLVINMGSASuPPFVGRLPEFRRVVDDFSHAADFLLVYVEEAHPSDGWAVPGALQVSRARSLENNNSMA

>DIO2b_petMar Petromyzon marinus (lamprey) frag
0 SRPVVDSLLILPGVFSNCLFLALYDAVSFLRRALQASLTHSAKGDAQHPRMLAGQGMLSVWRSYVLDAHKK
VRLGGEAPNSSV RPSPPQPPPPQLRQAPPCRLLDFARAHRPLVVNFGSASuPPFVEQLGEFCDLVRDFADVADFLVVYIEEAHPSDAWPAPGGLEVPRHLALGDRCVAASQLRGLMPPLGRCPVVADAMDNNANIDYGVSYERLYVIQDG
RIRYLGGKGPFFYRVREVKSFLESVKASR* 0

>GPX4_eptBur Eptatretus burgeri (hagfish) BJ646422 tga-sel frag
0                          GTM 0
0 ASNGEWQTAKNMFEFSAMDIDGNNVSLEKYR 2
1 GHVSIVVNVASKuGKTTVNYTQLSAMHAKYADSHGLRILAFPCNQFGKQ 0
0 EPGTDAEIKAFAAGYDVHFDLFSKIMVNGDDAHPLWKWMKSQRYGHGTLG 2
1 NAIKWNFTK 0
0 FLIDKEGQVVKRYGPIDDPV 0
0 VI 

>GPX1_eptBur Eptatretus burgeri (hagfish) BJ648558 tga-sel tag-stop frag
0       SARGIAKSFYELSARNLAGuELVQFSKFRDKVVLIENVASLuGTTSRDFTQMNQLHQRLGVHGLVVLGFPCNQFGHQ 0
0 ENATNEELLQSLKYVRPGRGFEPNFPIFDKCEVNGANAHPLFTFLKEHLPLPSDNPTCFMSDCKSIIWSPVQRSDIAWNFEKFLIAPNGEPFRRYSKLYQTIDLEPDICKLLGL* 0

>SEPHS1_eptBur Eptatretus burgeri (hagfish) cdna BJ649814 BJ654390 tga-sel taa-stop full
0 MSVHRYFDPEDHELDKSFRLTKFSDLKGuGCKVPQESLLKLLEGLEQDSPFQDEHQQFMGAVMPRL 1
2 GIGMDSCVIPLRHGGLSLVQTTDFFYPLVDDPYMM 0
0 GKIACANVLSDLYAMGVTDCDNMLMLLGISQKLSEK 0
0 ERDKVVPLMVRGFKDAAEEAGTTVTGGQTVMNPWIIIGGVASTVCQPNEFIM 2
1 PDNAVPGDVLVLTKPLGTQVAVAVHQWLDI 0
0 PEKWNKIKLVVTQEDVELAYQEAMMNMARLNRTAAGLMHTFNAH 1
2 AATDITGFGILGHAQNLARQQRNEVAFVIHNLPRNEVAFVIHNLPVVAKMAAISKACGNLFGLLQGTSAETSG 1
2 GLLICLPREQAARFCAEIKSPKYGEGHQAWIIGIVEKGARTARIIDKPRIIEVAPRGAPPPTVTTPTTPTAPLS* 0

>SELT_eptBur Eptatretus burgeri (hagfish) cdna BJ650136 tga-sel tag-stop full
0 MKSKMYSGPELLFQYC 2
1 ISuGYRRVFEEYSQALRERYPDIRIEGSNYPPPPLYS 2
1 TCASVLSVLKVMLIVLVVSGRNPFPLLGLDTPNAWNWSQNNK 0
0 IYACLMIFFLTNMIENQCLSTGAFEVVFN 1
2 DVPIWSKLQSGRVPSLPELAQILDNHLAMGGQASPSNTHGPQ* 0

>SEP15_eptBur Eptatretus burgeri (hagfish) BJ647169 tga-sel taa-stop
0         FGTRLGAGSFLALCLVS 0
0 VSAAELSSEVCRDRGFSSGLVCSSCDLLAQFDLHRLDPDCRSCCQPEVEEQEIKR 0
0 YAGAVLEVCGuKLGRFPQVQ 1
2 AFVKSNKPSTFKGLTTK 0
0 YVRGADPVLKLLDQDGNVAEELSITKWNTDSVEEFLSENLERL* 0

>SELW_eptBur Eptatretus burgeri (hagfish) cdna BJ662449 tgn-sel tag-stop
0 MPLKIHVVYC 2
1 GSuGYASK 0
0 FRALKVKLDHEFPGKLEI 0
0 TSEGTPGLTGKFEVQVGEKLVHSKK 0
0 NGDGFVDSPEKLQKIFKAVENALKGQ* 0

>SELT_eptBur Eptatretus burgeri (hagfish) BJ650136 tga-sel tag-stop
0 MKSKMYSGPELLFQYC 2
1 ISuGYRRVFEEYSQALRERYPDIRIEGSNYPPPPLYS 2
1 TCASVLSVLKVMLIVLVVSGRNPFPLLGLDTPNAWNWSQNNK 0
0 IYACLMIFFLTNMIENQCLSTGAFEVVFN 1
2 DVPIWSKLQSGRVPSLPELAQILDNHLAMGGQASPSNTHGPQ* 0