IMGT: Difference between revisions

From genomewiki
Jump to navigationJump to search
Line 30: Line 30:
***Naming is <locus><type><subtype>-<positionOnGenome></OrphonIdentifier>*allele
***Naming is <locus><type><subtype>-<positionOnGenome></OrphonIdentifier>*allele
*** e.g. [http://www.imgt.org/IMGT_GENE-DB/GENElect?query=2+IGHV1-18&species=Homo+sapiens IGHV1-18]
*** e.g. [http://www.imgt.org/IMGT_GENE-DB/GENElect?query=2+IGHV1-18&species=Homo+sapiens IGHV1-18]
*** -18 means that is is the 18th sequence in this locus. 1 is the subtype of the J sequence, as determined by curators, by phylogenetic similarity (??)
*** <positionOnGenome> starts from one and numbers the genes along the chromosome, see [http://genome.ucsc.edu/cgi-bin/hgTracks?insideX=118&revCmplDisp=0&hgsid=202563483&hgt_doJsCommand=&position=chr14%3A106%2C368%2C424-106%2C388%2C490&hgtgroup_map_close=0&hgtgroup_phenDis_close=1&hgtgroup_genes_close=0&hgtgroup_rna_close=0&hgtgroup_expression_close=0&hgtgroup_regulation_close=0&hgtgroup_compGeno_close=0&hgtgroup_neandertal_close=1&hgtgroup_varRep_close=0 genome browser (switch on encode genes)]
*** <positionOnGenome> starts from one and numbers the genes along the chromosome, see [http://genome.ucsc.edu/cgi-bin/hgTracks?insideX=118&revCmplDisp=0&hgsid=202563483&hgt_doJsCommand=&position=chr14%3A106%2C368%2C424-106%2C388%2C490&hgtgroup_map_close=0&hgtgroup_phenDis_close=1&hgtgroup_genes_close=0&hgtgroup_rna_close=0&hgtgroup_expression_close=0&hgtgroup_regulation_close=0&hgtgroup_compGeno_close=0&hgtgroup_neandertal_close=1&hgtgroup_varRep_close=0 genome browser (switch on encode genes)]
*** [http://www.imgt.org/IMGTindex/orphon.html Orphons] [http://www.ncbi.nlm.nih.gov/pubmed/6784929 (old paper)] seem to be (nonfunctional?) copies of receptors on other chromosomes
*** [http://www.imgt.org/IMGTindex/orphon.html Orphons] [http://www.ncbi.nlm.nih.gov/pubmed/6784929 (old paper)] seem to be (nonfunctional?) copies of receptors on other chromosomes

Revision as of 22:20, 15 July 2011

Intro

  • Basic problem: If you have an immune receptor sequence, e.g. antibody or T-Cell receptor, how do you call it?
  • Just like for all other nucleotide sequences: We need
    • submitted sequences (like Genbank)
    • "reference" sequences (like RefSeq)
    • genes (like NCBI Genes), i.e. a collection of reference sequences
    • and people that sort out this whole mess, the IMGT curators
  • IMGT is a 90's generation website, like the genome browser
  • Lots of static webpages, e.g. the new year wishes page

IMGT Structure

  • The main reference database for immune loci is IMGT.org
  • It consists of databases, tools and additional files (e.g. ontology, some notes, etc)
    • LIGM-DB: Nucleotide sequences of all kinds. Can be C, V, J, D, or combinations thereof.
    • MHC-DB
      • Also pulls in from Genbank, changes their identifiers
      • Maps Genbank sequences to MHC-"clusters" of sequences
      • E.g. GQ240381 is annotated as a sequence of HLA06329 [1]
    • GENE-DB: A list of categorized VDJC sequences and all their alleles
      • Naming is <locus><type><subtype>-<positionOnGenome></OrphonIdentifier>*allele
      • e.g. IGHV1-18
      • -18 means that is is the 18th sequence in this locus. 1 is the subtype of the J sequence, as determined by curators, by phylogenetic similarity (??)
      • <positionOnGenome> starts from one and numbers the genes along the chromosome, see genome browser (switch on encode genes)
      • Orphons (old paper) seem to be (nonfunctional?) copies of receptors on other chromosomes
    • Repertoire: descriptions of loci, genes and 3D-structures
      • Example human IGH [2]
      • Provides nice lists of a loci, with all alleles, types, etc, e.g. human IGV [3]
    • Web Resources: A mixed bag with explanations on how they curate
      • Actually the best starting point