IMGT: Difference between revisions

From genomewiki
Jump to navigationJump to search
No edit summary
No edit summary
Line 1: Line 1:
* The main reference database for immune-loci
* Basic problem: If you have an immune receptor sequence, e.g. antibody or T-Cell receptor, how do you call it?
* Just like genomes: We need "reference" sequences and people that choose them
* The main reference database for immune loci is IMGT.org
* It consists of databases, tools and additional files (e.g. ontology, some notes, etc)
* It consists of databases, tools and additional files (e.g. ontology, some notes, etc)
** LIGM-DB: Nucleotide sequences of all kinds. Can be C, V, J, D, combinations thereof.  
** LIGM-DB: Nucleotide sequences of all kinds. Can be C, V, J, D, or combinations thereof.  
*** Seems to be pulling in from Genbank and change their annotations slightly with different keywords
*** Seems to be pulling in from Genbank and change their annotations slightly with different keywords
*** Automatic transformation of manual annotation?
*** Automatic transformation or manual annotation?
*** Looks like they could have a BLAST-based pipeline that annotates the different fragments
*** Annotates Genbank sequences with the CVDJ-numbers
*** Example: Genbank AF139844
*** Example: Genbank AF139844
**** in IMGT: http://imgt.org/cgi-bin/IMGTlect.jv?query=201+AF139844
**** in IMGT: http://imgt.org/cgi-bin/IMGTlect.jv?query=201+AF139844
Line 10: Line 14:
**** Annotations that they add: [http://imgt.org/download/LIGM-DB/ftable_doc.html Ftable]
**** Annotations that they add: [http://imgt.org/download/LIGM-DB/ftable_doc.html Ftable]
** MHC-DB
** MHC-DB
*** Also pulls in from Genbank, but changes their identifiers
*** Also pulls in from Genbank, changes their identifiers
*** E.g. [http://www.ebi.ac.uk/Tools/dbfetch/emblfetch?GQ240381 GQ240381] is annotated as a sequence of [http://srs.ebi.ac.uk/srsbin/cgi-bin/wgetz?-id+2ke121D6Z1q+-e+\[IMGTHLA-ID:HLA06329\]]
*** Maps Genbank sequences to MHC-"clusters" of sequences
*** E.g. [http://www.ebi.ac.uk/Tools/dbfetch/emblfetch?GQ240381 GQ240381] is annotated as a sequence of HLA06329 [http://srs.ebi.ac.uk/srsbin/cgi-bin/wgetz?-id+2ke121D6Z1q+-e+IMGTHLA-ID:HLA06329]
** GENE-DB: A list of categorized VDJC sequences and all their alleles
***Naming is <locus><type><number>-<number>*allel
*** e.g. [http://www.imgt.org/IMGT_GENE-DB/GENElect?query=2+IGHV1-18&species=Homo+sapiens IGHV1-18]
** Repertoire: all loci with descriptions, e.g. human IGH, human IGK, etc.
*** Example human IGH [http://www.imgt.org/textes/IMGTrepertoire/LocusGenes/chromosomes/human/IGH/Hu_IGHchrom14.html]

Revision as of 02:12, 12 July 2011

  • Basic problem: If you have an immune receptor sequence, e.g. antibody or T-Cell receptor, how do you call it?
  • Just like genomes: We need "reference" sequences and people that choose them
  • The main reference database for immune loci is IMGT.org
  • It consists of databases, tools and additional files (e.g. ontology, some notes, etc)
    • LIGM-DB: Nucleotide sequences of all kinds. Can be C, V, J, D, or combinations thereof.
    • MHC-DB
      • Also pulls in from Genbank, changes their identifiers
      • Maps Genbank sequences to MHC-"clusters" of sequences
      • E.g. GQ240381 is annotated as a sequence of HLA06329 [1]
    • GENE-DB: A list of categorized VDJC sequences and all their alleles
      • Naming is <locus><type><number>-<number>*allel
      • e.g. IGHV1-18
    • Repertoire: all loci with descriptions, e.g. human IGH, human IGK, etc.
      • Example human IGH [2]