Chrom Alias

From genomewiki
Revision as of 21:31, 6 May 2022 by Hiram (talk | contribs) (Created page with "== Usage == The chrom alias mechanism functions automatically in the genome browser for custom track data and track hub data. It will convert chromosome names from alternate...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

Usage

The chrom alias mechanism functions automatically in the genome browser for custom track data and track hub data. It will convert chromosome names from alternate naming schemes to the names used in the assembly in the genome browser.

Format

The first line of the chromAlias.txt file begins with the pound symbol # followed by a blank space. Subsequent words, separated by a tab character, on this first line are the identification of the source authority for names in that column of the file. The first column set of names are the names of sequences used in the genome browser sequence. Subsequent columns are alternate naming schemes.

The lines following that first column header title line are columns of sequence names separated by a tab character. For the case of no equivalent name in that name space, the column is empty, there will be two adjacent tab characters.

Example

# ucsc  assembly        genbank ncbi    refseq
chr1    1       CM000663.2      1       NC_000001.11
chr10   10      CM000672.2      10      NC_000010.11
chrX    X       CM000685.2      X       NC_000023.11
chrM    MT      J01415.2        MT      NC_012920.1

In this example the columns are:

  1. ucsc - UCSC style chrN names
  2. assembly - names from NCBI file assembly_report.txt
  3. genbank - INSDC names
  4. ncbi - from chr2acc file in assembly_structure/ hierarchy
  5. refseq - names from RefSeq annotations