Programmatic access to the Genome Browser

From genomewiki
Revision as of 13:18, 6 July 2015 by Max (talk | contribs)
Jump to navigationJump to search

The UCSC API for retrieving data and uploading data is RESTful over HTTP but does not use JSON to save computational time on the server. Download of most data formats requires client-side C tools that convert to/from binary files. Data upload uses custom text files.

Here are some common tasks that can be done from scripts with the UCSC Genome Browser. It is assumed that the reader knows the standard Unix command line tools.

Download data stored in a database table

  • use Tools - Table Browser - "Describe schema" to browse the database schema. All fields have a human readable description and the links to other tables are shown.
  • the first column in many tables with genomic coordinates is called "bin" and can be stripped for most applications
  • to access the public Mysql server, use a commen like mysql --no-defaults -h genome-mysql.cse.ucsc.edu -u genome -A -e 'select * from pubsBingBlat' -NB > out.txt

Get the chromosome sequence for a range

Get the "wiggle" (x-y-plot) graph data for a chromosome range

Get a copy of the current Genome Browser image from a script

Upload a custom track and link to the genome browser with the track loaded