Genbank updates

From Genecats
Jump to navigationJump to search

To enable/disable genbank updates to an assembly for hgwbeta and the RR:

   -> in the source tree, add/remove the assembly name from these files: 
          ~/kent/src/hg/makeDb/genbank/etc/hgwbeta.dbs
          ~/kent/src/hg/makeDb/genbank/etc/rr.dbs
      and commit.  (Don't comment out names; remove them.  Git keeps the file edit history.)
   -> ssh hgwbeta
   -> cd ~/kent/src/hg/makeDb/genbank
   -> git pull
   -> make install-rr

If you are disabling updates and you also want to remove the downloads files that are updated by the GenBank process (in the bigZips and multiz* directories on hgdownload), the genbank person will need to remove this assembly from the /cluster/data/genbank/data/ftp directory. The genbank person can also drop the files from hgdownload.

Some extra notes about Genbank tables

The current list of Genbank tables is located at hgwdev:/cluster/data/genbank/etc/genbank.tbls (also located at hgwbeta:/genbank/etc/genbank.tbls). All tables in the list up to 'gbLoaded' must exist; those after 'gbLoaded' are optional. To get a list of those tables included in a database (using hg18 as an example), do:

 hgsql -N -e 'SHOW TABLES' hg18 | egrep -f /cluster/data/genbank/etc/genbank.tbls  (hgwdev)
 hgsql -N -e 'SHOW TABLES' hg18 | egrep -f /genbank/etc/genbank.tbls  (hgwbeta)

The two tables 'gbCdnaInfo' and 'gbStatus' are main tables that should contain all entries for a database.