QA scripts: Difference between revisions

From Genecats
Jump to navigationJump to search
(getting started with redmine 3183)
Line 1: Line 1:
==Shorthand Key:==
This is a list of the most frequently-used programs and scripts that QAers use. It does not include track-specific programs, such as chainNetTrio.csh. Some are devised by QA and live in the source tree at kent/src/utils/qa. Some are
  file name flnm
  database name dbnm
  table name tblnm


==Scripts==
=Must know about!=
catchBOT.csh                     check usage of highest users
nice top                     see who is doing what & the time on the server
up time             load average
featureBits dbnm tblnm             compares total amount of data in table compared to the entire assembly (not including gaps)
checkCoverage.csh dbnm tblnm [chrom]     finds non-gap places in table where there is no data and gives you the largest places with browser link to top 3
findLevel.csh dbnm tblnm     searches trackDb hierarchy for your table and corresponding .html file
commTrio.csh                     sorts & compares two files
encodeEmail.pl                     codes email addresses for html pages


countPerChrom.csh dbnm tblnm             number of material on each chrom
==bigPush.csh==
countPerChrom.csh aplCal1 rmsk | awk '{ print $2 "\t" $1}' | grep '^0'
alternative: to see those chroms with data: hgsql -e "select tName from all_mrna;" aplCal1 | sort| uniq -c | sort -rn | less (per Hiram; more efficient)


Pushes tables in list to mysqlbeta and records size.
  Requires sudo access to mypush to run.
 
  Do not redirect output or run in the background,
  as it will require you to type your password in.
  Program will ask you for your password again after
  large tables. If you take too long to re-type in
  the table the script stalled on might not get
  pushed. Double-check that all tables have been
  pushed!
 
  Will report total size of push and write two files:
  db.tables.push -> output for all tables from mypush
  db.tables.pushSize -> size of push
==commTrio.csh==
Sorts and compares two files. 
Counts unique and common records.
    usage:  leftFileName rightFileName [rm]
            optional [rm]: remove the three output files when finished
==compareWholeColumn.csh==
  gets a column from a table on dev and beta and checks diffs.
  reports numbers of rows unique to each and common.
  can compare to older database.
  writes files of everything.
    usage:  database table column [db2]
==compareWholeTable.csh==
  gets an entire table from two machines and checks diffs.
  reports numbers of rows unique to each and common.
  writes files of everything.
  not real-time on RR -- uses genome-mysql.
    usage:  database table [machine1] [machine2]
      (defaults to dev and beta)
=Might also like!=
==checkPushedFiles.csh==
checks to see if files are in place, after a push
usage: website files(s)
website should include the path of the directory where
the files reside, such as:
  http://hgdownload.cse.ucsc.edu/goldenPath/hg19/liftOver/
file(s) is either a single name or a list of names, and can
include items with additional directory structure, like so:
  filename
  dir/filename
  dir/dir/dir/filename
any output other than '200 OK' indicates an error.
==compareTableToFile.csh==
Ensures that a table correlates with its associated file.
Only prints results if there is a diff between table and file.
Works for these file types: narrowPeak, broadPeak, gappedPeak,
                            bedGraph, NRE, BiP, gcf
For wiggle files, you must specify [wig] parameter.
  usage:  database tableName fileName [wig] [verbose]
  fileName includes path of download file
  e.g. /goldenPath/<db>/fileName.gz
  use verbose for more details


[[Category:Browser QA]]
[[Category:Browser QA]]
[[Category:Browser QA Training]]
[[Category:Browser QA Training]]

Revision as of 02:11, 27 September 2011

This is a list of the most frequently-used programs and scripts that QAers use. It does not include track-specific programs, such as chainNetTrio.csh. Some are devised by QA and live in the source tree at kent/src/utils/qa. Some are

Must know about!

bigPush.csh

Pushes tables in list to mysqlbeta and records size.

 Requires sudo access to mypush to run.
 
 Do not redirect output or run in the background,
 as it will require you to type your password in.
 Program will ask you for your password again after
 large tables. If you take too long to re-type in
 the table the script stalled on might not get
 pushed. Double-check that all tables have been
 pushed!
 
 Will report total size of push and write two files:
 db.tables.push -> output for all tables from mypush
 db.tables.pushSize -> size of push

commTrio.csh

Sorts and compares two files.  
Counts unique and common records.
    usage:  leftFileName rightFileName [rm]
            optional [rm]: remove the three output files when finished

compareWholeColumn.csh

 gets a column from a table on dev and beta and checks diffs.
 reports numbers of rows unique to each and common.
 can compare to older database.
 writes files of everything.
   usage:  database table column [db2] 

compareWholeTable.csh

 gets an entire table from two machines and checks diffs.
 reports numbers of rows unique to each and common.
 writes files of everything.
 not real-time on RR -- uses genome-mysql.
   usage:  database table [machine1] [machine2]
     (defaults to dev and beta)


Might also like!

checkPushedFiles.csh

checks to see if files are in place, after a push
usage: website files(s)
website should include the path of the directory where
the files reside, such as:
  http://hgdownload.cse.ucsc.edu/goldenPath/hg19/liftOver/ 
file(s) is either a single name or a list of names, and can
include items with additional directory structure, like so:
  filename
  dir/filename
  dir/dir/dir/filename
any output other than '200 OK' indicates an error.

compareTableToFile.csh

Ensures that a table correlates with its associated file.
Only prints results if there is a diff between table and file.
Works for these file types: narrowPeak, broadPeak, gappedPeak,
                            bedGraph, NRE, BiP, gcf
For wiggle files, you must specify [wig] parameter.
 usage:  database tableName fileName [wig] [verbose]
  fileName includes path of download file 
  e.g. /goldenPath/<db>/fileName.gz
  use verbose for more details