Mm9 multiple alignment: Difference between revisions

From genomewiki
Jump to navigationJump to search
No edit summary
 
No edit summary
Line 1: Line 1:
Mm9 mouse multiple alignment
To avoid artifacts in downstream processing of the UCSC multiple alignments, it is important to be careful on the use of the parameters used in the blastz processing pipeline.  There are a number of steps in the pipeline and a variety of tunable parameters involved.  This page will track the various parameters used in the alignments as they proceed toward the completion of a multiple alignment conservation track on the mm9 mouse (NCBI build 37) assembly
To avoid artifacts in downstream processing of the UCSC multiple alignments, it is important to be careful on the use of the parameters used in the blastz processing pipeline.  There are a number of steps in the pipeline and a variety of tunable parameters involved.  This page will track the various parameters used in the alignments as they proceed toward the completion of a multiple alignment conservation track on the mm9 mouse (NCBI build 37) assembly


blastz alignment parameters
==blastz alignment parameters==


<TABLE BORDER=1>
<TABLE BORDER=1>
<TR><TH>target</TH><TH>query</TH></TR>
<TR><TH>sequence</TH><TH>distance</TH><TH>axtChain<BR>minScore</TH><TH>axtChain<BR>linearGap</TH></TR>
<TR><TH>rat rn4</TH><TD>0.1587</TD><TD>3000</TD><TD>medium</TD></TR>
<TR><TH>human hg18</TH><TD>0.4667</TD><TD>3000</TD><TD>medium</TD><TR>
</TABLE>
</TABLE>
==matrix parameters==
The "medium" gap score matrix, tuned for the mouse-human distance is:
<PRE>
tableSize    11
smallSize  111
position  1  2  3  11  111  2111  12111  32111  72111  152111  252111
qGap    350 425 450  600  900  2900  22900  57900  117900  217900  317900
tGap    350 425 450  600  900  2900  22900  57900  117900  217900  317900
bothGap 750 825 850 1000 1300  3300  23300  58300  118300  218300  318300
</PRE>
The "loose" gap score matrix, tuned for the chicken-human distance is:
<PRE>
tablesize    11
smallSize  111
position  1  2  3  11  111  2111  12111  32111  72111  152111  252111
qGap    325 360 400  450  600  1100  3600  7600  15600  31600  56600
tGap    325 360 400  450  600  1100  3600  7600  15600  31600  56600
bothGap 625 660 700  750  900  1400  4000  8000  16000  32000  57000
</PRE>

Revision as of 17:09, 16 August 2007

To avoid artifacts in downstream processing of the UCSC multiple alignments, it is important to be careful on the use of the parameters used in the blastz processing pipeline. There are a number of steps in the pipeline and a variety of tunable parameters involved. This page will track the various parameters used in the alignments as they proceed toward the completion of a multiple alignment conservation track on the mm9 mouse (NCBI build 37) assembly

blastz alignment parameters

sequencedistanceaxtChain
minScore
axtChain
linearGap
rat rn40.15873000medium
human hg180.46673000medium

matrix parameters

The "medium" gap score matrix, tuned for the mouse-human distance is:

tableSize    11
smallSize   111
position  1   2   3   11  111  2111  12111  32111   72111  152111  252111
qGap    350 425 450  600  900  2900  22900  57900  117900  217900  317900
tGap    350 425 450  600  900  2900  22900  57900  117900  217900  317900
bothGap 750 825 850 1000 1300  3300  23300  58300  118300  218300  318300

The "loose" gap score matrix, tuned for the chicken-human distance is:

tablesize    11
smallSize   111
position  1   2   3   11  111  2111  12111  32111  72111  152111  252111
qGap    325 360 400  450  600  1100   3600   7600  15600   31600   56600
tGap    325 360 400  450  600  1100   3600   7600  15600   31600   56600
bothGap 625 660 700  750  900  1400   4000   8000  16000   32000   57000