RandomPlacement: Difference between revisions

From genomewiki
Jump to navigationJump to search
No edit summary
No edit summary
Line 6: Line 6:
(to be done: add image to demonstrate)
(to be done: add image to demonstrate)


 
*You have a set of non-overlapping items of varying length that are currently sitting on the genome in their initial locations.  These are defined with a bed file.
*You have a set of non-overlapping items of varying length that are
*There is a set of "bounding regions" that define where these items are not allowed to be.  The items are allowed to exist anywhere in the gaps between these bounding regions.  When the randomization simulation is done, these gaps between the bounding regions are where they are allowed to be randomly placed. They always remain non-overlapping as they are randomly placed. These bounding regions are defined with a bed file.
currently sitting on the genome in their initial locations.  These are
*The distance measurement is done between all the placed items and their nearest neighbor bounding region.  They are measured as they initially sit, and after randomized placement to determine if they have a different measurement after randomization.  The distance measured is from either end of the placed item, whichever is closer to one of the bounding region elements.
defined with a bed file.
*As an alternative for distance measurement instead of to the bounding regions, a third bed file can be given that is used as items to measure distance to while still within the limitation of being placed inside the bounding regions.
*There is a set of "bounding regions" that define where these items are not
allowed to be.  The items are allowed to exist anywhere in the gaps between
these bounding regions.  When the randomization simulation is done, these
gaps between the bounding regions are where they are allowed to be randomly
placed. They always remain non-overlapping as they are randomly placed.
These bounding regions are defined with a bed file.
*The distance measurement is done between all the placed items and their
nearest neighbor bounding region.  They are measured as they initially sit,
and after randomized placement to determine if they have a different
measurement after randomization.  The distance measured is from either end
of the placed item, whichever is closer to one of the bounding region
elements.
*As an alternative for distance measurement instead of to the bounding
regions, a third bed file can be given that is used as items to measure
distance to while still within the limitation of being placed inside the
bounding regions.

Revision as of 22:39, 7 April 2006

randomPlacement description

source tree location: src/hg/randomPlacement/

It helps to draw a picture as you read this to see how it works. (to be done: add image to demonstrate)

  • You have a set of non-overlapping items of varying length that are currently sitting on the genome in their initial locations. These are defined with a bed file.
  • There is a set of "bounding regions" that define where these items are not allowed to be. The items are allowed to exist anywhere in the gaps between these bounding regions. When the randomization simulation is done, these gaps between the bounding regions are where they are allowed to be randomly placed. They always remain non-overlapping as they are randomly placed. These bounding regions are defined with a bed file.
  • The distance measurement is done between all the placed items and their nearest neighbor bounding region. They are measured as they initially sit, and after randomized placement to determine if they have a different measurement after randomization. The distance measured is from either end of the placed item, whichever is closer to one of the bounding region elements.
  • As an alternative for distance measurement instead of to the bounding regions, a third bed file can be given that is used as items to measure distance to while still within the limitation of being placed inside the bounding regions.