Cell Browser data flow and architecture: Difference between revisions

From Genecats
Jump to navigationJump to search
 
(3 intermediate revisions by the same user not shown)
Line 18: Line 18:
</pre>
</pre>


2. The output files from <code>cbBuild</code> are placed inside <code>/usr/local/apache/htdocs-cells</code>. Note that the original configuration files and expression matrices inside the dataset directory are converted into either JSON or binary files (BIN). These files are used by the Cell Browser to ....... (help) build a cell browser.
2. The output files from <code>cbBuild</code> are placed inside <code>/usr/local/apache/htdocs-cells</code>. Note that the original configuration files and expression matrices inside the dataset directory are converted into either JSON or binary files (BIN). These files are used by the Cell Browser website to display the visualization. The original files are human readable; whereas, the ones used by the browser are for faster access.


3. Once the dataset is on [https://cells-test.gi.ucsc.edu/ cells-test], the next destination is [https://cells-beta.gi.ucsc.edu/ cells-beta]. You will push the directory and files from <code>htdocs-cells</code> onto <code>/usr/local/apache/htdocs-cells-beta</code> using the command:
3. Once the dataset is on [https://cells-test.gi.ucsc.edu/ cells-test], the next destination is [https://cells-beta.gi.ucsc.edu/ cells-beta]. You will push the directory and files from <code>htdocs-cells</code> onto <code>/usr/local/apache/htdocs-cells-beta</code> using the command:


<pre>
<pre>
cbPush dir-name-1 dir-name-2  
cbPush dir-name
cbPush "dir-name-1 dir-name-2 dir-name-3" * Use this if you would like to push multiple datasets at once to cells-beta
</pre>
</pre>


Note that cbPush requires you to input a directory name. You can paste in multiple names at the end.
Note that cbPush requires you to input a directory name.


A good alias to have in your .bashrc that pushes the current directory you are in onto beta:
A good alias to have in your .bashrc that pushes the current directory you are in onto beta:
Line 33: Line 34:
alias cbPushDir='cbPush "${PWD##*/}"'
alias cbPushDir='cbPush "${PWD##*/}"'
</pre>
</pre>
You could name this alias whatever you prefer.


4. Once your dataset is on beta, you are almost there! Once the dataset is checked over for potential bugs, you will use the command:
4. Once your dataset is on beta, you are almost there! Once the dataset is checked over for potential bugs, you will use the command:
Line 41: Line 44:


You will be prompted to type in a password, use your <code>hgwdev</code> password. Once you do that, the datasets will be built onto the hgw0, hgw1, and hgw2 machines! Voila!
You will be prompted to type in a password, use your <code>hgwdev</code> password. Once you do that, the datasets will be built onto the hgw0, hgw1, and hgw2 machines! Voila!
Important to note that <code>sudo cellsPush</code> pushes out '''ALL''' of the changes that are on beta, so make sure everything is ready to be pushed out. You can use <code>datasetDiffs -r</code> to double check if there are any additional changes that might get pushed out along with your new dataset.


== '''System Architecture Map''' ==
== '''System Architecture Map''' ==

Latest revision as of 22:58, 10 August 2023

How does data flow between the different machines?

650


How does building a cell browser work?

   What files are copied over?
   Which ones are transformed into another format?

1. Data is first deposited in a dataset directory inside /hive/data/inside/cells on hgwdev and then gets built onto cells-test using the command:

# For datasets with no additional subsets
cbBuild -o alpha

# For dataset collections you will use the recursive option "-r"
cbBuild -r -o alpha

2. The output files from cbBuild are placed inside /usr/local/apache/htdocs-cells. Note that the original configuration files and expression matrices inside the dataset directory are converted into either JSON or binary files (BIN). These files are used by the Cell Browser website to display the visualization. The original files are human readable; whereas, the ones used by the browser are for faster access.

3. Once the dataset is on cells-test, the next destination is cells-beta. You will push the directory and files from htdocs-cells onto /usr/local/apache/htdocs-cells-beta using the command:

cbPush dir-name
cbPush "dir-name-1 dir-name-2 dir-name-3" * Use this if you would like to push multiple datasets at once to cells-beta

Note that cbPush requires you to input a directory name.

A good alias to have in your .bashrc that pushes the current directory you are in onto beta:

alias cbPushDir='cbPush "${PWD##*/}"'

You could name this alias whatever you prefer.

4. Once your dataset is on beta, you are almost there! Once the dataset is checked over for potential bugs, you will use the command:

sudo cellsPush

You will be prompted to type in a password, use your hgwdev password. Once you do that, the datasets will be built onto the hgw0, hgw1, and hgw2 machines! Voila! Important to note that sudo cellsPush pushes out ALL of the changes that are on beta, so make sure everything is ready to be pushed out. You can use datasetDiffs -r to double check if there are any additional changes that might get pushed out along with your new dataset.

System Architecture Map

Cb sysarchmap.png