<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>CloVR</title>
	<atom:link href="http://clovr.org/feed/" rel="self" type="application/rss+xml" />
	<link>http://clovr.org</link>
	<description>Automated Sequence Analysis from Your Desktop</description>
	<lastBuildDate>Thu, 16 May 2013 22:29:25 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>CloVR and IonTorrent</title>
		<link>http://clovr.org/2012/10/05/clovr-and-iontorrent/</link>
		<comments>http://clovr.org/2012/10/05/clovr-and-iontorrent/#comments</comments>
		<pubDate>Fri, 05 Oct 2012 15:11:21 +0000</pubDate>
		<dc:creator>Sam Angiuoli</dc:creator>
				<category><![CDATA[Blog]]></category>

		<guid isPermaLink="false">http://clovr.org/?p=1177</guid>
		<description><![CDATA[Great post about using CloVR for microbial genome annotation Microbial Annotation in the Cloud: CloVR and Ion Torrent]]></description>
				<content:encoded><![CDATA[<p>Great post about using CloVR for microbial genome annotation <a href='http://www.edgebio.com/microbial-annotation-cloud-clovr-and-ion-torrent'>Microbial Annotation in the Cloud: CloVR and Ion Torrent</a></p>
]]></content:encoded>
			<wfw:commentRss>http://clovr.org/2012/10/05/clovr-and-iontorrent/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Release clovr-1.0-RC4</title>
		<link>http://clovr.org/2012/05/15/release-clovr-1-0-rc4/</link>
		<comments>http://clovr.org/2012/05/15/release-clovr-1-0-rc4/#comments</comments>
		<pubDate>Tue, 15 May 2012 14:40:32 +0000</pubDate>
		<dc:creator>Sam Angiuoli</dc:creator>
				<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://clovr.org/?p=1168</guid>
		<description><![CDATA[CloVR 1.0-RC4, the fourth release candidate, is available for download. See release notes for more information.]]></description>
				<content:encoded><![CDATA[<p><a href="http://clovr.org/download/">CloVR 1.0-RC4</a>, the fourth release candidate, is available for download. See <a title="Releases" href="http://clovr.org/developers/releases/#clovr-1.0-RC4">release notes</a> for more information.</p>
]]></content:encoded>
			<wfw:commentRss>http://clovr.org/2012/05/15/release-clovr-1-0-rc4/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CloVR 1.0-RC3 available for download</title>
		<link>http://clovr.org/2011/10/26/clovr-1-0-rc3-available-for-download/</link>
		<comments>http://clovr.org/2011/10/26/clovr-1-0-rc3-available-for-download/#comments</comments>
		<pubDate>Wed, 26 Oct 2011 14:37:56 +0000</pubDate>
		<dc:creator>Sam Angiuoli</dc:creator>
				<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://clovr.org/?p=1165</guid>
		<description><![CDATA[The third release candidate of CloVR 1.0-RC3 is available for download. See release notes for more information.]]></description>
				<content:encoded><![CDATA[<p>The third release candidate of CloVR 1.0-RC3 is available for <a href="http://clovr.org/download">download</a>. See <a href="http://clovr.org/developers/releases/#clovr-1.0-RC3" title="Releases">release notes</a> for more information.</p>
]]></content:encoded>
			<wfw:commentRss>http://clovr.org/2011/10/26/clovr-1-0-rc3-available-for-download/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CloVR 454 and Illumina assembly</title>
		<link>http://clovr.org/2011/08/25/clovr-454-and-illumina-assembly/</link>
		<comments>http://clovr.org/2011/08/25/clovr-454-and-illumina-assembly/#comments</comments>
		<pubDate>Thu, 25 Aug 2011 15:04:24 +0000</pubDate>
		<dc:creator>Cesar Arze</dc:creator>
				<category><![CDATA[Blog]]></category>

		<guid isPermaLink="false">http://clovr.org/?p=286</guid>
		<description><![CDATA[Assembly of genomes in CloVR is currently handled via two separate pipelines; the clovr_assembly_celera pipeline operates on sequences generated via a 454 machine (FLX or Titanium) while the clovr_assembly_velvet pipeline handles sequences generated from Illumina GA machines. This blog post will detail &#8230; <a href="http://clovr.org/2011/08/25/clovr-454-and-illumina-assembly/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>Assembly of genomes in CloVR is currently handled via two separate pipelines; the clovr_assembly_celera pipeline operates on sequences generated via a 454 machine (FLX or Titanium) while the clovr_assembly_velvet pipeline handles sequences generated from Illumina GA machines. This blog post will detail the procedures required to run assembly using both of these pipelines. The two datasets used in the this demonstration follow:</p>
<ul>
<li><a href="http://data.clovr.org/d/15/e-coli-454-inputs">Ecoli 454 500k dataset</a></li>
<li><a href="http://data.clovr.org/d/17/e-coli-illumina-inputs">Ecoli 4M Illumina paired-end </a></li>
</ul>
<h2></h2>
<h2><strong>Using the CloVR Dashboard to run assembly pipelines</strong></h2>
<p>The CloVR dashboard is a new graphical interface that makes running pipelines extremely easy. The dashboard can be found by navigating a web-browser to the CloVR VM&#8217;s ip address like so:</p>
<pre>http://[yourclovrvmip]/clovr</pre>
<p>Once loaded the page should look like the following:</p>
<p><a href="http://clovr.org/wp-content/uploads/2011/02/Screen-Shot-2011-08-24-at-1.49.13-PM1.png"><img title="Dashboard main" src="http://clovr.org/wp-content/uploads/2011/02/Screen-Shot-2011-08-24-at-1.49.13-PM1-550x332.png" alt="" width="550" height="332" /></a></p>
<p>This is the dashboards main page from which we can configure our pipeline. We can start by tagging our data set by hitting the <strong>Add </strong>button found in the lower left corner of the dashboard (Denoted in the figure by the number one). Clicking on the add button presents us with a new dialog window.</p>
<p><a href="http://clovr.org/wp-content/uploads/2011/02/add_data.png"><img title="Adding data to the CloVR dashboard" src="http://clovr.org/wp-content/uploads/2011/02/add_data-550x296.png" alt="" width="550" height="296" /></a>We can select data found under the <em>/mnt/data/user_data/ </em>folder here via the <strong>Select file from image </strong>button or can upload data from another location on our computer by using the <strong>Browse&#8230;</strong> button. Once a file has been uploaded it can be checked off in the right-hand pane of the dialog box. For this demonstration I will be running the 454 assembly pipeline so the filetype of the file is set to SFF via the drop down <strong>File Type</strong> menu. A name and description should also be given to the dataset. Once the proper data has been entered the dataset can be tagged by clicking the <strong>Tag </strong>button.</p>
<p>We should now see our dataset in the CloVR dashboard dataset left-hand pane and can proceed to configure our pipeline. We run the 454 assembly pipeline through the CloVR microbe pipeline:</p>
<p><a href="http://clovr.org/wp-content/uploads/2011/02/Screen-Shot-2011-08-24-at-2.00.05-PM.png"><img title="Screen Shot 2011-08-24 at 2.00.05 PM" src="http://clovr.org/wp-content/uploads/2011/02/Screen-Shot-2011-08-24-at-2.00.05-PM-550x332.png" alt="" width="550" height="332" /></a>The dataset we just tagged can be selected from a drop-down menu and we must select the <strong>Assembly Only </strong>option as we are just interested in assembly. Any other settings pipeline settings may also be changed at this point. Once satisfied we can hit the<strong>Validate </strong>button to ensure that our configuration is ok followed by the <strong>Submit</strong> button to submit the pipeline to run.</p>
<p><a href="http://clovr.org/wp-content/uploads/2011/02/Screen-Shot-2011-08-24-at-4.34.09-PM.png"><img title="Screen Shot 2011-08-24 at 4.34.09 PM" src="http://clovr.org/wp-content/uploads/2011/02/Screen-Shot-2011-08-24-at-4.34.09-PM-550x332.png" alt="" width="550" height="332" /></a>The pipeline steps can be monitored through the dashboard with more fine-grained messages concerning the status of each individual step available by clicking on a specific pipeline that is running.</p>
<p>Once the pipeline has complete successfully output can also be downloaded via the data sets pane on the left-hand side of the dashboard.</p>
<h2><strong>Command-line 454 Assembly in CloVR</strong></h2>
<p>Assembly of sequences generated via a 454 machine are handled via the clovr_assembly_celera pipeline. As the name indicates, this pipeline makes use of the celera assembler software (version 5.4) in assembling sequences.</p>
<p>To demonstrated assembly in CloVR I will be running this pipeline on a an E.coli data set containing 500,000 reads (the data set is included at the bottom of this blog post in the appendix) and executing the run on commodity home/office hardware:</p>
<pre>Intel(R) Core(TM)2 Duo CPU     E6850  @ 3.00GHz (2 Cores)
MemTotal:      3980400 kB (4GB)</pre>
<h3>1.) Preparing input and pipeline configuration</h3>
<p>To start of with we&#8217;ll want to be logged into a running CloVR image. Datasets in CloVR must be &#8216;tagged&#8217; before they are ready for use. In essence tagging a dataset is akin to associating a unique identifier to it which any CloVR component can then call upon to retrieve the data. Tagging is done via the <a href="http://vappio.sourceforge.net/docs/api/datasets.html?highlight=dataset"><strong>vp-add-dataset</strong> </a>command:</p>
<pre>vp-add-dataset --tag-name=454_assembly_test /mnt/data/454_500k.sff -o</pre>
<p><span style="font-family: Consolas, Monaco, 'Courier New', Courier, monospace; line-height: 18px;">If the command has executed you&#8217;ll receive some status information regarding the tagging process:</span></p>
<pre>Task: tagData-1296774703.25 Type: tagData       State: completed     
Num: 1/1 (100%) LastUpdated: 2011/02/03 23:11:43 UTC</pre>
<p><span style="line-height: 18px;">With our data tagged we can move onto configuring our pipeline through the use of a configuration file generated by the <strong><a href="http://vappio.sourceforge.net/docs/api/pipelines_protocols.html#vp-describe-protocols-list-protocols-and-config-options">vp-describe-protocols </a></strong>command:</span></p>
<pre>vp-describe-protocols -p clovr_assembly_celera \
  -c input.INPUT_SFF_TAG=454_assembly_test \
  -c params.OUTPUT_PREFIX=454_500k \
  -c cluster.CLUSTER_NAME=local \
  -c cluster.CLUSTER_CREDENTIAL=local \
  -c cluster.TERMINATE_ONFINISH=false \
  -c pipeline.PIPELINE_DESC="CloVR 454 assembly test" \
   &gt; /mnt/data/clovr_assembly_celera.conf</pre>
<p>The output of this command is a configuration file for the CloVR 454 assembly pipeline configured to use the dataset we tagged above. Options are configured using the <strong>-c</strong> flag and must match a parameter found in the configuration file. Some common parameters are described below; please note that not all of these parameters are required to run the 454 assembly pipeline.</p>
<ul>
<li><strong>input.INPUT_SFF_TAG </strong>- Remember our first step where we tagged our input file (I named mine 454_assembly_test)? That value is entered under this option.</li>
<li><strong>params.OUTPUT_PREFIX &#8211; </strong>The output prefix is used when naming files generated by the assembly pipeline. It is practical here to use the organism name or some other descriptive text such as the <em>ecoli500k</em> value I will be using.</li>
<li><strong>params.TRIM</strong> &#8211; This option controls how stringently we ignore low-quality base pairs past the trim points set by the 454 instrument. The recommended value to use here is <em>chop, </em>which does not use any base pairs past the trim points.</li>
<li><strong>params.CLEAR &#8211; </strong>Controls where the clear range (or trim points) are specified for our input sequence data. <em>454</em> is the recommended setting here</li>
<li><strong>params.LINKER &#8211; </strong>The linker sequence to use. Set to either <em>flx</em> or <em>titanium</em> depending on the instrument used to generate sequence.</li>
<li><strong>params.INSERT_SIZE</strong> &#8211; This option should only be used when dealing with paired end datasets, which this dataset is. The format for this option is <em>&#8216;i d&#8217; </em>where mates are on average i +/- d base pairs apart. Our example dataset will use the value &#8216;<em>8000 1000</em>&#8216;</li>
</ul>
<ul>
<li><strong>cluster.CLUSTER_NAME &#8211; </strong>Set to <em>local</em> if this pipeline is being run locally (on a single machine or a local grid) or to the name of a pipeline created using the <strong><a href="http://vappio.sourceforge.net/docs/api/clusters.html#vp-start-cluster-start-a-new-cluster">vp-start-cluster</a></strong> command. We&#8217;re running this pipeline on our desktop so I will keep this parameter set to the value of <em>local</em></li>
<li><strong>cluster.CLUSTER_CREDENTIAL &#8211; </strong>This parameter should be set to <em>local</em> if this pipeline is being run on a local machine/grid and the value of our<strong> <a href="http://vappio.sourceforge.net/docs/api/credentials.html#vp-add-credential-adding-a-credential">vp-add-credential</a> </strong>command. Keeping this one on <em>local</em> to indicate a local run.</li>
<li><strong>cluster.EXEC_NODES &#8211; </strong>This command controls the number of nodes brought up in our cluster. If this is a local run it is recommended that this number not exceed the number of CPU cores available. If being run on EC2 this will control how many EC2 instances are brought up. For this assembly example I will have this value set to <em>0 as </em>we are running on one machine.</li>
</ul>
<h3>2.) Running the pipeline</h3>
<p>Once all our configuration parameters have been set to the satisfactory options we can start our pipeline by invoking the <strong><a href="http://vappio.sourceforge.net/docs/api/pipelines_protocols.html#vp-run-pipeline-run-a-pipeline">vp-run-pipeline</a> </strong>command:</p>
<pre>vp-run-pipeline --print-task-name \
       --pipeline-config /mnt/data/454_blog/clovr_assembly_celera.conf \
       --overwrite</pre>
<p>If the command is successfully executed a task number should be returned which can be used to monitor the overall status of the pipeline.  We can view our progress of the pipeline using the Ergatis interface that is installed on the CloVR VM. Ergatis display our pipeline in a block-like visualization with each analysis step displayed as a component of the pipeline. In order to view Ergatis we must first obtain the IP address for our cluster using the <a href="http://vappio.sourceforge.net/docs/api/clusters.html#vp-describe-cluster-describe-clusters">vp-describe-cluster</a> command:</p>
<pre>vp-describe-cluster

STATE   running
MASTER  local   clovr-10-90-135-181     running
GANGLIA http://clovr-10-90-135-181/ganglia
ERGATIS http://clovr-10-90-135-181/ergatis</pre>
<p>The URL to the Ergatis interface running off of our image will be listed in the output to the vp-describe-cluster command under the <em>ERGATIS </em>parmeter. Taking this and tossing it into a web-browser should take us to the following page:</p>
<p><a href="http://clovr.org/wp-content/uploads/2011/02/Screen-shot-2011-02-15-at-3.04.50-PM.png"><img class="aligncenter size-medium wp-image-301" title="Screen shot 2011-02-15 at 3.04.50 PM" src="http://clovr.org/wp-content/uploads/2011/02/Screen-shot-2011-02-15-at-3.04.50-PM-550x343.png" alt="" width="550" height="343" /></a>From here clicking on the <em>clovr </em>link on the left hand navigation will take us to a list of all our current running pipelines. The pipeline we just started should be here and viewable in the component-by-component view Ergatis is capable of displaying.</p>
<p><a href="http://clovr.org/wp-content/uploads/2011/02/Screen-shot-2011-02-15-at-12.29.10-PM.png"><img class="aligncenter size-medium wp-image-302" title="Screen shot 2011-02-15 at 12.29.10 PM" src="http://clovr.org/wp-content/uploads/2011/02/Screen-shot-2011-02-15-at-12.29.10-PM-550x343.png" alt="" width="550" height="343" /></a>It is also possible to monitor cluster usage during a pipeline run by using the URL provided in the <em>GANGLIA</em> parameter from the <strong>vp-describe-cluster </strong>output:</p>
<p><a href="http://clovr.org/wp-content/uploads/2011/02/Screen-shot-2011-02-15-at-11.39.25-AM.png"><img class="aligncenter size-medium wp-image-303" title="Screen shot 2011-02-15 at 11.39.25 AM" src="http://clovr.org/wp-content/uploads/2011/02/Screen-shot-2011-02-15-at-11.39.25-AM-550x343.png" alt="" width="550" height="343" /></a></p>
<h3>3.) Pipeline Output</h3>
<p>Once our assembly pipeline run has finished CloVR automatically downloads any output back to our VM and deposits it to the output directory defined in the pipeline configuration file. Navigating to this directory should provide a tarball file containing our assembly scaffolds as well as a QC file containing metrics for the assembly (i.e. N50).</p>
<h2><strong>Command-line Illumina Assembly in CloVR</strong></h2>
<p>Illumina assembly in CloVR follows the same workflow as the 454 assembly did save for running the data through a separate pipeline and having to run the pipeline on Amazon&#8217;s EC2 compute cloud.</p>
<h3>1.) Preparing input and pipeline configuration</h3>
<p>This example will make use of an input data set of 4 million paired-end reads (provided in appendix at the bottom of this post) that must be tagged as input just like the 454 example data set.</p>
<pre>vp-add-dataset --tag-name=ecoli_4M_fastq \
                 /mnt/data/illumina_4M_1.fastq \
                 /mnt/data/illumina_4M_2.fastq -o</pre>
<p>Before moving onto generation and modification of the pipeline configuration file it should be noted that due to the size our input dataset it will require a machine with at least 16GB of RAM. During peak processing of this dataset our RAM usage will eclipse 11GB and this will lock up machines that do not have this required amount of memory. Because of the memory situation associated with this dataset we will be making use of Amazon&#8217;s EC2 compute cloud and the m2.xlarge instance type containing 16GB of memory.</p>
<p>In order to do this we must generate an EC2 cluster credential using the <a href="http://vappio.sourceforge.net/docs/api/credentials.html#vp-add-credential-adding-a-credential">vp-add-credential</a> command coupled with the private key and certificate provided with an Amazon AWS account. Instructions to register for an Amazon AWS account and place the subsequent keys can be found <a href="http://clovr.org/docs/amazon-ec2-keys/">here</a>.</p>
<p>Once our keys are in place we can create our cluster credential by using the following command:</p>
<pre>vp-add-credential --cred-name illumina_ec2 --cert /mnt/keys/ec2.cert \
                  --pkey /mnt/keys/ec2.pkey</pre>
<p>With our credential create we can move onto generating our pipeline config by running the <strong>vp-describe-protocols </strong>command with the key difference being that our pipeline name is <em>clovr_assembly_velvet</em>:</p>
<pre>vp-describe-protocols -p clovr_assembly_velvet \
       -c input.SHORT_PAIRED_TAG=ecoli_4M_fastq \
       -c cluster.CLUSTER_NAME=illumina_ec2 \
       -c cluster.CLUSTER_CREDENTIAL=illumina_ec2 \
       -c cluster.MASTER_INSTANCE_TYPE="large" \
       -c cluster.TERMINATE_ONFINISH=false \
       -c pipeline.PIPELINE_DESC="CloVR illumina test run" \
       &gt; /mnt/data/clovr_assembly_velvet_ec2.conf</pre>
<p>An explanation of the parameters used follows:</p>
<ul>
<li><strong>input.SHORT_PAIRED_TAG &#8211; </strong>This parameter tells the pipeline where to find the dataset we created when running the <strong>vp-add-dataset </strong>command. It should be set to whatever was specified in the <em>tag-name </em>flag and for this example will be set to <em>illumina_assembly_test.</em></li>
</ul>
<ul>
<li><strong>cluster.CLUSTER_NAME &#8211; </strong>A unique name given to our cluster. If a local cluster is being run this should be left as the default value of <em>local</em>. For our example I will set this to <em>illumina_ec2_cluster</em>.</li>
<li><strong>cluster.CLUSTER_CREDENTIAL &#8211; </strong>Here we will use the credential we generated above, <em>illumina_ec2</em></li>
<li><strong>cluster.MASTER_INSTANCE_TYPE &#8211; </strong>This parameter controls the type of master node used in an EC2 cluster. Because this Illumina run requires a high memory machine we set the parameter to <em>large </em>to indicate we want to use a machine with 16GB of memory.</li>
</ul>
<p>If we had a local machine with 16GB of memory or more upon which we could run this pipeline on our call to <strong> vp-describe-protocols</strong> would look like this:</p>
<pre>vp-describe-protocols -p clovr_assembly_velvet \
       -c input.SHORT_PAIRED_TAG=ecoli_4M_fastq \
       -c cluster.CLUSTER_NAME=local \
       -c cluster.CLUSTER_CREDENTIAL=local \
       -c cluster.TERMINATE_ONFINISH=false \
       -c pipeline.PIPELINE_DESC="CloVR illumina test run" \
       &gt; /mnt/data/clovr_assembly_velvet_local.conf</pre>
<p>The three parameters different between this and our EC2 run being <strong>cluster.CLUSTER_NAME, cluster.CLUSTER_CREDENTIAL, </strong>and <strong>cluster.MASTER_INSTANCE_TYPE</strong>. Here we do not need a master instance type and our cluster name and cluster credential need to be set to <em>local</em>.</p>
<p>2.) Running the pipeline</p>
<p>With our configuration file in place we can go ahead and launch our pipeline using the<span style="color: #000000;"><strong> vp-run-pipeline </strong>command:</span></p>
<pre>vp-run-pipeline --print-task-name \
  --pipeline-config /mnt/data/illumina_ec2_blog/clovr_assembly_velvet_ec2.conf \
  --overwrite</pre>
<p>Likewise if we wanted to run this pipeline locally we could substitute the <em>clovr_assembly_velvet_ec2.conf</em> file with the <span style="color: #000000;"><em>clovr_assembly_velvet_local.conf </em>generated above.</span></p>
<p>After successfully executing this command we can monitor the pipeline in Ergatis, just as described in the 454 portion of this post, using the <strong> vp-describe-cluster </strong><span style="color: #000000;">command to grab the IP of our cluster. The only </span>difference<span style="color: #000000;"> here is a requirement of our cluster name because we are running on EC2.</span></p>
<pre>vp-describe-cluster --name illumina_ec2_cluster

MASTER ec2 e2-184-72-196-20.compute-1.amazonaws.com running
GANGLIA http://ec2-184-72-196-20.compute-1.amazonaws.com/ganglia
ERGATIS http://ec2-184-72-196-20.compute-1.amazonaws.com/ergatis</pre>
<h3>3.) Pipeline Output</h3>
<p>When the pipeline has been completed pipeline output can be found in the <em>/mnt/output </em>directory.</p>
<h2><strong>Post-assembly</strong></h2>
<p>Although it wasn&#8217;t covered in this blog post, CloVR does support some basic visualization of assemblies through the AMOS packages <a href="http://sourceforge.net/apps/mediawiki/amos/index.php?title=Hawkeye">hawkeye</a> program. This functionality can be enabled by setting the <strong>SKIP_BANK</strong> parameter in the pipeline configuration to 1. Once this is enabled an AMOS bank file is generated and lumped into the output at the conclusion of the pipeline run.</p>
<h2><a href="http://clovr.org/wp-content/uploads/2011/02/Screenshot-Hawkeye.png"><img class="aligncenter size-medium wp-image-316" title="Screenshot-Hawkeye" src="http://clovr.org/wp-content/uploads/2011/02/Screenshot-Hawkeye-550x443.png" alt="" width="550" height="443" /></a><a href="http://clovr.org/wp-content/uploads/2011/02/Contig-View.png"><img class="aligncenter size-medium wp-image-317" title="Contig View" src="http://clovr.org/wp-content/uploads/2011/02/Contig-View-550x292.png" alt="" width="550" height="292" /></a><a href="http://clovr.org/wp-content/uploads/2011/02/Scaffold-View.png"><img class="aligncenter size-medium wp-image-318" title="Scaffold View" src="http://clovr.org/wp-content/uploads/2011/02/Scaffold-View-550x262.png" alt="" width="550" height="262" /></a></h2>
<h2></h2>
]]></content:encoded>
			<wfw:commentRss>http://clovr.org/2011/08/25/clovr-454-and-illumina-assembly/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CloVR 1.0-RC2 available for download</title>
		<link>http://clovr.org/2011/08/25/clovr-1-0-rc2-available-for-download/</link>
		<comments>http://clovr.org/2011/08/25/clovr-1-0-rc2-available-for-download/#comments</comments>
		<pubDate>Thu, 25 Aug 2011 14:10:18 +0000</pubDate>
		<dc:creator>Sam Angiuoli</dc:creator>
				<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://clovr.org/?p=381</guid>
		<description><![CDATA[The second release candidate of CloVR 1.0-RC2 is available for download. See release notes for more information.]]></description>
				<content:encoded><![CDATA[<p>The second release candidate of CloVR 1.0-RC2 is available for <a href="http://clovr.org/download">download</a>. See <a href="http://clovr.org/developers/releases/#clovr-1.0-RC2" title="Releases">release notes</a> for more information.</p>
]]></content:encoded>
			<wfw:commentRss>http://clovr.org/2011/08/25/clovr-1-0-rc2-available-for-download/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CloVR interface demo on Youtube</title>
		<link>http://clovr.org/2011/08/02/clovr-interface-demo-on-youtube/</link>
		<comments>http://clovr.org/2011/08/02/clovr-interface-demo-on-youtube/#comments</comments>
		<pubDate>Tue, 02 Aug 2011 15:52:20 +0000</pubDate>
		<dc:creator>David Riley</dc:creator>
				<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://clovr.org/?p=343</guid>
		<description><![CDATA[A video has been added to the CloVR Youtube Channel that demonstrates using CloVR from image startup to results download in 5 steps. Check it out:]]></description>
				<content:encoded><![CDATA[<p>A video has been added to the CloVR Youtube Channel that demonstrates using CloVR from image startup to results download in 5 steps. Check it out:<br />
<iframe src="http://www.youtube.com/embed/By5RtWZVq-4" frameborder="0" width="300" height="255"></iframe></p>
]]></content:encoded>
			<wfw:commentRss>http://clovr.org/2011/08/02/clovr-interface-demo-on-youtube/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CloVR interface demo video available</title>
		<link>http://clovr.org/2011/08/02/clovr-interface-demo-video-available/</link>
		<comments>http://clovr.org/2011/08/02/clovr-interface-demo-video-available/#comments</comments>
		<pubDate>Tue, 02 Aug 2011 15:48:59 +0000</pubDate>
		<dc:creator>David Riley</dc:creator>
				<category><![CDATA[Blog]]></category>

		<guid isPermaLink="false">http://clovr.org/?p=342</guid>
		<description><![CDATA[A video has been added to the CloVR Youtube Channel that demonstrates using CloVR from image startup to results download in 5 steps: Start CloVR Virtual Machine Add Data Add Credential Account Run Analysis Download Results Check it out:]]></description>
				<content:encoded><![CDATA[<p>A video has been added to the CloVR Youtube Channel that demonstrates using CloVR from image startup to results download in 5 steps:</p>
<ol>
<li>Start CloVR Virtual Machine</li>
<li>Add Data</li>
<li>Add Credential Account</li>
<li>Run Analysis</li>
<li>Download Results</li>
</ol>
<div><span style="font-size: small;"><span class="Apple-style-span" style="line-height: 24px;">Check it out:</span></span></div>
<p><iframe src="http://www.youtube.com/embed/By5RtWZVq-4" frameborder="0" width="425" height="349"></iframe></p>
]]></content:encoded>
			<wfw:commentRss>http://clovr.org/2011/08/02/clovr-interface-demo-video-available/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Release candidate for CloVR 1.0 available for download</title>
		<link>http://clovr.org/2011/07/01/release-candidate-for-clovr-1-0-available-for-download/</link>
		<comments>http://clovr.org/2011/07/01/release-candidate-for-clovr-1-0-available-for-download/#comments</comments>
		<pubDate>Fri, 01 Jul 2011 19:34:54 +0000</pubDate>
		<dc:creator>Sam Angiuoli</dc:creator>
				<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://clovr.org/?p=337</guid>
		<description><![CDATA[The first release candidate of CloVR 1.0 is available for download. Highlights of this release include a New and improved web interface with support for adding data, running and monitoring pipelines, and configuring cloud support; Improved support for the DIAG &#8230; <a href="http://clovr.org/2011/07/01/release-candidate-for-clovr-1-0-available-for-download/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>The first release candidate of CloVR 1.0 is available for <a href="http://clovr.org/download">download</a>.</p>
<p>Highlights of this release include a New and improved web interface with support for adding data, running and monitoring pipelines, and configuring cloud support; Improved support for the <a title="DIAG" href="http://diagcomputing.org">DIAG cloud</a>; Expanded output and published SOPs for the protocols, <a href="http://precedings.nature.com/documents/5887/version/1">CloVR-microbe</a>, <a href="http://precedings.nature.com/documents/5888/version/1">CloVR-16S</a>, <a href="http://precedings.nature.com/documents/5886/version/1">CloVR-metagenomics</a>; Robustness and reliability improvements; Improved API for programmatic access to the VM; VDI and expanding virtual disk support for VirtualBox</p>
]]></content:encoded>
			<wfw:commentRss>http://clovr.org/2011/07/01/release-candidate-for-clovr-1-0-available-for-download/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CloVR beta-0.6 Available for Download</title>
		<link>http://clovr.org/2011/03/16/clovr-beta-0-6-available-for-download/</link>
		<comments>http://clovr.org/2011/03/16/clovr-beta-0-6-available-for-download/#comments</comments>
		<pubDate>Wed, 16 Mar 2011 20:39:30 +0000</pubDate>
		<dc:creator>David Riley</dc:creator>
				<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://clovr.org/?p=321</guid>
		<description><![CDATA[A new beta release of our virtual appliance, CloVR beta-0.6, is available for download. Highlights from the release include support of the DIAG compute grid, leveraging multiple CPU cores on a local pipeline run and a prototype web interface allowing &#8230; <a href="http://clovr.org/2011/03/16/clovr-beta-0-6-available-for-download/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>A new beta release of our virtual appliance, CloVR beta-0.6, is available for <a href="http://clovr.org/download">download</a>. Highlights from the release include support of the <a href="http://diagcomputing.org">DIAG</a> compute grid, leveraging multiple CPU cores on a local pipeline run and a prototype web interface allowing the launch of CLoVR Search, Microbe, Metagenomics and 16s tracks with just a few clicks.</p>
]]></content:encoded>
			<wfw:commentRss>http://clovr.org/2011/03/16/clovr-beta-0-6-available-for-download/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Cunningham &amp; Autoscaling</title>
		<link>http://clovr.org/2011/02/04/cunningham-autoscaling/</link>
		<comments>http://clovr.org/2011/02/04/cunningham-autoscaling/#comments</comments>
		<pubDate>Fri, 04 Feb 2011 15:25:35 +0000</pubDate>
		<dc:creator>James White</dc:creator>
				<category><![CDATA[Blog]]></category>

		<guid isPermaLink="false">http://clovr.org/?p=287</guid>
		<description><![CDATA[Several CloVR tracks now utilize autoscaling in the cloud to improve performance. Efficient autoscaling can be difficult to achieve and requirements vary for each pipeline. If we underestimate the number of instances needed, the pipeline will take significantly longer. If &#8230; <a href="http://clovr.org/2011/02/04/cunningham-autoscaling/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>Several <a href="../methods">CloVR tracks</a> now utilize autoscaling in the cloud to improve performance. Efficient autoscaling can be difficult to achieve and requirements vary for each pipeline. If we underestimate the number of instances needed, the pipeline will take significantly longer. If we overestimate how many instances we need, then we end up wasting resources (and money in the case of Amazon EC2).</p>
<p>In our workflows, some form of BLAST often takes up the majority of cpu time. Fortunately this process can be parallelized fairly easily by partitioning the query dataset. But a few questions arise:</p>
<ul>
<li>How much should we partition the query data?</li>
<li>How many parallelized jobs do we schedule?</li>
<li>How many instances do we need to request without being wasteful?</li>
</ul>
<p>To help answer these questions in CloVR, we&#8217;ve developed a BLAST runtime estimator called <strong>Cunningham</strong>. Cunningham computes statistics about the shared sequence composition between a database and a query dataset in order to estimate how many cpu hours a corresponding BLAST job would take. With this information, each pipeline determines how many instances it will need overall and how to partition the data.</p>
<p>Currently, we support BLAST{N,P,X} runtime estimates against several well known databases including: NCBI COGs, eggNOG, KEGG genes, NCBI-NR, SILVA, &amp; RefSeq (microbial genomes). Cunningham is lightweight and quite fast, typically requiring a few minutes to run. If you&#8217;re interested check out the <a href="http://precedings.nature.com/documents/5593/version/1" target="_blank">Cunningham white paper</a> in Nature Precedings.</p>
<p>The <a href="http://sourceforge.net/projects/cunningham/ " target="_blank">latest version of Cunningham</a> is freely available through SourceForge.</p>
]]></content:encoded>
			<wfw:commentRss>http://clovr.org/2011/02/04/cunningham-autoscaling/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
