{"id":6200,"date":"2012-03-27T10:10:04","date_gmt":"2012-03-27T08:10:04","guid":{"rendered":"http:\/\/www.ehu.es\/sgi\/?p=6200"},"modified":"2012-06-21T01:17:07","modified_gmt":"2012-06-20T23:17:07","slug":"abyss-3","status":"publish","type":"post","link":"https:\/\/www.ehu.eus\/sgi\/scientific-software\/abyss-3","title":{"rendered":"ABySS"},"content":{"rendered":"<h2>General Information<\/h2>\n<p>1.3.2 version of ABySS <strong>(A<\/strong>ssembly <strong>By S<\/strong>hort <strong>S<\/strong>equences). ABySS is a <em>de novo<\/em>, parallel, paired-end sequence assembler that is designed for short reads. ABySS can be executed in parallel.<\/p>\n<p>See also the installed [intlink id=\u00bb6043&#8243; type=\u00bbpost\u00bb]velvet[\/intlink] and comparing both <a title=\"abyss vs velvet\" href=\"http:\/\/www.ehu.es\/ehusfera\/hpc\/2012\/06\/20\/benchmarking-genetic-assemblers-abyss-vs-velvet\/\" target=\"_blank\">we have published article<\/a>.<\/p>\n<h2>How to use<\/h2>\n<p>The executables can be found in <code>\/software\/abyss\/bin<\/code>. To run abyss in a script type in it:<\/p>\n<pre>\/software\/abyss\/bin\/abyss-pe [abyss-pe options]<\/pre>\n<h2>Performance<\/h2>\n<p>See also the installed [intlink id=\u00bb6043&#8243; type=\u00bbpost\u00bb]velvet[\/intlink] and comparing both <a title=\"abyss vs velvet\" href=\"http:\/\/www.ehu.es\/ehusfera\/hpc\/2012\/06\/20\/benchmarking-genetic-assemblers-abyss-vs-velvet\/\" target=\"_blank\">we have published article<\/a>.<\/p>\n<h3>Parallelization<\/h3>\n<p>Some benchmarks has been performed with ABySS. They have been performed using file from an Illumina HiSeq2000 NGS with 100 bp per sequence. In the table 1 we can see an example about how ABySS scales as a function of the number of cores. As we can see ABySS scales very up to 8 cores. The results is valid unless for more than 10e6 sequences.<\/p>\n<table border=\"0\" align=\"center\">\n<caption>Table 1. Execution time of <code>abyss-pe<\/code> in seconds as a function the number of cores<\/caption>\n<tbody>\n<tr>\n<td><strong>cores<\/strong><\/td>\n<td>2<\/td>\n<td>4<\/td>\n<td>8<\/td>\n<td>12<\/td>\n<td>24<\/td>\n<\/tr>\n<tr>\n<td>Time (s)<\/td>\n<td>47798<\/td>\n<td>27852<\/td>\n<td>16874<\/td>\n<td>14591<\/td>\n<td>18633<\/td>\n<\/tr>\n<tr>\n<td>Aceleration<\/td>\n<td>1<\/td>\n<td>1.7<\/td>\n<td>2.8<\/td>\n<td>3.3<\/td>\n<td>2.6<\/td>\n<\/tr>\n<tr>\n<td>Performance(%)<\/td>\n<td>100<\/td>\n<td>86<\/td>\n<td>71<\/td>\n<td>55<\/td>\n<td>21<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3>Execution time<\/h3>\n<p>We have analized as well the execution time as a function of the size of the data. In the table 2 we\u00a0 observe how from 1 million to 10 millions of sequences the execution time increases by 10 as well. From 10 to 100 millions of sequences the time increases a little more, between 10 t0 20. Therefore, the behavior is more or less lineal.<\/p>\n<table border=\"0\" align=\"center\">\n<caption>Table 2. Execution time in seconds of <code>abyss-pe<\/code> executed in 2, 4 y 8 cores as a function of the number of processed sequences.<\/caption>\n<tbody>\n<tr>\n<td><strong>sequences<\/strong><\/td>\n<td>10e6<\/td>\n<td>10e7<\/td>\n<td>10e8<\/td>\n<\/tr>\n<tr>\n<td>Time in 2 cores (s)<\/td>\n<td>247<\/td>\n<td>2620<\/td>\n<td>47798<\/td>\n<\/tr>\n<tr>\n<td>Time in 4 cores (s)<\/td>\n<td>134<\/td>\n<td>1437<\/td>\n<td>27852<\/td>\n<\/tr>\n<tr>\n<td>Time in 8 cores (s)<\/td>\n<td>103<\/td>\n<td>923<\/td>\n<td>16874<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3>RAM memory<\/h3>\n<p>In these kind of programs more important than the execution time, which is reasonable, is the RAM memory usage, which can limit the calculation type. In the table 3 we observe how the RAM increases as a function of the number of sequences. We also show the logarithms of the measured values which has been used for a lineal regression. The jobs has been performed in 12 cores.<\/p>\n<table border=\"0\" align=\"center\">\n<caption>Tabl3 3. RAM memory used by <code>abyss-pe<\/code> as a function of the number of processed sequences. The logarithms of the measured values are also shown.<\/caption>\n<tbody>\n<tr>\n<td><strong>sequences<\/strong><\/td>\n<td>10e6<\/td>\n<td>5*10e6<\/td>\n<td>10e7<\/td>\n<td>5*10e7<\/td>\n<td>10e8<\/td>\n<\/tr>\n<tr>\n<td>RAM (GB)<\/td>\n<td>4.0<\/td>\n<td>7.6<\/td>\n<td>11<\/td>\n<td>29<\/td>\n<td>44<\/td>\n<\/tr>\n<tr>\n<td>log(sequences)<\/td>\n<td>6<\/td>\n<td>6.7<\/td>\n<td>7<\/td>\n<td>7.7<\/td>\n<td>8<\/td>\n<\/tr>\n<tr>\n<td>log(RAM)<\/td>\n<td>0.60<\/td>\n<td>0.88<\/td>\n<td>1.03<\/td>\n<td>1.46<\/td>\n<td>1.65<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>From the values of the table we obtain a fitting of the RAM in GB as a function of the number of sequences (<em>s<\/em>) to the equation<\/p>\n<p>log(RAM)=0.53*log(s)-2.65<\/p>\n<p>o equivalently<\/p>\n<p>RAM=(s^0.53)\/447<\/p>\n<h3>Conclusion<\/h3>\n<p>The memory usage is smaller than in other assemblers like [intlink id=\u00bb6043&#8243; type=\u00bbpost\u00bb]Velvet[\/intlink], see as well the report <a title=\"velvet performance\" href=\"..\/..\/ARCHIVOS\/velvet_performance.pdf\" target=\"_blank\"><em>Velvet performance in the machines of the Computing Service of the UPV\/EHU<\/em><\/a><em><\/em> and comparing both <a title=\"abyss vs velvet\" href=\"http:\/\/www.ehu.es\/ehusfera\/hpc\/2012\/06\/20\/benchmarking-genetic-assemblers-abyss-vs-velvet\/\" target=\"_blank\">we have published article<\/a>. In addition, the parallelization with MPI of ABySS allows to aggregate the RAM memory of several nodes to perform larger calculations.<\/p>\n<h2>More information<\/h2>\n<p><a title=\"ABySS\" href=\"http:\/\/www.bcgsc.ca\/platform\/bioinfo\/software\/abyss\" target=\"_blank\">ABySS<\/a> web page.<br \/>\n[intlink id=\u00bb6043&#8243; type=\u00bbpost\u00bb]Velvet[\/intlink] assembler.<br \/>\n<a title=\"velvet performance\" href=\"..\/..\/ARCHIVOS\/velvet_performance.pdf\" target=\"_blank\"><em>Velvet performance in the machines of the Computing Service of the UPV\/EHU<\/em><\/a> report.<\/p>\n<p><a title=\"velvet performance\" href=\"http:\/\/www.ehu.es\/ehusfera\/hpc\/2012\/06\/20\/benchmarking-genetic-assemblers-abyss-vs-velvet\/\" target=\"_blank\">Velvet and ABySS performance in the machines of the Computing Service of the UPV\/EHU<\/a>, post in the <a title=\"hpc blog\" href=\"http:\/\/www.ehu.es\/ehusfera\/hpc\" target=\"_blank\">hpc blog<\/a>.<\/p>\n<div id=\"_mcePaste\" class=\"mcePaste\" style=\"position: absolute; left: -10000px; top: 228px; width: 1px; height: 1px; overflow: hidden;\"><code>abyss-pe<\/code><\/div>\n","protected":false},"excerpt":{"rendered":"<p>General Information 1.3.2 version of ABySS (Assembly By Short Sequences). ABySS is a de novo, parallel, paired-end sequence assembler that is designed for short reads. ABySS can be executed in parallel. See also the installed [intlink id=\u00bb6043&#8243; type=\u00bbpost\u00bb]velvet[\/intlink] and comparing both we have published article. How to use The executables can be found in \/software\/abyss\/bin. &hellip; <a href=\"https:\/\/www.ehu.eus\/sgi\/scientific-software\/abyss-3\" class=\"more-link\">Seguir leyendo <span class=\"screen-reader-text\">ABySS<\/span> <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_links_to":"","_links_to_target":""},"categories":[239,196],"tags":[],"_links":{"self":[{"href":"https:\/\/www.ehu.eus\/sgi\/wp-json\/wp\/v2\/posts\/6200"}],"collection":[{"href":"https:\/\/www.ehu.eus\/sgi\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ehu.eus\/sgi\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ehu.eus\/sgi\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ehu.eus\/sgi\/wp-json\/wp\/v2\/comments?post=6200"}],"version-history":[{"count":14,"href":"https:\/\/www.ehu.eus\/sgi\/wp-json\/wp\/v2\/posts\/6200\/revisions"}],"predecessor-version":[{"id":6457,"href":"https:\/\/www.ehu.eus\/sgi\/wp-json\/wp\/v2\/posts\/6200\/revisions\/6457"}],"wp:attachment":[{"href":"https:\/\/www.ehu.eus\/sgi\/wp-json\/wp\/v2\/media?parent=6200"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ehu.eus\/sgi\/wp-json\/wp\/v2\/categories?post=6200"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ehu.eus\/sgi\/wp-json\/wp\/v2\/tags?post=6200"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}