楼主 | 收藏 | 举报 2018-07-06 00:00   浏览:143   回复:0

Testing SOAPdenovo2 Prerelease – IV

SOAPdenovo2 runs through the following steps -

(i) generate the de Bruijn graph and store it (‘pregraph’ option),

(ii) analyze the de Bruijn to assemble contigs,

(iii) map all reads on to the contigs,

(iv) build scaffolds by combining contigs.

Today, we will explore the second step.

The command to run is -

SOAPdenovo-63mer-v2.04.3 contig -g outPG

where outPG is the pregraph generated from the previous step.

There is very little to write about the module. We typed the command and went to get a coffee. The results were ready, when we came back. I guess we need a next-generation coffee machine

Following files were created:

outPG.contig – This is the contig file and is in fasta format. Here are the detailed stats given by SOAPdenovo2 -

516528 ctgs longer than 100, sum up 538242466bp, with average length 1042
the longest is 48565bp, contig N50 is 2861 bp,contig N90 is 445 bp
4039233 contigs longer than 46 output.

outPG.ContigIndex – A text-based index file that likely links the contigs and edges. We have not spent much time figuring out the details.

outPG.Arc, outPG.updated.edge – Two text output file.

Finally, here is the full set of parameters for users interested in exploring further.

contig -g inputGraph [-R] [-M mergeLevel -D EdgeCovCutoff]

-g inputGraph: prefix of input graph file names

-R (optional) resolve repeats using information generated in pregraph step, works only if -R is set in pregraph step too, [NO]

-M mergeLevel(min 0, max 3): the strength of merging similar sequences during contiging, [1]

-D EdgeCovCutoff: edges shorter than (2*K+1) with coverage no largert than EdgeCovCutoff will be deleted, [1]

打赏
网站首页 | 关于我们 | 联系方式 | 使用协议 | 版权隐私 | 网站地图  |  排名推广  |  广告服务  |  积分换礼  |  网站留言  |  RSS订阅  |  违规举报
 
免责声明:本站有部分内容来自互联网,如无意中侵犯了某个媒体 、公司 、企业或个人等的知识产权,请来电或致函告之,本网站将在规定时间内给予删除等相关处理。