UGENE Forum | |
https://forum.ugene.net/forum/YaBB.pl
General Category >> Help and How-to >> SPAdes - Mac, no tool https://forum.ugene.net/forum/YaBB.pl?num=1422521083 Message started by Leiga on Jan 29th, 2015 at 3:44pm |
Title: SPAdes - Mac, no tool Post by Leiga on Jan 29th, 2015 at 3:44pm
Hi everyone.
I am a fresh user of your software and have problem with genome assembly using SPAdes. As is written in the topic I am using this application on MacBook Pro. Following the instructions I have downloaded external package (with SPAdes for 64-bit MAC), change in the settings location of that folder and tried to run genome assemby. At the end I have this kind of notification: Quote:
SPAdes folder is in folder, but on the list of Preferences/External tools in UGENE software I still do not have anything with "SPAdes" letters. Can you help me? Regards L. |
Title: Re: SPAdes - Mac, no tool Post by Olga Golosova on Jan 29th, 2015 at 7:17pm
Hi Leiga!
There was an issue in the release UGENE version that is already fixed. Could you please download and use the snapshot UGENE version to run SPAdes? The snapshot package can be downloaded here: http://ugene.unipro.ru/snapshot.html You will need to specify "SPAdes" and "python" external tools in the Application Settings. Sorry for this inconvenience and thanks for your question! |
Title: Re: SPAdes - Mac, no tool Post by Leiga on Jan 30th, 2015 at 5:53am
Dear Olga,
I think I do some steps forward... changed version for mentioned by you, changed location of external package in preferences sheet. I tried to assemble genome: Quote:
I am not sure where is a mistake now (I choose SPAdes, Paired-ends, I left properties in default mode, add equal number of files for left and right reads, dataset - multi cell, running mode - error correction and assembly, k-mer size - auto. These are reads from unknown bacteriophage (dsDNA), not above 200k bp. Regards L. |
Title: Re: SPAdes - Mac, no tool Post by Olga Golosova on Jan 30th, 2015 at 5:53pm
Dear Leiga,
I was not able to reproduce the issue on our test data on the snapshot UGENE version (on Mac OS X 64-bit). I used, for example, the following sample data: https://dl.dropboxusercontent.com/u/59362392/SRR1617092_1.fastq https://dl.dropboxusercontent.com/u/59362392/SRR1617092_2.fastq Could you please download the data and try to run SPAdes with them? The run may take ~0.5 hour. Note that I used the following versions of the external tools: - python 2.7 - SPAdes 3.1.1 (bin/spades.py file was specified as the tool) Also, could you please send the YAML file (/Biofizyka/UGENE/Bakteriofagi/JD/R15011/datasets.yaml)? And, if it is possible, it would be great if you could send some test files on which the issue is reproduced. By the way, what was the sequencing platform? |
Title: Re: SPAdes - Mac, no tool Post by Leiga on Jan 30th, 2015 at 7:16pm
I think it is just a bad day :)
I run this app using your data: Quote:
System? NextSeq 500 yaml file: Quote:
Hope, solution is easy to find, and it is just my fault. Regards L. |
Title: Re: SPAdes - Mac, no tool Post by Olga Golosova on Jan 30th, 2015 at 8:44pm
Yeap, this is definitely not the best day :)
Quote:
This "errors" are not actually errors. You can skip this UGENE status. It happens because the log lines are parsed incorrectly. Notice that each of the log lines you posted contains the "error" word. Has the pipeline with the sample data finished successfully anyway? Thanks for the YAML file. We will investigate this problem. Links to the issues in the bug-tracker: * SPAdes error: exception caught while parsing YAML file: https://local.ugene.unipro.ru/tracker/browse/UGENE-3967 * SPAdes log is parsed incorrectly: https://local.ugene.unipro.ru/tracker/browse/UGENE-3966 |
Title: Re: SPAdes - Mac, no tool Post by Leiga on Jan 31st, 2015 at 12:54am
Yes, assembly reach the end of the process and finally I have several contigs from your data.
After that I tried to do the same with my data and have the same log as before but different yaml file: Quote:
Maybe there is a problem with file names? |
Title: Re: SPAdes - Mac, no tool Post by Yuriy Vaskin on Feb 3rd, 2015 at 7:05pm
Dear Leiga,
It looks like there is a bug with interlaced paired-read processing. It will be fixed in the next UGENE release thanks to your help. For now you could try using the paired-end mode with demultiplexed left and right reads. |
Title: Re: SPAdes - Mac, no tool Post by Olga Golosova on Feb 3rd, 2015 at 7:34pm
Dear Leiga,
Alternatively, you may also use a new snapshot with the fixture! I will write you as soon as it is ready. I think, it will be tomorrow. Sorry again for these problems! Olga |
Title: Re: SPAdes - Mac, no tool Post by Olga Golosova on Feb 4th, 2015 at 10:17pm
Dear Leiga,
Could you please try the new snapshot (revision must be >= 9049)? Can be downloaded from: http://ugene.unipro.ru/snapshot.html Thank you very much! Kind regards, Olga |
Title: Re: SPAdes - Mac, no tool Post by Leiga on Feb 6th, 2015 at 1:49am
Dear Olga,
below you can find log after new run of app (this new one): Quote:
As you can see it takes some time, and computer was very unstable (i did not see that before, during alignment with bowtie cpu was used in 90-98% for ugene app, but now it was strange, like sea waves, 2-90% for hammer app). Regards, L. |
Title: Re: SPAdes - Mac, no tool Post by Olga Golosova on Feb 6th, 2015 at 8:58pm
Dear Leiga,
We are not able to reproduce the error on our test data. It seems that now it is a problem in the SPAdes tool (http://bioinf.spbau.ru/spades), integrated into UGENE for assembly of sequencing data. It seems that the error occurs on particular data. As can be seen from the log, the error happened several hours later after you had launched the tool. Could you please try to launch SPAdes from the command line to make sure that this error is indeed inside the SPAdes algorithm? Below are instructions to do it: 1) Create a new folder. 2) Put into the folder: 2.1) the input files ("R15011_S11_L003_R1_001.fastq", etc.). 2.2) "/Biofizyka/UGENE/Bakteriofagi/JD/R15011/datasets.yaml" file. 3) Launch "Terminal" on Mac OS. 4) In the terminal change directory to the created folder. To do it input: Code:
5) Launch SPAdes from the terminal. To do it input: Code:
Kind regards, Olga |
Title: Re: SPAdes - Mac, no tool Post by Leiga on Mar 4th, 2015 at 7:16pm
Log report from newest version of UGENE release and command line is the same....
UGENE 1.16.0 Quote:
and directly from the command line: Quote:
So... problem still exist, maybe it is time to change dataset to another one. Regards L. |
Title: Re: SPAdes - Mac, no tool Post by Olga Golosova on Mar 4th, 2015 at 7:41pm
Leiga, thanks for testing!
|
Title: Re: SPAdes - Mac, no tool Post by Leiga on Mar 4th, 2015 at 8:24pm
So, I will change fastq files which gives me nice contig in clc software, and will test them with UGENE/SPADES
I think I am not able to test it on windows, there is no tool for that. Am I correct? L. |
Title: Re: SPAdes - Mac, no tool Post by Olga Golosova on Mar 4th, 2015 at 9:25pm Quote:
Yes, there is currently no SPAdes for Windows. Quote:
Let me sum up it: [list bull-redball] |
Title: Re: SPAdes - Mac, no tool Post by Leiga on Mar 5th, 2015 at 6:31pm
So,
I have change source fastq files (different phage) and after successful assembly using clc I tried to do the same with spades, log below: Quote:
Can you register to basespace (illumina) i think I will be able to share whole project with you, and you will get access to raw data from one of sequenced phages. Regards, L. |
Title: Re: SPAdes - Mac, no tool Post by Olga Golosova on Mar 10th, 2015 at 2:21pm
Hi, we've just registered to basespace. The account is ugene@unipro.ru.
|
Title: Re: SPAdes - Mac, no tool Post by Leiga on Mar 10th, 2015 at 9:25pm
Hey,
can you check it? I try to share project with you, can you try to download data from basespace? There is another option (transfer ownership - but it is not what I want to do :)) Regards L. |
Title: Re: SPAdes - Mac, no tool Post by Leiga on Mar 20th, 2015 at 7:17pm
Hi Olga,
Were you able to download data and try to solve a problem? Regards L. |
Title: Re: SPAdes - Mac, no tool Post by Olga Golosova on Mar 24th, 2015 at 5:35pm
Hi Leiga,
Yes, we downloaded the data, thank you! We're currently trying to reproduce the issue. We launched SPAdes with the test data about 1.5 days ago and it still does the calculations. I will write you as soon as it is finished. Could you please confirm that all the FASTQ files are paired-end reads from the same experiment? |
Title: Re: SPAdes - Mac, no tool Post by Leiga on Mar 27th, 2015 at 6:05pm
Yes,
I hope you were able to download proper data. What are the names of files which you have used? Regards, L. |
Title: Re: SPAdes - Mac, no tool Post by Ivan Protsyuk on Mar 31st, 2015 at 12:54pm
Hi Leiga,
I'm a member of the UGENE team, and I worked on your issue along with Olga. Indeed, we managed to download your data from the BaseSpace platform. The example of a file name is "ZK1-01_S10_L001_R1_001.fastq.gz". Other file names differ in a couple of numbers. As for SPAdes, we ran it via UGENE on several desktop computers with your data: two iMacs and one Linux machine. And it finished with errors, similar to what you posted previously, in two cases, and didn't finish at all on one of the iMacs during 4 days. Finally, we launched SPAdes on a server equipped with 2 CPUs with 6 cores each and 64 GB RAM, and it finished successfully in 18 hours. So, we inferred that a desktop computer is not powerful enough for running SPAdes with data like yours. If you want, we can share the SPAdes output results using BaseSpace and account name strapag@biol.uni.lodz.pl. |
Title: Re: SPAdes - Mac, no tool Post by Leiga on Apr 1st, 2015 at 1:59am
So,
do you think, it is because of: 1. somethink what is specific for nextseq's data (I mean "new" instrument); 2. Oversequencing... (as you probably see, this is really over sequenced phage genome:)) 3. PC specification only. I will try again to use that tool, now I have more optimsed codition for sequencing small genomes on taht instrument (0,8 - 0,5 Gb of data) Will tell you about results later... And, Yes, please send me your results using illumina account. Regards, L. |
Title: Re: SPAdes - Mac, no tool Post by Ivan Protsyuk on Apr 1st, 2015 at 7:04pm
I suppose that the issues with SPAdes are caused by the hardware configuration, since in the end, it succeded to align your data on a powerful computer. Probably, there would be no problem if the genome wasn't oversequenced at this extent.
Unfortunately, BaseSpace doesn't allow uploading arbitrary files, therefore I added the data produced by SPAdes to Google Drive. You can access it here: http://bit.ly/1C78Eft. Please, let me know if you have any questions regarding it. |
UGENE Forum » Powered by YaBB 2.5 AE! YaBB Forum Software © 2000-2010. All Rights Reserved. |