Welcome, Guest. Please Login or Register
UGENE Bulletin Board
  Welcome to our forum.
  HomeHelpSearchLoginRegister  
 
 
Page Index Toggle Pages: 1
Support for AWK script (Read 11077 times)
Apr 9th, 2012 at 10:30am

kenkl   Offline
Junior Member

Posts: 73
**
 
Hi there,

I'm trying to execute an AWK script to help me to trim sequences, so I created a custom element for that purposes with below execution string:

zapping.awk $infile > $outfile

It works when I execute the script in the command line but when I used it in the UGENE it "hangs" when it status was changed to "Running".

Kindly advice, thanks!

Kenneth
 

zapping.awk (0 KB | )
hang.png (178 KB | )
hang.png
IP Logged
 
Reply #1 - Apr 11th, 2012 at 2:52pm

German   Offline
Full Member

Posts: 118
***
 
Kenneth, how to use your script? What input parameters does it have? What is the format of used input/output files?
 
IP Logged
 
Reply #2 - Apr 12th, 2012 at 7:34am

kenkl   Offline
Junior Member

Posts: 73
**
 
Hi German, the execution sting would be:

zapping.awk $input > $output

where,
$input = FASTA file from LUCY
$output = FASTA sequence file

That's the way! Smiley

Kenneth
 
IP Logged
 
Reply #3 - Apr 16th, 2012 at 3:36pm

German   Offline
Full Member

Posts: 118
***
 
Kenneth, I have tried your script and it works very strange.
When I run "./zapping.awk human_T1.fa > out.fa" from command line then the content of out.fa is:
">human_T1
"
Is it correct?
 
IP Logged
 
Reply #4 - Apr 16th, 2012 at 3:46pm

kenkl   Offline
Junior Member

Posts: 73
**
 
Yes German, it is correct. I had troubleshoot this problem and it seems that zapping.awk can only process output files from LUCY.

If only a sequence is dump for zapping.awk, it will eliminate all sequences. That I think are something to do with some LUCY labelling I guess. But the problem now I'm facing also is that the LUCY element could not work.

It kept saying that it can't launch LUCY.

Kindly advice.

Kenneth
 

IP Logged
 
Reply #5 - Apr 16th, 2012 at 4:01pm

German   Offline
Full Member

Posts: 118
***
 
kenkl wrote on Apr 16th, 2012 at 3:46pm:
Yes German, it is correct. I had troubleshoot this problem and it seems that zapping.awk can only process output files from LUCY.

If only a sequence is dump for zapping.awk, it will eliminate all sequences. That I think are something to do with some LUCY labelling I guess. But the problem now I'm facing also is that the LUCY element could not work.

It kept saying that it can't launch LUCY.

Kindly advice.

Kenneth


I guess that the execution string of the lucy element is not correct. Could you show it please?

Also, I should tell you about the zapping element. Why do you use some output port? "Zapping" doesn't produce any data that can be used in UGENE. I think the scheme should look like:

[some workers...] -> [lucy] -> [Write FASTA]
                                    |
                                    v
                            [zapping]

Am I right?
 
IP Logged
 
Reply #6 - Apr 16th, 2012 at 4:07pm

kenkl   Offline
Junior Member

Posts: 73
**
 
Hi German, I had used the exact execution string in the commandline and it works well. But for your reference, my execution string looks like this:

lucy -v $vec $splice $in_fa $in_qul -o $out_fa $out_qul -debug $debug

Besides, actually after zapping it will produce a trimmed FASTA sequence, does UGENE support that? The reason being the trimmed sequenced from zapping are the best quality sequence for further downstream analysis such as BLAST and many more other workers.

Kenneth
 
IP Logged
 
Reply #7 - Apr 17th, 2012 at 10:20am

kenkl   Offline
Junior Member

Posts: 73
**
 
Hi German,

I realizes that the zapping.awk does not recognizes Fasta sequence from LUCY in UGENE as when I obtain the LUCY resulting sequence from UGENE workflow and uses zapping.awk on the resulting FASTA file in the command line, the output file that I obtained looks like this:

>ITS_GA00108.1.a1.ab1

Can you look into it?

Thanks!

Kenneth
 
IP Logged
 
Page Index Toggle Pages: 1