UGENE Forum
https://forum.ugene.net/forum/YaBB.pl
General Category >> Help and How-to >> How to view large data?
https://forum.ugene.net/forum/YaBB.pl?num=1473213232

Message started by elginong on Sep 7th, 2016 at 8:53am

Title: How to view large data?
Post by elginong on Sep 7th, 2016 at 8:53am
Hi,

I tried to download my fastq file but failed. Is there any software out there that can view very large data?

Elgin
Screenshot__41_.png (120 KB | 386 )

Title: Re: How to view large data?
Post by Olga Golosova on Sep 12th, 2016 at 4:03pm
A FASTQ file, received from a sequencing facility, contains separate short reads - nucleotide sequences, quality information about each base in a read, etc.

Commonly, you don't read a FASTQ file in detail, because there are a lot of reads inside (several thousands or even more).

However, you may overview the general statistics about the data. In UGENE you can do it using a popular tool FastQC, available from "Tools > NGS data analysis > Reads quality control" item in the main menu.

The next step is to assemble the small reads into a single long genome. There are two strategies to do that:

1) If you have an appropriate reference sequence, you can map all reads to it. There are different tools in UGENE to do the mapping (BWA, Bowtie2, etc.). There is also a complex workflow "Raw DNA-Seq data processing" that allows you to do additional pre- and post-processing steps, required to improve the data quality and make the mapping more efficient.
The assembled reads can be viewed in the UGENE Assembly Browser.

2) The alternative way is to assemble reads de novo. This task is more complex and for big data like human genome it requires a lot of resources. In UGENE for small genomes (like bacterial or viral) you may use SPAdes tool to do the task.

UGENE Forum » Powered by YaBB 2.5 AE!
YaBB Forum Software © 2000-2010. All Rights Reserved.