mhahsler/rBLAST

Blast search for multiple fastq files at a time

Closed this issue · 3 comments

Hi,

I have hundreds of fastq files and each file contains multiple sequences. Most of the sequences are DNA extracted from vertebrate bloods. With rBlast, I wonder how I can do a blast search for all these files at once and export blast results including percentage identity, coverage, species ID? Is a database of vertebrate sequences needed as the reference for query?

Thank you,

Gabriel

Hi,

you will need a database. You can write a simple loop in R that loads a sequence at a time and blasts it. Depending on how good your R knowledge is, you might want to ask someone for help with writing the code.

Best,
Michael

Hi Michael,

Thank you for getting back to me promptly. I wonder if you know any database that contains mammals and birds that I can download and use as my reference for local BLAST retrieval. I am not that good at R, can you provide some R-script examples of the simple loop that automatically load and blast a sequence?

Best,
Gabriel

I only worked with bacteria. I do not have R-script examples. You may want to hire someone. It should be straight forward.

Regsrds,
-Michael