IDBac text file analysis

Question

IDBac text file analysis

Closed this issue 5 years ago · 8 comments

Hello,

we tried processing text files containing two columns of masses and intensities (separated by space/tabs, no header), by using the option "Click here to convert txt files".

IDBac detects the files (fig. 2.png) in the target directory and appears to process them successfully (fig. 3.png).

However, when trying to retrieve the analyzed data, only one sample appears in the results table (fig. 4.png, only 1 sample appears in the table ("LMA_Varia_") instead of the 6 supposed to be analyzed).

The procedure has been tested on a Windows machine via the installation with "IDBac.Setup.1.1.10.exe", as well as with the R installation on a Linux machine (ubuntu 16.04), both ways leading to an identical outcome.

We suspect this issues is related to the use of txt files instead of mzXM/mzXML files.

Please let us know if this is an issue of IDBac or else.

Thanks.

Cheers,
Antoine

Answer 1 · 2019-07-09T14:15:15.000Z

Thanks for the thorough details, I'm trying to get an update out be the end of the week and will try to include a fix for this (might be related to #128).

If possible, can you zip the files and attach drag and drop them into a reply here? Or can contact and send them privately through slack: https://join.slack.com/t/idbacapp/shared_invite/enQtNjkwNzYzNzA0MjE1LWI3Y2E3NGNlMjAzZDU4MTc3YjY4YmI3YzUwNWZjOGZjYzhlMmNkZDE3ZDY4Y2RlYTJhOWVlYjM4YzEwZTBjYzg

Answer 2 · 2019-07-09T14:20:44.000Z

Hi Chase, thanks for the reply. Here are the test data we used. Let me know if you need something else. Cheers, Antoine

…

On Tue, Jul 9, 2019 at 4:15 PM Chase Clark ***@***.***> wrote: Thanks for the thorough details, I'm trying to get an update out be the end of the week and will try to include a fix for this (might be related to #128 <#128>). If possible, can you zip the files and attach drag and drop them into a reply here? Or can contact and send them privately through slack: https://join.slack.com/t/idbacapp/shared_invite/enQtNjkwNzYzNzA0MjE1LWI3Y2E3NGNlMjAzZDU4MTc3YjY4YmI3YzUwNWZjOGZjYzhlMmNkZDE3ZDY4Y2RlYTJhOWVlYjM4YzEwZTBjYzg — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#127?email_source=notifications&email_token=ACW4LDTSM4OLGQTILRGZMVDP6SMPJA5CNFSM4H6JUES2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODZQM2BY#issuecomment-509660423>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ACW4LDVURXDOCKKX2LLGQ6LP6SMPJANCNFSM4H6JUESQ> .

--

_______________________ / web: CCBC <https://lnu.se/personal/antoine.buetti/> & USI <https://www.ics.usi.ch/index.php/people-detail-page/253-antoine-buetti-dinh#education> \ \ Skype Name: twincacca / ----------------------- \ \ __ UooU\.'@@@@@@`. \__/(@@@@@@@@@@) (@@@@@@@@) `YY~~~~YY' || || --------------------------------

Answer 3 · 2019-07-09T14:29:04.000Z

Thanks! Looks like you replied via email and GitHub didn't accept the attachment. Can you place it in a reply from GitHub.com? Link: #127

Answer 4 · 2019-07-09T14:31:07.000Z

sure, here it is
test_antoine_20190709.zip

Answer 5 · 2019-07-12T16:43:26.000Z

I don't foresee making the fix this week as things got busy. However it looks like your data is centroid and IDBac requires profile (full-spectrum) data.

As a temporary solution if you are semi-comfortable with R and have profile data in .txt, .csv, or .tab you can:

Point inputDirectory to where your files are
Point ouputDirectory to where mzml files should be created

inputDirectory <- "F:/Chase/IDBac_Dev/example_data/tab/p"
ouputDirectory <- "F:/Chase/IDBac_Dev/abc"

Install MALDIquantForeign if you haven't

install.packages("MALDIquantForeign")

Then run:

source("https://gist.githubusercontent.com/chasemc/9391c733b8412c12ba0bc616a4edc275/raw/d0fbbf9de9202ec3182d135cb558a39154c3629a/convert_text_to_mzml.R")

converter(inputDirectory,
          ouputDirectory)

The resulting mzML files can then be used for IDBac

Answer 6 · 2019-07-15T11:26:10.000Z

Thanks for the suggestion, I could easily convert my data files from the format "txt" to "mzML" by following your instructions.

However, when I try to run IDBac using the mzML files, the same thing happens as when I previously run it with txt files. IDBac analyses only the first of the samples listed in the data containing folder.

Here I attach the mzXM data corresponding to the txt data I attached in my previous message. If you can have a look once you have time, it would be great.

Thanks a lot.

Cheers
Antoine
test_antoine_20190713.zip

Answer 7 · 2019-07-15T16:44:46.000Z

It looks like the data is still centroided (a list of peaks and intensities) which isn't currently supported. Do have the full-spectrum data for these samples?

Answer 8 · 2019-07-15T16:48:50.000Z

OK, I guess it's clear now. So, we need to work with full-spectrum data then. Yes, no problem. Thank you.