tl;dr:
Click any launch binder
badge on this page to get a session all set to step through running a Jupyter notebook demonstrating the script inside your browser with no installation needed. Or click here.
This fork of the official Fasta2Structure repository was set up to take advantage of the MyBinder system to offer a resource to run Fasta2Structure without the user needing to download, install, or maintain any software. Plus, no need to be running it in a typical desktop situation, and therefore the Fasta2Structure script is more convenient to use anywhere at scale. See here for documentation about Binder as deployed at MyBinder.org.
Please see the official Fasta2Structure repository for more information about Fasta2Structure.
Remember, if you are using Fasta2Structure, please cite:
Fasta2Structure: a user-friendly tool for converting multiple aligned FASTA files to STRUCTURE format. Bessa-Silva A. BMC Bioinformatics. 2024 Feb 15;25(1):73. doi: 10.1186/s12859-024-05697-7. PMID: 38365590
improved_Fasta2Structure: A User-Friendly Tool for Converting Multiple Aligned FASTA Files to STRUCTURE Format, that is even more user-friendly because it doesn't need a user to select files in a GUI (Tkinter-based) and can thus run well anywhere, such as on a computer cluster or in Jupyter running remotely or in conjunction with software to make pipelines like Snakemake & NextFlow. For those reasons, the improved script is more user-friendly for those familiar with computation and allows scaling up.
Plus, combined with Jupyter served by the MyBinder service it will make it easier for anyone to try the software or use it without installing anything on their own computer.
To make it more convenient to use, I've modified the Fasta2Structure script to allow more ways to run it to produce improved_Fasta2Structure. It will run on the command line if you supply arguments specifying files as input or fallback to running on the command line if Tkinter cannot connect to a graphical display.
You can still try the original GUI-based (Tkinter) software presently available at https://github.com/AdamBessa/Fasta2Structure without installing anything on your computer. You can go here and find a step-by-step to use a remote virtual desktop to test the Fasta2Structure script. Only you'll find it isn't as convenient as what is provided here and by requiring user interaction in a GUI wouldn't scale well.
The improved script that I've provided here also retains the ability to run the GUI way as well, and therefore with the improved version you have much greater flexibility and it truly is much more convenient to a larger audience of users. (More here and here as to how scientists tend to overvalue GUIs and how a GUI framework can act as a barrier-to-entry for novices.)
Click any launch binder
badge on this page to get a session all set to step through running a Jupyter notebook demonstrating the script inside your browser with no installation needed. Or click here.
I have added a series of extensive tests to make sure the improved_Fasta2Structure gives the same result as running the GUI-based (Tkinter) Fasta2Structure.py
presently available at https://github.com/AdamBessa/Fasta2Structure. The result of these tests are passing if the badge immediately below here is green:
The STRUCTURE software has gained popularity as a tool for population structure and genetic analysis. However, tailoring data to meet STRUCTURE's specific requirements can be challenging and prone to errors, particularly when managing multilocus data. Here, I introduce a graphical user interface (GUI) application designed to simplify the process of converting multiple sequence alignments into a single, cohesive file that is compatible with the STRUCTURE software. The application has been developed using Tkinter for the GUI and Biopython for handling FASTA files. It processes the files, identifies variable sites, and converts the sequences into a binary format. Subsequently, the sequences are concatenated and displayed within the graphical interface's text area, enabling users to review and verify the results. Furthermore, the program saves the concatenated results in a file, thereby providing a ready-to-use input for the STRUCTURE software. This application presents an efficient and reliable solution for transforming multiple aligned FASTA files into a concatenated binary format file, which is compatible with the STRUCTURE software. With its user-friendly graphical interface and error-reduction strategy, this tool proves to be invaluable for researchers engaged in population structure and genetic analysis. This program is designed to convert FASTA files into a binary representation that is then used to identify variable sites. This binary representation is then saved in a structure (.str) file. Variable sites are defined as positions in the sequence where the nucleotides differ among the sequences in the alignment. These sites are identified and flagged, and the resulting data is saved in a specific format that can be used for further analysis. The program allows users to select multiple FASTA files at once and performs the conversion of all selected files in the background. During the conversion, progress is indicated via a progress bar and a text label.
A session will open with all the necessary packages to support running the imporved Fasta2Structure script. The documentation for the original script Fasta2Structure.py
details this more, see there.
I have added a notebook that will open when the MyBinder session starts up that will illustrate using the improved script.
You can still try the original GUI-based (Tkinter) software presently described & available at https://github.com/AdamBessa/Fasta2Structure without installing anything on your computer. You can go here and find a step-by-step to use a remote virtual desktop to test the Fasta2Structure script. Only you'll find it isn't as convenient as what is provided here.
Getting started:
Click any launch binder
badge on this page to get a session all set to step through running a Jupyter notebook demonstrating the script inside your browser with no installation needed. Or click here.