This project provides a script to download all beta-lactamase protein sequences from the Beta-Lactamase Database (BLDB) and create a local BLAST database. The downloaded sequences can be used for various bioinformatics analyses related to antibiotic resistance.
- Linux or macOS (Windows users can use WSL)
Bash
shellPython3
withrequests
andtqdm
libraries installedBLAST+
installed on your system
- To install the required Python libraries, run:
python3 -m pip install requests tqdm
- Installing BLAST+ You can download and install latest BLAST+ from the NCBI BLAST website.
wget -c https://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/ncbi-blast-2.16.0+-x64-linux.tar.gz
wget -c https://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/ncbi-blast-2.16.0+-x64-linux.tar.gz.md5
md5sum -c ncbi-blast-2.16.0+-x64-linux.tar.gz.md5
tar -xvzf ncbi-blast-2.16.0+-x64-linux.tar.gz
Set the path of the bin
folder from ncbi-blast-2.16.0+
in your .bashrc
file.
The main script performs the following steps:
- Create a local directory to store downloaded data.
- Fetch HTTP links for beta-lactamase protein sequences from the BLDB webpage.
- Download the protein sequences using a Python script.
- Combine all downloaded FASTA files into one file.
- Create a local BLAST database from the combined sequences.
Clone this repository (if applicable) or create your own directory:
git clone <repository-url>
cd <repository-name>
Run the Bash script:
./create_local_bldb.sh