The tamilmvScrapper
is a Python-based tool designed to scrape HD movie links from "https://www.1tamilmv.nexus". It leverages requests
and BeautifulSoup
to navigate through web pages and extract relevant movie data, focusing on efficiency and respecting site load through controlled request intervals.
- Targeted Scraping: Users can specify the name of the movie to fetch relevant HD links efficiently.
- Pagination Handling: Supports scraping across multiple pages to gather comprehensive movie link data.
- Respectful Scraping: Implements a delay between requests to minimize the impact on the host server.
- Language: Python
- Libraries:
requests
,BeautifulSoup
(for HTML parsing),tqdm
(for progress bars)
Ensure you have Python and pip installed on your system to setup and run this scraper.
-
Clone the repository:
git clone https://github.com/afsalahamed07/tamilmvScrapper.git cd tamilmvScrapper
-
Install the necessary Python packages:
pip install requests beautifulsoup4 tqdm
To start scraping movie links, run the script and follow the prompts to enter the movie name and the number of pages you wish to scrape.
python scrapper.py
You will be asked to:
- Enter the name of the movie.
- Specify how many pages you want to scrape.
The script will then display the movie titles along with their respective links in the console.
- Session Initialization: A
requests.Session
is used to persist certain parameters across requests. - Fetching Content: Web pages are fetched using the specified user-agent headers to emulate a browser visit.
- Parsing HTML: Extracts relevant movie data from the structured markup using
BeautifulSoup
. - Filtering Data: Filters movies by name to ensure relevance to user input.
- Output: Provides a simple, clear output of movie titles and their links in the console.
This project is currently maintained in a public repository, accessible for educational and research purposes. It is regularly updated to adapt to changes in the source website's structure and policies.
The tamilmvScrapper
is intended for educational and research purposes only. Users should ensure they comply with legal stipulations regarding data scraping and usage in their respective jurisdictions.