- Browse through different sites and pick on to scrape. Check the "Project Ideas" section for inspiration.
- Identify the information you'd like to scrape from the site. Decide the format of the output CSV file.
- Summarize your project idea and outline your strategy in a Juptyer notebook. Use the "New" button above.
- I'm going to scrape [https://pk.indeed.com/jobs?q=data%20science&l=Pakistan&vjk=a0c9f842daa93ed3]
- I'll get a list of Jobs. For each job, I'll get job title, job company, job location and job summary
- For each page, i'll get the 15 jobs in the topic from the job page
- For each job, job title, job company, job location and job summary
- For each job we'll create a CSV file in the following format:
- job_title, job_company, job_location and job_summary