tducret/amazon-scraper-python

Is there a way to make capture these info?

Opened this issue · 2 comments

Hi,

Have been searching for Amazon scrappers and this project is the most effective and efficient, especially on the capability to work on "keyword search" - great work!

I had forked this and was trying to make some changes so that it works with Amazon in my region (Amazon.com/au & Amazon.co.jp). But unfortunately it turned out that I do not have the skill to do so.
Would be very good to see if there is a way to alter parameters like this.

I also shared the same thought in the other issue thread regarding "Price", and I was also thinking to get the data of "Seller" and "Stock level" as well for a thorough analysis.

Thanks so much again for having this fascinating project.

Kenneth

Hi Kenneth,
Thanks for your warm words :)

I tested amazon2csv on amazon.com.au and amazon.co.jp with the --url option, and it works.
If I search for something in my browser, for example "Python", I get a result page with the url https://www.amazon.com.au/s/ref=nb_sb_noss_2/357-0469114-0585948?url=search-alias%3Daps&field-keywords=python.

In the terminal, I type :
amazon2csv -m 10 -u "https://www.amazon.com.au/s/ref=nb_sb_noss_2/357-0469114-0585948?url=search-alias%3Daps&field-keywords=python"

... and I get proper results :)

That may not be your use case. Perhaps you would like to search with keywords directly.
In this case, you should modify amazonscraper/client.py where "amazon.com" is present, change it with "amazon.com.au".

Could you test it?

Price and other features are more difficult to parse.
There are multiple prices displayed. Which one should we extract?

it does indeed work as you describe @tducret but the documentation is a bit unclear on whether it wants just https://www.amazon.com.au/ or the entire search url. I think it's a bit inconsistent to require the entire search url as is since you don't have to do so for amazon.com. I think changing url to domain_name (e.g. amazon.com.mx) would make it much clearer and more consistent.