ai-robots-txt/ai.robots.txt

ImageSift AI or not?

Closed this issue · 2 comments

glyn commented

Reading ImageSift's about page, this seems to be an image search site rather than AI. If so, then ImageSift is out of scope for this project.

/cc @jsheard

PS. Apologies for not raising this until now. I've been on holiday for a few days.

Imagesift is an image search site, but their parent company offers genAI products: https://thehive.ai/apis/image-generation

Hive doesn't give any information about their user agents or robots.txt rules, so I think it is safe to assume they are using the Imagesift crawler as a front to gather data for AI training under the pretence of an image search engine.

glyn commented

Ok, thanks @jsheard. Let's assume the crawler is used for AI purposes until such time as the about page explicitly denies that.