alganzory/HaramBlur

feature: Community knowledge base

Closed this issue · 5 comments

suggestion: why should each client process every asset? we can gather knowledge to check for quick verification, and when a client sees a new image his device does the work and sends the result.
It can also be a DNS server. Or a distributed ledger ( blockchain ) as a source of truth to avoid centralization?
with this strategy, we can cluster the load to be able to classify content with high-accuracy models
What do you think?

Salam!

I'm not the creator of this extension, but I am one of its users that is following its development from a detection/AI standpoint. So my understanding does not represent the views of the developer @alganzory

The sheer number of images on the internet is on the order of trillions. There is no way that such a large amount of image content can be stored to cache the detection results. New images would appear every day, people have different device screen sizes, there are thousands of frames in video, etc. There are many factors contributing to the huge number of images. This is not generally how machine-learning based image recognition works.

Furthermore, running a server that can store the detection results from each client so that it can be re-used later in other clients (or the same client), would also require additional resources (that would need to be payed for), which is outside of the extensions running on each client browser.

Salam!

I'm not the creator of this extension, but I am one of its users that is following its development from a detection/AI standpoint. So my understanding does not represent the views of the developer @alganzory

The sheer number of images on the internet is on the order of trillions. There is no way that such a large amount of image content can be stored to cache the detection results. New images would appear every day, people have different device screen sizes, there are thousands of frames in video, etc. There are many factors contributing to the huge number of images. This is not generally how machine-learning based image recognition works.

Furthermore, running a server that can store the detection results from each client so that it can be re-used later in other clients (or the same client), would also require additional resources (that would need to be payed for), which is outside of the extensions running on each client browser.

@man2machine said it better than I would have
That being said, this made me think of enabling some sort of reporting for individual images that didn't get successfully detected, this bank of unsuccessful detections could then be used to train/retrain the models to be more accurate? For example, images of Sheiks or people wearing head covers usually get misclassified as women, if at any stage we train or retrain our own models I would surely want these to go into the training as I doubt that any of the famous models out there use these images for gender classification training or nsfw detection

well i just wanted to share my thoughts. you guys knows better of course

@marwenbk thanks for your suggestion, please keep sharing your thoughts and support <3