dragonflyoss/Dragonfly2

Hugging Face accelerates distribution of models and datasets based on Dragonfly

gaius-qi opened this issue · 3 comments

Distribute Hugging Face's LFS download file request through Dragonfly P2P, refer to https://d7y.io/docs/next/setup/integration/hugging-face.

During the downloading of datasets or models, the file size is large and there are many services downloading the files at the same time. The bandwidth of the storage will reach the limit and the download will be slow.

image

Dragonfly can be used to eliminate the bandwidth limit of the storage through P2P technology, thereby accelerating file downloading.

image

Related comment:

huggingface/huggingface_hub#1780 (comment)
huggingface/huggingface_hub#1780 (comment)
huggingface/huggingface_hub#1780 (comment)

TODO List:

@Wauplin Can you help us push this article to Hugging Face’s Blog or Hugging Face's Twitter or Hugging Face's Wechat public account? Or provide users with an acceleration reference plan in the Hugging Face website?

Hey @gaius-qi what I would suggest you is to write a community blog article. You can find all the instructions on this page: https://huggingface.co/blog-explorers. Once done, your article will be showcased in the Community posts section on https://huggingface.co/blog. Please let me know if you have any questions or if you want a quick review of the article. I think the content from this comment of you would be especially beneficial to explain when this solution is useful (and when it's not).

Hey @gaius-qi what I would suggest you is to write a community blog article. You can find all the instructions on this page: https://huggingface.co/blog-explorers. Once done, your article will be showcased in the Community posts section on https://huggingface.co/blog. Please let me know if you have any questions or if you want a quick review of the article. I think the content from this comment of you would be especially beneficial to explain when this solution is useful (and when it's not).

@Wauplin Thanks! 😊😊😊

I will do this as soon as possible so that more people can understand this efficient data distribution solution of Hugging Face & Dragonfly. 🚀🚀🚀