filecoin-project/filecoin-plus-large-datasets

[DataCap Application] -International Neuroimaging Data-Sharing Initiative (INDI)

Opened this issue · 8 comments

Data Owner Name

Child Mind Institute

What is your role related to the dataset

Data Preparer

Data Owner Country/Region

Hong Kong

Data Owner Industry

Life Science / Healthcare

Website

http://fcon_1000.projects.nitrc.org/

Social Media

http://fcon_1000.projects.nitrc.org/

Total amount of DataCap being requested

12PiB

Expected size of single dataset (one copy)

1.8P

Number of replicas to store

7

Weekly allocation of DataCap requested

800TiB

On-chain address for first allocation

f1gqg7bvgrz7hln7gya3se7sprdjgkgde5a3rgslq

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

  • Use Custom Multisig

Identifier

No response

Share a brief history of your project and organization

We are a data processor with hundreds of storage machines and a 5G bandwidth computer room in Hong Kong that can process large amounts of data.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

This bucket contains multiple neuroimaging datasets that are part of the International Neuroimaging Data-Sharing Initiative. Raw human and non-human primate neuroimaging data include 1) Structural MRI; 2) Functional MRI; 3) Diffusion Tensor Imaging; 4) Electroencephalogram (EEG) In addition to the raw data, preprocessed data is also included for some datasets. A complete list of the available datasets can be seen in the documentation lonk provided below.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

If you are a data preparer. What is your location (Country/Region)

None

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

No response

If you are not preparing the data, who will prepare the data? (Provide name and business)

No response

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

No response

Please share a sample of the data

aws s3 ls --no-sign-request s3://fcp-indi/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

  • I confirm

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Sporadic

For how long do you plan to keep this dataset stored on Filecoin

1.5 to 2 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, Africa, North America, South America, Europe

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), HTTP or FTP server, IPFS, Shipping hard drives

How do you plan to choose storage providers

Slack, Filmine, Big Data Exchange, Partners

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

No response

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!

Thanks for your request!
Everything looks good. 👌

A Governance Team member will review the information provided and contact you back pretty soon.

If you answered "Other" in the previous question, enter the details here
No response

If you are a data preparer. What is your location (Country/Region)
None

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?
No response

If you are not preparing the data, who will prepare the data? (Provide name and business)
No response

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.
No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below
No response

How do you plan to make deals to your storage providers
No response

If you answered "Others/custom tool" in the previous question, enter the details here
No response

Please answer the above questions.

If you are a data preparer. What is your location (Country/Region)
Hong Kong, China.

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?
The data we use is downloaded from the aws cloud, packaged to a suitable size using tar, and converted to a car file using boostx.

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.
no,not yet

If you already have a list of storage providers to work with, fill out their names and provider IDs below
f02096851 jerry 1012562273@qq.com EdgeIPC Indonesia no Anthony
f02828509 jkkts12 John@hs88.com hs88 Korea No jkkts12
f02812781 TVfinance Alma@TVfinance.com TVfinance Vietnam No Alma
f02843151 cheng lingsucheng@yeah.net FBC-Capital Turkey
f02886019 qbit qbit@mercurityfintech.com MFH Canada
We will update the storage provider in the future

How do you plan to make deals to your storage providers
use boost client

Hello, per the filecoin-project/notary-governance#922 for Open, Public Dataset applicants, please complete the following Fil+ registration form to identify yourself as the applicant and also please add the contact information of the SP entities you are working with to store copies of the data.

This information will be reviewed by Fil+ Governance team to confirm validity and then the application will be allowed to move forward for additional notary review.

SP List provided:
[{"providerID":"f02096851","City":"XYZ","Country":"Indonesia","SPOrg","EdgeIPC"},
{"providerID":"f02828509","City":"XYZ","Country":"Korea","SPOrg","hs88"},
{"providerID":"f02812781","City":"XYZ","Country":"Vietnam","SPOrg","TVfinance"},
{"providerID":"f02843151","City":"XYZ","Country":"Turkey","SPOrg","FBC-Capital"},
{"providerID":"f02886019","City":"XYZ","Country":"Canada","SPOrg","MFH"},]

Datacap Request Trigger

Total DataCap requested

12PiB

Expected weekly DataCap usage rate

800TiB

Client address

f1gqg7bvgrz7hln7gya3se7sprdjgkgde5a3rgslq

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1gqg7bvgrz7hln7gya3se7sprdjgkgde5a3rgslq

DataCap allocation requested

400TiB

Id

00166635-129c-4baf-9e66-66052a517236