Hongyang449/Anchor

Files missing under data without and missing documentation

Closed this issue · 1 comments

Thank you for providing the code and software.
I was hoping if you could host the datasets under data in Zenodo or somewhere publicly accessible? Currently none of those files are available and it's not possible to understand the proper format of the files Anchor expects.

I also was wondering to ask if you could expand the documentation on input sequence motifs and genome sequence?
The link just says e.g. Hocomoco v11, but there are many different files to download.

Thank you for your help.
I'm looking forward to use this amazing tool.

Hi,

Thank you for your questions.

The data used in DREAM challenge is publicly accessible. You can find the "ChIPseq", "DNASE" folders from here:
https://www.synapse.org/#!Synapse:syn6112317

For example, you can down load the DNase-seq bam files from here:
https://www.synapse.org/#!Synapse:syn6176232
The CTCF - ChIP-seq data from here:
https://www.synapse.org/#!Synapse:syn7415240

For the motif, we used the pwm file. For example, you can search "CTCF" at Hocomoco website and download the pwm from the link:
http://hocomoco11.autosome.ru/final_bundle/hocomoco11/full/HUMAN/mono/pwm/CTCF_HUMAN.H11MO.0.A.pwm

The hg19 genome can be downloaded from here:
https://www.synapse.org/#!Synapse:syn6184309

Best,
Hongyang