How to get the .bed files relative to the matrix and how to get the picture similar to example you list

Question

How to get the .bed files relative to the matrix and how to get the picture similar to example you list

Dweson opened this issue 5 years ago · 5 comments

I don't how to find the .bed files, what's more, we try to visualize the data with scores as 2d annotation, but the image is ugly and the black spots are too much. thank u

Answer 1 · 2019-12-26T16:36:16.000Z

Hi, you can do that using command peakachu pool, which performs a local clustering algorithm on the original calls and prints single representatives (black dots in the example) for each cluster.

Answer 2 · 2019-12-26T17:58:22.000Z

Adding to Xiaotao's answer, the bed files should be found in the folder specified by the -O option. The folder will be created if it didn't exist already. If there are too many black spots after using pool, then try filtering with a higher threshold (i.e. .95 instead of .9)

Answer 3 · 2019-12-27T02:15:26.000Z

@tariks @XiaoTaoWang Thank you very much, I have solved the second question according to your answers. But what I mean the .bed file is the training input text file, I don't know how to find it. The file I find from GEO database of Tang et al and Mumbach et al is not same as the file in /example dictionary.

Answer 4 · 2019-12-27T18:30:21.000Z

That makes sense. Both example files were derived from excel sheets from each publication's supplemental files, and not from GEO.

Answer 5 · 2019-12-28T04:19:28.000Z

@tariks Thank U, I find the .bed files as you say, but when I score another cooler file from 4DN, I got an error.
for i in models/*pkl; do peakachu score_chromosome -p 4DNFIP3ELSZY.mcool::resolutions/10000 --balance -O scores -m $i; done
ValueError: row, column, and data array must all be the same length
So how can I process the cooler so it can be suitable for peakachu, thanks!