Is the image input during the training process the original image and the binary image?

Question

Is the image input during the training process the original image and the binary image?

lisongyu111 opened this issue 3 years ago · 17 comments

lisongyu111 commented 3 years ago

lisongyu111 commented 3 years ago

Thank you

Answer 1 · 2021-09-14T07:55:11.000Z

Yes, you got that right.

The img and target parameters to the full_forward method used during training are exactly the original image (img) and the binary target segmentation (target).

Cheers, Konrad

Answer 2 · 2021-09-14T08:02:10.000Z

Thanks for answering! Where are the original pictures and binary pictures in the code?

Answer 3 · 2021-09-14T08:11:30.000Z

You can download the INRIA data here: https://project.inria.fr/aerialimagelabeling/

Here's the code that reads in the dataset: https://github.com/khdlr/HED-UNet/blob/master/deep_learning/utils/data.py

Answer 4 · 2021-09-14T08:15:08.000Z

Thanks Reply! What pictures are put in the AerialImageDataset and scenes texts in the code?

Answer 5 · 2021-09-14T08:38:10.000Z

The AerialImageDataset is the only folder that is being used. scenes is not used anymore, I will update the code and delete the get_batch function. Thanks for catching my error!

Answer 6 · 2021-09-14T08:52:16.000Z

Can you show me how you created the data file? I want to train my own data set.

Answer 7 · 2021-09-14T08:56:40.000Z

The dataset was not created by me, I just wrote the code that loads the data at training time.

What you need to do to train on your own data, is to implement a torch.utils.data.Dataset that returns pairs of image and ground truth annotations.

Then you can change the get_dataset function in data_loading.py to use your custom dataset instead of the pre-configured InriaDataset.

Let me know if that helps!

Answer 8 · 2021-09-14T09:09:37.000Z

I have 256*256 original images and binary images of aerial cities here, but I don’t know how to train this net

Answer 9 · 2021-09-14T09:16:53.000Z

Okay, so you'll subclass torch.utils.data.Datset like this

class MyCustomDataset(torch.utils.data.Dataset):
  def __init__(self):
    # Do whatever initialization you need:
    self.images  = <list_of_image_paths>
    self.targets = <list_of_binary_annotations>

  def __getitem__(self, index):
    # "load_image" can be imageio.imread for example
    image  = load_image(self.images[index])
    target = load_image(self.targets[index])
    return image, target
  
  def __len__(self):
    return len(self.images)

Then you can simply change data_loading.py to use MyCustomDataset instead of InriaDataset.

Hope that helps.

Answer 10 · 2021-09-24T09:43:22.000Z

Hello, I need your help, I am having difficulty training the model.The training results are all 0.I think my data should be put in the wrong way. My data is a 256*256 picture, what size picture did you use in the lab 64*64.Hope you can answer me.

…

------------------ 原始邮件 ------------------ 发件人: "khdlr/HED-UNet" ***@***.***>; 发送时间: 2021年9月14日(星期二) 下午5:17 ***@***.***>; ***@***.******@***.***>; 主题: Re: [khdlr/HED-UNet] Is the image input during the training process the original image and the binary image? (#3) Okay, so you'll subclass torch.utils.data.Datset like this class MyCustomDataset(torch.utils.data.Dataset): def __init__(self): # Do whatever initialization you need: self.images = <list_of_image_paths> self.targets = <list_of_binary_annotations> def __getitem__(self, index): # "load_image" can be imageio.imread for example image = load_image(self.images[index]) target = load_image(self.targets[index]) return image, target def __len__(self): return len(self.images) Then you can simply change data_loading.py to use MyCustomDataset instead of InriaDataset. Hoper that helps. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android.

Answer 11 · 2021-09-25T08:51:56.000Z

That is odd. I used fairly large pictures for training (768x768), but 256x256 should work.

Can you confirm that your masks are valid? What is the output if you add print(torch.unique(target)) to the full_forward function in train.py?

Answer 12 · 2021-09-25T09:01:01.000Z

Thank you for your reply. What files are the original and two-difference diagrams of your data set respectively? I don't think my data set has been built correctly. 发自我的iPhone

…

------------------ Original ------------------ From: Konrad Heidler ***@***.***> Date: Sat,Sep 25,2021 4:52 PM To: khdlr/HED-UNet ***@***.***> Cc: lisongyu111 ***@***.***>, Author ***@***.***> Subject: Re: [khdlr/HED-UNet] Is the image input during the training process the original image and the binary image? (#3) That is odd. I used fairly large pictures for training (768x768), but 256x256 should work. Can you confirm that your masks are valid? What is the output if you add print(torch.unique(target)) to the full_forward function in train.py? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android.

Answer 13 · 2021-09-25T09:31:05.000Z

Not entirely sure what you mean by original / two-difference diagrams.

Another issue might be data scaling - what is the output when you add print('img', torch.min(img), torch.mean(img), torch.max(img)) in full_forward?

Answer 14 · 2021-09-26T06:53:06.000Z

Thanks, I encountered a small problem.

…

------------------ 原始邮件 ------------------ 发件人: "khdlr/HED-UNet" ***@***.***>; 发送时间: 2021年9月25日(星期六) 下午5:31 ***@***.***>; ***@***.******@***.***>; 主题: Re: [khdlr/HED-UNet] Is the image input during the training process the original image and the binary image? (#3) Not entirely sure what you mean by original / two-difference diagrams. Another issue might be data scaling - what is the output when you add print('img', torch.min(img), torch.mean(img), torch.max(img)) in full_forward? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android.

Answer 15 · 2022-01-19T06:25:13.000Z

As input, you have given the images and their segmented mask. I want to know what ground truth you have considered while modeling the edge detection part. It seems Sobel Kernel has been used for edge calculation (but in the paper, it has been written that your results are compared with those). Please highlight the Edge Detection part. Thank You.

Answer 16 · 2022-01-19T08:20:01.000Z

At training time, we compute the edge GT by applying Sobel to the segmentation GT for cases where no edge GT is available. As these segmentation masks are valued in {0,1}, we can recover perfect edge masks.

The "Sobel" comparison in the paper is referring to applying Sobel directly to the imagery at test time, which will generate much worse edge masks.