Is it possible to export point clouds with semantic labels?

Is it possible to export point clouds with semantic labels or instance labels after instance segmentation?

As for exporting the GARField-based labels, this isn't currently supported.

Can you elaborate what you mean by "instance labels"? (ie do you mean cluster labels, or the grouping features?)

Either way, it's definitely possible. Currently, It should be possible to export the gaussians visible in the viewport, if you check the "Export Options" box. This code borrows from the gaussian export function in the original nerfstudio's export_utils. The label data can be added to the pointcloud metadata.

The corresponding function is linked here:

garfield/garfield/garfield_gaussian_pipeline.py

Line 561 in a094a0e

def _export_visible_gaussians(self, button: ViewerButton):

Thank you for your reply.
I want to export the point cloud of each part after grouping (point clouds with different labels are exported separately).

It should be possible to combine this gaussian export code, with:

cluster_labels, which should assign every gaussian with a cluster ID,

garfield/garfield/garfield_gaussian_pipeline.py

Line 70 in a094a0e

cluster_labels: Optional[TensorType] # For storing cluster labels

and looping through each of the clusters and exporting them individually -- if these pointclouds need to be in their original colors, you should look at the states stored in state_stack:

garfield/garfield/garfield_gaussian_pipeline.py

Line 65 in a094a0e

state_stack: List[Dict[str, TensorType]] # To revert to previous state

garfield/garfield/garfield_gaussian_pipeline.py

Lines 181 to 190 in a094a0e

    
           def _queue_state(self): 
        
               """Save current state to stack""" 
        
               self.state_stack.append({ 
        
                   'means': self.model.means.detach().clone(), 
        
                   'scales': self.model.scales.detach().clone(), 
        
                   'quats': self.model.quats.detach().clone(), 
        
                   'features_dc': self.model.features_dc.detach().clone(), 
        
                   'features_rest': self.model.features_rest.detach().clone(), 
        
                   'opacities': self.model.opacities.detach().clone(), 
        
               })

Unfortunately, I don't currently have the bandwidth to code this up in the near future, but I'd be happy to review any PRs or contributions!

This result seems comparable to the ones in the README -- I don't know what the scene RGB looks like, but it seems to be a bunch of flowers on a branch/twig. Given this, it seems like GARField successfully clusters the individual flowers / parts are clustered well together, as well as the larger structures (table, wall, ...)

I'm not sure what you mean by "how I can adjust it to get as good results as the demo" -- is your question,

Why aren't the flowers and the branches clustered together?

In this case, check if SAM-based masks are able to group the branches and the flowers together, at all. It's possible that the twigs are too thin to be grouped together. Since GARField distills 2D groups into 3D, if the masks fail to generate a desired group, it won't emerge in 3D.

Also, the scale seems small (0.0) here?

Also, SAM might return more masks with different parameters (e.g., increasing crop_n_layers or points_per_side).

Why is the ground / left-side of the wall muddled/patchy?

There probably isn't sufficient group supervision there, especially if only a few cameras face this part of the scene.

I'm not sure if your method will work on my plants. I want to segment each branch of the plant.

It does seem to separate the individual branches a bit (at least based on the PCA visualization you provided), but I'm with you in that I'm also not sure how GARField would perform on these plants.

I do think SAM is the bottleneck here, after running SAM's segment-everything mode on your RGB image (the web version). Across multiple views, individual branches will probably be grouped independently at some point (like the bottommost branch in the attached image). However, there's no guarantee all the branches will be grouped independently like this. Segment-everything uses point queries, and the thin structures are adversarial for that.

It seems that GarField relies heavily on the segmentation results of SAM. Is it possible to use box prompts with GarField?

It's not possible. For a group to exist in 3D, it must be generated in 2D.

Also, to clarify, GARField's selection/clustering isn't generating groups using prompts -- it uses 2D masks to supervise the 3D grouping features, which then can be filtered/grouped using their affinities. The "clicking" demo is a simple thresholding of the affinity, not a SAM-like point prompt fed into a decoder.

FYI, if you have another segmentation model that can generate these desired instance labels in 2D, you can add it to img_group_model.py.

FYI, if you have another segmentation model that can generate these desired instance labels in 2D, you can add it to img_group_model.py.

Yeah. I do want to train a 2D instance segmentation model myself to obtain the mask, but I'm not familiar with how to do it.

Hi, I have other question.

Why is the point cloud generated by nerfstudio not the same size as the original target size?
How can I restore the point cloud to its original size?

As for exporting the GARField-based labels, this isn't currently supported.

Can you elaborate what you mean by "instance labels"? (ie do you mean cluster labels, or the grouping features?)

Either way, it's definitely possible. Currently, It should be possible to export the gaussians visible in the viewport, if you check the "Export Options" box. This code borrows from the gaussian export function in the original nerfstudio's export_utils. The label data can be added to the pointcloud metadata.

The corresponding function is linked here:

garfield/garfield/garfield_gaussian_pipeline.py

Line 561 in a094a0e

def _export_visible_gaussians(self, button: ViewerButton):

Hi, how to call this function "_export_visible_gaussians" using the command line?

Hi, how to call this function "_export_visible_gaussians" using the command line?

This functionality isn't supported on the command line, but you should see a "Export Visible Gaussians" button once you check the "Export Options" checkbox (for garfield-gauss).

I have always had a question, why 3D Gaussian splatting cannot directly export point clouds, but exports a Gaussian splat model.

Hi, how to call this function "_export_visible_gaussians" using the command line?

This functionality isn't supported on the command line, but you should see a "Export Visible Gaussians" button once you check the "Export Options" checkbox (for garfield-gauss).

Yeah, I can see the "Export Options" checkbox (for garfield-gauss). But when I click the "Export Visible Gaussians", there isn't any response. Can the "Export Visible Gaussians" checkbox work when clicking?

It should work (I believe it writes a ply file to the current directory, as shown below).

If the export code doesn't work, or there are issues with the code, please feel free to open up a PR!

garfield/garfield/garfield_gaussian_pipeline.py

Line 599 in a094a0e

o3d.t.io.write_point_cloud(str(filename), pcd)

I have always had a question. The current segmentation method of nerf or 3dgs is to first find the mask from 2D, then map the mask to 3D, and segment the 3D target.

This introduces occlusion issues in 2D images and is not directly segmented in 3D space like point clouds.

Many models now are a combination of 2D and 3D. I would like to ask why not perform segmentation directly in the generated 3D GS model? What's the difficulty?

The generated 3D GS model retains both spatial information and high image-level fidelity. I think that achieving segmentation directly in 3D space truly reflects its advantages over point clouds.

I have always had a question. The current segmentation method of nerf or 3dgs is to first find the mask from 2D, then map the mask to 3D, and segment the 3D target.

This introduces occlusion issues in 2D images and is not directly segmented in 3D space like point clouds.

Many models now are a combination of 2D and 3D. I would like to ask why not perform segmentation directly in the generated 3D GS model? What's the difficulty?

The generated 3D GS model retains both spatial information and high image-level fidelity. I think that achieving segmentation directly in 3D space truly reflects its advantages over point clouds.

Hi, have you successfully export point cluds with semantic labels?

Hi, have you successfully export point cluds with semantic labels?

I've been testing other models recently, but I haven't research this model again.

Hi, have you successfully export point cluds with semantic labels?

I've been testing other models recently, but I haven't research this model again.

I am also trying to export point clouds with semantic lables, maybe we can discuss about this if possible.

	def _queue_state(self):
	"""Save current state to stack"""
	self.state_stack.append({
	'means': self.model.means.detach().clone(),
	'scales': self.model.scales.detach().clone(),
	'quats': self.model.quats.detach().clone(),
	'features_dc': self.model.features_dc.detach().clone(),
	'features_rest': self.model.features_rest.detach().clone(),
	'opacities': self.model.opacities.detach().clone(),
	})