Field interpolation using the finite difference method

A method for interpolating sparse and/or noisy data in one or several dimensions. Can be used to generate a signed distance field from a point cloud.

Code structure

The library is in field_interpolation/. src/ contains an example app.

Description

Introduction

Say we want to approximate some function f(x) = y. We have some data about the function. Value data (e.g. f(0) = 4 and f(5) = 2) and (optionally) gradient data (e.g. f′(0) = 1 and f′(5) = -1). We can also make assumptions about the model (how the function behaves). We can, for instance, assume that the function is smooth: f″(x) = 0. Let us now approximate the function f as values on a lattice (i.e. a grid). For this example, let us use six lattice points in the inclusive range [0, 5]. We can approximate the gradient of the field as f′(x) = f(⌊x⌋+1) - f(⌊x⌋). The smoothness constraint thus becomes f(n + 1) - f(n) = f(n + 2) - f(n) for each lattice point. Let's write down all our constraints in the above example:

Value constraints:
	f(0) = 4
	f(5) = 2

Gradient constraints:
	f(1) - f(0) = 1
	f(5) - f(4) = -1

Smoothness constraints:
	f(1) - f(0) = f(2) - f(1)
	f(2) - f(1) = f(3) - f(2)
	f(3) - f(2) = f(4) - f(3)
	f(4) - f(3) = f(5) - f(4)

This is a set of linear equations - let us write it in matrix form, using the notation xₐ = f(a):

	        A           ·  x   =   b

	| 1  0  0  0  0  0|          | 4|
	| 0  0  0  0  0  1|   |x₀|   | 2|
	|-1  1  0  0  0  0|   |x₁|   | 1|
	| 0  0  0  0 -1  1|   |x₂|   |-1|
	| 1 -2  1  0  0  0| · |x₃| = | 0|
	| 0  1 -2  1  0  0|   |x₄|   | 0|
	| 0  0  1 -2  1  0|   |x₅|   | 0|
	| 0  0  0  1 -2  1|          | 0|

Obviously this is overdetermined which means that there probably won't be any perfect solution to it. We instead have to solve it approximately, and try to minimize the error. The natural choice is, of course, linear least squares. (You will also notice that the A is very sparse - it's mostly zeros. This can be taken advantage of to save memory and cpu when the lattice becomes large).

The result will look something like this:

Here the red points and lines are our data (value and gradient), and the black lines is the piecewise linear interpolation of the results of the above least squares solution. We can note that the there is some error in both the gradients and the point values, but overall it looks OK-ish. Let us now increase the lattice resolution and try again:

We can now see we get a much closer approximation of the input data and a very smooth and plausible interpolation between them.

Noisy data

Having the solution be a least squares approximation has the advantage that we can use this method to model noisy data. We can also weight the different rows in the equations by simply multiplying them with a number. For instance, if we multiply all the smoothness constraints by 10, we are in effect saying "we believe the curve to be smooth, not matter what the data says". If we instead multiply the data constraints by 10, we are saying "trust the data". This means the weights acts like Bayesian priors. Here is some noisy data (red points, no gradients this time) with interpolation where the smoothness constraint goes from weak to strong:

Another way to view the above images is how much we trust the data decreases left-to-right.

Data interpolation

In the initial example the data constraints happened to lie perfectly on lattice coordinates. What if they don't?

Let's say we have the constraint f(3.4) = 10 on an integer lattice. They way we will add this is by using linear interpolation: 0.6 · f(3) + 0.4 · f(4) = 10. In other words: we want f(3.4) = 10 when we interpolate the solutions at f(3) and f(4).

For the gradient we do something even simpler: nearest neighbors. For f′(3.1) = -12 we simply add the equation f(4) - f(3) = -12. This is simple, yet works well in practice.

Extending it to multiple dimensions

The method can be trivially extended to multiple dimensions. For data value constraints we use bilinear interpolation in 2D, trilinear interpolation in 3D, etc. For the gradients we again pick the two nearest neighbors on each dimension and constrain their gradients along that dimension. For instance, in 2D:

∇f(2.1, 5.8) = [-1, 3] becomes:

	f(3, 6) - f(2, 6) = -1
	f(2, 6) - f(2, 5) =  3

This works surprisingly well.

We do something similar for the lattice smoothness constraints. In 3D:

	f(x, y, z) - 2·f(x+1, y,   z)   + f(x+2, y,   z)   = 0
	f(x, y, z) - 2·f(x,   y+1, z)   + f(x,   y+2, z)   = 0
	f(x, y, z) - 2·f(x,   y,   z+1) + f(x,   y,   z+2) = 0

As the number of dimensions go up, so does the number of constraints - but not by much, and the number of of values in each equation is low, meaning the resulting equation system remains sparse, and thus fast.

Relationship to existing methods

The method described in this article is similar to a steady-state Finite Difference Method (FDM). In FDM, the data constraints corresponds to boundary conditions:

Value constraint f(x) = y = Dirichlet boundary condition
Gradient constraint ∇ f(x) = d = Neumann boundary condition

However, in this library we apply FDM to noisy data to produce an overdetermined equation system. Adding weights to the equations allows for a linear least squares solution which approximates the field.

Expample use: surface reconstruction from point samples

A common problem in 3D scanning is reconstructing a mesh (surface) from a set of noisy surface points. We can use the methods described in this article for that by setting up a system so that:

Smoothness constraint: f″(x) = 0
For each particle:
- f(x) = 0 (the field is zero at the surface point)
- ∇ f(x) = n (the gradient of the field at the surface point is the point normal)

The solution will then be a smooth approximation of a signed distance field. One can then use e.g. Marching cubes to the resulting field to produce a mesh. The field will be smooth and accurate near f(x=0) (the mesh boundary), but depending on the quality of the input points (their noise level) the distances will likely be quite off further away from the surface. However, for mesh reconstruction it only needs to be accurate around f(x=0).

Making it work without point normals

Interestingly, surface reconstruction can be made to work even if your point data lacks normals, as long as there is some indication of outside/inside. For instance, if you have some area of the field (e.g. the boundary) that you know to be outside of the mesh you can add weak constraints that these places have a distance equal to the distance to the nearest data point. It doesn't have to be exact, just a helper for the solver. The reason this works is that the data points (f(x) = 0) acts like pivots for the smoothness constraints (which is trying to make the field "straight"). For instance, if you feed the solver with a smoothness constraint and the data f(0) = 10, f(10) = 0, then it will be able to figure out that f(20) = -10. This still works for noisy particles in 2D and 3D. Another approach it to find extreme points (topmost, leftmost etc) and add simple gradients for these (down, right, etc).

Differences with Smooth Signed Distance Surface Reconstruction:

The most similar work in this area is SSD. Differences include:

SSD uses octrees to solve for fewer unknowns.
SSD uses a different (and more expensive) smoothness (regularization) constraint.

In my work the smoothness constraint is that the gradient between neighboring edges (edges sharing a vertex in the same direction) should be zero along the direction of the edge. This adds D·V equations with 3 unknowns in each, where D is the dimensionality and V is the number of unknowns (voxels).

SSD instead uses the three-dimensional gradient difference between neighboring voxels. In SSD the gradient is estimated from the eight voxel corners, so each such constraint touches 16 unknowns. SSD also enforces this constraint along all three dimensions (not just along the direction to the neighbor), so each end up with three constraints for each pair of neighbors, so D²·V constraints with 16 unknowns each. So that is three times the number of equations and more than five times the number of unknowns touched. This produces a more expensive system of equations.

The solver

Solving a sparse linear least squares problem is a well-researched problem, which means there are many robust and fast solutions to it, both for exact solutions and for approximate ones. This library uses solvers in Eigen with some improvements. In particular, a fast approximate solver is employed for solving large multidimensional lattices. This solver works like this:

The problem is down-scaled to a coarser level, and solved exactly. This coarse solution is then up-scaled to the original lattice size again.
The problem is broken up into non-overlapping tiles and solved individually. At the boundaries between tiles, the approximate solution from the downscaled solver is used. The solutions for the tiles are re-assembled in the original lattice.
The solution from the tiled solver is used as a starting guess to an iterative, approximate Conjugate gradient solver.

There are probably plenty of improvement that can be done to this.

Speeding it up

There are several strategies to speeding up the solving a big system, all which can be combined. What they have in common is that they trade precision for speed. what to try first depends on what is the problem.

My resolution is huge

Try solving the system at a lower resolution, then scale it up (you can use the upscale_field helper function). Remember to multiply the field with the factor you scale it up with.

Now you have an approximate solution - use it as a starting guess for solving the full system using an iterative solver (you can use solve_sparse_linear_with_guess for this).

I have a lot of data points, far more than unknowns

Try using the nearest-neighborhood ValueKernel and GradientKernel to make your system more sparse.

You can also try...

Reducing the data points

If you have a lot of data points the solver can be slow. For an approximate solution you can use only every N points (e.g. every third). To compensate, multiply the weight with √N. Why?

Consider the case on N equations with equal weight W. Since we are minimizing the square of the error, they contribute to the energy function as E=N·W². So if you want to replace all these equations with just one equation, it should have the weight √N·W, so that E = (√N·W)² = N·W².

If your data have individual weights Wᵢ in the range [0,1] you can stochastically sample them by taking the likelihood you keep a point be Wᵢ². That is, replace this:

for (const auto& point : points) {
	add_value_constraint(&field, point.pos, point.value, point.weight * global_data_weight);
}

with this:

for (const auto& point : points) {
	if (random_in_range(0,1) < point.weight * point.weight) {
		add_value_constraint(&field, point.pos, point.value, global_data_weight);
	}
}

To understand this, consider N points with (equal) individual weights W, and global_data_weight = G. In the first example, the energy would be E = N·(W·G)² = N·W²·G². Now using the sampling method above we would expect to sample N·W² of the points, keeping the total energy as E = N·W²·G².

Todo

Algo

Sparse lattices

Speeding up

Coarse first, scale up, re-run on elements with large errors, keep smooth fixed (sparse)
Take resolution into account to adjust smoothness constraints.

Gui

Use 1D to verify iso-surface positioning is perfect
Split back-projected error into model and data constraints.
Add several saved configs for distance field tab
Show circumference comparison
Add slider for keeping gradient_smoothness slider same as C1 smoothness slider

Code

Proper build system

Volumental/field_interpolation