/metaballsWebgl

fast gpu metaballs for webgl

Primary LanguageJavaScript

metaballsWebgl

summary

metaballsWebgl is a GPU implemenmtation of metaballs for running under WebGL or WebGL2, using optimized marching cubes (see below). It uses three.js.

  • Input is a generally texture containing sphere positions, and optionally colours.
  • Output is a rendered image as part of a three.js scene.

It is designed to be fast for dynamic data; for example it can handle 10,000 spheres at a grid resolution of 100x100x100 at 60 fps using 33% of a 1080 gpu. It recomputes everything every render, so is inefficient for static data.

A test case can be viewed at https://sjpt.github.io/metaballsWebgl/marchtestlocal.html

The implicit function marching cubes part can be used independently from the metaball code. Choose function from the dropdown (modified from taken from http://mikolalysenko.github.io/Isosurface/). You can also type in your own function using the code box. We intend to add sliders to take advatage of the real-time aspect of these functions.

basic usage

marching = new Marching(isWebGL2);
scene.add(marching.three);
marching.updateData(datatexture, sphereScale);
  • datatexture is an n wide, 1 high texture of xyz values.
    • datatexture may contain a 4th channel of colour: rgb packed into a float
  • sphereScale is the inverse of the minimum/maximum xyz size, to scale to -1..1 range.

This is illustrated in marchtest.js, run using marchtestlocal.html or marchtestremote.html. These are the same except they access the necessary files in different ways.

more control

A structure marching.X provides more control.

    rad: 0.1,           // radius of spheres
    radInfluence: 1.5,  // how far infludence spreads
    sphereScale: 1,     // scale of sphere positions (TODO; something cheap but more complete)
    sphereYin: false,   // true for y based input texture (1 wide, n high)
    npart: 1000,        // number of particles
    ntexsize: 1000,     // size of texture holding particles
    spatdiv: 25,        // numnber of spat phase subdivisions if equal for x,y,z
    doshade: true,      // control final shading phase
    dowire: false,      // control final wireframe phase
    dopoints: false,    // control final points phase
    funtype: 2,         // 0 cubic, 1 quadratic, 2 exp
    useboxnorm: true,   // true to compute normals in box phase
    instancing: true,   // do we use instancing
    loops: 1,           // number of repititions of loop to apply each render cycle
    trivmarch: false,   // true for trivial version of marching cubes pass, performance test to estimate overheads
    xnum: 100, ynum: undefined, znum: undefined,     // numbers for voxel boundary count (ynum and znum default to xnum)
    yinz: 0,            // manual override value used to split y in 3d/2d lookup, needed where xnum*ynum > maxtexsize
    threeShader: true   // true to use customized three shader, false for trivial shading

tracking

Colours

Colours may be added as a fourth channel in the input sphere definition. They are tracked through the system, being blended as apporpriate. The blending tends to obscure computational inaccuracies arising from the gridded nature of the algorithm.

ids

The id (sphere humber in the input array) can be tracked through the system, and the result used for colouring or to control other shader attributes. In the default test setup different ids are mapped to random different colours.

Blending is not appropriate for ids. The system attempts to keep track of the closest (most influential) sphere at any point. The nature of the gridding means that this does not give clean edges between the id regions.

shaders

The system provides a very basic shader, or can operate by inserting necessary patches to the three standard material.

Additionally, the user can provide additional patches. For example we have a patch that provides 3d texturing (along the lines of various generations of Organic Art). This is not currently publically available.

algorithm

metaballsWebgl has been derived from standard marching cubes, with several optimizations. The optimizations are mainly designed to mitigate the inherent inefficiencies of GPU coding in Webgl.

It operates in four passes:

  1. spatial subdivision determines which spheres are active in which super-voxel
  2. grid potential fill, fills the voxel corners with potential data, and associated colour/id data if relevant.
  3. voxel relevance pass, flag which voxels are likely to contribute, and how
  4. marching cubes pass, generate the marching cubes triangles

The first three passes are rendered using three.js onBeforeCompile(), the final pass as part of the 'real' three scene.

The relatively rigid limitations of shader programming, especially in Webgl, mean there are some inherent inefficiencies.

The spatial subdivision pass outputs a bitmap for each super-voxel, indicating whether each sphere does or does not contribute to that super-voxel. The bitmap is saved in RGBA float texture, 24 bits per float, 96 bits per texture position. We typically use 25x25x25 super-voxels. This means that the fill pass just has to process the bit for spheres that are not active in the super-voxel, but does not have to lookup the sphere position and compute its contribution to the potential.

The final marching phase generates 5 triangles for every voxel. Most voxels have no surface and all the triangles are 'dummy' triangles with three 'dummy' vertices; almost none of the voxels use all 5 triangles. The shaders are written to minimize time spent handling dummy vertices and triangles; the purpose of the voxel relevance pass is to assist that. Nevertheless the intrinsic overhead (outside the shaders) of handling vertices and triangles meanss that nearly 50% of the gpu time goes into handling these dummy triangles.

The optimizations mean that the flow is highly non-uniform, and likely to behave very poorly on older gpus, and gpus in many tablets and phones.

lego/minecraft

The very helpful https://0fps.net/2012/07/12/smooth-voxel-terrain-part-2/ by Mikolaly Senko comments how surface net mesh points can be coerced to the centres of their voxel rather than finding a good surface point. This makes the smooth surface into a lego surface with only axial faces. The 'lego' option enables this.

We have applied a similar idea to marching cubes. Each edge crossing is forced to the centrpoint of the voxel edge. This is not as neat as the lego. Rather than axial faces all faces have normals (A,B,C) where A,B,C are -1, 0 or +1.

three.js note

three.js is an excellent framework, but we have had some issues with this project, which is somewhat outside the normal use of three.js.

attribute-less coding

Most of the code is inherently attribute-less; in particular vertex positions are arrived at procedurally, and in WebGL2 can be derived from gl_VertexID and gl_InstanceID. The calling code needs to identify the number of vertices and instances, but no specific vertex or instance attributes are needed.

three.js does not currently support this; it requires a position vertex attribute and at least on instance attribute. This is not difficult (and in necessary in WebGL1), but does involve a little extra code in the application that should be unnecessary.

mrdoob/three.js#19430 mrdoob/three.js#19451

You can get round that by preparing a position attribute in javascript to help three.js identify the number of vertices. However, if you do not use that attribute within the shader recent versions of three.js detect this, ignore the attribute, and do not know what size to use.

corruption of NaN values

Creating three.js vector objects with NaN values does not respect the NaN's but corrupts them to 0. eg new THREE.Vector3(NaN).x gives 0.

Some of this code can use NaN values in the shaders and intermediate texture buffers to 'kill' redundant trianges; future versions may well rely on them. Reading back (e.g. for debug) such buffers with NaN values works OK, but organising those as arrays of three.js vectors corrupts them.

This was easily fixed in our code by replacing new THREE.Vector3(x,y,z) with new THREE.Vector3().set(x,y,z).