BELLHOP

A mirror of the original Fortran BELLHOP/BELLHOP3D underwater acoustics simulators, with numerical properties and robustness improved and bugs fixed. These changes were made in support of the multithreaded C++/CUDA version of BELLHOP/BELLHOP3D: bellhopcxx/bellhopcuda.

Impressum

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.

Summary of changes

Edge case issues

Boundary stepping changes

How edge cases are handled seems like it should only affect a small fraction of results in any program. However, in BELLHOP, the step size for the numerical integration is typically set to a large value, and is then reduced so that the step lands on the nearest "boundary". (Here, this refers not only to the ocean surface and floor, but to any "plane" within the ocean where the SSP changes, where the bottom has a corner/vertex in its definition, and so on.) Thus, for typical runs, every step of every ray in BELLHOP lands on an edge case. Depending on the exact set of floating-point operations done and the specific inputs, these steps "randomly" land just before or just after the boundary. This means the step "randomly" ends up in different segments, which leads to further differences down the line. While these differences typically have a very small effect on the overall ray trajectory, there are several mechanisms (see "Detailed information on edge case issues" section below) by which the error can be amplified and lead to significantly different results later. This makes comparing the results to the C++/CUDA version very difficult, and may in some cases make reproducing results between runs of the Fortran program difficult. This has been fixed by overhauling the system which moves the numerical integration to the boundary to give predictable behavior. This also involved changes to reflections (landing on the top or bottom boundary); now consistently only two identical points are generated at reflections, one with the incoming tangent and one with it outgoing.

Shallow angle range

In InfluenceGeoGaussianCart, a ray is considered to be at a shallow angle if its angle is less than or equal to 60 degrees, and a steep ray otherwise. The handling of these two cases is completely different, i.e. there is a discontinuity as the ray angle passes 60 degrees. However, some environment files trace rays at round number angles, including exactly 60 degrees. Differences in the set of floating-point operations used can cause these rays to be considered on one side or the other of this boundary, leading to different results. This cannot be fully fixed without changing the physics of the simulation (i.e. remove the discontinuity), so instead the discontinuity has been moved slightly away from the round number to increase the chances of consistent results.

Strange behavior when escaping the altimetry/bathymetry beam box in 3D

There are mulitple strange behaviors in the code for when rays escape the region where altimetry and bathymetry is defined in 3D (GetTopSeg3D and GetBotSeg3D).

If the ray escapes the box, a warning is printed, but segment information (e.g. IsegTopx, xTopSeg, Top_tri_n) is not updated. But, these values are not fully initialized between rays, meaning that it's possible for the first step of one ray to use values from the last step of the last ray.
If the ray escaped the box to the negative side in either dimension, or the results are NaN (see below), the ray is then terminated in TraceRay without the current step. However, if the ray escaped to the positive side, the ray is terminated as part of the normal stopping conditions, including the current step.
There is a check for the depth being NaN, which sets the segment to 0 (invalid). But it is not clear what code catches that condition, or if the errors arising from the invalid segment are at all better than the NaN (which could only have arisen from an error in the first place).
The original logic allowed for steps to the edge of the same segment they are in, which has been generally changed as discussed above to step into the next segment according to the tangent.
In 2D, the altimetry / bathymetry are extended to very large values so these conditions never get hit, whereas in Nx2D and 3D they are not so almost all rays escape the box.

These functions have been rewritten as follows:

There is an override so that being at the outer edge of a segment is OK for the last segment, just for one step.
If the position is actually outside the segment, print a warning and put it in the nearest valid segment.
Changed the stopping condition to stop if on the boundary and pointing outwards.
Values all initialized between rays.

`RToIR` step-to-edge-case issue

The code ir = MAX( MIN( INT( ( r - Pos%Rr( 1 ) ) / Pos%Delta_r ) + 1, Pos%NRr ), 1 ) or similar is used in various Influence functions. The INT( ) operation here is another edge case which steps often step exactly to. For example, suppose r is 3000 after a step to a particular segment boundary, Pos%Rr( 1 ) == 0, and Pos%Delta_r == 500. The argument to INT( ) may end up being 5.99999999999998 or 6.000000000000002 depending on the exact value of r, but this would result in different values of ir and therefore substantially different results. This code pattern has been replaced with the function RToIR, which will "snap to" each integer value of the argument, to ensure these cases are handled consistently.

Algorithm bugs

Clobbering of `rayt` in 3D arrivals

ApplyContribution (3D) in arrivals mode wrote to rayt, and computed it as the non-normalized ray tangent (difference between two consecutive ray points). Its value in the two Cartesian influence functions is the normalized ray tangent; it is computed once per step. This value will be clobbered by the first arrival, and incorrect when computing arrivals at any other receivers in the same step. This has been fixed.

Missing clearing of 3D arrivals matrix with multiple sources

For 3D runs with multiple sources, the arrivals matrix was never cleared between sources, so receivers had all the arrivals from earlier sources plus new arrivals from the new sources. This has been fixed. This did not affect Nx2D because it overwrites slices of the 3D arrivals matrix from the 2D one, which ends up overwriting all parts of the 3D matrix.

Incorrect handling of 3D Gaussian ray-centered semi-coherent or incoherent

The code that applies the factor of 2*pi for Gaussian beams to semi-coherent or incoherent in 3D ApplyContribution only checked if the beam was Gaussian Cartesian ('B'), not Gaussian ray-centered ('b'), so the latter did not get the 2*pi factor. This has been fixed.

Polarity flipping

In InfluenceCervenyRayCen, rnV was negated for image 2 and again for image 3. If Nimage was set to 2, this means rnV would change sign every step. mbp had implemented this differently (without the bug) in InfluenceCervenyCart, so that implementation was brought to this function.

Precomputing phase in 2D hat ray-centered influence

In InfluenceGeoHatRayCen (2D), the original code did IncPhaseIfCaustic (or rather, the non-function equivalent of it) within the Stepping loop, after checking and skipping the step if irA == irB. Which steps meet this condition depends on Pos%Rz, which is receiver information and has nothing to do with the ray/beam itself. If q goes from negative to positive on one step, the caustic will go unnoticed until the next step where irA != irB. However, if there are TWO caustics during this time (i.e. at two or more sequential steps where irA != irB), q will have gone from negative to positive back to negative and the phase change will be completely missed. This means field results at one receiver depend on where other receivers are, which is non-physical.

mbp had a comment suggesting this be precomputed, which is how it is done in the corresponding 3D influence functions, and is the solution which has been applied here.

Arrivals merging

The algorithm in AddArr (both 2D and 3D) does not properly handle the case where the current arrival is the second step of a pair, but the storage is full and the first step of the pair got stored in some slot other than the last one. It will only consider the last ray in storage as as a candidate for the first half of the pair. This means that whether a given pair is successfully paired or not depends on the number of arrivals before that pair, which depends on the order rays were computed, which is arbitrary and non-physical.

Fixing the bug would not be difficult, but it would require iterating over all arrivals to check for pairs, which would slow things down. It also hypothetically could merge arrivals with previous arrivals which are not part of the same pair but happened to have similar delay/phase, though it's not clear if this would actually be a problem or not.

In the C++/CUDA version, the incorrect behavior was emulated for single-threaded mode, in order to be able to compare arrivals results to BELLHOP/BELLHOP3D. However, for multithreaded mode or CUDA, the code was changed to just write the first MaxNArr arrivals and discard any future ones. It's not clear if it's possible to write a lock- free algorithm for "keep the N arrivals with the highest amplitude", due to all the other arrival data besides the amplitude which needs to be written and could not be written atomically. A mutex-based approach would hurt the multithreaded performance and completely destroy the CUDA performance.

SGB bug with backward-traced rays

The computation for Ratio1 in SGB (2D) was SQRT( COS( alpha ) ), which will crash (square root of negative real) for rays shot backwards. All other influence functions use SQRT( ABS( COS( alpha ) ) ), so the missing ABS has been added. Also, the way the loop over range works assumes the ray always travels towards positive R, which is not true for certain bathymetries or for rays shot backwards. This loop has not been fixed.

Uninitialized values

iSegz, iSegr initialization

In BELLHOP, iSegz and iSegr are initialized (to 1) only once globally. In some cases (a source on a boundary), in conjunction with other subtle issues (see Case Study below), the missing initialization caused the initial state of one ray to be dependent on the final state of the previous ray, which is obviously non-physical and makes reproducing certain results impossible. This has been fixed by reinitializing them before every ray.

Missing initialization on some beam parameters

The beam parameters Nimage, iBeamWindow, and Component were not initialized in code, and not all BELLHOP-provided test .env files initialize Component. The value of uninitialized variables in Fortran is not defined. Reasonable initializations have been added to these parameters.

Uninitialized read in 3D Step phase 2

Step3D phase 2 had RayNormal_unit using ray2%phi which is uninitialized memory (or left over from the previous ray). This was changed to ray1%phi. Also, it used RayNormal_unit despite this being for cases where the tangent was already normalized, but computed the normalized tangent immediately before. So this was changed to RayNormal.

Uninitialized variables in `PickEpsilon`

Codepaths for several possible settings do not set output values in PickEpsilon, leading to undefined behavior if environment files use these settings. For 2D, HalfWidth and epsilonOpt are unset for:

Cerveny beam-width Cerveny beams (BeamType( 2 : 2 ) == 'C', in addition to BeamType( 1 : 1 ) being 'C' or 'R' for Cerveny beams)
Geometric hat beams specified as '^' (or now in 2022, ' ') instead of 'G'

For 3D / Nx2D, epsilonOpt is unset for:

Cerveny beam-width Cerveny beams
WKB beams

For both 2D and 3D, beams not following any of the beam types or beam width types are also unset, though this leads to other errors as well.

These issues went undetected because the results of PickEpsilon are actually only used in conjunction with Cerveny beams, the Cerveny 3D influence function Influence3D was removed from BELLHOP3D before the earliest version we have, and none of the 2D environment files use the Cerveny beam-width Cerveny beams.

Missing checks

SGB eigenrays and arrivals

InfluenceSGB had code to support eigenrays, but the condition to check the distance from the receiver was commented out, meaning that every ray would be an eigenray. Furthermore, it assumed that any run other than eigenrays was coherent TL, ignoring incoherent, semi-coherent, and arrivals. This function was reworked to use ApplyContribution to support all of these run types. Also, the maximum distance from the receiver (RadMax / RadiusMax) was uninitialized on the Nx2D codepath (though in the original code, checking it was commented out so this didn't matter). It has been set to the same value as used for 2D SGB and Cerveny influence.

Non-TL runs not handled correctly in Cerveny influence

The two Cerveny influence functions (ray-centered and Cartesian in 2D; 3D Cerveny is referenced in commented out code but not provided) do not support eigenray and arrivals runs. This is now checked, instead of silently failing.

Missing divide by zero check in Nx2D reflect

In Reflect2D (Nx2D only version), for Beam%Type( 4 : 4 ) == 'S' by "Seongil", the check for si == 0 preventing a divide-by-zero is missing compared to the 2D version. It has been readded.

3D SSP Hexahedral size overflow check

In 3D SSP Hexahedral, SSP%Nz is assigned to SSP%NPts and SSP%Seg%z to SSP%z. However, the latter has a maximum of MaxSSP depth values, whereas the former did not have a check for exceeding this size. A check has been added.

Input vector monotonicity

The handling of input data from the environment file or other input files which is not monotonically increasing is inconsistent. In some cases, such as source and receiver coordinates, the input data is sorted, so providing out-of-order input data is fully supported. In some cases, BELLHOP(3D) checks to ensure the input data is monotonically increasing. However, for SSP coordinates and reflection coefficients, BELLHOP(3D) assumes the input data is monotonically increasing without checking it. If it is not, this gives other errors later or garbage results. Checks have been added in these cases.

Other issues

Use of `TINY( )`

In a few places throughout the codebase, TINY( ) is used to get a small number to replace zero, in the sense of IF ( ABS( a - b ) < 10.0 * TINY( a ) ) to replace IF ( a == b ) which might fail due to floating point inaccuracies. However, this is probably the wrong function for this job. TINY gives the smallest positive number of the given type (note that this should be a denormalized number, but gfortran gives the smallest positive normalized number): TINY( 1.0 ) is about 1E-38 and TINY( 1.0D0 ) is about 2E-308. These values are much smaller than typical errors: either a will exactly equal b, or if it's one float larger or smaller, that'll be SPACING( a ) which is about 1E-7 if a == 1.0 or about 2E-16 if a == 1.0D0. So, if the original values being compared are known, like in this example we know a and b, the best result would be something like 10.0 * SPACING( a ). If the magnitude of the values which lead to the maybe-zero value are not known, as in IF ( ABS( c ) < ??? ), a good choice would be EPSILON( c ). This has been changed in InfluenceCervenyRayCen, ReadRayElevationAngles, and ReadRayBearingAngles. The uses of TINY in Reflect / Reflect3D for manipulating the SQRT branch cut are not an issue and have not been changed.

Fixed value of `pi`

pi was previously defined to be 3.1415926535898D0. The closest double precision floating point number (REAL( KIND=8 )) to the true value of pi is 0x400921FB54442D18, or about 3.1415926535897932D0, so the definition has been replaced with this. The value which pi was previously defined to be was in error by about -7e-15 or 16 ULPs, which lead to divergence in some cases.

Curvature correction rotation divergence

In CurvatureCorrection2 in Reflect3DMod, even if there is no curvature correction (R1, R2, and R3 are all zero), rotating p into and out of the ray's coordinate system introduces slight floating-point error, which accumulates over every step and can trigger edge case differences later. This has been improved by keeping p constant when there is no curvature correction.

Missing ray termination due to beam box in Nx2D

In TraceRay2D (Nx2D), the conditions comparing the ray position to the beam box were inexplicably commented out in the 2022 revision. These conditions are active in 2D and 3D. The result of running a ray without these conditions is that when it gets to the boundary of the beam box, the step size is reduced to zero and then forced to a small number. The ray takes 50 of these small steps (millimeters over the boundary) and then terminates due to iSmallStepCtr. This behavior is not useful compared to just ending at the boundary, so the conditions have been restored. Furthermore, all of the box conditions have been changed from > to >=; this is so that a ray which steps to the box will then terminate on that step, instead of taking another infinitesimal step over the boundary and then terminating.

Intentional asymmetry between X and Y crossing thresholds

In ReduceStep3D and StepToBdry3D, the threshold for crossing segments in X was EPSILON( h1 ) but in Y was 1000.0 * EPSILON( h1 ). mbp had a comment questioning this. The 1000 has been removed.

Memory leaks of temporary arrays

TopGlobalx, TopGlobaly, BotGlobalx, and BotGlobaly were never deallocated. This has been fixed.

Echo vector last value

In both 2D altimetry and bathymetry, when the vectors are echoed, the condition for writing the last element after skipping some will never be satisfied due to the loop bounds. This condition has been fixed.

Other

Typos in comments and other minor issues have been fixed. See the commit logs if you want a full list.

Features which are probably bugs

Some features of BELLHOP / BELLHOP3D are probably not intended, but they were not difficult to emulate in the C++/CUDA version, so their behavior was left alone. Or, there is code which has no effect and was removed in the C++/CUDA version.

Three issues with phase and caustics

The code updating phase and computing the final value of phaseInt in the various influence functions has been moved to the functions IncPhaseIfCaustic and FinalPhase respectively. There are three differences in how this code operates between versions, which do not seem to have any mathematical basis and therefore may be bugs.

First, the step of the ray whose phase is read--X for ray2D( X )%Phase (or the same for 3D)--is iS - 1 in all cases, except for 2D Gaussian Cartesian where it is iS. Given that the code then updates the phase based on whether the caustic has yet occurred during the step, it seems that starting from the initial phase of the step (rather than the final phase of the step) would be correct. So, the 2D Gaussian Cartesian version may be a mistake.

Second, in all the 2D influence functions, but none of the 3D ones, if the caustic has occurred, the ray phase is discarded, and only the accumulated phase from previous caustics (plus the new pi/2) is used. This seems like it may have been a typo which was copied-and-pasted to all the 2D functions, but then fixed in the 3D functions.

Finally, when checking for caustics, the position of the equals sign in the greater than / greater-than-or-equal-to / etc. symbols differs. In 2D SGB, the equal signs are on qOld, whereas in all the other cases they are on q. Since this is just detecting a change from positive to negative or vice versa, neither choice is more correct than the other. But it's not clear why this should differ between different influence functions.

Strange behavior in `r` / `ir` initialization in 3D Cartesian

rA is computed as the distance between the start of the ray and the source, but the ray always starts at the source so this is always zero.
ir is then initialized to the minimum index of receivers at r > rA. This will be 1 if there are not receivers at 0, or 2 if there are.
On the first step only, if the step is not a duplicate, there is a check for whether the "ray is moving outward in range", which it always is because it always starts from range 0 (rA is 0, rB is a norm so it must be >= 0, and if it was 0 it would be a duplicate step and not get there). This sets ir to 1.
Therefore, if the first step is a duplicate, receivers at 0 will be skipped, but if the first step is not a duplicate, receivers at 0 will be included.

This is unlikely to be desired behavior. Just initialize rA = 0.0 and ir = 1 and remove the other code.

PCHIP not supported in 3D mode

PCHIP SSP is not supported in 3D, despite being very similar to cubic, which is supported in 3D.

Receiver inside beam window inconsistencies

The check for whether the receiver is inside the beam window is commented out in both 3D Gaussian influence functions. There is code involving MaxRadius which may have a similar purpose earlier in that function in ray-centered, but not in Cartesian, and it's not clear why this condition should be omitted in the 3D but not the 2D Gaussian beams. Also, 2D beams consider the receiver to be outside the beam if it is on the edge, whereas 3D beams consider the receiver to be inside the beam if it is on the edge (e.g. write if a < BeamWindow for 2D but a <= BeamWindow for 3D). Neither of these is more correct than the other, but there does not seem to be any reason to have different behavior.

Influence duplicate point detection inconsistencies

There is a similar set of inconsistencies for the detection of duplicate points. All cases use a dynamic threshold, 1.0D3 * SPACING( ray%x( 1 ) ), except for 3D Cartesian ray-centered, which uses a static threshold of 1e-4. The dynamic threshold is commented out in this code. Also, again, in 3D, the point is considered a duplicate if its change in position is <= the threshold, whereas in 2D is it < the threshold.

2D altimetry geoacoustics

Geoacoustics are supported for altimetry (which doesn't really make sense), but writing the header for geoacoustics is only supported for bathymetry.

Dead code in 3D ray-centered

Both hat and Gaussian 3D ray-centered store m and delta from B to A after every step, as if the last step's value was going to be reused. But the last step's value is recomputed every step anyway, as is needed because steps can be skipped, making the old A value stale.

Gaussian 3D ray-centered also initializes deltaA, irA, and mA before the stepping loop, but their values are immediately overwritten. It also sets irA to zero with a comment that this is a flag, but there is no code to check this condition.

Arrivals `REAL( KIND=4 )` conversions

In WriteArrivals (ASCII/binary, 2D/3D) there is inconsistent conversion to 32-bit float when writing A and Phase. This was partly cleaned up in the 2022 changes, but there are still some inconsistencies. In all but 2D ASCII, Phase is converted to degrees in 64-bit, whereas in 2D ASCII it is converted to degrees in 32-bit; and in all but 3D ASCII, the actual value written is 32-bit, whereas in 3D ASCII, the value written (in text) is 64-bit.

Detailed information on edge case issues

Basic situation

The numerical integration in BELLHOP/BELLHOP3D operates by taking a trial step along the ray with a large step size, and then reducing the step size to land on the first "interface or boundary crossing". Then it evaluates the field parameters at this trial location, and adjusts the angles for the trial step and tries again. Finally, it interpolates between the two trials based on the field parameters and commits a step in that resulting direction. Steps are reduced to land on whichever of the following comes first (along the ray):

Plane where SSP changes along Z
Crossing to outside the beam box (maximum bounds for all simulation) in R or Z (2D) or in X, Y, or Z (3D) (new in 2022)
Crossing and reflecting from the top (ocean surface)
Crossing and reflecting from the bottom (ocean floor)
Top segment crossing (i.e. where the top definition changes, either due to a vertex in its height, or changed parameters; this rarely occurs for the top, but often for the bottom), in R (2D) or in X or Y (3D)
Bottom segment crossing, in R (2D) or in X or Y (3D)
Plane where SSP changes along R (2D) or X or Y (3D)
3D only: crossing the diagonal of the top XY segment (i.e. crossing from one of the triangles to the other, which may have different slopes)
3D only: crossing the diagonal of the bottom XY segment

Each of these boundaries produces a step size, and the smallest one is used. Then this step size is used to update the position (and other parameters). Due to floating-point numbers having limited precision, in practice it is "random" whether the resulting position is slightly before or slightly after the actual boundary. For example, if there is a depth SSP change at 1000 m, the position may end up being 999.999999994 m or 1000.0000000002 m. This often occurs even though, in this example (as is often the case for values provided in a config file), the "correct" value (1000.0) is precisely expressible as a floating- point number. This is not actually non-deterministic behavior--BELLHOP/ BELLHOP3D will produce binary identical outputs for identical input files. However, for some set of rays shot toward the same boundary, some set of them will land on each side of the boundary, without any discernible pattern to this behavior. And a version of BELLHOP/BELLHOP3D built with a different compiler or on a different machine, or of course our C++/CUDA version, will produce a different pattern.

Amplifying infinitesimal errors to large errors

Despite this behavior only causing an infinitesimal difference in the resulting ray position, this difference gets amplified through multiple interacting mechanisms in BELLHOP/BELLHOP3D. The amplified errors can then show up as substantial errors or inconsistencies in the output.

First, a ray which lands on the near side of the boundary will take its next step, hit the same boundary again, but cross it due to the enforced minimum step size. By itself, this issue immediately causes inconsistent results in ray mode: there is a different number of superfluous near-duplicated points, and therefore a different number of steps in the ray. In addition, the difference in its position (compared to a ray which happens to land on the other side of the boundary) has now been amplified from the scale of about 1e-16 from the floating-point error, to the minimum step size of about 1e-5. This larger error increases the chances of inconsistent behaviors later.

The inconsistency in which side of a boundary a step is to also directly causes another issue. Since environment files use round numbers, receivers being exactly on boundaries is very common. Whether the step is slightly before or slightly after the boundary (receivers) determines whether the incoming step or the outgoing step influences the receivers. Often the change between these two is small, but sometimes it is large. For example, if the outgoing step is an infinitesimal step, a great deal of precision is lost when computing the delay at the receiver, which can substantially change the phase. (For example, in one case we looked at in detail, the delay was 32.473 instead of 32.467, due to this effect an error of about 0.01%. Due to the frequency omega = 314.159, this shifted the phase by about a quadrant, causing the final results at this receiver to be unrecognizable compared to the original.) Or, if this boundary is the stopping condition for the ray, there is no outgoing step, so the receivers will be influenced or not influenced at all.

Second, error can accumulate just due to the integration, even without any of the "hard edge" cases discussed next. A small error in the initial trajectory of a ray can lead the ray to be off by a noticeable amount after traveling 100 km. Also see the "Curvature correction rotation divergence" section above for an example where very small error is compounded every step.

Finally, and most importantly, the physics of BELLHOP/BELLHOP3D is full of "hard edges" or discontinuities, where moving a ray by a very small amount creates a completely different result for the future propagation of the ray. Some of these discontinuities are unavoidable, for example:

If a ray is curving up towards the surface, the difference between curving back down just before it reaches the surface, and reflecting, is a discontinuity. Even if the trajectory is almost flat in both cases, the phase will be inverted if it is reflected.
If a ray approaches a vertex of the bottom where the slope changes, whether it hits just before or just after that vertex will completely change the reflection trajectory.

Nevertheless, some of these discontinuities are purely artificial, see for example the "Shallow angle range" and "RToIR step-to-edge-case issue" sections above. The most common example of this is that in TL, eigenrays, or arrivals, a ray being considered close enough to a receiver to influence it is a hard edge for certain influence types. For runs with tens of thousands each of rays and receivers, even if each ray position is only off by 1e-5 meters, it is not rare that a ray will be close enough to a receiver to "count" in one case, and too far in the other case. Usually the influence near the edge of the beam is low, but sometimes this ray will be the only ray to influence a particular receiver. In this case, if the ray hits the receiver in the Fortran version and does not hit it in the C++/CUDA version, the relative error is 100%; but even worse, if the ray does not hit the receiver in the Fortran version but does hit it in the C++/CUDA version, the relative error is infinite!

Fix for boundary stepping inconsistency

There is no one fix for all of these circumstances, but a fix has been implemented which generally solves the inconsistency about which side of a boundary a step is to. First, a function StepToBdry2D/3D is added which corresponds to ReduceStep2D/3D. This function ensures that after the step, the resulting position is exactly on the boundary--and if the value is not precisely representable in floating-point, that it is the same value as already stored in memory for the boundary. Second, the functions which update which segment a position is in have been changed to use the ray tangent to resolve edge cases, instead of choosing between less-than-or-equal-to versus less-than and such. The ray is always placed into the later segment: the segment such that it can move forward a nontrival distance and remain in the same segment. The idea is, since every step must be an edge case, it is put exactly on the edge, and then that exactly-on-the-edge position is handled in a consistent manner.

This gets more complicated for crossing the diagonals in 3D (and Nx2D), as the boundary does not have a single, fixed value in one dimension. In this case, a set of flags Top_td_* and Bot_td_* are used to store information about which side of the boundary the ray is on while its position is nearly on top of the boundary. The stepping process takes these into account and produces behavior which is generally consistent for large steps to and then from the boundary. It also attempts to not interfere with cases where a ray is traced along this diagonal boundary, or lands on the boundary due to stepping to a different boundary.

Things are even worse for Nx2D, where a 2D ray has to step to 3D boundaries, in a way that survives a transformation to 2D and then back to 3D. To handle this, a system (OceanToRayX) was put in place which attempts to step to the exact value in whatever dimension is relevant to the boundary being stepped to. If it is not able to find a floating-point value which will remain exactly on the boundary after the 3D - 2D - 3D transformation, it uses the next floating-point value past the boundary in the direction of the ray tangent, so the ray will be in the next segment and will not add a superfluous infinitesimal step.

Case study: missing iSegz, iSegr initialization

This particular issue, and the insight gained from debugging it, is what initially led to the creation of this modified BELLHOP/BELLHOP3D repo.

In BELLHOP, iSegz and iSegr are initialized (to 1) only once globally. This means their initial state for each ray is their final state from the previous ray. Normally, this shouldn't matter, because the segment indices are recomputed if they are wrong; but...

SSP checks whether the current depth and range are within the current segment, using an interval closed on both sides (including both endpoints). If the segment is incorrect, the correct segment is searched for, but this search uses a half-open interval (including the lower endpoint but excluding the upper one). It often happens that the depth or range value is exactly on an endpoint of a segment, because usually both source positions and depth segment boundaries are artificially set to nice round numbers in the environment files. In these cases, this algorithm will use the lower segment this endpoint belongs to if the current segment index (iSegz or iSegr) is the lower segment, and it will use the upper segment if the segment index was anything else and it had to search for the correct segment. So, in these cases, the segment the ray starts in depends on the previous state of iSegz or iSegr, i.e. the segment the previous ray finished in. Normally, this shouldn't matter, because the position is exactly on the boundary of these two segments, so it should be equally valid in either of them; but...

The numerical integration step algorithm takes a trial step forward of a default step size. If this step puts the position into a new segment, it reduces the step size so that it falls on the segment boundary instead. In the cases above, if the position is on the lower boundary of the upper segment, this step will remain in the same segment, but if it's on the upper boundary of the lower segment, this step will cross into the upper segment and therefore be reduced to land on the boundary again, for a step size of zero. There is a failsafe which ensures a minimum step size, placing the ray slightly inside the upper segment. This creates an extra, very small (~1e-5 m), step, or not, depending on which segment the previous ray happened to end in. Normally, this shouldn't matter, because one extra infinitesimally small step should not substantially change the field results; but...

The SGB (Bucker's Simple Gaussian Beams) transmission loss influence algorithm has a bug, acknowledged by mbp in code comments, where every step is assumed to be of the full, maximum step size, which is often 500 m or larger. Thus, one extra infinitesimally small step is actually one extra full-size step, which substantially impacts the field results from this beam. Again, this change happens "randomly" based on the final state of the previous ray/beam traced.

iSegz and iSegr have been re-initialized to 1 before every ray/beam is traced, removing this "random" inconsistent behavior. The SGB bug still stands, as fixing it would mean changing the physics, but at least it produces reproducible results now.

Other information

See index.htm for information from the original repo.

Code initially retrieved 12/17/21 from http://oalib.hlsresearch.com/AcousticsToolbox/ , the version labeled 11/4/20. In late 2022, the diff between mbp's newer 4/20/22 and 11/4/20 releases was computed, and the changes were applied to this code (with appropriate changes to integrate them).

Files pertaining to the other simulators (Krakel, Kraken, KrakenField, Scooter) have been removed.

The Makefile has been set up to build with gfortran by default (Linux or Windows with mingw etc.)

A-New-BellHope/bellhop