mie-lab/trackintel

ENH: location generation improvements

hongyeehh opened this issue · 0 comments

Our location_generation relies on the DBSCAN algorithm and enables the generation of user- and dataset-level locations. Locations are generated from core points from the DBSCAN results. When the input epsilon is small and locations only consist of 1 or 2 staypoints, we create a buffer with epsilon distance around the original point or line geometry to create valid polygon geometry for the column extent. This creates inconsistencies in the location results. For example, we create the following locations using epsilon=20 and num_samples=2:

  • a. locations created from multiple sps are directly connected
  • b. locations created from 2 sps are buffered

I suggest also buffering case a geometry to ensure consistency.

In addition, we occasionally observe huge locations created in dense sp regions, which overlap with many other locations (case c in the figure). We would need to check how these locations are generated and propose solutions to split/avoid them

pic