SAET

SHORELINE ANALYSIS AND EXTRACTION TOOL

INDEX

INTRODUCTION
WORKFLOW
REQUIREMENTS
FOLDER STRUCTURE
RUNNING SAET
OUTPUTS
CONSIDERATIONS
INSTALLATION

1. INTRODUCTION (index)

SAET is a software for the extraction and analysis of shorelines using satellite images from the Sentinel-2 series (levels 1C and 2A) and Landsat 8 and 9 (collection 2, levels 1 and 2). It is primarily focused on studying the impact of coastal storms, allowing the determination of the shoreline position before and after the storm to evaluate its effects. Although this is its main function, SAET can also be used for temporal analysis and to study the evolution of any event (natural or anthropogenic). The main features are as follows:

Direct access to official satellite image servers: ESA and USGS.
Download and processing of entire scenes, allowing coverage of large areas affected by storms.
Numerous configuration parameters (different types of water indices, segmentation thresholds, etc.) that make SAET a highly flexible software capable of being adapted to different scene conditions.
Sub-pixel algorithm for shoreline extraction.
Visualization of QUICKLOOK images in HTML format.
Activation of offline Sentinel-2 images.
Output of the shoreline positions in shapefile format (point and line) in the WGS84 spatial reference system (EPSG: 4326).

This software has been developed as part of the H2020 EU project ECFAS (a proof-of-concept for the implementation of a European Copernicus Coastal Flood Awareness System, GA n° 101004211) by the Geo-Environmental Cartography and Remote Sensing Group (CGAT) at the Universitat Politècnica de València, Spain. It contains the core algorithm for shoreline extraction at a sub-pixel level. For detailed information on the tool and the algorithm, please refer to the following papers:

Palomar-Vázquez, J.; Pardo-Pascual, J.E.; Almonacid-Caballer, J.; Cabezas-Rabadán, C. Shoreline Analysis and Extraction Tool (SAET): A New Tool for the Automatic Extraction of Satellite-Derived Shorelines with Subpixel Accuracy. Remote Sens. 2023, 15, 3198. https://doi.org/10.3390/rs15123198
Pardo-Pascual, J.E., Almonacid-Caballer, J., Ruiz, L.A., Palomar-Vázquez, J. Automatic extraction of shorelines from Landsat TM and ETM multi-temporal images with subpixel precision. Remote Sensing of Environment, 123, 2012. https://doi.org/10.1016/j.rse.2012.02.024
Pardo-Pascual, J.E., Sánchez-García, E., Almonacid-Caballer, J., Palomar-Vázquez, J.M., Priego de los Santos, E., Fernández-Sarría, A., Balaguer-Beser, Á. Assessing the Accuracy of Automatically Extracted Shorelines on Microtidal Beaches from Landsat 7, Landsat 8 and Sentinel-2 Imagery. Remote Sensing, 10(2), 326, 2018. https://doi.org/10.3390/rs10020326
Sánchez-García, E., Palomar-Vázquez, J. M., Pardo-Pascual, J. E., Almonacid-Caballer, J., Cabezas-Rabadán, C., & Gómez-Pujol, L. An efficient protocol for accurate and massive shoreline definition from mid-resolution satellite imagery. Coastal Engineering, 103732, 2020. https://doi.org/10.1016/j.coastaleng.2020.103732
Cabezas-Rabadán, C., Pardo-Pascual, J. E., & Palomar-Vázquez, J. Characterizing the relationship between the sediment grain size and the shoreline variability defined from sentinel-2 derived shorelines. Remote Sensing, 13(14), 2829. 2021. https://doi.org/10.3390/rs13142829
CGAT: https://cgat.webs.upv.es
ECFAS: https://www.ecfas.eu
SAET ECFAS E-learning module

Copyright and License

This open-source software is distributed under the GNU license and is the copyright of the UPV. It has been developed within the framework of the European ECFAS project by the following authors: Jesús Palomar Vázquez, Jaime Almonacid Caballer, Josep E. Pardo Pascual, and Carlos Cabezas Rabadán.

This project has received funding from the Horizon 2020 research and innovation programme under grant agreement No. 101004211

Please note that this software is designed specifically for the automatic extraction of shorelines in pre-storm and post-storm events and is not intended for massive extraction purposes.

How to Cite SAET

To cite SAET in your research, please use the following reference:

J. Palomar-Vázquez, J. Almonacid-Caballer, J.E. Pardo-Pascual, and C. Cabezas-Rabadán (2021). SAET (V 1.0). Open-source code. Universitat Politècnica de València. http://www.upv.es

2. WORKFLOW (index)

In this image, we can see the general workflow of the algorithm. The parameters can change depending on the expected results (see section 5).

3. REQUIREMENTS (index)

The tool uses Sentinel 2, Landsat 8 and Landsat 9 images as input. In this way, the first thing needed is to have username and password from Copernicus Scihub and USGS Landsat Explorer servers:

Access USGS Landsat Explorer service: In this case, you need to do two things: to register on the Landsat Explorer website and to make a request to access the service “machine to machine” (m2m). For the first requirement, you must register on the website https://ers.cr.usgs.gov/register. Once you have your credentials, access the website https://earthexplorer.usgs.gov, and go to your profile settings. Click on the button “Go to Profile” and finally, on the option “Access Request”. There you can make a new request to the m2m service by filling out a form. Once you have your credentials for both data source providers you can edit the file “saet_config.py” (see structure section) by changing the asterisks with your own credentials:

os.environ['USER_ESA'] = os.getenv('USER_ESA', '********')
os.environ['PASS_ESA'] = os.getenv('PASS_ESA', '********')
os.environ['USER_USGS'] = os.getenv('USER_USGS', '********')
os.environ['PASS_USGS'] = os.getenv('PASS_USGS', '********')

4. FOLDER STRUCTURE (index)

The folder SAET contains the following files and subfolders:

saet_run.py. Main script. It must be executed in command line mode.
saet_tools.py. Module that contains all functions needed to run the algorithm.
polynomial.py. Module with extra functions for surface interpolation.
saet_config. py file that contains several configuration variables (credentials for the satellite imagery servers, folder structure for results, etc.).
examples_of_use.txt. Text file that contains several examples of use of the tool.
aux_data. Folder containing:
- beaches.shp. Shapefile with all the European areas classified as beaches. Based on “Coastal uses 2018” dataset (https://land.copernicus.eu/local/coastal-zones/coastal-zones-2018). This shapefile contains the field named “BEACH_CODE” that will be copied to the final shoreline shapefile. This file is used to focus the shoreline extraction into those areas.
- landsat_grid.shp. Shapefile including Landsat-8 footprints (https://www.usgs.gov/media/files/landsat-wrs-2-descending-path-row-shapefile). This shapefile has two fields named “PATH” and “ROW” that will be used to select a scene by its identifier.
- sentinel2_grid.shp. Shapefile including Sentinel-2 footprints (based on the .kml file https://sentinels.copernicus.eu/web/sentinel/missions/sentinel-2/data-products).
- map_director.qgz. Project file for QGIS with the three previous .shp files. It is useful for planning the process of downloading scenes.
- SAET.pdf. Document explaining the tool.
- roi.geojson. File in geojson format containing an example of ROI (region of interest) to be used for searching scenes. To make easier the creation of this file (if it is needed) you can visit this website: https://geojson.io
search_data. Folder containing the file for the result of the algorithm in “searching” mode. This file will have the format indicated in the configuration file (saet_config.py). In this way, the possible formats are html, txt or json.
landsatxplore2. This folder contains the needed files for a modified version of the landsatxplore API to access to the Landsat imagery from the USGS server. This have been necessary to update this library to the new Collection 2 of Landsat, which includes Landsat-9 product.

Configuration file

The configuration file (saet_config.py) contains some parameters that allow controlling the access to the imagery servers and modify the algorithm workflow. Normally it will not be needed to modify this file apart from the credential values, but if you want to do it, you must take in account this explanation about each section:

Section “credentials”. Change the asterisks characters by your own credentials to properly run SAET (see section 3).
Section “home folder”. It represents the path where SAET will be installed. All other subfolders will depend on it by employing relative paths.
Section “auxiliary data”. Relative path to the auxiliar data needed to SAET. The name of each shapefile can be changed if it is required.
Section “logging”. This section should be changed only by expert users. It controls the level of messages (severity) that SAET can return. For testing and debugging purposes, set this level to 10.
Section results. It controls how the user can see the searching results. The rest of parameters that control SAET are exposed as command line parameters (see section 5)

5. RUNNING SAET (index)

One way to do this is by opening a PowerShell window or a command window (cmd). In Windows, go to the search bar and type "powershell" or "cmd". Run the script saet_run.py with parameters:

python saet_run.py --parameter=value

Note: whether using one or the other method (powershel or cmd), it would be a good idea to open these tools as administrator.

Parameters

Parameter	Description	Required	Usage / examples	Default value
--help	Shows the tool help message.
--rm	Run mode. “os” means only searching; “dp” downloading and processing; “od” only downloading; “op” only processing (for previous downloaded images); “or” offline S2 retrieval. If you want just only search for images, use the “os” mode. Note: “dp” mode must be used along with the parameter --np (number of products). “or” mode must be used along with the parameter –oa (offline S2 activation)	Yes	--rm=os --rm=dp --rm=od --rm=op --rm=or	os
--fp	Footprint for scene searching. This parameter has three ways to be used: - path to the ROI file (see structure section). - bounding box in latitude and longitude coordinates separated by “comma” with the format min_long, min_lat, max_long, max_lat. - using NONE value to avoid searching by ROI. In this case, we can activate the searching by scene or tile identifiers (parameters --ll and --sl).	Yes except “op” mode	fp=c:\data\roi.geojson. fp= 0.28,39.23,0.39,39.33 fp = NONE	NONE
--sd	Start date for searching scenes in format (YYYYMMDD).	Yes except “op” mode	--sd=20211001
--cd	Central date for searching scenes in format (YYYYMMDD). It is assumed to be the central date of the storm.	Yes except “op” mode	--cd=20211001
--ed	End date for searching scenes in format (YYYYMMDD).	Yes except “op” mode	--ed=20211001.
--mc	Maximum percentage of cloud coverage in the scene. It must be a number between 0 and 100.	Yes except “op” mode	--mc=10
--lp	Product type for Landsat scenes. By default, the tool uses the Collection 2 (level 1) to search Landsat-8 images (landsat_ot_c2_l1), but it also can search Landsat 8 and Landsat 9 images from Collection 2 at level 2 (landsat_ot_c2_l2). This parameter can be set up to “NONE”, to avoid Landsat scenes processing. Note: - L8 and L9 products are only available from Collection 2. Collection 1 are not available anymore as of December 30, 2022.	Yes except “op” mode	--lp= landsat_ot_c2_l1 --lp= landsat_ot_c2_l2 --lp=NONE
--ll	Scene list identifiers for Landsat images. It must be a list of numbers of 6 digits. If there is more than one identifier, they must be separated by the “comma” character. The default value is NONE (which means that the search by ROI will have priority).	Yes except “op” mode	--ll=198032 --ll=198032,199031 --ll=NONE	NONE
--sp	product type for Sentinel-2 scenes (S2). The tool uses 1C (S2MSI1C) and 2A (S2MSI2A) products. The product by default is 1C. This parameter can be set up to “NONE”, to avoid S2 scenes processing.	Yes except “op” mode	--sp= S2MSI1C --sp= S2MSI2A --sp=NONE	NONE
--sl	Scene list identifiers for S2. It must be a list of alphanumeric characters (named “tile” identifier) composed of two numbers and three capital letters. If there is more than one identifier, they must be separated by the “comma” character. The default value is NONE (which means that the search by ROI will have priority).	Yes except “op” mode	--sl=30TYJ --sl=30TYJ,30TYK --sl=NONE	NONE
--bc	Beach code list to filter the extraction process for a group of beaches. This code is related to the “BEACH_CODE” field in the shapefile “Beaches.shp”. The default value is NONE, which means that all beaches in the image will be processed. In case you want process a group of beaches, you must indicate a list of codes, separated by “comma”. If some code in the list is not correct, it will not be considered. If all codes are incorrect, all beaches will be processed.	False	--bc=1683 --bc=1683,2485,758 --bc=NONE	NONE
--of	Output data folder. It indicates the folder where the images and results will be stored	No	--of=c:\data (windows) --of=/data(linux)	SAET installation folder
--wi	Water index type. SAET supports these indices: aweinsh, aweish, mndwi, kmeans (K-means it is not a water index, but also leads to a classification mask. In this case it is not needed a threshold value).	No	--wi=aweinsh --wi=aweish --wi=mndwi --wi=kmeans	aweinsh
--th	Threshold method to obtain the water-land mask from the water index. SAET supports three methods: standard 0 value, Otsu bimodal and Otsu multimodal with three classes. These methods are applied for all type of index except kmeans.	No	--th=0 (standard 0 value) --th=1 (Otsu bimodal) --th=2 (Otsu multimodal)	0
--mm	Morphological method. To generate the shoreline at pixel level (SPL) from the water-land mask. SAET can apply two methods: erosion and dilation.	No	--mm=erosion --mm=dilation	dilation
--cl	Cloud masking severity. This parameter controls what kind of clouds will be used to mask the SPL. SAET supports three levels of severity: low (SAET don’t use cloud mask), medium (only opaque clouds) and high (opaque clouds, cirrus, and cloud shadows). Note: Landsat 8-9 Collection 2 and Sentinel-2 use algorithms to classify clouds. SAET uses these classification layers. You must assume that sometimes this classification can fail. This will directly affect the result.	No	--cl=0 (low) --cl=1 (medium) --cl=2 (high)	0
--ks	Kernel size. The main algorithm for shoreline extraction uses a kernel analysis over each pixel in the SPL. Users can control this size, choosing between 3 or 5 pixels.	No	--ks=3 --ks=5	3
--np	Number of products. List of products to be processed in both “d” and “p” modes. The list must consist of the identifiers of the products to be processed. These identifiers can be obtained in the searching mode. This parameter supports three different formats: list of identifiers, range of identifiers or all identifiers.	No	--np=0,2,5,3 (list) --np=5-10 (range) --np=* (all)	NONE
--oa	Offline Sentinel-2 activation. Some S2 products could be in offline mode. So, before downloading them, they must be activated (from offline to online mode). See “considerations” section.	No except “or” mode	--oa=check --oa=activate	check

This is the text of help that appears when you run SAET with the --h parameter (python saet_run.py --h):

usage: saet_run.py [-h] --rm {os,dp,od,op,or} --fp FP --sd SD --cd CD --ed ED --mc [0-100] --lp {landsat_ot_c2_l1,landsat_ot_c2_l2,NONE} --ll LL --sp {S2MSI1C,S2MSI2A,NONE} --sl SL [--bc BC] [--of OF]
                   [--wi {aweish,aweinsh,mndwi,kmeans}] [--th {0,1,2}] [--mm {erosion,dilation}] [--cl {0,1,2}] [--ks {3,5}] [--np NP] [--oa {check,activate}]

optional arguments:
  -h, --help            show this help message and exit
  --rm {os,dp,od,op,or}
                        Run mode (only search [s] / download and process [dp] / only donwload [od] / only process [op] / offline S2 retrieval [or]). --rm=os / --rm=dp / --rm=od / --rm=op / --rm=or.
                        Default: os
  --fp FP               path of the roi file for searching scenes (fp=c:\data oi.geojson), coordinates long/lat in this format: fp=long_min,lat_min,long_max,lat_max. Default: NONE
  --sd SD               Start date for searching scenes (YYYYMMDD). --sd=20210101. Default:20200101
  --cd CD               Central date for storm (YYYYMMDD). --sd=20210101. Default:20200102
  --ed ED               End date for searching scenes (YYYYMMDD). --sd=20210101. Default:20200103
  --mc [0-100]          maximum cloud coverture for the whole scene [0-100]. --mc=10
  --lp {landsat_ot_c2_l1,landsat_ot_c2_l2,NONE}
                        Landsat 8 product type. landsat_ot_c2_l1 or landsat_ot_c2_l2 or NONE. Default: landsat_ot_c2_l1
  --ll LL               List of scenes for Landsat 8 (number of 6 digits). --ll=198032,199031. Default: NONE
  --sp {S2MSI1C,S2MSI2A,NONE}
                        Sentinel 2 product type (S2MSI1C / S2MSI2A). --s2=S2MSI1C / --s2=S2MSI2A / NONE. Default: S2MSI1C
  --sl SL               List of scenes for Sentinel 2 (string of 5 characters). --sl=31TCF,30TYK. Default: NONE
  --bc BC               beach code filter list. --bc=520,548 Default: NONE
  --of OF               output data folder. --of=c:\data (windows) --of=/data. Default: SAET_HOME_PATH
  --wi {aweish,aweinsh,mndwi,kmeans}
                        Water index type (aweish, aweinsh,mndwi,kmeans). --wi=aweinsh. Default: aweinsh
  --th {0,1,2}          Thresholding method (0: standard 0 value, 1: Otsu bimodal, 2: Otsu multimodal 3 classes). --th=0. Default: 0
  --mm {erosion,dilation}
                        Morphological method (erosion, dilation). --mm=dilation, Default: dilation
  --cl {0,1,2}          Cloud mask level (0: no masking, 1: only opaque clouds, 2: opaque clouds + cirrus + cloud shadows). Default: 0
  --ks {3,5}            Kernel size for points extraction. Default: 3
  --np NP               List of number of products for download (only if --rm=d and --rm=p). [0,2,5,3] / [*] / [5-10]. Default: NONE
  --oa {check,activate}
                        Offline S2 activation (only if --rm=or). "check" / "activate". Default: "check"

Examples of use

Searching for all Sentinel-2 (level 1C) scenes inside an area of interest (tile 30SYJ), with less than 15% of cloud coverage and within the date range from 01-04-2023 to 30-04-2023 (central date or storm peak 15-04-2023):

python saet_run.py --rm=os --fp=NONE --sd=20230401 --cd=20230415 --ed=20230430 --mc=15 --lp=NONE --ll=NONE --sp=S2MSI1C --sl=30SYJ

2023-06-28 16:29:02,542 INFO Starting SAET algorithm...

2023-06-28 16:29:02,880 INFO Searching for S2 images...

2023-06-28 16:29:04,066 INFO Found 6 products
Scene: S2B_MSIL1C_20230420T104619_N0509_R051_T30SYJ_20230420T125145 Cloud coverage: 9.2 availability: online
Scene: S2B_MSIL1C_20230417T103629_N0509_R008_T30SYJ_20230417T123856 Cloud coverage: 0.05 availability: online
Scene: S2A_MSIL1C_20230412T103621_N0509_R008_T30SYJ_20230412T155912 Cloud coverage: 0.0 availability: online
Scene: S2B_MSIL1C_20230407T103629_N0509_R008_T30SYJ_20230407T124138 Cloud coverage: 0.0 availability: online
Scene: S2A_MSIL1C_20230405T105031_N0509_R051_T30SYJ_20230405T160934 Cloud coverage: 0.01 availability: online
Scene: S2A_MSIL1C_20230402T103621_N0509_R008_T30SYJ_20230402T160048 Cloud coverage: 0.0 availability: online


[0] Scene: S2B_MSIL1C_20230420T104619_N0509_R051_T30SYJ_20230420T125145 Cloud coverage: 9.2% 5 days Online
[1] Scene: S2B_MSIL1C_20230417T103629_N0509_R008_T30SYJ_20230417T123856 Cloud coverage: 0.05% 2 days Online
[*******] Central date:20230415
[2] Scene: S2A_MSIL1C_20230412T103621_N0509_R008_T30SYJ_20230412T155912 Cloud coverage: 0.0% -3 days Online
[3] Scene: S2B_MSIL1C_20230407T103629_N0509_R008_T30SYJ_20230407T124138 Cloud coverage: 0.0% -8 days Online
[4] Scene: S2A_MSIL1C_20230405T105031_N0509_R051_T30SYJ_20230405T160934 Cloud coverage: 0.01% -10 days Online
[5] Scene: S2A_MSIL1C_20230402T103621_N0509_R008_T30SYJ_20230402T160048 Cloud coverage: 0.0% -13 days Online


2023-06-28 16:29:06,192 INFO Time passed: 0hour:0min:3sec

2023-06-28 16:29:06,192 INFO SAET algorithm have finished successfully.

Results show a first list of products with its availability (online or offline). For every image, this last list contains the identifier (number of order in the list), the name, the cloud coverage percentage, and the difference in days between the central date and de image date. Besides, depending on the values of the variable “QUICKLOOK” (configuration file), an .html file with the quicklook images is opened automatically in the default browser.

Note: sometimes firefox may experiment problems showing quicklooks. If this is the case, try chrome as default browser.

Once the searching results have been obtained, if we want to download and process just only the closest products to the central date (products 1 and 2), the suitable command line will be the following:

python saet_run.py --rm=dp --fp=NONE --sd=20230401 --cd=20230415 --ed=20230430 --mc=15 --lp=NONE --ll=NONE --sp=S2MSI1C --sl=30SYJ --np=1,2

In case we want to reprocess previous downloaded images, for example by using other water index or thresholding method, we only have to change these parameters and repeat the same command line but changing the run mode (--rm) to “op”. In this case, the images previously downloaded will be reprocessed.

python saet_run.py --rm=op --wi=mndwi
2023-06-28 17:11:26,536 INFO Starting SAET algorithm...

List of scenes in the data folder:

[0] S2A_MSIL1C_20230405T105031_N0509_R051_T30SYJ_20230405T160934
[1] S2B_MSIL1C_20230420T104619_N0509_R051_T30SYJ_20230420T125145

Number of images to be reprocessed?: 0,1

SAET will display a list of images already stored in the output data folder and will ask for the images to be processed. You must use the same syntax as the --np parameter.

Note: more examples can be found in the file “examples_of_use.txt”.

Run mode election

Next picture shows the workflow to run SAET in the most convenient way. The recommendation is:

Select your area of analysis and product of interest. The file map_director.qgz (QGIS) will be very useful to decide which scene (Landsat) or tile (Sentinel-2) will be used.
Always start with the "only searching mode".
If you are going to analyse just a few images (for example, one or two images before and after the storm peak date), you can follow with the "downloading and processing" run mode.
In case you want to analyse a time series and in order to prevent connection problems form the serves, it is encouraged to use "only download" run mode instead "downloading and processing".
Anyway, if you want to reprocess any image previously downloaded, you can use the "only processing" run mode. This mode will allow you to reprocess the images with other parameters (water index, threshol method, etc.) without having to download them again.
Only in case you want to analyse Sentinel-2 images and some of them are in "offline" mode, you can use the "Offline S2 activation" run mode, along with the parameter "--oa", first time with the value "--oa=activate" to activate the product, and from time to time, with the value "--oa=check", to check if the products are online. Note: Only S2 online products can be downloaded. Once a product has been activated, it remains in online mode just for a few days. Finally, if you prefer, you also can do the activation process from the Landsat Earth Explorer platform.

6. OUTPUTS (index)

After running the tool, a new structure of folders will be created inside the SAET installation folder. Every time SAET is run, new folders are added to the structure. This structure can be similar as follows:

The “ouput_data” folder will be created if it does not exist. Inside, “data”, “sds” and “search_data” folders will be created. “data” folder contains subfolders to download and process every scene.
Every type of image (L8, L9 or S2) is stored in its own folder, which is named as the name of the original image (scene folder). The scene folder contains all needed bands and auxiliary files (metadata, cloud mask, etc.).
The “temp” folder is where all intermediate output files will be stored.
Results (shorelines) will be stored into the “sds” folder, inside of scene folders, and will contain two versions of the shoreline in shapefile format: line format (_lines.shp) and point format (_points.shp). Shapefile shorelines are stored in the World GEodetic System 1984 (WGS84 - EPSG:4326).
“search_data” folder contains an .html file with the different thumbnails corresponding with the found images.

The “temp” folder contains some intermediate files that may be interesting review in case we do not obtain the expected results:
- bb300_r.shp: shapefile containing the beaches file (in WGS84 geographic coordinates) reprojected to the coordinate reference system of the scene.
- clip_bb300_r.shp: previous shapefile cropped by the scene footprint.
- bb300_r.tif: previous file converted to binary raster (pixels classified as beach have the code 1).
- scene_footprint.shp: shapefile containing the footprint polygon of the scene.
- *_wi.tif: raster file containing the computed water index.
- *_cmask.tif: raster file containing the binary mask of the cloud coverage (pixels classified as clouds, cirrus or cloud shadows have the code 1).
- pl.tif: raster file containing the binary mask representing the extracted shoreline at pixel level (pixels classified as shoreline have the code 1).
- *_B11.shp (for Sentinel-2) or *_B6.shp (for Landsat 8-9): shapefile containing the extracted shoreline in point vector format, without having been processed by the cleaning algorithm.
- *_cp.shp: shapefile containing the extracted shoreline in point vector format, once it has been processed by the cleaning algorithm. This folder will be copied to the "SDS" folder by changing the prefix "_cp" to "_points", in both shapefile and json format.
- *_cl.shp: shapefile containing the extracted shoreline in line vector format, once it has been processed by the cleaning algorithm. This folder will be copied to the "SDS" folder by changing the prefix "_cl" to "_lines", in both shapefile and json format.

7. CONSIDERATIONS (index)

This tool downloads one or more L8, L9 or S2 scenes, and it downloads the whole scene. In the case of L8-9, due to the server restrictions all the bands are downloaded (it does not allow single band request or clipping), althought it is a reasonably fast process. In the case of S2, the download process can be a bit slow, but in this case the script only downloads the bands that are needed for the shoreline extraction (the server does allow that feature).
L8 and L9 products are only available from Collection 2. In USGS servers, Collection 1 are not available anymore as of December 30, 2022 (https://www.usgs.gov/landsat-missions/landsat-collection-1).
The algorithm uses the cloud mask information. For L8-9, this information is stored in a .tif file, whereas for S2, it depends on the product (.gml format for the product 1C, and .tif format for the product 2A). This situation can change in the next months and some changes may be needed (see https://sentinels.copernicus.eu/web/sentinel/-/copernicus-sentinel-2-major-products-upgrade-upcoming).
The shapefiles inside the folder “aux_data” are mandatory to make the tool work. If modifications (removing, updating) in the “beaches.shp” file are needed do not forget to maintain the field “BEACH_CODE” with unique identifiers.
The final shoreline is provided in two versions (line and point) and it has the field “BEACH_CODE” to facilitate the subsequent analysis by comparing the same beach section on different dates.
One good way to begin using the tool is trying to see what are the L8-9 or S2 scenes that we are interested in. For this goal, we can use the grid shapefiles for L8-9 or S2 (“aux_data” folder) and other online viewers, like “OE Browser” (https://apps.sentinel-hub.com/eo-browser). In this website we can see the needed products, their footprints, and their available dates and cloud coverage. Once we know this information, we can use the “searching by scenes mode” in the tool. Anyway, if you decide to use the “quicklook” visualization, the .html file will help you to decide the best images for your purposes. On the contrary, if we use the “searching by footprint mode”, the algorithm will search for the L8-9 or S2 nearest to the central data provided. This mode, sometimes, especially in S2 scenes can download more scenes than needed due to the fact that the ROI can overlap with more than one S2 footprint, which leads to a longer downloading time.
To avoid wasting time, can start using the tool firstly in “searching” mode and also in “searching by scenes mode”.
If we request the Sentinel-2 last images, could be possible that we only have access to the 1C product (2A product is not immediately available, and it is needed to spend some time to have access to this product). On the other hand, we also need to consider that 1C product has a cloud mask of lower quality than 2A product.
Sentinel-2 products older than 18 months (level 2A) or 12 months (level 1C) are not directly available as they appear in offline mode. So, if you want to download these products you must active them before. You can do this directly from the Copernicus Open Access Hub platform (https://scihub.copernicus.eu/), following the instructions that appear on “data restoration” section. (https://scihub.copernicus.eu/userguide/DataRestoration). You can do the same directly using SAET. Follow these steps:
1. Run SAET in only search mode (--rm=os)
2. Maybe some products are in offline mode
3. Run SAET in “offline S2 retrieval” mode (--rm=or). You must use the parameter “--oa" (offline S2 activation) along with the run mode “or”. Use the value “--oa=check" to see which products are offline.
4. Run SAET again in “offline S2 retrieval” mode (--rm=or), but this time using the value “--oa=activate”.

8. INSTALLATION (index)

SAET has been developed in python and has been tested for the python version 3.9.7 (64 bits). You can install this version from by installing the file “Windows installer (64-bit)” form the link https://www.python.org/downloads/release/python-397. SAET needs some extra libraries to work. In the file “requirements_readme.txt” we can see the minimum versions of these libraries and some notes about the GDAL library:

Package	Version	Description
Python-dateutil	2.8.2	Functions to extend the standard datetime module
Numpy	1.21.2	Numeric package
Matplotlib	3.4.3	Visualization library
GDAL	3.3.1	Geospatial Data Abstraction Library for raster geospatial data formats.
Sentinelsat	1.1.0	Library to access the Copernicus Open Data Hub servers
Shapely	1.7.1	Library to manage shapefiles
Pyshp	2.1.3	Library to manage shapefiles
Scikit-image	0.18.3	Image processing library
Scikit-learn	1.0.2	Library for data analysis and classification
Scipy	1.7.1	Scientific computing library
Networkx	2.6.2	Library for managing and analysing networks

The easier way to install SAET to avoid conflicts with other libraries already installed in your python distribution is to create a virtual environment. Virtual environments are used to isolate the installation of the needed libraries for each project, avoiding problems among different versions of the same library. Therefore, is the most recommended method.

Virtual environment creation and installation of SAET (recommended)

Note: You can find a detailed document in this step-by-step pdf

Once you have installed python (for example in “c:\python397_64”), follow the next steps (on Windows):

Open a command prompt window.
In this window, install the library “virtualenv” by typing 'pip install virtualenv'.
Close the command prompt window.
Create a new folder called "SAET_installation" (the name does not matter) in whatever location (for example 'c:\SAET_installation').
Copy all SAET files into this folder.
Open a new command prompt window and change the current folder to the SAET installation folder (type 'cd C:\SAET_installation')
In the command prompt type: 'c:\Python397_64\Scripts\virtualenv env' ("env" is the name of a new virtual environment). This will create a new folder named “env”.
Activate the new virtual environment by typing: 'env\Scripts\activate'.
Install all needed libraries one by one typing 'pip install -r requirements_windows.txt' (for windows), or 'pip install -r requirements_linux.txt' (for linux).
Change your credentials in the file “saet_config.py”.

To check if SAET has been correctly installed, type the next sentence in the command prompt window:

python saet_run.py --h

If you have any problems with this way of installation, remove the virtual environment, create it again and try to install the libraries one by one manually. In the file “requirements_readme.txt” we can see the versions of these libraries and some notes about the GDAL library. To remove the virtual environment, follow the next steps:

If you have any problems with this way of installation, remove the virtual environment, create it again and try to install the libraries one by one manually. In the file “requirements_readme.txt” we can see the minimum versions of these libraries and some notes about the GDAL library. To remove the virtual environment, follow the next steps:

Close your command prompt window
Delete the folder containing the virtual environment (in this case, the folder “env”)
Repeat the steps 7 and 8 to create and activate again the virtual environment.
Try the manual installation of each single library typing pip install (library_name)==(library_version). Example: pip install numpy==1.21.2. It is recommendable to do the manual installation in the same order as you can see in the table of libraries in the section 8.

Important note for manual installation:

GDAL installation with pip command can be problematic. If errors occur during the installation of GDAL, so try to install the corresponding wheel file, according to your operative system (Windows or Linux). These files are in the folder "gdal_python_wheels":

Windows: GDAL-3.3.3-cp39-cp39-win_amd64.whl
Linux: GDAL-3.4.1-cp39-cp39-manylinux_2_5_x86_64.manylinux1_x86_64.whl

The installation can be done using the pip command. The example for Windows would be like that:

pip install ./gdal_python_wheels/GDAL-3.3.3-cp39-cp39-win_amd64.whl

Virtual environment creation and installation of SAET on Linux

Open a new terminal
Type 'pip3 install virtualenv'
Close the terminal
Go to the SAET installation folder
Open a new terminal in this folder
Type 'virtualenv env' (“env” is the name of the virtual environment).
Activate this new virtual environment typing 'source env/bin/activate'
Install the libraries typing 'pip3 install -r requirements_linux.txt'
Change your credentials in the file “saet_config.py”
Type 'python3 saet_run.py --h'

simonweppe/SAET_master