This Github repository contains data, protocols, and analyses to support the associated manuscript:
Kendall-Bar, J.M., Mukherji, R., Lopez, C., Nichols, J., Lozano, D.L., Pitman, J.K., Holser, R.H., Beltran, R., Schalles, M., Field, C.L., Johnson, S.P., Vyssotski, A.L., Costa, D.P., Williams, T.M. Eavesdropping on the brain at sea: the development of a noninvasive technique to maximize the detection of weak electrophysiological signals from wild animals. Journal of Animal Biotelemetry (2022): https://doi.org/10.1186/s40317-022-00287-x
Here is a table of contents to find what you are looking for:
- /data 🗁: folder with raw, processed, and summarized data
- /scripts 🗁: folder with .R scripts and JMP reports
- /figures 🗁: folder with figures generated from R and/or edited Illustrator
- /additional_files 🗁: folder with supplementary figures and statistics to support the manuscript
This research compendium has been developed using the statistical programming language R. To work with the compendium, you will need installed on your computer the R software itself and optionally RStudio Desktop.
You can download the compendium as a whole or clone this repository. After unzipping: - open the Eavesdropping-on-the-Brain-at-Sea.Rproj file in RStudio - run devtools::install() to ensure you have the packages this analysis depends on (also listed in the DESCRIPTION file). Finally, run the R scripts (in scripts 🗁) according to the steps below:
Text and Figures: CC-BY-4.0
Run this first to get metadata for all seals.
Run this to get paired and binned summary stats for signal quality data.
Run this to get plots out of raw data excerpts provided in data folder.
Column description of important data files:
Metadata for all seals. All Excel Date Times are provided in the following format: "mm/dd/yyyy hh:mm:ss". Row descriptions:
- Test # : recording # (includes test recordings between deployments)
- Animal : portable logger deployment number (incorporated into Nickname)
- Name : long name
- Nickname : unique animal code
- Recording ID : recording type including location (WILD vs. CAPTIVE), age estimate (i.e. 2mo = 2 months old), and age class (weanling, yearling, juvenile)
- Methods_Paper_SEALID : recording number (to match to Table 1 in MS)
- Sex : visually determined sex (M/F)
- Age : estimated age interval in years
- Age estimate : verbal description of age estimate
- Version : tag iteration used (V1/V2/V3)
- Deployment : deployment number
- Seal ID : Resight Seal ID for Ano Nuevo Research database: https://www.anonuevoresearch.com/
- Pressed Start Logger: Excel Date Time for pressing start
- Logger Start: Excel Date Time for actual logger start
- Start from Real Time Clock: Excel Date Time for time derived by real time clock utility (implemented in 2021).
- Start for EDF Files: start time used for EDF files.
- ON ANIMAL: time heart beats first detected in ECG channel (coincides with instrument attachment)
- OFF ANIMAL: time heart beats last detected in ECG channel (coincides with instrument detachment)
- Duration_ON_ANIMAL_h: hours logger was attached until either was removed or device stopped recording.
- Logger Stop: time logger turned off (if applicable).
- Device Failure: indicates whether logger was in ON or OFF state when recovered.
- Standard Length: straight length of animal in centimeters (nose to tail)
- Curved Length: curved length of animal in centimeters (nose to tail along body)
- Ax Girth: circumference of animal behind pectoral flippers
- Mass animal_kg: mass of animal in kilograms
- Flipper tag 1: ID listed on flipper tag 1 (including G to denote green color)
- Position: flipper tag position 1
- Flipper tag 2: ID listed on flipper tag 2 (including G to denote green color)
- Position: flipper tag position 2
- Birth date: verbal description of birth date estimate
- Animal ID: unique animal ID for elephant seal database: https://www.anonuevoresearch.com/
- Deploy ID: unique deployment ID for TOPP Bird & Mammal Database: http://lml-research-app-1.ucsc.edu/web/entryform/
- TOPP ID: unique animal ID for TOPP Bird & Mammal Database: http://lml-research-app-1.ucsc.edu/web/entryform/
- Deploy Latitiude: latitude where instrument was attached to animal
- Deploy Longitude: longitude where instrument was attached to animal
- Hematocrit: blood hematocrit level (if known)
- Ultrasound skull depth_cm: skull depth estimated from ultrasound images
- Recording Duration_s: time logger was recording in seconds
- Recording Duration_days: time logger was recording in days
- Begin Calm in Water for ICA: Excel Date Time for start of ICA training dataset
- End Calm in Water for ICA: Excel Date Time for end of ICA training dataset
- Duration for ICA: length of ICA training dataset hh:mm:ss
- Best EOG EMG EEG: channels that provided best EOG, EMG, L EEG, and R EEG signals
- ICA Decomposition Quality: subjective assessment of ICA decomposition
- ICA Component Maximal Brain: IC# that expressed maximal brain activity
- ICA Component Maximal Brain: IC# that expressed maximal brain activity
- Pruned with ICA Components: ICs that were removed from EOG, EMG, and EEG signals for visual and quantitative analysis
Signal quality data for each observation (a 30-sec time period around each comment- See Cmt Text). Column descriptions:
- Seconds.On.Animal: Seconds since instrument attachment
- Date Time: Excel Date Time for each observation
- Seal.ID: Nickname from S5_00_Sleep_Study_Metadata.xlsx
- Version: Version from S5_00_Sleep_Study_Metadata.xlsx
- AGE : age from S5_00_Sleep_Study_Metadata.xlsx
- Wild v. Captive: WILD or CAPTIVE
- Phase: Mode of categorical location denoting current location (LAND vs. WATER) and then the phase number (i.e. LAND02 denotes second time on land).
- Date : Excel date of recording
- Sel Start: Start time of observation hh:mm:ss
- Sel End: End time of observation hh:mm:ss
- Sel Duration: selection duration (all 30s)
- Pressure_mean : mean pressure for selection
- Pressure_SD : standard deviation of pressure for selection
- REEG2_Raw_Ch7_Mean
- LEEG3_Raw_Ch8_Mean
- EEG_ICA5_Mean
- pitch_Mean
- roll_Mean
- EEG_ICA_DELTA
- EEG_Pruned_DELTA
- EEG_Raw1_DELTA
- EEG_Raw1_DELTA_SD
- EEG_Raw2_DELTA
- EEG_Raw2_DELTA_SD
- EEG_ICA_DELTA2
- EEG_ICA_DELTA_SD
- BEST_EEG_DELTA
- BEST_EEG
- Cmt Text : Comment placed during scoring (includes: Instrument ON Animal, SWS1, REM, SWS2, Heart Patterns Scorable, Sleep State Scorable, Eye Movement, Muscle Twitch, LS (light sleep), Animal Enters Water, Animal Exits Water)
Signal quality data summarized per day per animal. Column descriptions:
- Observation #
- Day: Day since instrument attachment
- Seal.ID: Nickname from S5_00_Sleep_Study_Metadata.xlsx
- Mean: Mean SWS δ/REM δper day
- sd: Standard deviation SWS δ/REM δper day
- Max: Maximum SWS δ/REM δper day (for a single sleep cycle)
- Min: Minimum SWS δ/REM δper day (for a single sleep cycle)
- Mean_SWS: Mean SWS δper day
- sd_SWS: Standard deviation SWS δper day
- Mean_REM: Mean REM δper day
- sd_REM: Standard deviation REM δper day
- Version: Version from S5_00_Sleep_Study_Metadata.xlsx
- Phase: Mode of categorical location denoting current location (LAND vs. WATER) and then the phase number (i.e. LAND02 denotes second time on land).
- Percent.Obs.Water: # of sleep cycles in water for that day / total sleep cycles that day
- Deployment: deployment number from S5_00_Sleep_Study_Metadata.xlsx
- Seal.Number : Methods_Paper_SEALIDfrom S5_00_Sleep_Study_Metadata.xlsx.
- AGE : age from S5_00_Sleep_Study_Metadata.xlsx
Signal quality data summarized per sleep cycle. Column descriptions:
- Observation #
- PairLabel: Sleep cycle number (for paired SWS and REM observations)
- Day: Day since instrument attachment
- MinSec: Seconds on animal before first observation (SWS)
- MeanSec: Mean seconds on animal between SWS and REM observations
- Standardized: SWS δ/REM δfor each observation (paired SWS/REM sleep cycle)
- Seal.ID: Nickname from S5_00_Sleep_Study_Metadata.xlsx
- Location: animal location (LAND v WATER)
- Version: Version from S5_00_Sleep_Study_Metadata.xlsx
- Phase: Mode of categorical location denoting current location (LAND vs. WATER) and then the phase number (i.e. LAND02 denotes second time on land).
- AGE : age from S5_00_Sleep_Study_Metadata.xlsx
- Deployment: deployment number from S5_00_Sleep_Study_Metadata.xlsx
- Seal.Number : Methods_Paper_SEALIDfrom S5_00_Sleep_Study_Metadata.xlsx.
- SWS: SWS δfor best EEG channel
- REM: REMδfor best EEG channel
- Days.Elapsed: Mean days on animal between SWS and REM observations
1-min excerpts of raw signals in different settings. Data can be plotted using R script 06_SignalQuality_Excerpts_Plot.R in code repository. Column descriptions:
- SecElapsed : seconds since logger start
- Date : Excel date of recording
- ECG: raw timeseries data for ECG
- LEOG: raw timeseries data for left EOG
- REOG: raw timeseries data for right EOG
- LEMG: raw timeseries data for left EMG
- REMG: raw timeseries data for right EMG
- LEEG1: raw timeseries data for left EEG (frontal region)
- REEG2: raw timeseries data for right EEG (frontal region)
- LEEG3: raw timeseries data for left EEG (parietal region)
- REEG4: raw timeseries data for right EEG (parietal region)
- Acc X/Acc Y/Acc Z : unprocessed accelerometer timeseries data
- HeartRate: output for automated peak detection
- Seconds: seconds since start of each excerpt (0 to 60)
- Comment: channel with event markers for each identified heart beat
- SealID: Nickname from S5_00_Sleep_Study_Metadata.xlsx
- Wild v. Captive: WILD or CAPTIVE
- Active v SWS v REM: denoting whether excerpt is of active behavior (galumphing on land or swimming in water), slow-wave sleep (SWS), or rapid-eye-movement (REM) sleep
- Location: LAND or SHALLOW (water)
- Activity: Galumphing (land), Swimming (water), Stationary (land or on the ocean floor), or Drifting (water).
1-min excerpts of challenges and solutions to signal recording obstacles. Data can be plotted using R script 06_SignalQuality_Excerpts_Plot.R in code repository. Column descriptions:
- SecElapsed : seconds since logger start
- Date : Excel date of recording
- ECG: raw timeseries data for ECG
- LEOG: raw timeseries data for left EOG
- REOG: raw timeseries data for right EOG
- LEMG: raw timeseries data for left EMG
- REMG: raw timeseries data for right EMG
- LEEG1: raw timeseries data for left EEG (frontal region)
- REEG2: raw timeseries data for right EEG (frontal region)
- LEEG3: raw timeseries data for left EEG (parietal region)
- REEG4: raw timeseries data for right EEG (parietal region)
- Acc X/Acc Y/Acc Z : unprocessed accelerometer timeseries data
- HeartRate: output for automated peak detection
- Seconds: seconds since start of each excerpt (0 to 60)
- Comment: channel with event markers for each identified heart beat
- SealID: Nickname from S5_00_Sleep_Study_Metadata.xlsx
- Wild v. Captive: WILD or CAPTIVE
- Active v SWS v REM: denoting whether excerpt is of active behavior (galumphing on land or swimming in water), slow-wave sleep (SWS), or rapid-eye-movement (REM) sleep
- Location: LAND or SHALLOW (water)
- Activity: wet (headcap had significant water intrusion), dry (headcap had no water intrusion), VHF BAD (VHF on land), VHF GOOD (VHF in water where signals were attenuated), with pings (satellite pings present), without pings (satellite pings removed), HR BAD (HR signals messier with poor wire fortification), HR GOOD (HR signals better with good wire fortification).