Surfer: Export your personal data in one click

Table of Contents

How it works
Getting Started
Roadmap
License
Contact
Acknowledgements

Demo (click to view)

Surfer is a digital footprint exporter, designed to aggregate all your personal data from various online platforms into a single folder.

Currently, your personal data is scattered across hundreds of platforms and the companies operating these platforms have no incentive to give this data back to you. Surfer solves this problem by navigating to websites and scraping data from these websites.

We believe that personal data aggregation is the key to enabling truly useful, universal personal assistants.

Currently Supported Platforms

Twitter
LinkedIn
GitHub
YouTube
Notion
ChatGPT
Gmail
iMessages (coming soon!)
Twitter Bookmarks (coming soon!)
Reddit (coming soon!)

How it works

Click on "Export" to initiate the data extraction process.
The app waits for the target page to load completely.
The system checks if the user is signed in to the platform being scraped.
If not signed in, the user is prompted to sign in.
If signed in, the process continues.
Once signed in, the app interacts with the platform's user interface.
The app then scrapes the user's data from the platform.
Finally, the extracted data is exported and saved to your local storage.

Sample Exported Data

  "platform_name": "X Corp",
  "name": "Twitter",
  "runID": "twitter-001-1724267514217",
  "timestamp": 1724267623318,
  "content": [
    "Twitter Post 1",
    "Twitter Post 2",
    "Twitter Post 3",
    ...
  ]
}

Getting Started

To download the app, head over to https://surfsup.ai. Or you can go to the releases page.

For instructions on setting up the app locally and contributing to the project, please refer to the Contributing Guidelines, Helper Functions Documentation, and Guide to Adding New Platforms.

See the open issues for a full list of proposed features (and known issues).

Analytics

We use Supabase to collect analytics. We ONLY collect the number of installs, the number of updates, and the success or failures of runs in Surfer. All data is anonymized.

Roadmap

Short-Term

Data being maintained/updated everyday
Scheduled exports
Obtain a code signing certificate for Windows
Replace setTimeout with await for script execution to ensure elements exist before scraping
Implement robust error handling for the scraping process
Add support for more online platforms
Add verbosity to runs

Medium to Long-Term

Implement concurrent scraping to allow for multiple scraping jobs to run simultaneously
Adding knowledge graphs, chatting with data, visualizations, etc
Adding sub-tasks within platforms (i.e. Twitter Bookmarks, LinkedIn Connections Data, etc)
Integrate with other agentic frameworks like LangChain for advanced personal AI assistants
Explore integration with wearable devices for enhanced personal data tracking and acknowledgment