/tweets-to-json

Save your tweets into a JSON file using Twitter API

Primary LanguageJavaScript

Save your tweets into a JSON file using Twitter API. You can run it programatically (for example with GitLab or GitHub CI) to automatically keep a JSON archive of your tweets.

This was originally built for an easy way to provide external data to Hugo data templates.

Note: this script is not intended for huge datasets, it handles all tweets in memory. According to user_timeline docs it "can only return up to 3,200 of a user's most recent Tweets". YMMV. Read also about Twitter's rate limits.

Configuration

  1. Get Bearer token key for your app from Twitter Developer Portal and add it to TWITTER_BEARER_TOKEN environment variable (or to .env file in the root of your project)
  2. Create tweets-to-json-config.js with searchParams and exportFn (see example below)

Usage

When you run tweets-to-json first time, a tweets.json file (can be overrided) is created in the same directory and all your tweets are saved into it. Subsequent invocations will check the file, take the latest tweets ID, and only query for tweets after that ID. The tweets are saved in order, latest tweet first.

If you run this from CI, you might find --fail-when-zero flag handy as it returns error code 1 when there are no tweets.

Usage: tweets-to-json -o <filepath> [options]

Options:
  -o, --output-file <filepath>  specify where to output the tweets (default: "./tweets.json")
  --fail-when-zero              return exit status 1 if no new tweets are found
  -V, --version                 output the version number
  -h, --help                    display help for command

Example configuration

Full example of tweets-to-json.config.js:

const dayjs = require('dayjs')

module.exports = {
  searchParams: {
    screen_name: 'uninen',
    exclude_replies: true,
    trim_user: true,
    tweet_mode: 'extended',
    count: 100,
  },
  exportFn: (result) => {
    const tags = []
    const mentions = []
    const media = []
    const urls = []

    if (result.entities.hashtags && result.entities.hashtags.length > 0) {
      for (const tag of result.entities.hashtags) {
        tags.push(tag.text)
      }
    }
    if (
      result.entities.user_mentions &&
      result.entities.user_mentions.length > 0
    ) {
      for (const user of result.entities.user_mentions) {
        mentions.push(user.screen_name)
      }
    }
    if (result.entities.media && result.entities.media.length > 0) {
      for (const obj of result.entities.media) {
        media.push({
          url: obj.media_url_https,
          tco_url: obj.url,
        })
      }
    }
    if (result.entities.urls && result.entities.urls.length > 0) {
      for (const url of result.entities.urls) {
        urls.push({
          tco_url: url.url,
          url: url.expanded_url,
        })
      }
    }

    return {
      id: result.id_str,
      text: result.full_text,
      urls: urls,
      tags: tags,
      media: media,
      mentions: mentions,
      timestamp: dayjs(result.created_at).unix(),
    }
  },
}

Contributing

All contributions are welcome! Please follow the code of conduct when interacting with others.

This project lives on GitLab and is mirrored on GitHub.

Follow @Uninen on Twitter.