/SgEExt

:octopus: a Simple gemoji Emoji Extractor

Primary LanguagePythonMIT LicenseMIT

SgEExt — a Simple gemoji Emoji Extractor

Pronounced [sgɪksˈt]

Introduction

Let's have a talk about an usual situation.
You run a self-hosted Jekyll blog, with the jemoji plugin loaded so as to "enhance" your content with some emojis.
You'd have noticed that by default the plugin renders your static pages with <img> whose src attributes are pointing to a GitHub's (Microsoft's) location.
The latter means that a third-party is able (note that it may not be the case) to log people who consult your website, and thus is a privacy issue.

So, at this step, you may have tried to extract images to serve it yourself, but the operation is only possible from a macOS system ('cause Apple emojis are present within the system font, and... gemoji is mainly using the "Apple's emoji character palette").

Now that is said, there are your options :

  • Buy an Apple device
  • Kindly ask a friend with an Apple device to extract and send you those PNGs (cc @Naernon)
  • Change your underneath emojis module / plugin / whatever
  • Wait for the gemoji project to use another set of emoji
  • Download PNGs directly from GitHub, and that is definitely what this project is about !

Dependencies

  • python3 (>= 3.6)
  • python3-requests

Installation

Manually

git clone https://github.com/HorlogeSkynet/SgEExt.git

Usage

python3 sgeext.py --help

# Careful, running without any arguments would download the whole emojis palette !
# Resulting files will be set under `emoji/unicode/` (automatically created).
# Microsoft GitHub's emojis (images) will be set under `emoji/`.
python3 sgeext.py

# This directory structure will be created if it does not exist.
python3 sgeext.py -l joy -d emojis/images/

# Wanna force re-download of existing files ? Sure.
python3 sgeext.py -l joy relaxed sunglasses -d emojis/images/ -f

# Wanna save "real" emojis under their "real" name ? Sure.
python3 sgeext.py -l ok_hand -n

# Wanna download GitHub added "emojis" (mostly images) ? Sure.
python3 sgeext.py -l bow relaxed octocat trollface --verbose

# 'octocat' & 'trollface' would be ignored.
python3 sgeext.py -l bow relaxed octocat trollface --only-emojis

# Handle duplicates ('uk' and 'gb' refer to the same emoji).
python3 sgeext.py -l fr gb us jp uk jp gb it

# Wanna download the emojis currently being used in your (Jekyll) blog ? Sure.
python3 sgeext.py -l $(grep -hREo ':[a-z+-]+[a-z1-9_-]+:' /path/to/your/blog/_posts/*.md | sort | uniq | cut -d ':' -f 2) -d /path/to/your/blog/images/emojis/

How does it work ?

The script... :

  • ... (after having tried locally) fetches the emojis database hosted on GitHub (don't click if you are on mobile)
  • ... iterates through the elements and extracts their unicode value as hexadecimal
  • ... uses the above result to download them from GitHub

As githubassets.com is actually a CDN, it does not look rate-limited.
Anyway, I cannot take any responsibility for any inappropriate usage of this software.

Compatibility

Although SgEExt has been developed with cross-platform constraints, it has not been tested on Windows yet.
Feedbacks are welcome, as usual.

Apple users, you don't have to go this way.