one-thesaurical-motherfucker: A JavaScript repository from mattbierner

Thesaurus your text automatically. Now you too can be one thesaurical motherfucker, just like Herman Melville, that most thesaurical of all thesaurical motherfuckers.

What?

This program takes some text and uses a thesaurus to lookup the longest synonym for each word, ignoring details such as part of speech or context. Here's an example:

Input - 224 characters

Call me Ishmael. Some years ago—never mind how long precisely—having little or no money in my purse, and nothing particular to interest me on shore, I thought I would sail about a little and see the watery part of the world.

Output - 516 characters

Compulsory military service me Unacceptable person. Some years ago—never collective unconscious how de longue haleine precisely—having small indefinite quantity or no unregistered bank account in my unregistered bank account , and matter of indifference classificational to mental acquisitiveness me on geological formation , I higher cognitive process I would present no difficulties about a small indefinite quantity and have information about the infirm of purpose supporting character of the steady-state universe .

See this post for more examples.

Running

Server code is stored in the master branch. Site code is on the gh-pages brach.

To run the server:

$ npm install
$ node server.js

Api

You are free to use the Api, but don't be a jerk. I can't guarantee it's availability or uptime, so you are probably better off hosting the server yourself. Any simple Node host will do.

Api hosted at http://one-thesaurical-motherfucker.azurewebsites.net

`/api/tokens`

Translate a set of tokens.

Request

http://one-thesaurical-motherfucker.azurewebsites.net/api/tokens
{
    'tokens': [
        { 'token': 'Call', 'id': 0 },
        { 'token': 'me', 'id': 1 },
        { 'token': 'Ishmael', 'id': 2 },
        { 'token': '.', 'id': 3 },
        ...
    ],
    OPTIONS
}

tokens - Array of between zero and 2000 tokens. Each token is either a string or an object with a token property. If you use an object, you can also pass an optional id property that will be returned on the result tokens. An id is not required and not used internally for translation.

See below for explanation of options.

Response

{
    'tokens': [
        { 'token': 'Call', 'synonym': 'Compulsory military service', 'id': 0 },
        { 'token': 'me', 'id': 1 },
        { 'token': 'Ishmael', 'synonym': 'Unacceptable person', 'id': 2 },
        { 'token': '.', 'id': 3 },
        ...
    ]
}

All input tokens are returned as part of the response, but only translated tokens will have the synonym property.

Errors

Invalid mode
Too many tokens

`/api/text`

Translate some text, using the internal tokenizer.

Request

http://one-thesaurical-motherfucker.azurewebsites.net/api/text
{
    'text': "Call me Ishmael.",
    OPTIONS
}

text - string of between zero and 30000 characters.

See below for explanation of options.

Response

{
    'tokens': [
        { 'token': 'Call', 'synonym': 'Compulsory military service' },
        { 'token': 'me' },
        { 'token': 'Ishmael', 'synonym': 'Unacceptable person' },
        { 'token': '.' },
        ...
    ]
}

Errors

Invalid mode
Too much text

Options

`'mode'`

What translation method should be used. Choices are:

'longest' - Always choose the longest word.
'random' - Choose a random word.

`'whole_words'`

Should only whole words be returned?

mattbierner/one-thesaurical-motherfucker

What?

Running

Api

/api/tokens

/api/text

Options

'mode'

'whole_words'

`/api/tokens`

`/api/text`

`'mode'`

`'whole_words'`