diy-onic-converter

Coding challenge to make your own (DIY) “bionic reading converter.”

Notes:

The naive initial implementation discards all HTML and operates on the string returned by node.textContent. I have left it in the source, but its usage is commented out.
The HTML-aware implementation of this function will process the DOM tree and generate a queue of replacements for each text node.
- NOTE: This implementation is messier than I would like, but I encountered an unexpected issue where utilizing node.replaceWith on my text nodes would mess up the parent's childNodes iterator and the remaining nodes would fail to get processed. By que-ing up all of the replaceWith actions and performing the replacement at the end, I was able to achieve what I needed, but I normally do not like mutating function props, and the code structure is less than ideal. Given more time I would refactor this to have processNode return a queue of actions, rather than mutating the provided array.
I am splitting words based on simple regex of \w+. I initially split on whitespace, but found that this was capturing and highlighting a lot of punctuation and symbols (, °, etc. I see in the example output for the exercise that this is expected, but I but I made a delibarate choice to omit these. As a result, hyphenated words (like "gold-plated") have the first three letters emboldened on each segment as well.
As per the suggested "above and beyond" scope suggestions:
- I have altered the default behavior to apply this transformation to all of the children of the provided container, not just <p> tags.
- I have added a charsToHighlight parameter to alter the number of characters to make bold in each word (default is 3).
- I have ensured that this can be run on arbitrary webpages and included instructions below.
- I have ensured that HTML will be preserved (with the above-mentioned caveats).
- I have described the limitations of this implementation and how I might go about improving things if I had more time.

Limitations:

Currently the function does not take into account words that have multiple segments in adjacent nodes. For instance:
```
<span style="color: blue;">H</span>ello World!
```
- In this case, ideally, the H within the span and el within the adjacent text node would be made bold, but currently these nodes are treated as separate words.
- To accomplish this, I may make note of whether the last-processed word was less than the length of the highlight range, and note whether there exists no whitespace or block-level element between them to continue the highlight action across two nodes.
Currently punctuation and unicode characters are not properly recognized. The μ symbol, for instance ought to be recognized as part of a word, but we're currently naively matching on /\w+/

How to use this:

As per the exercise instructions, run npm start and visit localhost: 8080. Open the developer console and run the following:

diyOnicConverter('body');

To run this on another arbitrary website:

Visit a webpage like https://en.wikipedia.org/wiki/Aardvark
Highlight and copy the entire contents of public/diy-onic-converter.js into your clipboard.
Open the developer console on the webpage.
Paste the contents of the diy-onic-converter into the console and hit Enter.
Type diyOnicConverter('body') and hit Enter again.
Observe the changes to the website.

I have tested this on several wikipedia articles and it seems to work as expected.

mikegreiling/diy-onic-converter

diy-onic-converter

Notes:

Limitations:

How to use this:

To run this on another arbitrary website: