CottageLabs/OpenArticleGauge

Odd behaviour on license statement cleanup

Opened this issue · 0 comments

There are appears to be something slightly odd happening with the cleanup of linebreaks in license statements. In the Sage case, the registered license statement:

This article is distributed under the terms of the Creative Commons Attribution 3.0 License (<a href="http://www.creativecommons.org/licenses/by/3.0/">http://www.creativecommons.org/licenses/by/3.0/</a>) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified

...does not match the statement found in the html of http://dx.doi.org/10.1177/0265691413479085 which contains a line break and additional whitespace:

This article is distributed under the terms of the Creative Commons Attribution 3.0 License (<a href="http://www.creativecommons.org/licenses/by/3.0/">http://www.creativecommons.org/licenses/by/3.0/</a>) which permits any use, reproduction and distribution of the work without further permission provided the original work is
                     attributed as specified

However the non-commercial license statement which looks similar on the surface is matched just fine.

Registered license statement:

This article is distributed under the terms of the Creative Commons Attribution-Non Commercial 3.0 License (<a href="http://www.creativecommons.org/licenses/by-nc/3.0/">http://www.creativecommons.org/licenses/by-nc/3.0/</a>) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified

HTML from http://dx.doi.org/10.1177/1742395312466903

This article is distributed under the terms of the Creative Commons Attribution-Non Commercial 3.0 License (<a href="http://www.creativecommons.org/licenses/by-nc/3.0/">http://www.creativecommons.org/licenses/by-nc/3.0/</a>) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original
                     work is attributed as specified