rufuspollock-okfn/bibserver

chardet dependency

Closed this issue · 3 comments

I note /bibserver/parsers/BibTexParser.py
is importing chardet
I presume this is http://pypi.python.org/pypi/chardet

I note that the chardet homepage
http://chardet.feedparser.org/
is returning a 410 Gone response which is not a good sign. Seems like we should be curating our own version of chardet
if the original developer is no longer supporting it.

epoz commented

We are modifying the parsers to improve unicode handling. The character set detection is being moved in to the parser base class. Looking into using the detection code from http://www.crummy.com/software/BeautifulSoup/.

epoz commented

I suspect the disappearance of chardet is related to Mark Pilgrim's decision to remove his presence on the Internet (in so far as that is possible)

Etienne Posthumus reply@reply.github.com wrote:

I suspect the disappearance of chardet is related to Mark Pilgrim's decision to remove his presence on the Internet (in so far as that is possible)

Fascinating. More about this at http://en.wikipedia.org/wiki/Mark_Pilgrim_(software_developer)
Looks like he has taken down a very substantial amount of free documentation, e.g.
"Dive into Greasemonkey" http://diveintogreasemonkey.org/ a free book on Greasemonkey.
Raises some serious issues about the meaning of "publication" of such material.

--Jim