/hoedown-mweb

Standards compliant, fast, secure markdown processing library in C

Primary LanguageCISC LicenseISC

Hoedown

缘起

近期使用markdown做笔记。希望找一款软件,取代 evernote 笔记功能。
evernote转为「资料收集库」。
而 markdown 软件,用于后续深入的 整理、总结、输出

按这个需要出发,大概需要几个特点:

  1. 同时支持 mac和IOS;
  2. 文档库管理,多级目录、tag、检索等等;
  3. 对markdown扩展支持比较丰富,比如代码、表格、TOC等等;

最后选择了 mweb

使用过程中,对一些语法解析不太满意。
和作者 @oulvhai 联系之后,得知 MWeb 使用的解析库 是 Hoedown,自告奋勇修改该库。

Hoedown 的原始代码地址:https://github.com/hoedown/hoedown

mweb的问题列表: issue:https://github.com/oulvhai/MWeb-issues

主要修改几个新的功能:

  • 列表块之前,不需要强制 空行
  • 代码块之前,不需要强制 空行
  • 列表里面,自然换行,也可以解析为<br>
  • bug修复:代码块中的 语法,被误以为 footnote;
  • 其他小细节;

其实更好的办法是换库:github的cmark库。目前活跃度很高,代码质量也很高。
https://github.com/github/cmark/tree/master/src

Hoedown 本身解析的过程,是「一行行硬解析」,没有使用AST。
还好 Markdwowm 不算复杂,这个框架勉强还能应付的过去。
后面的扩展和维护,潜力小,比较麻烦。如果要做的完全符合标准,估计还需要加一堆「补丁」。

Build Status

Hoedown is a revived fork of Sundown, the Markdown parser based on the original code of the Upskirt library by Natacha Porté.

Features

  • Fully standards compliant

    Hoedown passes out of the box the official Markdown v1.0.0 and v1.0.3 test suites, and has been extensively tested with additional corner cases to make sure its output is as sane as possible at all times.

  • Massive extension support

    Hoedown has optional support for several (unofficial) Markdown extensions, such as non-strict emphasis, fenced code blocks, tables, autolinks, strikethrough and more.

  • UTF-8 aware

    Hoedown is fully UTF-8 aware, both when parsing the source document and when generating the resulting (X)HTML code.

  • Tested & Ready to be used on production

    Hoedown has been extensively security audited, and includes protection against all possible DOS attacks (stack overflows, out of memory situations, malformed Markdown syntax...).

    We've worked very hard to make Hoedown never leak or crash under any input.

    Warning: Hoedown doesn't validate or post-process the HTML in Markdown documents. Unless you use HTML_ESCAPE or HTML_SKIP, you should strongly consider using a good post-processor in conjunction with Hoedown to prevent client-side attacks.

  • Customizable renderers

    Hoedown is not stuck with XHTML output: the Markdown parser of the library is decoupled from the renderer, so it's trivial to extend the library with custom renderers. A fully functional (X)HTML renderer is included.

  • Optimized for speed

    Hoedown is written in C, with a special emphasis on performance. When wrapped on a dynamic language such as Python or Ruby, it has shown to be up to 40 times faster than other native alternatives.

  • Zero-dependency

    Hoedown is a zero-dependency library composed of some .c files and their headers. No dependencies, no bullshit. Only standard C99 that builds everywhere.

  • Additional features

    Hoedown comes with a fully functional implementation of SmartyPants, a separate autolinker, escaping utilities, buffers and stacks.

Bindings

You can see a community-maintained list of Hoedown bindings at the wiki. There is also a migration guide available for authors of Sundown bindings.

Help us

Hoedown is all about security. If you find a (potential) security vulnerability in the library, or a way to make it crash through malicious input, please report it to us by emailing the private Hoedown Security mailing list. The Hoedown security team will review the vulnerability and work with you to reproduce and resolve it.

Unicode character handling

Given that the Markdown spec makes no provision for Unicode character handling, Hoedown takes a conservative approach towards deciding which extended characters trigger Markdown features:

  • Punctuation characters outside of the U+007F codepoint are not handled as punctuation. They are considered as normal, in-word characters for word-boundary checks.

  • Whitespace characters outside of the U+007F codepoint are not considered as whitespace. They are considered as normal, in-word characters for word-boundary checks.

Install

Just typing make will build Hoedown into a dynamic library and create the hoedown and smartypants executables, which are command-line tools to render Markdown to HTML and perform SmartyPants, respectively.

If you are using CocoaPods, just add the line pod 'hoedown' to your Podfile and call pod install.

Or, if you prefer, you can just throw the files at src into your project.