Conversions from TEI XML –> LaTeX –> PDF

  • clone with submodules: git clone --recursive
  • update all submodules: git submodule foreach git pull

Basic ideas

  1. Track *.tex files in the file:./TeX/ subdirectory.
  2. The LaTeX files are produced by converting the XML files in file:./SARIT-corpus/ with the XSLT in Stylesheets/profiles/sarit/latex/to.xsl
  3. The LaTeX files have to be compiled with XeLaTeX from the TeXLive 2015 version, latest version.
  4. Keep and use scripts in file:./bin/ (they’re supposed to give you an idea, we won’t develop any fancy programs; e.g., file:./bin/convert_sarit_to_tex.sh).

Improving the TeX output

Please file an issue if you want to improve something.

To make it easier to track and solve the problem, try and give information on the following points:

  1. Describe the problem, maybe add (a page of) the pdf showing the problem.
  2. Where in the tex file does the problem occur?
  3. What part of the XML file is the problem coming from?
  4. Where in the XSLT stylesheet does the transformation happen?
  5. What does the TeX logfile say, if anything?
  6. What makes the problem go away?

The more infos you give, the better.

Devanagari fonts

A very incomplete list, with some subjective evaluations:

  1. Chandas/Uttara (http://www.sanskritweb.net/cakram/)
    1. Many ligatures
    2. Poor ratio of arabic numerals to Devanagari chars
    3. No bold/italic
    4. GPL licensed
  2. Shobhika
    1. two weights—regular and bold; over 1600 Devanagari glyphs; also Vedic accents
    2. https://github.com/Sandhi-IITBombay/Shobhika
    3. See http://list.indology.info/pipermail/indology_list.indology.info/2016-November/044671.html
  3. Sanskrit 2003
    1. Many ligatures
    2. Good ratio Latin <-> Devanagari sizes
    3. License unclear
  4. Annapurna SIL
    1. Good ratio of Latin to Devanagari letters
    2. Ligatures not so great (ddhya?)
    3. Real bold
  5. Sahadeva & Nakula
    1. No bold italic
    2. Good ligature coverage
  6. Samyak
    1. Not too many ligatures
    2. No bold/italic
  7. Lohit Devanagari
    1. Missing ligatures (ddhya, e.g.)
    2. No bold/italic
  8. Noto Serif/Sans Devanagari
    1. SIL Open Font License, Version 1.1
    2. http://www.google.com/get/noto/#sans-deva
    3. http://www.google.com/get/noto/#serif-deva

See also

  1. https://bhusunda.wordpress.com/2015/11/09/comparison-of-devanagari-fonts-on-os-x-using-the-807-documented-ligatures-of-classical-sanskrit-according-to-the-research-of-ulrich-stiehl/
    1. links to: https://spideroak.com/share/MJUHK43VNZSGC/stuff/Users/det/SpiderOak%20Hive/SpiderShare/Sanskrit6.numbers.pdf
  2. http://luc.devroye.org/indic.html
  3. http://www.sanskritweb.net/cakram/
  4. http://www.sanskritweb.net/fonts/index.html