funderburkjim/elispsanskrit

compare to Huet, verbs, present conjugation

Opened this issue · 4 comments

This issue summarizes differences and similarities noticed in the comparison of verb conjugations; the methodology is described in the readme for huetcompare/verbs-prim-prs.

class pada comparison

huet_class_pada.txt summarizes the roots and their conjugation classes and padas as they appear in the SL_roots.xml file from Huet.

Huet assigns the conjugation class 11 to denominatives.

pysan_class_pada.txt summarizes the roots and their conjugation classes and padas as they appear in the
dcpforms-MW-verb.txt of the pysan system. These were
developed ca. 2004 from the Cologne digitization of the Monier-Williams as it then existed. A similar, more current, and in some ways more complete source of roots from MW can be found in this MWvlex file.

The two lists of roots with class and pada assignments are compared in
compare_class_pada.txt.

This comparison is organized into 6 categories:

  • 254 cases of roots which appear in both sources, and with identical class-pada data.
  • 150 cases of roots which appear in both sources, and for which the Huet class-pada data is a
    strict subset of the pysan data
  • 88 cases of roots which appear in both sources, and for which the pysan class-pada data is a
    strict subset of the Huet data
  • 44 cases of roots which appear in both sources, and for which there are other differences in the
    class-pada data from the two sources
  • 1078 cases of roots which appear only in the pysan source
  • 202 cases of roots which appear only in the Huet source. Of these, 165 are denominatives (Huet class 11), and the 37 others are various.

It would be interesting to delve further into these differences, and to make use of the probably more accurate MW data from the MWvlex source mentioned above.

conjugation table comparison

huet_conj_tables.txt contains 4422 conjugation tables derived from Huet's SL_roots.xml.

For each root-class-pada combination, there are 4 conjugation tables for the 4 present tenses,
for which Huet uses theses abbreviations.

  • pr = present tense
  • ip = imperative
  • op = optative
  • im = imperfect

Each of these appears on one line of the file.

One would expect that 4422 would be a multiple of 4, which of course it isn't. I think the reason is that there are certain duplicates, but I'm not sure of this.

I felt that a comparison of the conjugation algorithms of Huet and pysan should be made without taking into account the class-pada differences of the two systems. Thus, for each of the Huet conjugation tables, a computation was done to derive a pysan conjugation, using the give, root, class, pada and tense as appearing in the Huet conjugation table entry. The result is
[pysan_conj_tables.txt(https://github.com/funderburkjim/elispsanskrit/blob/master/huetcompare/verbs-prim-prs/pysan_conj_tables.txt).

compare_conj_tables.txt summarizes the similarities and differences in the corresponding conjugations from the two files just mentioned.
Some statistics resulting from the comparison:

  • 3128 (71% of the 4422) are identical.
    • There is one adjustment that brought about 500 tables into this group. When the spelling of
      a conjugation table entry ends in 'r' in Huet AND ends in 'H' in pysan AND is otherwise the same
      in both systems, then this entry is considered the same. This often occurs in the 3p form of the
      optative.
  • 748 denominative roots (class=11) in Huet have no pysan conjugation.
  • 5 conjugations of non-denominative roots also have no pysan conjugation
  • 541 conjugations differ in one or more other details.

This is all the summary comments for now.

developed ca. 2004 from the Cologne digitization of the Monier-Williams as it then existed. A similar, more current, and in some ways more complete source of roots from MW can be found in this MWvlex file.

Huet and Hellwig both have derived from Cologne at an early date, spokensanskrit.org as well. Interesting to see what (older) issues still are around because of the lack of update on them as well.

  • 748 denominative roots (class=11) in Huet have no pysan conjugation.

This is important. For example the Kochergina dictionary I asked you to compare vs. MW, had the lack of quite many 10th class roots, needed to translate Ramayana. So this is in some way similar.