Parallel corpus as a graph.
Ported from Graph in spraakbanken/swell-editor.
To install parallel-corpus
in the current environment:
pip install parallel-corpus
To add parallel-corpus
to a PDM project:
pdm add parallel-corpus
To add parallel-corpus
manually to pyproject.toml
:
[project]
dependencies = ["parallel-corpus>=0.1.2"]
first = "Jonathan saknades ."
# Initialize graph with source and target equal.
g = graph.init(first)
second = "Jonat han saknades ."
# Update target with new text.
gm = graph.set_target(g, second)
# The graph will now contain a edge from 'Jonathan' and both 'Jonat' and 'han'.
print(f"{gm.edges=}")
This project keeps a changelog.
This project uses conventional commits.
Tools used:
- pdm for project management.
- pre-commit for pre-commit checking
- runs ruff linter
- runs ruff formatter
- checks that commit message is according conventional commits.
- install hooks with
pre-commit install
.
- git-cliff for changelog updates.
- bump-my-version for version bumping.
- syrupy for snapshot testing.