mjpost/sacrebleu

Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons

PythonApache-2.0

Issues

Should BLEU score be dependent on the order of the sequences?
#272 opened a month ago by davidgonmar
2
Further standardize the layout and packaging
#271 opened 2 months ago by mjpost
3
no published sdist for 2.4.2
#269 opened 2 months ago by dhellmann
4
Removing the change of behavior regarding SIGPIPE from the top-level
#268 opened 2 months ago by jblespiau
2
BLEU and chrF2 on Chinese
#240 opened a year ago by zhhl9101
1
Add BLEU max_ngram_order to signature
#251 opened 10 months ago by BramVanroy
2
Memory leak in spm tokenizers
#264 opened 4 months ago by nkrasner
0
Difference between cli and corpus_score
#259 opened 6 months ago by VarunGumma
1
when use package 'evaluate‘ with 'sacrebleu' calculate metric happend error
#263 opened 4 months ago by TristanShao
1
No references found for test set wmt23/*
#261 opened 5 months ago by kellymarchisio
4
__init__.py divergence in 2.4.0
#255 opened 6 months ago by dustalov
0
Compute BLEU from Python mishandles references
#256 opened 8 months ago by OrianeN
2
Accelerating sentence scores
#254 opened 9 months ago by AmitMY
4
Problems about the tokenizers
#192 opened 2 years ago by DoctorDream
4
Add WMT 23 test sets
#245 opened 9 months ago by mjpost
1
[Feature Request] HTER Implementation
#248 opened 10 months ago by shivanraptor
1
Taking a long time to download the test set
#242 opened a year ago by Phuoc-Hoan-Le
3
Schedule a new release
#241 opened a year ago by daskol
2
What's the difference between setting "--tokenize" to "flores101" and setting it to "flores200"?
#243 opened a year ago by Phuoc-Hoan-Le
1
Working on tokenized pairs?
#244 opened a year ago by MostHumble
1
BLEU and CHRF reports wrong scores when any hypothesis is empty
#239 opened a year ago by SantiagoEG
0
Inconsistent scores between loop and separate check
#237 opened a year ago by 106AbdulBasit
0
TER asian support
#229 opened a year ago by esalesky
1
Discrepancy in docstrings in TER `normalized`
#230 opened a year ago by BramVanroy
2
TER between two empty strings is 100
#228 opened a year ago by BramVanroy
3
GitHub workflows CI tests fail on Python 3.6
#233 opened a year ago by martinpopel
2
AttributeError: module 'sacrebleu' has no attribute '__version__'
#231 opened a year ago by dsj96
0
Remove the flores101 related tokenizer logging message when the selected tokenizer is flores200
#219 opened 2 years ago by hadyelsahar
0
Unicode normalization?
#224 opened 2 years ago by davidweichiang
1
Silent failure with incorrect reference format in Python API
#220 opened 2 years ago by mdarcy220
0
Default params for calculating bleu
#195 opened a year ago by base-y
4
Calculate sentence-level and corpus-level BLEU with tokenizer flores101(or flores200) on GPU
#227 opened a year ago by ElizabethUniverse
2
Empty lines in WMT21/dev Icelandic-English
#225 opened 2 years ago by ZJaume
1
module 'sacrebleu' has no attribute 'corpus_bleu
#222 opened 2 years ago by gongel
2
TER above 100?
#208 opened 2 years ago by JoyeBright
12
lru_cache max size is not set in tokenizer_spm.py
#217 opened 2 years ago by chikiulo
1
Add WMT22 data
#215 opened 2 years ago by mjpost
0
Switch to poetry for build/install
#202 opened 2 years ago by mjpost
4
Installing latest version via PIP results in an error
#209 opened 2 years ago by epwalsh
8
Support for printing significance score as JSON
#207 opened 2 years ago by me-manikanta
0
List test sets available for a given language pair
#210 opened 2 years ago by ZJaume
2
Support test sets from sign language task?
#212 opened 2 years ago by bricksdont
0
Incorrect sample size?
#206 opened 2 years ago by kocmitom
1
How to calculate the chrf2 score?
#193 opened 2 years ago by RamoramaInteractive
2
[Feature Request] Word Error Rate
#199 opened 2 years ago by jonathanmutal
3
tarball is not defined
#201 opened 2 years ago by VanyaBK
1
Why give 0 bleu score when evaluating De ?
#198 opened 2 years ago by Hannibal046
3
more accurate exception types on argument checks
#189 opened 2 years ago by abcdenis
2
Is it possible to add flores dataset directly to dataset?
#191 opened 2 years ago by BrightXiaoHan
0
Is ‘extract_ngrams’ removed in the latest version?
#188 opened 2 years ago by rangehow
2