dataset

Functions to read machine translation evaluation datasets

 read_source_ref (source_path, ref_path)

Read the testset into two arrays and return them

 download_read_set (base_path, source_language_code, target_language_code,
                    test_set_name)

Downloads data set if it is not cached. Return source and reference arrays.

 read_own_set (base_path, source_language_code, target_language_code,
               test_set_name, date='')

Reads already present non-sacrebleu test set. Return source and reference arrays.

 get_translated_test_set (base_path, sourcelang, targetlang, mtengine,
                          test_set_name, test_date, domain='')

Read MT hypothesis translations for specified MT engine

 read_tsv_set (tsv_file)

Reads complete evaluation set from TSV file containing source, hypothesis and reference