||Base class for Benchmark modules. The purpose of a Benchmark is to provide the data needed to run an experiment, such as queries, folds, and relevance judgments.|
Benchmark(config=None, provide=None, share_dependency_objects=False, build=True)[source]¶
Base class for Benchmark modules. The purpose of a Benchmark is to provide the data needed to run an experiment, such as queries, folds, and relevance judgments.
- Modules should provide:
topicsdict mapping query ids (qids) to queries
qrelsdict mapping qids to docids and relevance labels
foldsdict mapping a fold name to training, dev (validation), and testing qids
- if these can be loaded from files in standard formats, they can be specified by setting the
fold_file, respectively, rather than by setting the above attributes directly
Documents with a relevance label >= relevance_level will be considered relevant. This corresponds to trec_eval’s –level_for_rel (and is passed to pytrec_eval as relevance_level).