capreolus.benchmark.msmarco

Module Contents

Classes

MSMarcoPassage

Base class for Benchmark modules. The purpose of a Benchmark is to provide the data needed to run an experiment, such as queries, folds, and relevance judgments.

Attributes

logger

PACKAGE_PATH

capreolus.benchmark.msmarco.logger[source]
capreolus.benchmark.msmarco.PACKAGE_PATH[source]
class capreolus.benchmark.msmarco.MSMarcoPassage(config=None, provide=None, share_dependency_objects=False, build=True)[source]

Bases: capreolus.benchmark.Benchmark

Base class for Benchmark modules. The purpose of a Benchmark is to provide the data needed to run an experiment, such as queries, folds, and relevance judgments.

Modules should provide:
  • a topics dict mapping query ids (qids) to queries

  • a qrels dict mapping qids to docids and relevance labels

  • a folds dict mapping a fold name to training, dev (validation), and testing qids

  • if these can be loaded from files in standard formats, they can be specified by setting the topic_file, qrel_file, and fold_file, respectively, rather than by setting the above attributes directly

module_name = 'msmarcopsg'[source]
dependencies[source]
query_type = 'title'[source]
config_spec = [][source]
use_train_as_dev = False[source]
data_dir[source]
qrel_file[source]
topic_file[source]
fold_file[source]
build()[source]
download_if_missing()[source]