`capreolus.benchmark.robust04`¶

Module Contents¶

Classes¶

`Robust04`	Robust04 benchmark using the title folds from Huston and Croft. [1] Each of these is used as the test set.
`Robust04Yang19`	Robust04 benchmark using the folds from Yang et al. [1]
`Robust04Yang19Desc`	Robust04 benchmark using the folds from Yang et al. [1]
`Robust04Huston14`	Base class for Benchmark modules. The purpose of a Benchmark is to provide the data needed to run an experiment, such as queries, folds, and relevance judgments.
`Robust04Huston14Desc`	Base class for Benchmark modules. The purpose of a Benchmark is to provide the data needed to run an experiment, such as queries, folds, and relevance judgments.

Attributes¶

PACKAGE_PATH

capreolus.benchmark.robust04.PACKAGE_PATH[source]¶

class capreolus.benchmark.robust04.Robust04(config=None, provide=None, share_dependency_objects=False, build=True)[source]¶

Bases: capreolus.benchmark.Benchmark

Robust04 benchmark using the title folds from Huston and Croft. [1] Each of these is used as the test set. Given the remaining four folds, we split them into the same train and dev sets used in recent work. [2]

[1] Samuel Huston and W. Bruce Croft. 2014. Parameters learned in the comparison of retrieval models using term dependencies. Technical Report.

[2] Sean MacAvaney, Andrew Yates, Arman Cohan, Nazli Goharian. 2019. CEDR: Contextualized Embeddings for Document Ranking. SIGIR 2019.

module_name = 'robust04'[source]¶

dependencies[source]¶

qrel_file[source]¶

topic_file[source]¶

fold_file[source]¶

query_type = 'title'[source]¶

class capreolus.benchmark.robust04.Robust04Yang19(config=None, provide=None, share_dependency_objects=False, build=True)[source]¶

Bases: capreolus.benchmark.Benchmark

Robust04 benchmark using the folds from Yang et al. [1]

[1] Wei Yang, Kuang Lu, Peilin Yang, and Jimmy Lin. 2019. Critically Examining the “Neural Hype”: Weak Baselines and the Additivity of Effectiveness Gains from Neural Ranking Models. SIGIR 2019.

module_name = 'robust04.yang19'[source]¶

dependencies[source]¶

qrel_file[source]¶

topic_file[source]¶

fold_file[source]¶

query_type = 'title'[source]¶

class capreolus.benchmark.robust04.Robust04Yang19Desc(config=None, provide=None, share_dependency_objects=False, build=True)[source]¶

Bases: Robust04Yang19, capreolus.benchmark.Benchmark

Robust04 benchmark using the folds from Yang et al. [1]

[1] Wei Yang, Kuang Lu, Peilin Yang, and Jimmy Lin. 2019. Critically Examining the “Neural Hype”: Weak Baselines and the Additivity of Effectiveness Gains from Neural Ranking Models. SIGIR 2019.

module_name = 'robust04.yang19.desc'[source]¶

query_type = 'desc'[source]¶

class capreolus.benchmark.robust04.Robust04Huston14(config=None, provide=None, share_dependency_objects=False, build=True)[source]¶

Bases: capreolus.benchmark.Benchmark

Base class for Benchmark modules. The purpose of a Benchmark is to provide the data needed to run an experiment, such as queries, folds, and relevance judgments.

Modules should provide:

a topics dict mapping query ids (qids) to queries
a qrels dict mapping qids to docids and relevance labels
a folds dict mapping a fold name to training, dev (validation), and testing qids
if these can be loaded from files in standard formats, they can be specified by setting the topic_file, qrel_file, and fold_file, respectively, rather than by setting the above attributes directly

module_name = 'robust04.huston14.title'[source]¶

dependencies[source]¶

qrel_file[source]¶

topic_file[source]¶

fold_file[source]¶

query_type = 'title'[source]¶

class capreolus.benchmark.robust04.Robust04Huston14Desc(config=None, provide=None, share_dependency_objects=False, build=True)[source]¶

Bases: Robust04Huston14, capreolus.benchmark.Benchmark

Base class for Benchmark modules. The purpose of a Benchmark is to provide the data needed to run an experiment, such as queries, folds, and relevance judgments.

Modules should provide:

a topics dict mapping query ids (qids) to queries
a qrels dict mapping qids to docids and relevance labels
a folds dict mapping a fold name to training, dev (validation), and testing qids
if these can be loaded from files in standard formats, they can be specified by setting the topic_file, qrel_file, and fold_file, respectively, rather than by setting the above attributes directly

module_name = 'robust04.huston14.desc'[source]¶

fold_file[source]¶

query_type = 'desc'[source]¶

capreolus.benchmark.robust04¶

Module Contents¶

Classes¶

Attributes¶

`capreolus.benchmark.robust04`¶