capreolus.extractor.embedtext

Module Contents

Classes

EmbedText(config=None, provide=None, share_dependency_objects=False, build=True) Base class for Extractor modules. The purpose of an Extractor is to convert queries and documents to a representation suitable for use with a Reranker module.
capreolus.extractor.embedtext.logger[source]
class capreolus.extractor.embedtext.EmbedText(config=None, provide=None, share_dependency_objects=False, build=True)[source]

Bases: capreolus.extractor.Extractor

Base class for Extractor modules. The purpose of an Extractor is to convert queries and documents to a representation suitable for use with a Reranker module.

Modules should provide:
  • an id2vec(qid, posid, negid=None) method that converts the given query and document ids to an appropriate representation
module_name = embedtext[source]
requires_random_seed = True[source]
dependencies[source]
config_spec[source]
pad = 0[source]
pad_tok = <pad>[source]
embed_paths[source]
load_state(self, qids, docids)[source]
cache_state(self, qids, docids)[source]
get_tf_feature_description(self)[source]
create_tf_feature(self, sample)[source]

sample - output from self.id2vec() return - a tensorflow feature

parse_tf_example(self, example_proto)[source]
exist(self)[source]
preprocess(self, qids, docids, topics)[source]
id2vec(self, qid, posid, negid=None)[source]