macavaney veneres commited on
Commit
fef8ff6
1 Parent(s): 6c2eac0

Update wrapup.md (#1)

Browse files

- Update wrapup.md (81b559d2c810543c689a7569e0032f0755967cde)


Co-authored-by: Alberto Veneri <veneres@users.noreply.huggingface.co>

Files changed (1) hide show
  1. wrapup.md +2 -2
wrapup.md CHANGED
@@ -1,6 +1,6 @@
1
  ### Putting it all together
2
 
3
- When you use the document encoder in an indexing pipeline, the rewritting document contents are indexed:
4
 
5
  <div class="pipeline">
6
  <div class="df" title="Document Frame">D</div>
@@ -18,7 +18,7 @@ import pyt_splade
18
  dataset = pt.get_dataset('irds:msmarco-passage')
19
  splade = pyt_splade.SpladeFactory()
20
 
21
- indexer = pt.IterDictIndexer('./msmarco_psg', pretokenized=True)
22
 
23
  indxer_pipe = splade.indexing() >> indexer
24
  indxer_pipe.index(dataset.get_corpus_iter())
 
1
  ### Putting it all together
2
 
3
+ When you use the document encoder in an indexing pipeline, the rewritten document contents are indexed:
4
 
5
  <div class="pipeline">
6
  <div class="df" title="Document Frame">D</div>
 
18
  dataset = pt.get_dataset('irds:msmarco-passage')
19
  splade = pyt_splade.SpladeFactory()
20
 
21
+ indexer = pt.IterDictIndexer('./msmarco_psg', pretokenised=True)
22
 
23
  indxer_pipe = splade.indexing() >> indexer
24
  indxer_pipe.index(dataset.get_corpus_iter())