liujch1998 commited on
Commit
86bc4ef
1 Parent(s): d8d9fba

Add a warning in doc search

Browse files
Files changed (1) hide show
  1. app.py +1 -0
app.py CHANGED
@@ -172,6 +172,7 @@ with gr.Blocks() as demo:
172
  <li>If you query for two or more clauses, and a clause has more than {MAX_CLAUSE_FREQ_FAST_APPROX_PER_SHARD} matches (per shard), we will search within a random subset of all documents containing that clause.</li>
173
  <li>The number of found documents may contain duplicates (e.g., if a document contains your query term twice, it may be counted twice).</li>
174
  </ul>
 
175
  ''')
176
  with gr.Row():
177
  with gr.Column(scale=1):
 
172
  <li>If you query for two or more clauses, and a clause has more than {MAX_CLAUSE_FREQ_FAST_APPROX_PER_SHARD} matches (per shard), we will search within a random subset of all documents containing that clause.</li>
173
  <li>The number of found documents may contain duplicates (e.g., if a document contains your query term twice, it may be counted twice).</li>
174
  </ul>
175
+ <p style="font-size: 16px;">❗️WARNING: Corpus may contain problematic contents such as PII, toxicity, hate speech, and NSFW text. This tool is merely presenting selected text from the corpus, without any post-hoc safety filtering. It is NOT creating new text. This is a research prototype through which we can expose and examine existing problems with massive text corpora. Please use with caution. Don't be evil :)</p>
176
  ''')
177
  with gr.Row():
178
  with gr.Column(scale=1):