Spaces:

juliensimon
/

voice-queries

Running on T4

juliensimon HF staff commited on Jan 3, 2022

Commit

15763b2

•

1 Parent(s): 8be8d39

Download tokenizer

Files changed (1) hide show

app.py CHANGED Viewed

@@ -20,6 +20,8 @@ df = pd.read_csv(filename)
 df.drop_duplicates(inplace=True)
 print(f'Number of documents: {len(df)}')
 corpus = []
 sentence_count = []
 for _, row in df.iterrows():
@@ -107,4 +109,4 @@ iface = gr.Interface(
     ],
     allow_flagging=False
 )
-iface.launch()

 df.drop_duplicates(inplace=True)
 print(f'Number of documents: {len(df)}')
+nltk.download('punkt')
 corpus = []
 sentence_count = []
 for _, row in df.iterrows():
     ],
     allow_flagging=False
 )
+iface.launch()