File size: 504 Bytes
676ed72
 
 
 
 
 
 
 
 
 
 
 
 
6404d3b
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
---
title: Arabic Wiki
emoji: 📈
colorFrom: purple
colorTo: purple
sdk: gradio
sdk_version: 3.44.4
app_file: app.py
pinned: false
license: apache-2.0
---

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

# Installation
    pip install requirements.txt
# Pre-processing
    wget https://dumps.wikimedia.org/arwiki/latest/arwiki-latest-pages-articles-multistream.xml.bz2
    wikiextractor -o output --json arwiki-latest-pages-articles-multistream.xml.bz2