Is the training corpus available online
#2
by
viktoroo
- opened
It would be greatly appreciated if you could share more information on training corpus!
The training corpus is not publicly available due to distribution rights. It was crawled from some well-known Arabic press websites such as Aljazeera and Alarabiya and annotated manually. It is made of 378.000 tokens (14.000 sentences)
Thanks for the info!
viktoroo
changed discussion status to
closed