Carlos Rodríguez PRO

crodri

AI & ML interests

NLP, language modelling, NLU, Sentiment Analysis, Knowledge Engineering, Knowledge representation

Organizations

Posts 1

view post
Post
1836
Multilingual RAG optimized models and datasets available from the Language Technologies Unit @ the Barcelona Supercomputing Unit
We are releasing new RAG-optimized multilingual models and dataset, within the AINA project contributions:

projecte-aina/FlorRAG , based on out Bloom Flor6.3b model, capable of RAG in Catalan, Spanish and English

projecte-aina/RAG_Multilingual , a 56K+ instructional dataset with human-like answers created from kernel-of-truth of extractive datasets using a Mixtral8x7b model