AI & ML interests

Conversational AI, Generative AI, LLM, Datasets, Chatbots

Bitext provides NLP/NLG services to 3 of the top 5 companies on NASDAQ. Bitext automates Text Data Services for Multilingual GenAI, covering:

  • Generation of Synthetic Text based on proprietary NLG technology (not generative)
  • Automation of Data Labelling and Annotation (DAL) using GenAI models and NLP tools with a human-in-the-loop approach
  • Verticalization of General-Purpose models (GPT, Mistral, OpenELM) in 20 domains (Customer Support, Banking, Travel)
  • Training and Evaluation of General-Purpose models for Conversational AI

We offer hybrid synthetic datasets to fine-tune LLMs like GPT, Mistral, and OpenELM, showcasing domain adaptation in sectors like Retail Banking. Our two-step approach allows clients to create customized LLMs by first using our dataset and then fine-tuning with their own data.

Our technology supports 77 languages (including Arabic, Japanese, Chinese, Hindi, Urdu) and 25 regional variants (like Egyptian Arabic, Canadian French, Indian English). More details can be found From General-Purpose LLMs to Verticalized Enterprise Models.