QCRI
/

LlamaLens-Native

+---
+license: cc-by-nc-sa-4.0
+datasets:
+- QCRI/LlamaLens-English
+- QCRI/LlamaLens-Arabic
+- QCRI/LlamaLens-Hindi
+language:
+- ar
+- en
+- hi
+base_model:
+- meta-llama/Llama-3.1-8B-Instruct
+pipeline_tag: text-generation
+tags:
+- Social-Media
+- Hate-Speech
+- Summarization
+- offensive-language
+- News-Genre
+metrics:
+- accuracy
+- f1
+- rouge
+---
+# LlamaLens: Specialized Multilingual LLM forAnalyzing News and Social Media Content
+## Overview
+LlamaLens is a specialized multilingual LLM designed for analyzing news and social media content. It focuses on 18 NLP tasks, leveraging 52 datasets across Arabic, English, and Hindi.
+<p align="center">
+<picture>
+<img width="352" alt="capablities_tasks_datasets" src="./llamalens-avatar.png">
+</picture>
+</p>
+## Dataset
+The model was trained on the [LlamaLens dataset](https://huggingface.co/collections/QCRI/llamalens-672f7e0604a0498c6a2f0fe9).
+## To Replicate the Experiments
+The code to replicate the experiments is available on [GitHub](https://github.com/firojalam/LlamaLens).
+## Model Inference
+To utilize the LlamaLens model for inference, follow these steps:
+1. **Install the Required Libraries**:
+   Ensure you have the necessary libraries installed. You can do this using pip:
+   ```bash
+   pip install transformers torch
+   ```
+2. **Load the Model and Tokenizer:**:
+Use the transformers library to load the LlamaLens model and its tokenizer:
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+# Define model path
+MODEL_PATH = "QCRI/LlamaLens"
+# Load model and tokenizer
+device_map = "auto"
+model = AutoModelForCausalLM.from_pretrained(MODEL_PATH, device_map=device_map)
+tokenizer = AutoTokenizer.from_pretrained(MODEL_PATH, trust_remote_code=True)
+tokenizer.pad_token = tokenizer.eos_token
+```
+3. **Prepare the Input:**:
+Tokenize your input text:
+```python
+# Define task and input text
+task = "classification"  # Change to "summarization" for summarization tasks
+instruction = (
+    "Analyze the text and indicate if it shows an emotion, then label it as joy, love, fear,"
+    " anger, sadness, or surprise. Return only the label without any explanation, justification, or additional text."
+)
+input_text = "I am not creating anything I feel satisfied with."
+output_prefix = "Summary: " if task == "summarization" else "Label: "
+# Define messages for chat-based prompt format
+messages = [
+    {"role": "system", "content": "You are a social media expert providing accurate analysis and insights."},
+    {"role": "user", "content": f"{instruction}\nInput: {input_text}"},
+    {"role": "assistant", "content": output_prefix}
+]
+# Tokenize input
+input_ids = tokenizer.apply_chat_template(
+    messages,
+    add_generation_prompt=False,
+    continue_final_message=True,
+    tokenize=True,
+    padding=True,
+    return_tensors="pt"
+).to(model.device)
+```
+4. **Generate the Output:**:
+Generate a response using the model:
+```python
+# Generate response
+outputs = model.generate(
+    input_ids,
+    max_new_tokens=128,
+    do_sample=False,
+    eos_token_id=tokenizer.eos_token_id,
+    pad_token_id=tokenizer.eos_token_id,
+    temperature=0.001
+)
+# Decode and print response
+response = tokenizer.decode(outputs[0][input_ids.shape[-1]:], skip_special_tokens=True)
+print(response)
+```
+## Results
+Below, we present  the performance of **L-Lens: LlamaLens** , where *"Eng"* refers to the English-instructed model and *"Native"* refers to the model trained with native language instructions. The results are compared against the SOTA (where available) and the Base: **Llama-Instruct 3.1 baseline**. The **Δ** (Delta) column indicates the difference between LlamaLens and the SOTA performance, calculated as (LlamaLens – SOTA).
+---
+## Arabic
+|              **Task**              |                  **Dataset**                 | **Metric** | **SOTA** | **Base** | **L-Lens-Eng** | **L-Lens-Native** | **Δ (L-Lens (Eng) - SOTA)** |
+|:----------------------------------:|:--------------------------------------------:|:----------:|:--------:|:---------------------:|:---------------------:|:--------------------:|:------------------------:|
+|    Attentionworthiness Detection   |              CT22Attentionworthy             |    W-F1    |   0.412  |         0.158         |         0.425         |         0.454        |           0.013          |
+|      Checkworthiness Detection     |               CT24_checkworthy               |   F1_Pos   |   0.569  |         0.610         |         0.502         |         0.509        |          -0.067          |
+|           Claim Detection          |                   CT22Claim                  |     Acc    |   0.703  |         0.581         |         0.734         |         0.756        |           0.031          |
+|       Cyberbullying Detection      |                   ArCyc_CB                   |     Acc    |   0.863  |         0.766         |         0.870         |         0.833        |           0.007          |
+|          Emotion Detection         |                Emotional-Tone                |    W-F1    |   0.658  |         0.358         |         0.705         |         0.736        |           0.047          |
+|          Emotion Detection         |                 NewsHeadline                 |     Acc    |   1.000  |         0.406         |         0.480         |         0.458        |          -0.520          |
+|             Factuality             |                   Arafacts                   |    Mi-F1   |   0.850  |         0.210         |         0.771         |         0.738        |          -0.079          |
+|             Factuality             |               COVID19Factuality              |    W-F1    |   0.831  |         0.492         |         0.800         |         0.840        |          -0.031          |
+|        Harmfulness Detection       |                  CT22Harmful                 |   F1_Pos   |   0.557  |         0.507         |         0.523         |         0.535        |          -0.034          |
+|        Hate Speech Detection       |        annotated-hatetweets-4-classes        |    W-F1    |   0.630  |         0.257         |         0.526         |         0.517        |          -0.104          |
+|        Hate Speech Detection       |                OSACT4SubtaskB                |    Mi-F1   |   0.950  |         0.819         |         0.955         |         0.955        |           0.005          |
+|         News Categorization        |                     ASND                     |    Ma-F1   |   0.770  |         0.587         |         0.919         |         0.929        |           0.149          |
+|         News Categorization        |      SANADAkhbarona-news-categorization      |     Acc    |   0.940  |         0.784         |         0.954         |         0.953        |           0.014          |
+|         News Categorization        |      SANADAlArabiya-news-categorization      |     Acc    |   0.974  |         0.893         |         0.987         |         0.985        |           0.013          |
+|         News Categorization        |      SANADAlkhaleej-news-categorization      |     Acc    |   0.986  |         0.865         |         0.984         |         0.982        |          -0.002          |
+|         News Categorization        |                UltimateDataset               |    Ma-F1   |   0.970  |         0.376         |         0.865         |         0.880        |          -0.105          |
+|          News Credibility          |            NewsCredibilityDataset            |     Acc    |   0.899  |         0.455         |         0.935         |         0.933        |           0.036          |
+|         News Summarization         |                     xlsum                    |     R-2    |   0.137  |         0.034         |         0.129         |         0.130        |          -0.009          |
+|    Offensive Language Detection    |                   ArCyc_OFF                  |    Ma-F1   |   0.878  |         0.489         |         0.877         |         0.879        |          -0.001          |
+|    Offensive Language Detection    |                OSACT4SubtaskA                |    Ma-F1   |   0.905  |         0.782         |         0.896         |         0.882        |          -0.009          |
+|        Propaganda Detection        |                     ArPro                    |    Mi-F1   |   0.767  |         0.597         |         0.747         |         0.731        |          -0.020          |
+|          Sarcasm Detection         |                 ArSarcasm-v2                 |   F1_Pos   |   0.584  |         0.477         |         0.520         |         0.542        |          -0.064          |
+|      Sentiment Classification      |                ar_reviews_100k               |   F1_Pos   |    --    |         0.681         |         0.785         |         0.779        |            --            |
+|      Sentiment Classification      |                     ArSAS                    |     Acc    |   0.920  |         0.603         |         0.800         |         0.804        |          -0.120          |
+|          Stance Detection          |                    stance                    |    Ma-F1   |   0.767  |         0.608         |         0.926         |         0.881        |           0.159          |
+|          Stance Detection          |           Mawqif-Arabic-Stance-main          |    Ma-F1   |   0.789  |         0.764         |         0.853         |         0.826        |           0.065          |
+|       Subjectivity Detection       |                    ThatiAR                   |   f1_pos   |   0.800  |         0.562         |         0.441         |         0.383        |          -0.359          |
+---
+## English
+|              **Task**              |                  **Dataset**                 | **Metric** | **SOTA** | **Base** | **L-Lens-Eng** | **L-Lens-Native** | **Δ (L-Lens (Eng) - SOTA)** |
+|:----------------------------------:|:--------------------------------------------:|:----------:|:--------:|:---------------------:|:---------------------:|:--------------------:|:------------------------:|
+|      Checkworthiness Detection     |               CT24_checkworthy               |   f1_pos   |   0.753  |         0.404         |         0.942         |         0.942        |           0.189          |
+|           Claim Detection          |                claim-detection               |    Mi-F1   |    --    |         0.545         |         0.864         |         0.889        |            --            |
+|       Cyberbullying Detection      |                 Cyberbullying                |     Acc    |   0.907  |         0.175         |         0.836         |         0.855        |          -0.071          |
+|         Emotion Detection          |                    emotion                   |    Ma-F1   |   0.790  |         0.353         |         0.803         |         0.808        |           0.013          |
+|             Factuality             |                 News_dataset                 |     Acc    |   0.920  |         0.654         |         1.000         |         1.000        |           0.080          |
+|             Factuality             |                  Politifact                  |    W-F1    |   0.490  |         0.121         |         0.287         |         0.311        |          -0.203          |
+|         News Categorization        |          CNN_News_Articles_2011-2022         |     Acc    |   0.940  |         0.644         |         0.970         |         0.970        |           0.030          |
+|         News Categorization        |             News_Category_Dataset            |    Ma-F1   |   0.769  |         0.970         |         0.824         |         0.520        |           0.055          |
+|      News Genre Categorisation     |             SemEval23T3-subtask1             |    Mi-F1   |   0.815  |         0.687         |         0.241         |         0.253        |          -0.574          |
+|         News Summarization         |                     xlsum                    |     R-2    |   0.152  |         0.074         |         0.182         |         0.181        |           0.030          |
+|    Offensive Language Detection    |         Offensive_Hateful_Dataset_New        |    Mi-F1   |    --    |         0.692         |         0.814         |         0.813        |            --            |
+|    Offensive Language Detection    |          offensive_language_dataset          |    Mi-F1   |   0.994  |         0.646         |         0.899         |         0.893        |          -0.095          |
+| Offensive Language and Hate Speech |             hate-offensive-speech            |     Acc    |   0.945  |         0.602         |         0.931         |         0.935        |          -0.014          |
+|        Propaganda Detection        |                     QProp                    |    Ma-F1   |   0.667  |         0.759         |         0.963         |         0.973        |           0.296          |
+|          Sarcasm Detection         | News-Headlines-Dataset-For-Sarcasm-Detection |     Acc    |   0.897  |         0.668         |         0.936         |         0.947        |           0.039          |
+|      Sentiment Classification      |               NewsMTSC-dataset               |    Ma-F1   |   0.817  |         0.628         |         0.751         |         0.748        |          -0.066          |
+|       Subjectivity Detection       |            clef2024-checkthat-lab            |    Ma-F1   |   0.744  |         0.535         |         0.642         |         0.628        |          -0.102          |
+|
+---
+## Hindi
+|              **Task**              |                  **Dataset**                 | **Metric** | **SOTA** | **Base** | **L-Lens-Eng** | **L-Lens-Native** | **Δ (L-Lens (Eng) - SOTA)** |
+|:----------------------------------:|:--------------------------------------------:|:----------:|:--------:|:---------------------:|:---------------------:|:--------------------:|:------------------------:|
+|             Factuality             |                   fake-news                  |    Mi-F1   |    --    |         0.759         |         0.994         |         0.993        |            --            |
+|        Hate Speech Detection       |             hate-speech-detection            |    Mi-F1   |   0.639  |         0.750         |         0.963         |         0.963        |           0.324          |
+|        Hate Speech Detection       |   Hindi-Hostility-Detection-CONSTRAINT-2021  |    W-F1    |   0.841  |         0.469         |         0.753         |         0.753        |          -0.088          |
+|     Natural Language Inference     |          Natural Language Inference          |    W-F1    |   0.646  |         0.633         |         0.568         |         0.679        |          -0.078          |
+|         News Summarization         |                     xlsum                    |     R-2    |   0.136  |         0.078         |         0.171         |         0.170        |           0.035          |
+|    Offensive Language Detection    |          Offensive Speech Detection          |    Mi-F1   |   0.723  |         0.621         |         0.862         |         0.865        |           0.139          |
+|       Cyberbullying Detection      |                 MC_Hinglish1                 |     Acc    |   0.609  |         0.233         |         0.625         |         0.627        |           0.016          |
+|      Sentiment Classification      |              Sentiment Analysis              |     Acc    |   0.697  |         0.552         |         0.647         |         0.654        |          -0.050
+## Paper
+For an in-depth understanding, refer to our paper: [**LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content**](https://arxiv.org/pdf/2410.15308).
+# License
+This model is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
+# Citation
+Please cite [our paper](https://arxiv.org/pdf/2410.15308) when using this model:
+```
+   @article{kmainasi2024llamalensspecializedmultilingualllm,
+     title={LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content},
+     author={Mohamed Bayan Kmainasi and Ali Ezzat Shahroor and Maram Hasanain and Sahinur Rahman Laskar and Naeemul Hassan and Firoj Alam},
+     year={2024},
+     journal={arXiv preprint arXiv:2410.15308},
+     volume={},
+     number={},
+     pages={},
+     url={https://arxiv.org/abs/2410.15308},
+     eprint={2410.15308},
+     archivePrefix={arXiv},
+     primaryClass={cs.CL}
+   }
+```