Alessamo
/

CDT-Domain-Tagger

capability-tagging

Model card Files Files and versions

Alessamo commited on Sep 12

Commit

35d27a5

·

verified ·

1 Parent(s): 7805963

Update README.md

Files changed (1) hide show

README.md +67 -3

README.md CHANGED Viewed

@@ -1,3 +1,67 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+language:
+- en
+base_model:
+- Qwen/Qwen2.5-7B
+tags:
+- capability-tagging
+- qwen
+- domain
+---
+# Model Card for CDT-Task-Tagger
+This model is a component of the **Cognition-Domain-Task (CDT) framework**, a comprehensive capability framework for Large Language Models presented in our paper CDT: A Comprehensive Capability Framework for Large Language Models Across Cognition, Domain, and Task. It has been specifically fine-tuned to classify a given instruction into one of 33 domains as defined by the CDT framework.
+## Model Details
+### Model Description
+This model categorizes any given instruction into one of 33 predefined knowledge domains, pinpointing the subject area of the request.
+- **Model type:** Qwen2ForCausalLM
+- **Language(s) (NLP):** English
+- **License:** Apache 2.0
+- **Finetuned from model:** Qwen2.5-7B-Base
+### Model Sources
+<!-- Provide the basic links for the model. -->
+- **Repository:** https://github.com/Alessa-mo/CDT
+- **Paper Link:**
+### Basic Usage
+Please refer to https://github.com/Alessa-mo/CDT. You can run the following scripts to tag the cognition labels.
+```bash
+cd tag_annotate
+export CUDA_VISIBLE_DEVICES=0
+python annotate.py \
+    --data_path path/to/your/data \
+    --output_dir path/to/output/dir \
+    --model_path CDT-Domain-Tagger \
+    --prompt_file ./prompt/annotation_prompt.jsonl \
+    --cognition_skill_file ./prompt/cognition.json \
+    --domain_skill_file ./prompt/domain.json \
+    --task_skill_file ./prompt/task.json \
+    --tag_type "Domain" \
+    --batch_size 32
+```
+**Note**: Make sure your data is a JSON file and has the following format:
+```json
+[
+    {
+        "messages": [
+            {
+                "role": "user",
+                "content": "xxxx"
+            },
+            {
+                "role": "assistant",
+                "content": "xxxx"
+            }
+        ]
+    },
+]
+```
+## Citation
+If you find this model useful, please cite:
+```bash