Thanos
Collection
Skill-of-Mind-Infused LLM
β’
5 items
β’
Updated
β’
2
π§ Multifaceted Skill-of-Mind | π€ Thanos-1B | π€ Thanos-8B | π» Github | π Arxiv | π PDF
π¨ Disclaimer: All models and dataset are intended to be used for research purposes only.
π¨ Thanos-3B is intended to be used for research purposes only.
This work was supported by a grant of the KAIST-KT joint research project through AI Tech Lab, Institute of convergence Technology, funded by KT [Project No. G01230605, Development of Task-oriented Persona-based Dialogue Generation Combining Multi-modal Interaction and Knowledge Modeling].
If you find the resources in this repository useful, please cite our work:
@misc{lee2024thanosenhancingconversationalagents,
title={Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model},
author={Young-Jun Lee and Dokyong Lee and Junyoung Youn and Kyeongjin Oh and Ho-Jin Choi},
year={2024},
eprint={2411.04496},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2411.04496},
}
Base model
meta-llama/Llama-3.2-3B-Instruct