HuanjinYao
/

Mulberry_llama_11b

Image-Text-to-Text

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mulberry_llama_11b / README.md

HuanjinYao's picture

Update README.md

a984d8f verified 27 days ago

|

history blame contribute delete

635 Bytes

	---
	license: apache-2.0
	pipeline_tag: image-text-to-text
	library_name: transformers
	base_model: meta-llama/Llama-3.2-11B-Vision-Instruct
	---


	# Mulberry

	Mulberry-llama-11b is a step-by-step reasoning model trained on the Mulberry-260K SFT dataset, which was generated through collective knowledge search using CoMCTS.

	For reasoning inference, please refer to our GitHub.

	Paper: https://arxiv.org/abs/2412.18319

	Code: https://github.com/HJYao00/Mulberry


	## More Details
	Base Model: https://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct

	Training Framework: LLaMA-Factory

	Hardware: 8x NVIDIA H100