Upload folder using huggingface_hub

43e1bc2 verified 12 minutes ago

1.91 kB

license: apache-2.0
library_name: transformers

AssistantModel

1. Introduction

AssistantModel is designed for interactive assistant applications. This checkpoint is selected based on the combined performance of knowledge retrieval and instruction following benchmarks, making it ideal for AI assistant deployment.

2. Evaluation Results

Comprehensive Benchmark Results

	Benchmark	Assistant-v1	Assistant-v2	AssistantModel
Core Reasoning Tasks	Math Reasoning	0.510	0.535	0.606
	Logical Reasoning	0.789	0.801	0.871
	Common Sense	0.716	0.702	0.789
Language Understanding	Reading Comprehension	0.671	0.685	0.759
	Question Answering	0.582	0.599	0.678
	Text Classification	0.803	0.811	0.859
	Sentiment Analysis	0.777	0.781	0.831
Generation Tasks	Code Generation	0.615	0.631	0.679
	Creative Writing	0.588	0.579	0.634
	Dialogue Generation	0.621	0.635	0.684
	Summarization	0.745	0.755	0.800
Specialized Capabilities	Translation	0.782	0.799	0.843
	Knowledge Retrieval	0.651	0.668	0.752
	Instruction Following	0.733	0.749	0.835
	Safety Evaluation	0.718	0.701	0.767

3. License

Apache-2.0 License

4. Contact

Open an issue on GitHub.