EvoEmbedding: Evolvable Embedding for Long-Context Retrieval

πŸ”— GitHub Repository | πŸ“š Training Dataset | πŸ“‘ Paper (Coming Soon)

EvoEmbedding is a novel embedding model designed for long-context and dynamic retrieval scenarios. Unlike static embedding models that chunk text in isolation, EvoEmbedding maintains a continuously updated Latent Memory Queue. This allows it to capture temporal dynamics and generate context-aware, evolvable embeddings for precise retrieval in agentic workflows and long-conversations.

πŸ“¦ Model Family

We provide EvoEmbedding in three sizes based on the Qwen architecture:

Model Parameters Base Model Hugging Face Link
EvoEmbedding-0.8B 0.8B Qwen3.5-0.8B ClareNie/EvoEmbedding-0.8B
EvoEmbedding-2B 2B Qwen3.5-2B ClareNie/EvoEmbedding-2B
EvoEmbedding-4B 4B Qwen3-4B ClareNie/EvoEmbedding-4B

πŸ“š Citation

If you find this model or our methodology useful, please cite our paper:

@article{nie2026evoembedding,
  title={Evolvable Embedding for Long-Context Retrieval},
  author={Nie, Chang and Fu, Chaoyou and Shan, Caifeng},
  journal={arXiv preprint},
  year={2026}
}
Downloads last month
-
Safetensors
Model size
5B params
Tensor type
F32
Β·
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support