ruGPT3xl language model with sparse attention