AscendKernelGen
/

KernelGen-LM-32B

Text Generation

code-generation

chain-of-thought

text-generation-inference

Model card Files Files and versions

AscendKernelGen commited on Mar 13

Commit

20d682c

·

verified ·

1 Parent(s): 2a6428e

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -7,14 +7,14 @@ language:
 # AscendKernelGen/KernelGen-LM-32B
 ![License](https://img.shields.io/badge/License-Apache-yellow)
-[![arXiv](https://img.shields.io/badge/arXiv-2601.07160-b31b1b.svg)](https://arxiv.org/abs/2601.07160)
 KernelGen-LM-32B is a state-of-the-art domain-adaptive large language model specialized for low-level NPU kernel generation, specifically for the Huawei Ascend architecture using the AscendC programming language. Built upon the Qwen3-32B backbone, it is trained on the Ascend-CoT dataset and refined via reinforcement learning with execution feedback. It achieves unprecedented success rates in generating complex, functional hardware kernels, improving compilation success on L2 tasks from 0% (baseline) to 96.5% (Pass@10), while functional correctness achieves
 40.5% compared to the baseline’s complete failure.
-**Other artifacts:**
 * The **AscendKernelGen Technical Report** is published at https://arxiv.org/abs/2601.07160.
-* The **NPUKernelBench** evaluation framework is published at https://git.openi.org.cn/PCL-Benchmark/NPUKernelBench.
 ## Introduction

 # AscendKernelGen/KernelGen-LM-32B
 ![License](https://img.shields.io/badge/License-Apache-yellow)
+<!-- [![arXiv](https://img.shields.io/badge/arXiv-2601.07160-b31b1b.svg)](https://arxiv.org/abs/2601.07160) -->
 KernelGen-LM-32B is a state-of-the-art domain-adaptive large language model specialized for low-level NPU kernel generation, specifically for the Huawei Ascend architecture using the AscendC programming language. Built upon the Qwen3-32B backbone, it is trained on the Ascend-CoT dataset and refined via reinforcement learning with execution feedback. It achieves unprecedented success rates in generating complex, functional hardware kernels, improving compilation success on L2 tasks from 0% (baseline) to 96.5% (Pass@10), while functional correctness achieves
 40.5% compared to the baseline’s complete failure.
+<!-- **Other artifacts:**
 * The **AscendKernelGen Technical Report** is published at https://arxiv.org/abs/2601.07160.
+* The **NPUKernelBench** evaluation framework is published at https://git.openi.org.cn/PCL-Benchmark/NPUKernelBench. -->
 ## Introduction