Add pipeline tag, library name and link to paper

#2
by nielsr HF staff - opened
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -1,8 +1,12 @@
1
  ---
2
  license: apache-2.0
 
 
3
  ---
 
4
  # Step-Audio-Tokenizer
5
 
 
6
 
7
  Step-Audio LLM is the industry’s first 130-billion parameter hu-manlike unified end-to-end model that integrates multimodal speech un-derstanding and generation capabilities, including singing voice synthesis, tool utilization, role-play and multilingual/dialectal comprehension and synthesis.
8
 
 
1
  ---
2
  license: apache-2.0
3
+ library_name: funasr
4
+ pipeline_tag: feature-extraction
5
  ---
6
+
7
  # Step-Audio-Tokenizer
8
 
9
+ This repository contains the tokenizer model described in the paper [Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction](https://arxiv.org/abs/2502.11946).
10
 
11
  Step-Audio LLM is the industry’s first 130-billion parameter hu-manlike unified end-to-end model that integrates multimodal speech un-derstanding and generation capabilities, including singing voice synthesis, tool utilization, role-play and multilingual/dialectal comprehension and synthesis.
12