File size: 959 Bytes
5925520
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
language:
- en
tags:
- openvino
---

# MIT/ast-finetuned-speech-commands-v2

This is the [MIT/ast-finetuned-speech-commands-v2](https://huggingface.co/MIT/ast-finetuned-speech-commands-v2) model converted to [OpenVINO](https://openvino.ai), for accellerated inference.

An example of how to do inference on this model:
```python
from optimum.intel.openvino import OVModelForAudioClassification
from transformers import AutoTokenizer, pipeline

# model_id should be set to either a local directory or a model available on the HuggingFace hub.
model_id = "helenai/MIT-ast-finetuned-speech-commands-v2-ov"
feature_extractor = AutoFeatureExtractor.from_pretrained(model_id)
model = OVModelForAudioClassification.from_pretrained(model_id)
pipe = pipeline("audio-classification", model=model, feature_extractor=feature_extractor)
result = pipe("https://datasets-server.huggingface.co/assets/speech_commands/--/v0.01/test/38/audio/audio.mp3")
print(result)
```