Delete test_outputs
Browse files- test_outputs/README_tests.md +0 -33
- test_outputs/fp16/fp16_duration_predictor.fp16_out0.npy +0 -3
- test_outputs/fp16/fp16_duration_predictor.fp16_out0.wav +0 -0
- test_outputs/fp16/fp16_text_encoder.fp16_out0.npy +0 -3
- test_outputs/fp16/fp16_vocoder.fp16_out0.wav +0 -0
- test_outputs/int8_dynamic/int8_dynamic_duration_predictor.int8_out0.npy +0 -3
- test_outputs/int8_dynamic/int8_dynamic_text_encoder.int8_out0.npy +0 -3
- test_outputs/int8_dynamic/int8_dynamic_vector_estimator.int8_out0.npy +0 -3
- test_outputs/int8_dynamic/int8_dynamic_vocoder.int8_out0.wav +0 -0
test_outputs/README_tests.md
DELETED
|
@@ -1,33 +0,0 @@
|
|
| 1 |
-
# Supertonic Quantized – Test Outputs
|
| 2 |
-
|
| 3 |
-
This folder was generated automatically by a Colab script that:
|
| 4 |
-
|
| 5 |
-
1. Downloaded the Hugging Face repo **Shadow0482/supertonic-quantized**
|
| 6 |
-
2. Located all `*.onnx` models (both `fp16/` and `int8_dynamic/`)
|
| 7 |
-
3. Ran each model once with dummy inputs using ONNX Runtime
|
| 8 |
-
4. Saved:
|
| 9 |
-
- `.wav` files for audio-like tensors (1D or 2D, 1–2 channels, >=16 samples)
|
| 10 |
-
- `.npy` files for all other outputs
|
| 11 |
-
|
| 12 |
-
All paths below are relative to the `test_outputs/` directory.
|
| 13 |
-
|
| 14 |
-
## Per-model results
|
| 15 |
-
|
| 16 |
-
- `fp16/duration_predictor.fp16.onnx -> fp16/fp16_duration_predictor.fp16_out0.npy`
|
| 17 |
-
- `fp16/text_encoder.fp16.onnx -> fp16/fp16_text_encoder.fp16_out0.npy`
|
| 18 |
-
- `fp16/vector_estimator.fp16.onnx -> FAILED`
|
| 19 |
-
- `fp16/vocoder.fp16.onnx -> fp16/fp16_vocoder.fp16_out0.wav`
|
| 20 |
-
- `int8_dynamic/duration_predictor.int8.onnx -> int8_dynamic/int8_dynamic_duration_predictor.int8_out0.npy`
|
| 21 |
-
- `int8_dynamic/text_encoder.int8.onnx -> int8_dynamic/int8_dynamic_text_encoder.int8_out0.npy`
|
| 22 |
-
- `int8_dynamic/vector_estimator.int8.onnx -> int8_dynamic/int8_dynamic_vector_estimator.int8_out0.npy`
|
| 23 |
-
- `int8_dynamic/vocoder.int8.onnx -> int8_dynamic/int8_dynamic_vocoder.int8_out0.wav`
|
| 24 |
-
|
| 25 |
-
## Models that failed to load / run
|
| 26 |
-
|
| 27 |
-
- `fp16/vector_estimator.fp16.onnx` -> **FAILED**: [ONNXRuntimeError] : 1 : FAIL : Load model from /content/supertonic/supertonic_quantized/fp16/vector_estimator.fp16.onnx failed:Type Error: Type (tensor(float16)) of output arg (/vector_field/main_blocks.3/attn/Cast_output_0) of node (/vector_field/main_blocks.3/attn/Cast) does not match expected type (tensor(float)).
|
| 28 |
-
|
| 29 |
-
|
| 30 |
-
> Note:
|
| 31 |
-
> These tests use synthetic dummy inputs. They confirm that the
|
| 32 |
-
> quantized ONNX graphs load and execute, but they are **not**
|
| 33 |
-
> a replacement for real end-to-end TTS quality evaluation.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
test_outputs/fp16/fp16_duration_predictor.fp16_out0.npy
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:adae1157c6f7966fff1b160279a438f80e8121d75fb1ae7fd23929f75ce7bdf1
|
| 3 |
-
size 132
|
|
|
|
|
|
|
|
|
|
|
|
test_outputs/fp16/fp16_duration_predictor.fp16_out0.wav
DELETED
|
Binary file (46 Bytes)
|
|
|
test_outputs/fp16/fp16_text_encoder.fp16_out0.npy
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:ee27b7abb5e574f8eb582405d7a0ba92b526420d3bca2e0b89e3394409b73879
|
| 3 |
-
size 10368
|
|
|
|
|
|
|
|
|
|
|
|
test_outputs/fp16/fp16_vocoder.fp16_out0.wav
DELETED
|
Binary file (61.5 kB)
|
|
|
test_outputs/int8_dynamic/int8_dynamic_duration_predictor.int8_out0.npy
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:584b0425a13c8ee9a35808a806fb4bd86264bda3ed799425caa7c9644e5a79dc
|
| 3 |
-
size 132
|
|
|
|
|
|
|
|
|
|
|
|
test_outputs/int8_dynamic/int8_dynamic_text_encoder.int8_out0.npy
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:ee27b7abb5e574f8eb582405d7a0ba92b526420d3bca2e0b89e3394409b73879
|
| 3 |
-
size 10368
|
|
|
|
|
|
|
|
|
|
|
|
test_outputs/int8_dynamic/int8_dynamic_vector_estimator.int8_out0.npy
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:5856cde54e8cc4b4dff2f218d3cc8834aaa204a38b96ebf0b168ea54ce010040
|
| 3 |
-
size 5888
|
|
|
|
|
|
|
|
|
|
|
|
test_outputs/int8_dynamic/int8_dynamic_vocoder.int8_out0.wav
DELETED
|
Binary file (61.5 kB)
|
|
|