Hy-MT2 1.8B Β· OpenASR
Hy-MT2 1.8B zhβen translation for OpenASR realtime subtitles β clause-by-clause, fully on-device
Realtime translation pack for the OpenASR runtime β pure-Rust inference, no Python at inference time.
β¨ Highlights
- π Realtime zhβen subtitle translation β drives OpenASR's experimental clause-retranslation lane next to live Chinese ASR
- β‘ Hot-session clause latency in the hundreds of milliseconds β prefix KV-cache reuse keeps warm per-clause retranslation fast on Apple Silicon CPU
- π¦ Byte-identical upstream weights β the Q4_K_M GGUF tensor data is repackaged unmodified at a pinned revision; sha256 and provenance are embedded in the pack
- π§Ύ License + NOTICE embedded β the upstream Apache-2.0 LICENSE.txt and the OpenASR modification NOTICE travel inside the pack metadata
- π― Parity-gated packaging β greedy decode token IDs match llama.cpp on the same prompt before release
- π¦ Native in OpenASR β
.oasrpacks run with no Python at inference, engineered for peak performance on CPU & GPU
π Quickstart
# 1. Install the OpenASR CLI Β· https://openasr.org
# 2. Pull the pack
openasr pull hymt2-1.8b:q4km
Realtime translation is an experimental capability: with this pack installed, realtime sessions can enable clause-by-clause zhβen translation alongside live Chinese ASR transcripts.
π¦ Pack
| Quant | File (.oasr) |
Size |
|---|---|---|
| Q4_K_M | hymt2-1.8b-q4_k_m.oasr |
1.13 GB |
Single q4_k_m build repackaged from the upstream GGUF release. The tensor data is preserved byte-for-byte β no requantization, no weight changes.
π§ About Hy-MT2 1.8B
Hy-MT2 1.8B is Tencent's compact many-to-many translation model from the Hunyuan-MT family.
OpenASR packages the upstream Q4_K_M GGUF release as a local .oasr translation pack and uses it
for the experimental simultaneous-translation MVP: finalized and in-progress Chinese clauses from
realtime ASR are retranslated clause-by-clause into English, with provisional translations
superseded as the source clause stabilizes. The MVP contract is zhβen; the pack runs fully
on-device through OpenASR's pure-Rust GGML runtime with no Python at inference time.
βοΈ How this pack was made
Repackaged from tencent/Hy-MT2-1.8B-GGUF at a pinned revision with the OpenASR importer:
openasr model-pack import-hymt2-gguf-local <src>.gguf <out>.oasr \
--package-id hymt2-1.8b \
--license-file LICENSE.txt --notice-file NOTICE.openasr.txt
The .oasr container is GGUF-backed. The importer verifies the pinned upstream
sha256 and splices provenance, license, and notice metadata into the GGUF KV
section; the model weights are byte-identical to the upstream GGUF artifact.
βοΈ License
This pack inherits the upstream model's license: Apache-2.0
(source). The upstream LICENSE.txt and the OpenASR
modification NOTICE.openasr.txt are embedded in the pack metadata; the only
modification is packaging metadata, not weights.
π Acknowledgements
This pack redistributes tencent/Hy-MT2-1.8B-GGUF in OpenASR's .oasr runtime format. Credit
for the model architecture, training, and original weights belongs to the upstream Tencent Hunyuan
authors. The upstream model is licensed under Apache-2.0; OpenASR packaging retains that
license and attribution. The model weights are byte-identical to the upstream GGUF artifact β
the only modification is packaging metadata, recorded in the embedded NOTICE.openasr.txt.
π Links
- π¦ OpenASR β https://github.com/QuintinShaw/OpenASR
- π Website β https://openasr.org
- π€ Upstream model β tencent/Hy-MT2-1.8B-GGUF