Instructions to use xThr45hx/EmbeddingGemma-300M-Tensor-G4-A17 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- LiteRT
How to use xThr45hx/EmbeddingGemma-300M-Tensor-G4-A17 with LiteRT:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
EmbeddingGemma-300M β Tensor G4 NPU (Android 17 / Beta-SDK recompile)
AOT-compiled EmbeddingGemma-300M for the Google Tensor G4 NPU (darwinn EdgeTPU), built with the official Google Tensor ML SDK (Beta).
β οΈ Android firmware note β why this repo exists. The G4 NPU bytecode (DGC) is compiled against a specific Tensor NPU firmware. Builds compiled against Android 16 firmware fail to load on Android 17 (newer NPU runtime β "Failed to get Darwinn graph" / SB-invocation error). This repo holds an A17-targeted recompile on the current Beta SDK. The older Android 16 build: xThr45hx/EmbeddingGemma-300M-Tensor-G4-NPU.
π§ Status: on-device A17 load verification in progress. This build compiles clean (2265/2265 ops, single partition, DGC0 + rio_a0); confirming it loads + runs on a real Android 17 device is the next step. Provisional until this note is updated.
File
embeddinggemma-300M_seq256_Google_Tensor_G4.tfliteβ seq256 (max 256 tokens in one pass), 768-d output. The efficient RAG workhorse for short chunks/queries. (A seq512 long-form variant may follow.)
How it was compiled
- Input:
embeddinggemma-300M_seq256_mixed-precision.tflitefrom litert-community/embeddinggemma-300m (the plain, non-device-compiled mixed-precision file). - SDK:
ai-edge-litert-nightly+ai-edge-litert-sdk-google-tensor==2.1.5; officialaot_compile(target=[TENSOR_G4]), no flags (mixed-precision path); mandatorygoogle_tensor_backendimport. - Result: 2265 / 2265 ops offloaded to 1 partition (fully fused, no fallback). Output 196,993,056 bytes, markers DGC0 + rio_a0 + tfl3. SHA-256
eec2daf64f07f8cc84a92080c5e2afb00fc6bdf0cb688e00638c0229620b0b4a.
License
Gemma β inherits from EmbeddingGemma. See the base model card for terms.
- Downloads last month
- 31
Model tree for xThr45hx/EmbeddingGemma-300M-Tensor-G4-A17
Base model
litert-community/embeddinggemma-300m