Instructions to use opsbr/eye-grep-deberta-v3-small with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers.js
How to use opsbr/eye-grep-deberta-v3-small with Transformers.js:
// npm i @huggingface/transformers import { pipeline } from '@huggingface/transformers'; // Allocate pipeline const pipe = await pipeline('token-classification', 'opsbr/eye-grep-deberta-v3-small');
eye-grep tagger β deberta-v3-small
The accuracy-first tagger for eye-grep, a log colorizer that highlights ids, timestamps, IPs and repeated strings in server logs. It is a token classifier that labels each content token of a log line with one of 11 tags, letting a renderer color site-specific id formats it has never seen before.
Fine-tuned from microsoft/deberta-v3-small (SentencePiece). This is the champion model β highest accuracy, larger download β and the eye-grep CLI loads it by default. For an in-browser build use the smaller, distilled opsbr/eye-grep-electra-small.
Tag schema (11 classes)
PUNCT WORD NUM RAND IP DURATION SIZE TIMESTAMP LEVEL URL PATH
RAND is a high-entropy id (uuid / hash / token); NUM, SIZE, DURATION are
numeric values; TIMESTAMP, IP, URL, PATH, LEVEL are self-explanatory;
WORD/PUNCT are ordinary text.
Files
ONNX in the transformers.js layout β onnx/model.onnx (fp32) and
onnx/model_quantized.onnx (int8, the default) β plus the SentencePiece tokenizer.
Usage
eye-grep CLI (this is the default model):
eye-grep --model opsbr/eye-grep-deberta-v3-small app.log
# private repo β export HF_TOKEN first
transformers.js:
import { AutoTokenizer, AutoModelForTokenClassification } from '@huggingface/transformers';
const tokenizer = await AutoTokenizer.from_pretrained('opsbr/eye-grep-deberta-v3-small');
const model = await AutoModelForTokenClassification.from_pretrained('opsbr/eye-grep-deberta-v3-small');
Training data
Fine-tuned on the synthetic, fully-owned opsbr/eye-grep gold set (Apache-2.0) β deterministically generated, no third-party log data.
Notes
- Int8 dynamic quantization costs only ~0.002 usefulness versus fp32.
- The tokenizer mirrors eye-grep's frozen
train/spec.py; the model's subword predictions are aligned back onto content-token spans at inference time.
- Downloads last month
- 8
Model tree for opsbr/eye-grep-deberta-v3-small
Base model
microsoft/deberta-v3-small