Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
stefan-it
/
xlstm-german-wikipedia
like
7
Text Generation
Transformers
Safetensors
German
xlstm
custom_code
License:
cc-by-sa-3.0
Model card
Files
Files and versions
Community
Train
Use this model
ee8442e
xlstm-german-wikipedia
1 contributor
History:
36 commits
stefan-it
readme: fix reference to used dataset
ee8442e
verified
2 months ago
.gitattributes
Safe
1.52 kB
initial commit
6 months ago
README.md
Safe
4.12 kB
readme: fix reference to used dataset
2 months ago
brat-logo.png
Safe
57.8 kB
figure: add some new logo :p
3 months ago
config.json
Safe
730 Bytes
config: add mapping for AutoModelForSequenceClassification to own xLSTMForSequenceClassification
3 months ago
configuration_xlstm.py
Safe
3.08 kB
xlstm-config: temporarily introduce new hidden_size parameter
3 months ago
generation_config.json
Safe
69 Bytes
model: add generation confgi
3 months ago
model.safetensors
Safe
445 MB
LFS
model: add re-trained xLSTM model with grouped corpus for pretraining
3 months ago
modeling_xlstm.py
Safe
9.85 kB
modeling: sync xLSTMForSequenceClassification with Patrick's codebase from https://github.com/HallerPatrick/helibrunna/blob/a1b377271867d5f23201ccacb55e017749aba487/model/modeling_xlstm.py
3 months ago
special_tokens_map.json
Safe
551 Bytes
tokenizer: add config and vocab
3 months ago
tokenizer.json
Safe
1.84 MB
tokenizer: add config and vocab
3 months ago
tokenizer_config.json
Safe
957 Bytes
tokenizer: add config and vocab
3 months ago
training-loss.png
Safe
133 kB
figure: add re-trained loss curve for training
3 months ago