File size: 1,575 Bytes
bf0b8dd b96022f bf0b8dd |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 |
---
thumbnail: https://huggingface.co/Norod78/gpt-fluentui-flat-svg/raw/main/train/sample16.svg
license: mit
library_name: transformers
pipeline_tag: text-generation
widget:
- text: <svg
---
# gpt-fluentui-flat-svg
A custom GPT model which was trained upon svg files.
Specifically the flat emoji variants from [Microsoft's FluentUI repo](https://github.com/microsoft/fluentui-emoji).
These svn file only consist of "stand-alone" path elements which should make it simpler to train upon and sample from.
# training and dataset
Both Tokenizer and Model were trained using [aitextgen](https://docs.aitextgen.io/)
The python file which was used for training, the .txt file dataset and a few generated samples can be found [here](https://huggingface.co/Norod78/gpt-fluentui-flat-svg/tree/main/train)
# post processing and extracting .svg files from generated samples
```
# Extract from generated output and into a seperate .svg file all sequences which starts with <svg and ends with:
# A. </svg>
# B. If the sequence does not end with </svg> then find the last > in the sequence and append </svg> to it
```
# generated samples
The generated samples below were also created with [this script](https://huggingface.co/Norod78/gpt-fluentui-flat-svg/blob/main/train/atg_train.py)
![sample16](https://huggingface.co/Norod78/gpt-fluentui-flat-svg/raw/main/train/sample16.svg)
![sample15](https://huggingface.co/Norod78/gpt-fluentui-flat-svg/raw/main/train/sample15.svg)
![sample14](https://huggingface.co/Norod78/gpt-fluentui-flat-svg/raw/main/train/sample14.svg)
|