File size: 1,191 Bytes
268af1a
 
edeba3f
 
 
 
61c9cbe
edeba3f
61c9cbe
edeba3f
 
97fab1c
268af1a
 
edeba3f
 
 
 
268af1a
edeba3f
268af1a
edeba3f
268af1a
edeba3f
 
268af1a
edeba3f
 
268af1a
edeba3f
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
---
library_name: transformers
tags:
- cryptology
- cipher
datasets:
- asimokby/Turkish-OSCAR-GEC
language:
- tr
base_model:
- google-t5/t5-base
license: apache-2.0
---

This project contains a text-to-text model designed to decrypt Turkish text encoded using a substitution cipher.
In a substitution cipher, each letter in the plaintext is replaced by a corresponding, unique letter to form the ciphertext.
The model leverages statistical and linguistic properties of Turkish to make educated guesses about the letter substitutions,
aiming to recover the original plaintext message.

This model is for monoalphabetic Turkish substitution ciphers and it outputs decoded text.

Example:

Encoded text: 
**Wn cqwqp fqpöqgbatqj ckjxq çmxmçmvy rafhqça zgqjgaçkxnh.**

Decoded text: 
**Bu sabah kahvaltıdan sonra yürüyüşe çıkmayı planlıyorum.**

Suggested Usage:
```py
#Load the model and tokenizer
cipher_text = "" #Encoded text here!
inputs = tokenizer(cipher_text, return_tensors="pt", padding=True, truncation=True, max_length=256).to(device)
outputs = model.generate(inputs["input_ids"], max_length=256)
decoded_text = tokenizer.decode(outputs[0], skip_special_tokens=True)
```