File size: 1,690 Bytes
0b56ecb
 
 
4afae7b
 
 
 
0b56ecb
4afae7b
ae12a21
 
4afae7b
 
 
 
 
 
f5d2d9e
af63ac0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f5d2d9e
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
---
language:
- en
pipeline_tag: text-generation
tags:
- science fiction
- text generation
---

This model has been fine-tuned on the novels written by H G Wells. H G Wells is a famous author and is well known for his science fiction novels. He is known as the father of science fiction. 

This model can be used to generate text in the style of H G Wells. Since this model has been trained on most novels of the science fiction genre by H G Wells, it produces text of the science fiction genre.

The limitations of this model are that it can only generate text in the style of H G Wells, and not in the style of any other author. It may also be prone to generate text that has the stereotypes that were present at that time. An ethical consideration that needs to be taken into account is that the generated text may have gender biases that were present at the time when H G Wells wrote these novels.

I created my own dataset to train this model. I chose 14 novels written by H G Wells for my dataset. Most of the novels in the dataset are of the genre science fiction. The dataset contains more than 1 million tokens.

The texts included in the corpus are novels written by H G Wells. The novels in the corpus are:

The Time Machine - 37677

In the Days of the Comet - 95299

The Food of the Gods - 90723

Tales of Space and Time - 85850

The World Set Free - 74971

The War of the Worlds - 69530

The First Men in the Moon - 81517

The Invisible Man - 60581

The Island of Doctor Moreau - 52073

The Sleeper Awakes - 91274

The War in the Air - 115573

The Research Magnificient - 131866

The Udying Fire - 52036

The Red Room - 4618

The total number of tokens in the corpus is 1043588