File size: 12,074 Bytes
623439f cbbd4d1 623439f 96ede02 76ef35c 0312ca1 4e34fb1 96ede02 4e34fb1 96ede02 97117ec 4e34fb1 005b2ba e59b000 b6f714f ab065b3 b6f714f ab065b3 88a0bec ab065b3 e59b000 005b2ba de3e322 87365be d357a85 87365be 6a0cd0a 4e34fb1 84b0dca 4e34fb1 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 |
---
license: cc-by-sa-4.0
language:
- en
library_name: transformers
pipeline_tag: text-generation
---
<div style="text-align: right;font-size:.7em;margin-left:50%"><em>Per un attimo Brahe cercò le parole, le immagini, le analogie; pensò perfino i gesti della mano e delle dita, come un attore che si prepari a rendere fisico un sentimento. Ma appena cominciò a dire "come", a dare solidità a ciò che non aveva, a rendere visibile ciò che non lo era, a collocare, nello spazio ciò che era pura probabilità, e a cercare una qualsiasi cosa tra le forme del mondo cui paragonarlo, Epstein lo interruppe.</em><br>Daniele del Giudice, <em>Atlante occidentale</em></div>
*Brahe* is an analytical LLM for multilingual literature fine-tuned from llama-13B. Given any text, Brahe will generate a list of potentially twenty annotations. Brahe is intended to be used by computational humanities project, similarly to BookNLP.
*Brahe* has been trained on 8,000 excerpts of literature in the public domain and on a set of synthetic and manual annotations. Half of the excerpts are in English and half in other languages (mostly French, German, Italian…).
Thanks to the native multilinual capacity of llama-13B, *Brahe* has been proven to work on languages that were not part of its original corpus, such as the Gascon variety of Occitan.
*Brahe* is a reversed companion of *[Epstein](https://huggingface.co/Pclanglais/epstein)*, a generative AI model to create new literary texts by submitting annotated prompts (for now in English-only). Both models are named after the protagonists of the philosophical novel of Daniele del Giudice, *Atlante occidentale*. Brahe is a scientist working at the CERN on quantum physics, Epstein is a novelist and they both confront their different views of reality.
## Annotations
In its current version, *Brahe* may generate the following annotations.
* Summary: short summary
* Tone: general tonality of the text (humoristic, tragic, scholarly…)
* Speech standard: the specific social/literary level of the text (poetic, dialectical, vulgar…)
* Intertextuality: non-literary writing forms that may be similar to this text (red tape, scientific article, case law…)
* Genre: a specific literary genre that would be used in bookshops such as detective fiction, science-fiction, romance, historical novel, young adult…
* Literary movement: aesthetic movement the text seems to embody (does not work so well)
* Literary form: whether it's the description of a place, a conversation, a stream of consciousness
* Trope: a trope or literary cliché (a fuzzy definition but works surprisingly well)
* Enonciation: who is speaking in the text (first-person narrative, dialog, third-person narrative, omniscient narrator)
* Narrative arc: how is the action unfolding (suspense, dramatic tension, comic relief…)
* Active character: the list of characters that have an active involvment in the story.
* Mentioned characters: the list of characters only mentioned, with no active involvement in the story
* Quoted works: another text mentioned or quoted in the text.
* Absolute place: a precise place with a proper name such as Paris, Sesame Street, Lisbonne Airport.
* Fuzzy place: unnamed place where the story happens such as a field, an appartment, a church (does not work so well…)
* Fuzzy time nonspecific moment where the action occur moment such as monday, yesterday, a week after.
* Time setting: historical period where the action seems to occur such as the 1960s, the Renaissance, the Victorian period…
* Diegetic time: very approximative number of minutes/hours/days that have unfolded between the beginning and the end of the text (5 minutes, 35 minutes, 2 hours, 3 days).
* Absolute time: a precise date where the action occurs, such as January 15, 1845, 23rd century…
The annotations are not generated systematically but only whenever the model is confident enough.
## Examples
With the incipit of Henry James' *The Ambassadors*:
>*Strether\'s first question, when he reached the hotel, was about his friend; yet on his learning that Waymarsh was apparently not to arrive till evening he was not wholly disconcerted. A telegram from him bespeaking a room "only if not noisy," with the answer paid, was produced for the inquirer at the office, so that the understanding that they should meet at Chester rather than at Liverpool remained to that extent sound. The same secret principle, however, that had prompted Strether not absolutely to desire Waymarsh\'s presence at the dock, that had led him thus to postpone for a few hours his enjoyment of it, now operated to make him feel that he could still wait without disappointment. They would dine together at the worst, and, with all respect to dear old Waymarsh—if not even, for that matter, to himself—there was little fear that in the sequel they should not see enough of each other. The principle I have just mentioned as operating had been, with the most newly-disembarked of the two men, wholly instinctive—the fruit of a sharp sense that, delightful as it would be to find himself looking, after so much separation, into his comrade\'s face, his business would be a trifle bungled should he simply arrange that this countenance should present itself to the nearing steamer as the first "note," for him, of Europe. Mixed with everything was the apprehension, already, on Strether\'s part, that he would, at best, throughout, prove the note of Europe in quite a sufficient degree.*
Output of Brahe:
> Summary: Strether is waiting for his friend Waymarsh to arrive at the hotel, but is not completely disappointed when he learns that Waymarsh will not arrive until evening. Strether has a secret principle that led him to postpone meeting Waymarsh at the dock, but now feels that they will see enough of each other. Strether is concerned that his presence in Europe will be enough of a note for his friend.<br>
> Trope: The anticipation of meeting a friend<br>
> Narrative arc: Suspense<br>
> Enunciation: Third-person narrative<br>
> Tone: Anticipatory<br>
> Genre: Literary fiction<br>
> Intertextuality: Travelogue<br>
> Speech standard: Standard<br>
> Literary form: Stream of consciousness<br>
> Literary movement: Realism<br>
> Active character: Strether, Waymarsh<br>
> Time setting: Contemporary<br>
> Fuzzy time: Evening<br>
> Diegetic time: A few hours<br>
> Absolute place: The hotel<br>
> Fuzzy place: The hotel
With the incipit of Proust's *Swann's Way*:
> Combray at a distance, from a twenty-mile radius, as we used to see it from the railway when we arrived there every year in Holy Week, was no more than a church epitomising the town, representing it, speaking of it and for it to the horizon, and as one drew near, gathering close about its long, dark cloak, sheltering from the wind, on the open plain, as a shepherd gathers his sheep, the woolly grey backs of its flocking houses, which a fragment of its mediaeval ramparts enclosed, here and there, in an outline as scrupulously circular as that of a little town in a primitive painting. To live in, Combray was a trifle depressing, like its streets, whose houses, built of the blackened stone of the country, fronted with outside steps, capped with gables which projected long shadows downwards, were so dark that one had, as soon as the sun began to go down, to draw back the curtains in the sitting-room windows; streets with the solemn names of Saints, not a few of whom figured in the history of the early lords of Combray, such as the Rue Saint-Hilaire, the Rue Saint-Jacques, in which my aunt's house stood, the Rue Sainte-Hildegarde, which ran past her railings, and the Rue du Saint-Esprit, on to which the little garden gate opened; and these Combray streets exist in so remote a quarter of my memory, painted in colours so different from those in which the world is decked for me to-day, that in fact one and all of them, and the church which towered above them in the Square, seem to me now more unsubstantial than the projections of my magic-lantern; while at times I feel that to be able to cross the Rue Saint-Hilaire again, to engage a room in the Rue de l'Oiseau, in the old hostelry of the Oiseau Flesché, from whose windows in the pavement used to rise a smell of cooking which rises still in my mind, now and then, in the same warm gusts of comfort, would be to secure a contact with the unseen world more marvellously supernatural than it would be to make Golo's acquaintance and to chat with Geneviève de Brabant.*
Output of Brahe:
> Summary: The narrator reflects on their childhood memories of Combray, a town with dark and depressing streets.<br>
> Trope: The town as a character<br>
> Narrative arc: Reflective<br>
> Enunciation: First-person narrative<br>
> Tone: Nostalgic<br>
> Genre: Memoir<br>
> Intertextuality: Historical accounts of Combray<br>
> Speech standard: Poetic<br>
> Literary form: Description of a place<br>
> Literary movement: Symbolism<br>
> Active character: The narrator, the aunt<br>
> Time setting: Childhood<br>
> Absolute place: Combray<br>
> Fuzzy place: Combray
With an incipit of Nueit de junh by Joan Francés Blanc, in Gascon dialect of Occitan (a language not part of *Brahe* training corpus):
> Que voleré morí's. Non pas tant entà pèrder la vita, l'alen, tà deishà's escapar l'arma. Mes tà disaparéisher. Tà desbrembà's l'amor vadut aquiu, e tant viste perdut. Que voleré cercar solaç dens quauque ostau amic. Mes l'amic, lo sol, qui n'avèn pas hrèita de parlà's tà compréne's, que moriscó. E adara que vira a casa las pajas deus líbers shens legi'us. Que minja shens trobar lo gost ni brembà's lo nom deus plats. Que's passeja shens véder las caras deu monde qui tròba suu camin. No's bremba pas tanpauc deu tribalh, qui'u hè passar totun bèths moments en tot ocupar l'esperit a causas bestiaments materiaus : quin empachar los eslhèves de dromí's. Quin despertar la soa intelligéncia, que n'an ua, mes dens aqueth vegarau de París, n'ei pas aquò qui ajuda a vàder respectat... Òras e òras passadas (pergudas ?) en talhèrs de pedagogia. Sajar de dar idèas, de préner iniciativas, entà cargar un pauc mei l'aso de tribalh. Tà qui posqui pas tròp pensar a l'ivèrn d'avants. Decémer. Lo hred que cadè d'un còp sus la vila. Que passè de dètz degrès a mensh cinc dens la nueit. Que vedó lo lendoman lo mond qui's torran qui s'an desbrembat d'escotar la ràdio. Vita shens nat chepic que lo tribalh quotidian. Puish lo telefòne, dens la nueit, d'islas de luenh, tà díser la mort de l'amic. Que partiva tau tribalh, dens l'autò crompada dab quauques milèrs de quilomètres. E los travaders qui ne'u vòlen pas créder. Lo camion en fàcia. Lo néguer. La hemna qui quitè de vàder hòla. Òras au telefòne. Amics mei disponibles, qui pòden deishar lo tribalh tà gahar lo purmèr avion. Nau òras dinc a Pointe-à-Pitre. E eth, esperar lo ser, cada ser, avants o après lo telefòne, tà plorà's. Shens masclor. O dab la masclor deu gojat qui plorèva dab la casa sancèra aus enterraments. Preparar lo viatge. Esperar un mes tàd'aquò. Aver lo temps de crompar un disc de Lluís Llach. Gira la roda del temps... i si no fós absurd el mes enllà... Que pensa a la soa mort. Non se vei víver shens aquera preséncia de l'amic. Qu'èra partit luenh, sovent. Lille, a bisa. E après, qu'èra partit eth, tà París. Mes adara, qu'ei un partir definitiu. A maugrat la fe.
Output of Brahe:
> Summary: The text is a series of thoughts and reflections on death, friendship, and the passage of time.<br>
> Trope: The passage of time, the fleeting nature of life<br>
> Narrative arc: Reflective, introspective<br>
> Enunciation: First-person narrative<br>
> Tone: Reflective, melancholic<br>
> Genre: Poetry<br>
> Intertextuality: Poetry<br>
> Speech standard: Poetic, elevated language<br>
> Literary form: Stream of consciousness<br>
> Literary movement: Existentialism<br>
> Time setting: Contemporary<br>
> Fuzzy place: Unnamed village, Paris, Pointe-à-Pitre
|