Narratives in LLM Pretraining Data Collection Models & datasets from Characterizing Narrative Content in Web-Scale LLM Pretraining Data (NarraDolma & NarraBERT) • 7 items • Updated 1 day ago • 2
Characterizing Narrative Content in Web-scale LLM Pretraining Data Paper • 2606.19468 • Published 4 days ago • 1
Narratives in LLM Pretraining Data Collection Models & datasets from Characterizing Narrative Content in Web-Scale LLM Pretraining Data (NarraDolma & NarraBERT) • 7 items • Updated 1 day ago • 2
Narratives in LLM Pretraining Data Collection Models & datasets from Characterizing Narrative Content in Web-Scale LLM Pretraining Data (NarraDolma & NarraBERT) • 7 items • Updated 1 day ago • 2
Narratives in LLM Pretraining Data Collection Models & datasets from Characterizing Narrative Content in Web-Scale LLM Pretraining Data (NarraDolma & NarraBERT) • 7 items • Updated 1 day ago • 2
Narratives in LLM Pretraining Data Collection Models & datasets from Characterizing Narrative Content in Web-Scale LLM Pretraining Data (NarraDolma & NarraBERT) • 7 items • Updated 1 day ago • 2