topic_modelling / funcs /bertopic_vis_documents.py

Commit History

Can split passages into sentences. Improved embedding, LLM representation models, improved zero shot capabilities
55f0ce3
Running

seanpedrickcase commited on

Reduce outliers now more efficient and relabels with correct vectoriser. Default topic labels now tidier. Hiearchical topics outputs more useful for joining to df afterwards. Switched low resource reduction algorithm to UMAP as default is not good.
e1c1f68

Sonnyjim commited on

Allowed for loading in external topic labels. A few visualisation modifications.
b27bab2

Sonnyjim commited on

Lots of general fixes. New visualisations, fixed hierarchical vis for zero shot. Added calc all probabilities.
b4510a6

Sonnyjim commited on

Split off LLM representation, visualisation, and reduce outliers from main function. Added hierarchical visualisation and logs
5d87c3c

Sonnyjim commited on

More efficient embeddings save and representations load/process. Custom visualisation hover option added, formatting improvements. Version 0.1?
ffe5eb2

Sonnyjim commited on