Commit History

Fix logging messages in predict script
4d4de75

Joshua Lochner commited on

Only consider spoken words when calculating metrics
9f15397

Joshua Lochner commited on

Ensure event duration is non-negative
2439d9a

Joshua Lochner commited on

Remove zero-width spaces from text
884d564

Joshua Lochner commited on

Fix classifier train command
90d506c

Joshua Lochner commited on

Add support for mute action type and remove videos with full action type
1286fe5

Joshua Lochner commited on

Initialize logging in each script
c4f250e

Joshua Lochner commited on

Do not allow predictions to miss start of video
aa018be

Joshua Lochner commited on

Fix `--no_cuda` argument for preprocessing
87b2dec

Joshua Lochner commited on

Revert model input size back to 512 tokens
721bf64

Joshua Lochner commited on

Fix conflicting `--no_cuda` argument
09cabec

Joshua Lochner commited on

Use correct logger per script
e3d3d3f

Joshua Lochner commited on

Update preprocessing script to use logging module
cfbd4d5

Joshua Lochner commited on

Add `no_cuda` argument to not use GPU
de9c8c4

Joshua Lochner commited on

Update README to include installation instructions
776c8b2

Joshua Lochner commited on

Fix button colour on dark theme
921fb1d

Joshua Lochner commited on

Remove redundant calls to change device
8981122

Joshua Lochner commited on

Add `output_as_json` argument for inference
52340fc

Joshua Lochner commited on

Adjust tokenizer input size based on model input size
9604abd

Joshua Lochner commited on

Fix typo in prediction command
39f6f81

Joshua Lochner commited on

Add transcript option to streamlit app and visual improvements
8a55e13

Joshua Lochner commited on

Show message if predictions returned, but all ignored due to filters/settings
8326048

Joshua Lochner commited on

Update README.md
bfb080b

Joshua Lochner commited on

Remove unused utilities
0e18e8c

Joshua Lochner commited on

Move `load_datasets` to train script
086ca93

Joshua Lochner commited on

Improve how transcripts are stored and how manual transcripts are segmented
583f4cf

Joshua Lochner commited on

Add boilerplate code to detect whether segment was split due to length
df35612

Joshua Lochner commited on

Revert evaluation script to use `processed_file` by default
8fc746d

Joshua Lochner commited on

Fix segmentation using binary search
de9c264

Joshua Lochner commited on

Add fallback for old transcript version
c445f1a

Joshua Lochner commited on

Fix `num_tokens` key in words
83dc695

Joshua Lochner commited on

Optimize segment generation and extraction
4b4c9f0

Joshua Lochner commited on

Abstract inference code
8b71088

Joshua Lochner commited on

Remove duplicated methods from streamlit app
a9123fa

Joshua Lochner commited on

Improve caching and downloading of classifier for predictions
fb87012

Joshua Lochner commited on

Create `ClassifierLoadError`
02e576a

Joshua Lochner commited on

Download classifier and vectorizer if not present
d7a594b

Joshua Lochner commited on

Raise ModelLoadError if model does not exist
dffef09

Joshua Lochner commited on

Update errors
0b7cd5a

Joshua Lochner commited on

Assign default model for predictions
8f0e2d8

Joshua Lochner commited on

Create LICENSE
c78435b

Joshua Lochner commited on

Update README.md
0e3177b

Joshua Lochner commited on

Create FUNDING.yml
4f980b5
unverified

Joshua Lochner commited on

Add `do_process_database` option to preprocessing script
d7b6d7f

Joshua Lochner commited on

Use `get_model_tokenizer` method from streamlit app
9a5d9ed

Joshua Lochner commited on

Hide previous output on run
e926596

Joshua Lochner commited on

Improve exceptions thrown while obtaining transcripts
0b48a99

Joshua Lochner commited on

Fix import error
dbf7b4c

Joshua Lochner commited on

Remove unused imports
bdfb4b1

Joshua Lochner commited on

Fix YouTube ID regex
df05196

Joshua Lochner commited on