Commit History

Add language preference list
62ea1e5

Joshua Lochner commited on

Fix logging messages in predict script
4d4de75

Joshua Lochner commited on

Only consider spoken words when calculating metrics
9f15397

Joshua Lochner commited on

Ensure event duration is non-negative
2439d9a

Joshua Lochner commited on

Remove zero-width spaces from text
884d564

Joshua Lochner commited on

Add support for mute action type and remove videos with full action type
1286fe5

Joshua Lochner commited on

Initialize logging in each script
c4f250e

Joshua Lochner commited on

Do not allow predictions to miss start of video
aa018be

Joshua Lochner commited on

Fix `--no_cuda` argument for preprocessing
87b2dec

Joshua Lochner commited on

Revert model input size back to 512 tokens
721bf64

Joshua Lochner commited on

Fix conflicting `--no_cuda` argument
09cabec

Joshua Lochner commited on

Use correct logger per script
e3d3d3f

Joshua Lochner commited on

Update preprocessing script to use logging module
cfbd4d5

Joshua Lochner commited on

Add `no_cuda` argument to not use GPU
de9c8c4

Joshua Lochner commited on

Remove redundant calls to change device
8981122

Joshua Lochner commited on

Add `output_as_json` argument for inference
52340fc

Joshua Lochner commited on

Adjust tokenizer input size based on model input size
9604abd

Joshua Lochner commited on

Remove unused utilities
0e18e8c

Joshua Lochner commited on

Move `load_datasets` to train script
086ca93

Joshua Lochner commited on

Improve how transcripts are stored and how manual transcripts are segmented
583f4cf

Joshua Lochner commited on

Add boilerplate code to detect whether segment was split due to length
df35612

Joshua Lochner commited on

Revert evaluation script to use `processed_file` by default
8fc746d

Joshua Lochner commited on

Fix segmentation using binary search
de9c264

Joshua Lochner commited on

Add fallback for old transcript version
c445f1a

Joshua Lochner commited on

Fix `num_tokens` key in words
83dc695

Joshua Lochner commited on

Optimize segment generation and extraction
4b4c9f0

Joshua Lochner commited on

Abstract inference code
8b71088

Joshua Lochner commited on

Improve caching and downloading of classifier for predictions
fb87012

Joshua Lochner commited on

Create `ClassifierLoadError`
02e576a

Joshua Lochner commited on

Download classifier and vectorizer if not present
d7a594b

Joshua Lochner commited on

Raise ModelLoadError if model does not exist
dffef09

Joshua Lochner commited on

Update errors
0b7cd5a

Joshua Lochner commited on

Assign default model for predictions
8f0e2d8

Joshua Lochner commited on

Add `do_process_database` option to preprocessing script
d7b6d7f

Joshua Lochner commited on

Improve exceptions thrown while obtaining transcripts
0b48a99

Joshua Lochner commited on

Fix import error
dbf7b4c

Joshua Lochner commited on

Remove unused imports
bdfb4b1

Joshua Lochner commited on

Add support for entering YouTube URL into textbox
94ad7ba

Joshua Lochner commited on

Add `--channel_id` parameter to evaluation script to run evaluation on a channel
537f2b7

Joshua Lochner commited on

Fix the reduction of overlapping segments
2782b0c

Joshua Lochner commited on

Output auto-submission link for missing segments
183ba5e

Joshua Lochner commited on

Improve output of evaluation script
a294fb2

Joshua Lochner commited on

Remove unused code in training script
31d605f

Joshua Lochner commited on

Improve preprocessing and segmentation
b27b0d5

Joshua Lochner commited on

Create `get_model_tokenizer` helper method for loading model and tokenizer
bce5ce9

Joshua Lochner commited on

Move `CATGEGORY_OPTIONS` to shared
fca2a61

Joshua Lochner commited on

Move `seconds_to_time` to shared
bb58e90

Joshua Lochner commited on

Use classifier category if transformer generates unknown category
ad7fc61

Joshua Lochner commited on

Remove redundant count
a6de017

Joshua Lochner commited on

Add compatibility for python 3.6+
7781f10

Joshua Lochner commited on