Afonso Marques
commited on
add demo to readme + small improvements
Browse files
README.md
CHANGED
|
@@ -2,9 +2,30 @@
|
|
| 2 |
|
| 3 |
A multilingual automatic speech recognition and video captioning tool using faster whisper on cpu.
|
| 4 |
|
| 5 |
-
|
| 6 |
|
| 7 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
+ [ImageMagick](https://imagemagick.org/script/download.php)
|
| 9 |
+ [Python (3.11)](https://www.python.org/downloads/release/python-3116/)
|
| 10 |
|
|
@@ -20,23 +41,23 @@ pipenv install
|
|
| 20 |
|
| 21 |
Note that this assumes a proper Git installation and ssh key configuration.
|
| 22 |
|
| 23 |
-
|
| 24 |
|
| 25 |
-
|
| 26 |
|
| 27 |
Run the following code to your example using the CLI. The example is based on a youtube video url (optional):
|
| 28 |
|
| 29 |
```
|
| 30 |
-
pipenv run python
|
| 31 |
```
|
| 32 |
|
| 33 |
Fontsize, Font, Background Color and Text Color arguments are available:
|
| 34 |
|
| 35 |
```
|
| 36 |
-
pipenv run python
|
| 37 |
```
|
| 38 |
|
| 39 |
-
|
| 40 |
|
| 41 |
A FastAPI API is also made available.
|
| 42 |
|
|
|
|
| 2 |
|
| 3 |
A multilingual automatic speech recognition and video captioning tool using faster whisper on cpu.
|
| 4 |
|
| 5 |
+
<video width="400" height="300" src="https://github.com/marquesafonso/multilang-asr-captioner/assets/79766107/fcff8ac1-cdfc-4400-821c-f797d84c2d8a"></video>
|
| 6 |
|
| 7 |
+
## Docker
|
| 8 |
+
|
| 9 |
+
1. clone the repo
|
| 10 |
+
```{bash}
|
| 11 |
+
git clone git@github.com:marquesafonso/multilang-asr-captioner.git
|
| 12 |
+
```
|
| 13 |
+
2. Build and run the container using docker-compose
|
| 14 |
+
|
| 15 |
+
```{bash}
|
| 16 |
+
docker compose up
|
| 17 |
+
```
|
| 18 |
+
Then check the [landing page](http://127.0.0.1:8000).
|
| 19 |
+
|
| 20 |
+
From there you will see the [submit_video endpoint](http://127.0.0.1:8000/submit_video/) and the [documentation](http://127.0.0.1:8000/docs/)
|
| 21 |
+
|
| 22 |
+
Tip: on Linux or Mac localhost will resolve directly to 0.0.0.0 but on windows you will need to change it to 127.0.0.1
|
| 23 |
+
|
| 24 |
+
## Local
|
| 25 |
+
|
| 26 |
+
### Requirements and Instalations
|
| 27 |
+
|
| 28 |
+
To run this tool locally on your computer you will need the following sofware installed:
|
| 29 |
+ [ImageMagick](https://imagemagick.org/script/download.php)
|
| 30 |
+ [Python (3.11)](https://www.python.org/downloads/release/python-3116/)
|
| 31 |
|
|
|
|
| 41 |
|
| 42 |
Note that this assumes a proper Git installation and ssh key configuration.
|
| 43 |
|
| 44 |
+
### Quick start
|
| 45 |
|
| 46 |
+
#### Command Line Interface
|
| 47 |
|
| 48 |
Run the following code to your example using the CLI. The example is based on a youtube video url (optional):
|
| 49 |
|
| 50 |
```
|
| 51 |
+
pipenv run python cli.py --invideo_filename '<your_file_name>' --video_url 'https://www.youtube.com/watch?v=<your_youtube_video>' --max_words_per_line 8
|
| 52 |
```
|
| 53 |
|
| 54 |
Fontsize, Font, Background Color and Text Color arguments are available:
|
| 55 |
|
| 56 |
```
|
| 57 |
+
pipenv run python cli.py --invideo_filename '<your_file>' --video_url 'https://www.youtube.com/watch?v=<your_youtube_video>' --max_words_per_line 8 --fontsize 28 --font "Arial-Bold" --bg_color None --text_color 'white'
|
| 58 |
```
|
| 59 |
|
| 60 |
+
#### API
|
| 61 |
|
| 62 |
A FastAPI API is also made available.
|
| 63 |
|