File size: 659 Bytes
22ecb24 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 |
### 1. Index is not persisted. Because of Hugging face limitation.
Easy fix > Persist on Azure Storage (similar to s3)
It's not that bad, it's using `text-embedding-3-small` which is $0.020 / 1M tokens.
Fix in step_1_index_documents.py
### 2. Improve Python type in function
### 3. Add errors message and try catch
### 4. Add automated test
### 5. Steps are displayed live
### 6. Being able to test from any step (from step 3, avoiding to index and comparing, ...)
### 7. Add time it took between each step
### 8. Add price for each step
### 9. Improve which file is with who
Put the file names here, where in it
```
text += f'### Discrepancy {i}:\n\n'
``` |