Commit History

Improvements with embeddings load and file save
650da6e

seanpedrickcase commited on

added a few more requirements for aws load
58d3f97

seanpedrickcase commited on

Added transformers to requirements for AWS
0444b49

seanpedrickcase commited on

Updated Dockerfile and requirements files to create a smaller container
91bd588

seanpedrickcase commited on

Minor bug fix to connections parameter function
b1c3d49

seanpedrickcase commited on

Explicitly installing curl and exposing port in Dockerfile
f6d59b2

seanpedrickcase commited on

Cognito authorisation option added to app, some other minor changes.
759001a

seanpedrickcase commited on

When running on cloud now checks for relevant header details on load
f9e3451

seanpedrickcase commited on

Now accepts .zip file as inputs. Moved semantic search option bar. Minor API mode changes.
7f029b5

seanpedrickcase commited on

Changed embedding model to MiniLM-L6 as faster. Compressed embeddings are now int8. General improvements to API mode
ea0dd40

seanpedrickcase commited on

Minor changes to function outputs. Attempted Python downgrade to 3.10 to address xlsx output issues
2806807

seanpedrickcase commited on

General code improvements and refinements.
a95ef9f

seanpedrickcase commited on

Set bm25 in functions explicitly. Some API updates. Now can get connection params on startup.
2393537

seanpedrickcase commited on

Some package updates and minor changes
2754a2b

seanpedrickcase commited on

embedding files now write to output folder
1dc162b

seanpedrickcase commited on

Changed all intermediate file outputs to save to output folder
fea085c

seanpedrickcase commited on

Allowed for custom output folder, returned Dockerfile to work under user account and port 7860
d3ff2e2

seanpedrickcase commited on

Testing not running Dockerfile as user for Lambda compatibility
3b77fe5

seanpedrickcase commited on

Modified Dockerfile to run app on port 8080, which Lambda uses
f20dd46

seanpedrickcase commited on

Added Lambda web adapter to Dockerfile. Updated Gradio version
fcd826b

seanpedrickcase commited on

Updated Dockerfile to create output folder correctly and enable regional AWS STS endpoint
2b16125

seanpedrickcase commited on

Now checks for output folder before saving. Minor code cleaning
2089141

seanpedrickcase commited on

Fixed cleaning for semantic search. Handles text with backslashes in (if cleaned). Updated packages. requirements file for only keyword search added.
8466e45

seanpedrickcase commited on

Assigned AWS bucket name to environmental variable
7bdc986

seanpedrickcase commited on

Added additional password auth for AWS-based files. Changed 'Clean' default to no
651ef78

seanpedrickcase commited on

AWS credentials no longer a requirement for app to work
30b5dc1

seanpedrickcase commited on

Gradio 4.21. Limitations on file size and creating embeddings. Added AWS integration
e0fe055

seanpedrickcase commited on

Removed GRADIO_ROOT_PATH from the Dockerfile as it is not necessary for API Gateway access
7e9dd76

Sean-Case commited on

Updated to latest Gradio version that hopefully solved root_url issues. Added ssl_verify=False to launch
813f823

Sean-Case commited on

Set SDK version to 4.20.0 on huggingface. Added GRADIO_SERVER_PORT to Dockerfile
13cab70

Sean-Case commited on

set gradio version to 4.18.0, before root path issue reappears
8ba34be

Sean-Case commited on

Changed dockerfile gradio_root_path variable
745a159

Sean-Case commited on

String query not correctly specified in fuzzy search, changed this
ff8dfa3

Sean-Case commited on

ECS not allowing me to save files so increasing container privileges in Dockerfile
9c0a094

Sean-Case commited on

set Dockerfile Gradio_root_path environmental variable
29ce26a

Sean-Case commited on

Now loads in embedding model locally in Dockerfile
3034296

Sean-Case commited on

edited Dockerfile slightly
55b0541

Sean-Case commited on

Added small line to dockerfile start. Added dockerignore file for local builds
d99f440

Sean-Case commited on

Improved code for cleaning and outputting files. Added Dockerfile
4ee3470

Sean-Case commited on

Improved xlsx output formatting. Deals better with cleaning data then analysing in same session.
352c02a

Sean-Case commited on

Added highlight search term functionality to keyword search output
36a404e

seanpedrickcase commited on

Updated to Gradio 4.16.0. Now works correctly with BGE embeddings
2bcd818

seanpedrickcase commited on

Upgraded to Gradio 4.16.0. Added Spacy fuzzy search functionality.
4ce2224

Sean-Case commited on

Changed intro text
8c115b3

Sean-Case commited on

Changed requirements to download correct spacy model version
4e77f14

Sean-Case commited on

Cut out semantic search temporarily while issues with Jina gated model resolved. Improved error/progress tracking and messaging. Placeholder for Spacy fuzzy search.
739b386

seanpedrickcase commited on

Switch back to Gradio version 3.50.2 as 4 is not working with pyinstaller app creation yet
c6dc87d

Sean-Case commited on