Remove FastAPI implementation and update max_new_tokens in model config 9331285 scott12355 commited on Mar 20
Refactor generateFromChatHistory to use a GPU-enabled function and remove unnecessary comments ec3c542 scott12355 commited on Mar 20
Refactor GPU initialization and device management for Hugging Face Spaces compatibility 237f784 scott12355 commited on Mar 20