Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
yusufs
/
vllm-inference
Paused

App Files Files
Fetching metadata from the HF Docker repository...
vllm-inference
Ctrl+K
Ctrl+K
  • 1 contributor
History: 46 commits
yusufs's picture
yusufs
feat(runner.sh): using MODEL_ID only
490e6a3 4 months ago
  • .gitignore
    19 Bytes
    feat(download_model.py): remove download_model.py during build, it causing big image size 5 months ago
  • Dockerfile
    1.32 kB
    feat(runner.sh): using runner.sh to select llm in the run time 4 months ago
  • README.md
    1.73 kB
    feat(add-model): always download model during build, it will be cached in the consecutive builds 5 months ago
  • download_model.py
    700 Bytes
    feat(add-model): always download model during build, it will be cached in the consecutive builds 5 months ago
  • main.py
    6.7 kB
    feat(parse): parse output 5 months ago
  • openai_compatible_api_server.py
    24.4 kB
    feat(dep_sizes.txt): removes dep_sizes.txt during build, it not needed 5 months ago
  • poetry.lock
    426 kB
    feat(refactor): move the files to root 5 months ago
  • pyproject.toml
    416 Bytes
    feat(refactor): move the files to root 5 months ago
  • requirements.txt
    9.99 kB
    feat(first-commit): follow examples and tutorials 5 months ago
  • run-llama.sh
    1.53 kB
    feat(seed): Random seed for reproducibility. 4 months ago
  • run-sailor.sh
    1.85 kB
    feat(seed): Random seed for reproducibility. 4 months ago
  • runner.sh
    952 Bytes
    feat(runner.sh): using MODEL_ID only 4 months ago