Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
yusufs
/
vllm-inference
Paused

App Files Files
Fetching metadata from the HF Docker repository...
vllm-inference
Ctrl+K
Ctrl+K
  • 1 contributor
History: 57 commits
yusufs's picture
yusufs
feat(runner.sh): DeepSeek-R1-Distill-Qwen-32B d66bcfc2f3fd52799f95943264f32ba15ca0003d
148829b 3 months ago
  • .gitignore
    19 Bytes
    feat(download_model.py): remove download_model.py during build, it causing big image size 5 months ago
  • Dockerfile
    1.44 kB
    feat(Dockerfile): install gcc 3 months ago
  • README.md
    1.73 kB
    feat(add-model): always download model during build, it will be cached in the consecutive builds 5 months ago
  • download_model.py
    700 Bytes
    feat(add-model): always download model during build, it will be cached in the consecutive builds 5 months ago
  • main.py
    6.7 kB
    feat(parse): parse output 5 months ago
  • openai_compatible_api_server.py
    24.4 kB
    feat(dep_sizes.txt): removes dep_sizes.txt during build, it not needed 5 months ago
  • poetry.lock
    426 kB
    feat(refactor): move the files to root 5 months ago
  • pyproject.toml
    416 Bytes
    feat(refactor): move the files to root 5 months ago
  • requirements.txt
    9.99 kB
    feat(first-commit): follow examples and tutorials 5 months ago
  • run-llama.sh
    1.51 kB
    fix(runner.sh): --enforce-eager not support values 4 months ago
  • run-sailor.sh
    1.83 kB
    fix(runner.sh): --enforce-eager not support values 4 months ago
  • runner.sh
    2.04 kB
    feat(runner.sh): DeepSeek-R1-Distill-Qwen-32B d66bcfc2f3fd52799f95943264f32ba15ca0003d 3 months ago