Skip to content

fix(ci): update vllm to 0.21.0 for Dockerfile #4934

Merged
qinxuye merged 72 commits into
xorbitsai:mainfrom
zwt-1234:main
May 21, 2026
Merged

fix(ci): update vllm to 0.21.0 for Dockerfile #4934
qinxuye merged 72 commits into
xorbitsai:mainfrom
zwt-1234:main

Conversation

@zwt-1234
Copy link
Copy Markdown
Collaborator

No description provided.

zwt-1234 and others added 30 commits October 1, 2025 08:43
Added wetext installation and set LD_LIBRARY_PATH.
Removed 'gptqmodel' from the requirements file.
zwt-1234 added 22 commits March 30, 2026 17:11
Set environment variables for NVM and Node.js version.
Removed pre-release installation of torch, torchvision, and torchaudio, retaining only triton and torchcodec installations.
Removed installation of pre-release PyTorch packages and adjusted the installation order.
Removed installation of torchcodec from Dockerfile.
Change docker build command to use --load instead of --push for aarch64 image.
Added security options to the Docker build for ARM64.
Removed the security option from the Docker build command for aarch64.
@XprobeBot XprobeBot added this to the v2.x milestone May 21, 2026
Copy link
Copy Markdown

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the Dockerfile by bumping the base image version, adjusting library paths for Python 3.12, and modifying package dependencies, including the removal of Triton and specific FlashInfer components. However, several critical issues were identified: the specified versions for the base image, Transformers, and the Flash Attention wheel appear to be non-existent or incorrect, which will lead to build failures. Additionally, hardcoding a regional pip mirror is discouraged in favor of the official PyPI index to ensure global accessibility.

I am having trouble creating individual review comments. Click here to see my feedback.

xinference/deploy/docker/Dockerfile (46)

high

The version transformers==5.5.0 does not exist on PyPI (the current latest stable version is in the 4.x range). This will cause the Docker build to fail. Please verify the intended version.

xinference/deploy/docker/Dockerfile (52)

high

The wheel URL references cu130 (CUDA 13.0) and torch2.11. Neither CUDA 13.0 nor PyTorch 2.11 have been released yet. This URL is likely invalid and will cause the build to fail.

xinference/deploy/docker/Dockerfile (1)

medium

The tag v0.21.0 for vllm/vllm-openai does not appear to be an official release tag (vLLM is currently in the v0.7.x range). Please verify if this is a typo or if it refers to a specific non-official image.

xinference/deploy/docker/Dockerfile (29)

medium

Hardcoding the PIP_INDEX to a regional mirror is not recommended for a general-purpose Dockerfile as it may be inaccessible or slower for users outside of China. It is better to keep the official PyPI index as the default and allow users to override it via a build argument.

ARG PIP_INDEX=https://pypi.org/simple

@qinxuye qinxuye changed the title feat:Update Dockerfile for vllm and dependency versions fix(ci): update vllm to 0.21.0 for Dockerfile May 21, 2026
@XprobeBot XprobeBot added the bug Something isn't working label May 21, 2026
@qinxuye qinxuye merged commit 3423183 into xorbitsai:main May 21, 2026
7 of 13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants