UI-TARS本地部署

UI-TARS是一个用于本地部署的交互控制模型，支持桌面应用的自动化操作。部署步骤包括：首先，从官方GitHub仓库下载项目源码和模型checkpoint；其次，使用vLLM框架进行本地模型部署，启动API服务；最后，通过客户端调用API实现功能。部署过程中可能遇到pynvml模块或模型配置问题，提供了相应的解决方案。此外，还支持通过UI-TARS-Desktop客户端进行本地桌面应用的交互控制。

DYF-AI

874人浏览 · 2025-05-19 00:03:57

DYF-AI · 2025-05-19 00:03:57 发布

UI-TARS本地部署

UI-TARS 论文（arXiv）
UI-TARS 官方仓库：包含部署指南、模型下载链接及示例代码。
UI-TARS-Desktop 客户端：支持本地桌面应用的交互控制。
模型部署框架：vLLM本地部署

1.下载项目源码

git clone https://github.com/bytedance/UI-TARS.git

2.下载模型checkpoint

# 使用huggingface镜像源
export HF_ENDPOINT=https://hf-mirror.com
# 以2B模型为例（太穷了7B没显存）
huggingface-cli download --resume-download ByteDance-Seed/UI-TARS-2B-SFT --local-dir ./UI-TARS-2B-SFT

3.本地模型部署

启动 API 服务

#python -m vllm.entrypoints.openai.api_server --served-model-name ui-tars --model <模型路径>
python -m vllm.entrypoints.openai.api_server --served-model-name ui-tars --model /mnt/n/model/GUI-model/UI-TARS-2B-SFT
# --trust-remote-code
python -m vllm.entrypoints.openai.api_server --served-model-name ui-tars --model /mnt/n/model/GUI-model/UI-TARS-2B-SFT --trust-remote-code

若报错：

# 报错1
AttributeError: module 'pynvml' has no attribute 'nvmlDeviceGetCudaComputeCapability'
# 解决1
 pip install --force-reinstall --ignore-installed nvidia-ml-py
 # 报错2：
 ValueError: size must contain 'shortest_edge' and 'longest_edge' keys.
# 解决2：
https://www.modelscope.cn/models/bytedance-research/UI-TARS-7B-DPO/feedback/issueDetail/27680
preprocessor_config.json增加：
  "size": {
    "max_pixels": 2116800,
    "min_pixels": 3136,
    "shortest_edge": 3136,
    "longest_edge": 2116800
  },
  "temporal_patch_size": 2,
  "shortest_edge": 3136,
  "longest_edge": 2116800

4. 客户端调用示例

from openai import OpenAI
client = OpenAI(base_url="http://localhost:8000/v1", api_key="empty")
response = client.chat.completions.create(
    model="ui-tars",
    messages=[{"role": "user", "content": "搜索今日天气"}]
)

print(response.choices[0].message.content)

5. 安装UI.TARS-0.1.2.Setup.exe

配置 UI-TARS 客户端
打开 UI-TARS：启动 UI-TARS Windows 客户端。
进入模型配置界面：在客户端中找到模型配置相关的功能区域，通常在设置或者模型管理模块。
添加模型配置：
模型名称：为模型设置一个便于识别的名称，例如 local-vlm-model。
API 基础 URL：输入 vLLM 服务的基础 URL，默认情况下为 http://localhost:8000/v1。
认证信息：若服务需要认证，需填写相应的认证信息；若无需认证，可留空。

2048 AI社区

有“AI”的1024 = 2048，欢迎大家加入2048 AI社区

更多推荐

UFW防火墙安全指南

UFW（Uncomplicated Firewall）是Ubuntu/Debian系统中简化防火墙管理的工具，通过直观命令帮助用户有效控制网络流量，提升系统安全性。文章详细介绍了UFW的基本命令，包括启停防火墙、添加规则、限制连接速率和日志配置等操作，并提供了安全最佳实践，如默认拒绝策略、IP地址限制和服务级规则管理。同时，还涵盖高级配置技巧，例如多网络接口设置、规则优先级调整、IPv6支持及与f