Head size 160 is not supported by PagedAttention.

Supported head sizes are: [64, 80, 96, 112, 128, 256].

错误信息:

Head size 160 is not supported by PagedAttention. Supported head sizes are: [64, 80, 96, 112, 128, 256].

解决方法:

vim /root/miniconda3/envs/myvllm/lib/python3.10/site-packages/vllm/attention/ops/paged_attn.py

---
    def get_supported_head_sizes() -> List[int]:
        return [64, 80, 96, 112, 128, 160, 256]
---

相关推荐

  1. LeetCode 150, 112, 130

    2024-07-21 01:24:01       17 阅读
  2. LeetCode hot100-16

    2024-07-21 01:24:01       34 阅读
  3. DAY 10 | 1047, (20,150)

    2024-07-21 01:24:01       51 阅读
  4. 面试经典150题(96-100)

    2024-07-21 01:24:01       51 阅读
  5. PYTHON 120道题目详解(100-102)

    2024-07-21 01:24:01       42 阅读

最近更新

  1. docker php8.1+nginx base 镜像 dockerfile 配置

    2024-07-21 01:24:01       52 阅读
  2. Could not load dynamic library ‘cudart64_100.dll‘

    2024-07-21 01:24:01       54 阅读
  3. 在Django里面运行非项目文件

    2024-07-21 01:24:01       45 阅读
  4. Python语言-面向对象

    2024-07-21 01:24:01       55 阅读

热门阅读

  1. 数据仓库中的数据治理

    2024-07-21 01:24:01       18 阅读
  2. Piping(√)

    2024-07-21 01:24:01       14 阅读
  3. KTV点歌系统有什么作用?

    2024-07-21 01:24:01       16 阅读