浏览代码

Update utils_llama.py

Allen 1 年之前
父节点
当前提交
ae85ea9ffe
共有 1 个文件被更改,包括 1 次插入0 次删除
  1. 1 0
      research/long-context-llama/H2O/utils_llama.py

+ 1 - 0
research/long-context-llama/H2O/utils_llama.py

@@ -205,6 +205,7 @@ class H2OLlamaAttention(nn.Module):
         past_key_value: Optional[Tuple[torch.Tensor]] = None,
         past_key_value: Optional[Tuple[torch.Tensor]] = None,
         output_attentions: bool = False,
         output_attentions: bool = False,
         use_cache: bool = False,
         use_cache: bool = False,
+        cache_position: Optional[torch.LongTensor] = None,
     ) -> Tuple[torch.Tensor, Optional[torch.Tensor], Optional[Tuple[torch.Tensor]]]:
     ) -> Tuple[torch.Tensor, Optional[torch.Tensor], Optional[Tuple[torch.Tensor]]]:
 
 
         bsz, q_len, _ = hidden_states.size()
         bsz, q_len, _ = hidden_states.size()