Min-p 采样

在最近各种解码策略神仙打架的时代，再次出现了一种简单有效的解码策略：Min-p采样。

认为有一下缺陷

Top-p很难平衡生成的质量和多样性，特别是当前几个token概率较高，其他token概率较低时
Top-k则有时会忽略一些高质量的token或者包括更多的低质量的token

Min-p的核心就是根据最大概率来动态的缩放最低概率阈值

Min-p的实现也十分简单

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23


class MinPLogits:
    def __init__(self, min_p: float, filter_value: float = -float("Inf"), min_tokens_to_keep: int = 1):
        self.min_p = min_p
        self.filter_value = filter_value
        self.min_tokens_to_keep = min_tokens_to_keep

    def __call__(self, input_ids: torch.LongTensor, scores: torch.FloatTensor) -> torch.FloatTensor:
        probs = torch.softmax(scores, dim=-1)
        # 取最大概率top_probs
        top_probs, _ = probs.max(dim=-1, keepdim=True)
        # 动态的概率阈值min_p * top_probs
        scaled_min_p = self.min_p * top_probs
        # 低于概率阈值的token都会被remove
        tokens_to_remove = probs < scaled_min_p

        sorted_indices = torch.argsort(scores, descending=True, dim=-1)
        sorted_indices_to_remove = torch.gather(tokens_to_remove, dim=-1, index=sorted_indices)
        # Keep at least min_tokens_to_keep
        sorted_indices_to_remove[..., : self.min_tokens_to_keep] = False

        indices_to_remove = sorted_indices_to_remove.scatter(1, sorted_indices, sorted_indices_to_remove)
        scores_processed = scores.masked_fill(indices_to_remove, self.filter_value)
        return scores_processed