测验程序中基于预设二进制答案数组的匹配：如何自动为差异化问题加权？

阿华AIGC实验室

2026-4-27

解决方案：自动计算题目权重，基于预设答案的区分度

你的核心痛点非常清晰：当前的等权重匹配没有区分「哲学家分歧大的题目」和「所有人答案一致的题目」，导致部分匹配结果随机。要实现客观的自动权重计算，关键是量化每道题的「区分能力」——也就是这道题在所有预设答案中的不一致程度，不一致程度越高，权重越大。

核心思路：用「熵（Entropy）」或「答案分散度」计算权重

我们可以通过统计每道题的答案分布，自动计算其权重：

如果某道题所有哲学家答案完全一致（比如全是1），那它的区分度为0，权重也为0（因为这道题无法帮我们区分任何哲学家）
如果某道题的答案分布越分散（比如一半哲学家选0，一半选1），区分度越高，权重也越高

最科学的方式是用信息熵来计算区分度，它能精准衡量答案的分散程度；如果追求简单，也可以用「1 - 最常见答案的占比」来快速计算。

具体实现步骤与代码修改

1. 预先计算每道题的权重

我们先写一个函数，遍历所有哲学家的答案，计算每道题的权重（只需要计算一次，不用每次匹配都重复计算）：

const philosophers = [
  { name: "Nietzsche", answers: [1, 0, 1, 1, 0, 1, 1, 0, 0, 1, 1, 0, 1, 0, 1] },
  { name: "Diogenes", answers: [1, 0, 1, 1, 0, 1, 1, 0, 0, 1, 1, 0, 0, 1, 1] },
  { name: "Aquinas", answers: [0, 1, 0, 0, 1, 0, 1, 1, 1, 0, 0, 1, 0, 1, 0] }
];

// 计算每道题的权重（基于信息熵）
function calculateQuestionWeights(philosophers) {
  const numQuestions = philosophers[0].answers.length;
  const weights = new Array(numQuestions).fill(0);

  for (let questionIdx = 0; questionIdx < numQuestions; questionIdx++) {
    // 统计当前题目的0和1的数量
    let count0 = 0;
    let count1 = 0;
    philosophers.forEach(phil => {
      phil.answers[questionIdx] === 0 ? count0++ : count1++;
    });

    const total = philosophers.length;
    const p0 = count0 / total;
    const p1 = count1 / total;

    // 计算信息熵：熵越高，区分度越高
    // 处理log2(0)的特殊情况（当所有答案一致时，熵为0）
    let entropy = 0;
    if (p0 > 0) entropy -= p0 * Math.log2(p0);
    if (p1 > 0) entropy -= p1 * Math.log2(p1);

    // 也可以用简单版本：1 - 最常见答案的占比
    // weights[questionIdx] = 1 - Math.max(p0, p1);

    weights[questionIdx] = entropy;
  }

  return weights;
}

// 预先计算权重（只执行一次）
const questionWeights = calculateQuestionWeights(philosophers);

2. 修改匹配函数，使用加权得分

原来的findMatch函数是统计正确题目的数量，现在改成统计加权得分——如果用户答案和哲学家一致，就加上该题的权重：

function findBestMatch(userAnswers) {
  let bestMatchName = "";
  let highestWeightedScore = -Infinity;

  philosophers.forEach(phil => {
    let currentScore = 0;
    for (let questionIdx = 0; questionIdx < userAnswers.length; questionIdx++) {
      if (userAnswers[questionIdx] === phil.answers[questionIdx]) {
        // 加上当前题目的权重，而不是加1
        currentScore += questionWeights[questionIdx];
      }
    }

    // 更新最佳匹配
    if (currentScore > highestWeightedScore) {
      highestWeightedScore = currentScore;
      bestMatchName = phil.name;
    }
  });

  return bestMatchName;
}