如何在三分类情感分析数据集内计算指定单词的情感得分？现有方案合理性验证及相关研究建议

阿华AIGC实验室

2026-4-30

Hey there! Let's break down your two proposed approaches for calculating sentiment scores for your target nouns, and also share some better-aligned methods and research references to help you out.

方案可行性分析

方案一：基于共现标签的加权平均得分

This approach is fully feasible and serves as a classic baseline for entity-level sentiment analysis. Here's how it typically works:

Assign numerical values to sentiment labels (e.g., positive=1, neutral=0, negative=-1)
Count how many sentences containing your target word X fall into each label category (let's call these counts P, N, Neu)
Calculate the weighted score using a formula like:
```
Score(X) = (P*1 + N*(-1) + Neu*0) / (P + N + Neu)
```
Or you can use proportional weighting if you want to prioritize label distribution over raw counts.

Pros & Caveats

Pros: Super straightforward, low computational cost, no model training required—great for getting quick initial insights.
Caveats: Fails to account for contextual ambiguity (e.g., the word "apple" might refer to a fruit or a company with totally different sentiment contexts) and can be skewed if the number of sentences containing X is too small. Also, it assumes the entire sentence's sentiment is directed at X, which isn't always true (e.g., "I love the new iPhone, but the charger is terrible"—if X is "charger", the sentence label might be mixed, but the target sentiment is negative).

Relevant Research

The Harvard General Inquirer project pioneered this co-occurrence-based approach, using large text corpora to map words to emotional categories via statistical frequency.
Domain-specific sentiment lexicon studies like Building a Sentiment Lexicon for Financial News use this exact method to calculate sentiment scores for financial entities by counting their co-occurrence with labeled news articles.

方案二：基于微调BERT的单词情感得分

This approach is also feasible, but you'll need to adjust your method to get meaningful results—just feeding a single word into a fine-tuned BERT won't work well, since BERT is a contextual model that relies on surrounding text to generate accurate representations.

How to Do It Right

First, fine-tune your BERT model on your labeled dataset for sentence-level sentiment classification.
For each target word X:
- Collect all sentences containing X from your dataset.
- For each sentence, modify the input to explicitly mark X (e.g., add special tokens like [TGT] around X: "I love [TGT]iPhone[TGT]")
- Extract the hidden state representations of X's tokens from BERT, or use the model's attention weights to measure how much X contributes to the sentence's sentiment prediction.
- Alternatively, use template-based inputs (e.g., "The [X] is [MASK]") and use the model's prediction of sentiment-related words to calculate a score.

Pros & Caveats

Pros: Captures contextual nuances, so it can handle ambiguous words and distinguish sentiment directed specifically at X from the overall sentence sentiment.
Caveats: Higher computational cost, requires enough labeled data to fine-tune BERT effectively, and requires more implementation work to target specific entities.

Relevant Research

BERT for Aspect-Based Sentiment Analysis: A Simple but Effective Approach: This paper shows how to adapt BERT for aspect/entity-specific sentiment analysis by marking target entities in the input, which is exactly the use case you're targeting.
Target-Dependent Sentiment Classification with Attentional Encoder Networks: While this uses an LSTM instead of BERT, the core idea of focusing attention on target entities to calculate their sentiment score is foundational for modern contextual approaches.

更推荐的优化思路

If your goal is to get precise sentiment scores for your target nouns, Aspect-Based Sentiment Analysis (ABSA) is the dedicated field for this task—far more tailored than your initial two approaches:

For small datasets: Optimize your first approach by filtering out sentences where X isn't the focus of the sentiment (e.g., use dependency parsing to check if X is the object of a sentiment-bearing verb). You can also combine it with pre-built sentiment lexicons (like VADER) to weight sentences based on how strongly sentiment words are associated with X.
For larger datasets: Use an ABSA-specific model (like BERT-ABSA) that's explicitly trained to predict sentiment for target entities within sentences. This directly outputs a sentiment score for each occurrence of X, which you can average to get an overall score for the word.

内容的提问来源于stack exchange，提问作者Dipto Das