Claim Extraction Skill
Extract all substantive claims from AI research content. A claim is any assertion that:
- •States something as true about AI capabilities, limitations, or progress
- •Predicts future developments
- •Hints at unreleased work
- •Expresses a positioned opinion on the field's direction
- •Critiques others' claims or work
Extraction Schema
For each claim, extract:
1. claimText
The claim in clear, standalone form. Paraphrase if needed for clarity.
2. claimType
- •
fact: Assertion about current state ("GPT-4 can do X") - •
prediction: Forward-looking ("By 2026, we'll have...") - •
hint: Implies unreleased work ("We've been seeing interesting results with...") - •
opinion: Positioned take ("I think scaling is/isn't sufficient") - •
critique: Challenges others ("Marcus is wrong because...") - •
question: Genuine uncertainty expressed ("I'm not sure if...")
3. topic
Primary topic category:
- •
scaling: Scaling laws, compute, training efficiency - •
reasoning: LLM reasoning, chain-of-thought, planning - •
agents: AI agents, tool use, autonomy - •
safety: AI safety, alignment, control - •
interpretability: Mechanistic interpretability - •
multimodal: Vision, audio, video models - •
rlhf: RLHF, preference learning, Constitutional AI - •
benchmarks: Evals, benchmarks, capability measurement - •
infrastructure: Training infra, chips, hardware - •
policy: AI policy, regulation, governance - •
general: General AI commentary
4. stance
- •
bullish: Optimistic about AI progress/capabilities - •
bearish: Skeptical/pessimistic about AI progress - •
neutral: Balanced or factual without clear stance
5. bullishness
Float from 0.0 (maximally bearish) to 1.0 (maximally bullish)
6. confidence
How confident does the author seem? (0.0-1.0)
- •Hedging language: "might", "could", "I think", "possibly" → lower
- •Certainty language: "will", "definitely", "it's clear that" → higher
7. timeframe (for predictions)
- •
near-term: < 1 year - •
medium-term: 1-3 years - •
long-term: 3-10 years - •
unspecified: No clear timeframe - •
null: Not a prediction
8. evidenceProvided
- •
strong: Cites data, papers, or detailed reasoning - •
moderate: Some reasoning but not rigorous - •
weak: Assertion without support - •
appeal-to-authority: "Trust me, I work on this"
9. quoteworthiness
Is this claim notable enough to quote in a digest? (0.0-1.0)
Output Format
Return JSON:
json
{
"claims": [
{
"claimText": "The claim in clear form",
"claimType": "prediction",
"topic": "reasoning",
"stance": "bullish",
"bullishness": 0.8,
"confidence": 0.7,
"timeframe": "medium-term",
"evidenceProvided": "moderate",
"quoteworthiness": 0.6,
"relatedTo": ["o1", "chain-of-thought"],
"originalQuote": "Brief relevant quote if notable"
}
]
}
Guidelines
- •Extract MULTIPLE claims from a single piece of content if present
- •Don't over-extract - only substantive, meaningful claims
- •A tweet saying "Interesting paper" is NOT a claim
- •Look for IMPLICIT claims ("We've made a lot of progress" implies capability gains)
- •Pay attention to WHO is speaking - lab researchers hinting at their own work is high signal
- •Critics often make claims by contradiction ("X is wrong, therefore Y")
Author Context Matters
Consider the author's affiliation when assessing:
- •Lab researchers (Anthropic, OpenAI, DeepMind): May hint at unreleased work
- •Critics (Marcus, Chollet, Mitchell): Often make claims through critique
- •Independent (Simon Willison, Jim Fan): Provide practitioner perspectives