Earnings Attribution Analysis

Goal: Determine WHY a stock moved after an 8-K filing by calculating the SURPRISE (actual vs expected).

Thinking: ALWAYS use ultrathink for maximum reasoning depth.

End Goal: Build company-specific driver understanding for real-time prediction accuracy.

Core Principles

•Comprehensiveness Over Speed: Query ALL relevant data sources
•Evidence-Based Claims Only: Every claim must cite a source
•Surprise-Focused: Stock moves on actual vs expectations, not absolute results
•Learn from History: Each analysis builds company-specific knowledge
•Self-Audit: Validate all claims have sources before completing

Confidence Levels

•High: Multiple sources agree, clear fundamental reason, unambiguous evidence
•Medium: Sources partially agree, conclusion fits with caveats
•Insufficient: Sources conflict, missing critical data, no clear driver

If evidence is insufficient, say so. Never fabricate certainty.

Resources

•Neo4j schema & queries: neo4j_schema.md
•Output format: output_template.md
•Evidence audit checklist: evidence_audit.md
•Usage examples: examples.md
•Self-improvement: update-skills.md
•Known data gaps: data_gaps.md

Workflow (10 Steps)

Use TodoWrite to track progress. Mark each step in_progress before starting, completed immediately after.

Step	Action	Notes
1	Data Inventory	Map what data exists (REQUIRED FIRST)
2	Get Report + Returns	Move magnitude and context
3	Get Consensus	Always query Perplexity for expectations
4	Query Neo4j	News, Transcript, XBRL history, Dividends, Splits
5	Query Perplexity	Fill gaps (use deep_research for complex cases)
6	Synthesize	Calculate surprises, identify primary driver
7	Output Report	Save to `drivers/Movers/Companies/{TICKER}/{accession}.md`
8	Self-Audit	Run evidence_audit.md and validate Evidence Ledger + sources
9	Propose Skill Updates	Follow update-skills.md
10	Mark Completed	Update tracking CSV (`completed=TRUE`)

Rules: Step 1 always first. Steps 2-5 case-by-case sequencing. Steps 6-10 sequential.

Step 1: Data Inventory

Query what data exists before making claims. See neo4j_schema.md for full query.

Check for: News count, Transcript exists, XBRL reports count, Dividends, Splits.

Step 2: Get Report and Returns

Get filing details and price reaction. See neo4j_schema.md for query.

Returns to capture:

•Macro-adjusted (vs SPY) - primary
•Sector-adjusted - context (may be null)
•Industry-adjusted - context (may be null)

Step 3: Get Consensus Estimates

Always query Perplexity - do not rely on Neo4j News for consensus.

code

mcp__perplexity__perplexity_search:
  query: "{ticker} Q{quarter} FY{year} EPS revenue estimate consensus before {date}"

Source preference for Evidence Ledger:

•Best: Perplexity pre-filing search (explicit consensus before event)
•Acceptable: News headline citing estimate (note: post-filing, weaker provenance)
•Unacceptable: Unsourced or inferred values

If using News as consensus source, cite it accurately in Evidence Ledger—do not claim Perplexity in Data Sources Used.

Step 4: Query Neo4j Sources

Query based on Data Inventory results. See neo4j_schema.md for all queries.

4A: News

•Start ±2 trading days; expand to ±5 if <3 items
•Filter: WHERE r.daily_stock IS NOT NULL (data quality)
•Extract: EPS vs estimate, guidance, analyst reactions

4B: Transcript (Item 2.02 only)

•Get Prepared Remarks + Q&A exchanges
•Look for: analyst concerns, management hedging, specific numbers

4C: Historical XBRL

•Last 4 quarters of 10-K/10-Q data
•Look for: EPS trend, revenue trend, margin changes

4D: Dividends

•Cuts = bearish, Raises = bullish, Special = one-time

4E: Splits

•Rarely explain fundamental reactions

Step 5: Query Perplexity

Use when Neo4j doesn't provide complete picture.

Situation	Tool
Quick facts, specific data	`mcp__perplexity__perplexity_search`
Complex multi-factor analysis	`mcp__perplexity__perplexity_reason`
Conflicting sources, major moves (>10%)	`mcp__perplexity__perplexity_research`

Query Principles:

•Be specific about time (exact date, not "recently")
•Include company name AND ticker
•For consensus: specify fiscal quarter AND year
•For "why stock moved": include date AND direction

Step 6: Synthesize

Calculate SURPRISE for each metric using these exact formulas:

Surprise Calculation Formulas

code

EPS Surprise % = ((Actual - Consensus) / |Consensus|) × 100
Revenue Surprise % = ((Actual - Consensus) / |Consensus|) × 100

Guidance Surprise:

•Calculate midpoint of guidance range: (Low + High) / 2
•Compare to consensus: ((Midpoint - Consensus) / |Consensus|) × 100
•If range width changed significantly, note this separately

Rounding: All percentages to 2 decimal places.

Example Calculation

code

Actual EPS: $2.03
Consensus EPS: $1.98
Surprise = (($2.03 - $1.98) / |$1.98|) × 100 = +2.53%

Guidance Range: $12.00 - $13.50
Midpoint: $12.75
Consensus: $13.22
Surprise = (($12.75 - $13.22) / |$13.22|) × 100 = -3.56%

Surprise Analysis Table

Metric	Consensus	Actual	Surprise %
EPS	$1.98	$2.03	+2.53%
Revenue	$2.1B	$2.15B	+2.38%
FY Guidance	$13.22	$12.75 (midpoint)	-3.56%

Rank surprises by absolute magnitude. Largest surprise typically explains the move.

Attribution Structure

Primary Driver (High Confidence)

•What: [Descriptive name]
•Surprise: Expected $X, Got $Y (Z% surprise)
•Evidence: [Source citation]

Contributing Factor(s) (if applicable)

•What: [Name]
•Evidence: [Source]

Evidence Extraction

Capture evidence immediately when reading each source. Populate the Evidence Ledger in output_template.md as you go; every numeric claim must appear there.

Strict Rules for Numbers (Used in Surprise Calculations)

Any number used to calculate surprise MUST have:

•Exact value (not rounded or paraphrased)
•Explicit source with timestamp/date
•Format: {metric}: ${exact_value} (Source: {source_name}, {date})

Examples:

code

Consensus EPS: $1.98 (Source: Perplexity, pre-filing)
Actual EPS: $2.03 (Source: 8-K EX-99.1, 2023-11-02)
Guidance midpoint: $12.75 (Source: 8-K EX-99.1, calculated from $12.00-$13.50)

If you cannot cite exact value + source, do NOT use the number in surprise calculations.

Flexible Rules for Qualitative Evidence

For non-numeric observations, paraphrasing is acceptable with source:

code

Management tone: Cautious on near-term demand (Source: Transcript Q&A #3)
Analyst concern: Margin pressure (Source: News "ROK Beats but Guides Lower")

Sources: News: "{headline}", Transcript Q&A #N, Transcript Prepared Remarks, Exhibit EX-99.1, Perplexity search, XBRL: {report_id}

Output

See output_template.md for full report format.

Save to: drivers/Movers/Companies/{TICKER}/{accession_no}.md

Required sections:

•Report Metadata
•Returns Summary
•Evidence Ledger
•Executive Summary
•Surprise Analysis
•Attribution (Primary + Contributing)
•Data Sources Used
•Confidence Assessment
•Historical Context

Core Rules

•Data Inventory First: Know what data exists before making claims
•Surprise-Based: Stock moves on actual vs expected
•Evidence Required: Every claim needs a cited source
•Perplexity for Consensus: Always query for market expectations
•Self-Audit: Validate all claims have sources
•Learn and Record: Store company-specific learnings
•Honest Confidence: State "Insufficient" when evidence doesn't support
•No XBRL for 8-K: Use historical 10-K/10-Q XBRL for trends

Perplexity Tools

Tool	Use For
`mcp__perplexity__perplexity_search`	Quick facts, consensus, simple questions
`mcp__perplexity__perplexity_ask`	Conversational Q&A with real-time web search
`mcp__perplexity__perplexity_reason`	Multi-factor analysis, complex reasoning
`mcp__perplexity__perplexity_research`	Comprehensive investigation, conflicting sources

Conflict Resolution Guidelines

When sources conflict, use these principles (guidelines, not rigid rules):

Source Reliability Hierarchy

Priority	Source Type	Examples
1 (Highest)	Primary filing	8-K, EX-99.1 press release
2	Transcript	Earnings call Q&A, prepared remarks
3	Official news	Company PR, SEC filing summary
4	Analyst coverage	Research notes, rating changes
5 (Lowest)	General news	Headlines, aggregated coverage

Driver Priority (Typical, Not Absolute)

•Forward-looking (guidance) usually > Backward-looking (EPS)
•BUT: If EPS miss is >10%, it can dominate
•AND: Validate against actual price reaction direction

Conflict Handling

code

If News says "beat" but Transcript reveals "guidance cut":
→ Trust Transcript (higher reliability)
→ Note the conflict in analysis

If two News sources give different consensus numbers:
→ Use the one with explicit source attribution
→ If neither has clear attribution, note both values and the discrepancy
→ Do NOT average (averages have no direct source)

Rule: When in doubt, note the conflict explicitly rather than silently choosing one.

Data Quality Guardrails

News Anomaly Filter (REQUIRED)

~1,746 News→Company relationships (~0.9%) have daily_industry but no daily_stock. Always filter:

cypher

WHERE r.daily_stock IS NOT NULL

Missing Returns Handling

If a report has no return data (pf.daily_stock IS NULL):

•Flag as incomplete data
•Do NOT use for driver sensitivity calculations
•Can still analyze qualitatively

Duplicate News Detection

If multiple news items have nearly identical titles (>80% similarity):

•Use the earliest timestamp
•Treat as single data point, not confirmation

Stale Consensus Warning

When Perplexity returns consensus estimates:

•Verify the date/quarter matches your target
•"Q3 2023 consensus" is wrong if you need Q4 2023
•Re-query with more specific date if unclear

Company-Specific Learning

After each analysis, update: drivers/Movers/Companies/{TICKER}/learnings.md

See output_template.md for learnings format.

Gating rule: Only update learnings.md after the Evidence Ledger is complete and evidence_audit.md passes.

Step 10: Mark Completed

After successful analysis (report saved, audit passed, learnings updated):

•Update tracking CSV: Set completed=TRUE for this accession_no in drivers/Movers/8k_fact_universe.csv
•Verify: Confirm the row is updated correctly

This ensures the analysis pipeline tracks which filings have been processed.