AgentSkillsCN

AIRS & Appropriate Reliance Research

掌握AI应用评估、心理测量工具开发,以及合理信赖度研究的相关领域知识。

SKILL.md
--- frontmatter
name: "AIRS & Appropriate Reliance Research"
description: "Domain knowledge for AI adoption measurement, psychometric instrument development, and appropriate reliance research"
applyTo: "**/*airs*,**/*reliance*,**/*adoption*,**/*utaut*,**/*psychometric*,**/*instrument*,**/*survey*,**/*scale*"

AIRS & Appropriate Reliance Research

Domain knowledge for AI adoption measurement, psychometric instrument development, and appropriate reliance research

This skill contains knowledge about the AIRS-16 validated instrument, the proposed AIRS-18 extension with Appropriate Reliance (AR), and research methodologies for studying AI adoption and human-AI collaboration.

When to Use

  • Discussing AIRS-16 or AIRS-18 instruments
  • Developing or extending psychometric scales
  • Analyzing AI adoption patterns
  • Researching appropriate reliance / trust calibration
  • Preparing academic papers or research briefs
  • Meeting preparation with researchers

Assets

FilePurpose
article/APPROPRIATE-RELIANCE-TECHNICAL-BRIEF.mdFull technical brief with AR implementation
article/HOFMAN-MEETING-BRIEF.mdResearch meeting preparation template
alex_docs/AR-TELEMETRY-DESIGN.mdBehavioral telemetry design for hypothesis validation

AIRS-16: AI Readiness Scale

Source: Correa, F. (2025). Doctoral dissertation, Touro University Worldwide.

Production: airs.correax.com | Time: 5 minutes | Built by: Alex Cognitive Architecture

Validation: N=523, CFI=.975, TLI=.960, RMSEA=.053, R²=.852

Quick Links

LinkPurpose
Take AssessmentStart the 16-item survey
View HistoryReview past results
Register OrgEnterprise organization setup
GitHub (Platform)AIRS Enterprise source code
GitHub (Research)Validation data & analysis

User Roles

RoleAccess
👤 ParticipantTake assessments, view personal results, download PDF reports
FounderOrganization creator, can be promoted to Admin
🛡️ AdminDashboard analytics, member management, invitations
👑 Super AdminPlatform-wide access, all orgs, AI prompts configuration

8 Constructs (2 items each)

ConstructCodeDescription
Performance ExpectancyPEBelief that AI will help achieve job performance gains
Effort ExpectancyEEPerceived ease of use of AI systems
Social InfluenceSIDegree to which colleagues/leadership encourage adoption
Facilitating ConditionsFCAvailability of organizational resources and training
Hedonic MotivationHMEnjoyment and curiosity when exploring AI capabilities
Price ValuePVPerceived benefit relative to effort invested (β=.505 — strongest predictor)
HabitHBExtent to which AI use has become automatic and routine
Trust in AITRConfidence in AI reliability, accuracy, and data handling

Key Finding: What Actually Predicts AI Adoption

PredictorβpStatus
Price Value (PV).505<.001✅ STRONGEST
Hedonic Motivation (HM).217.014✅ Significant
Social Influence (SI).136.024✅ Significant
Trust in AI (TR).106.064⚠️ Marginal
Performance Expectancy (PE)-.028.791❌ Not significant
Effort Expectancy (EE)-.008.875❌ Not significant
Facilitating Conditions (FC).059.338❌ Not significant
Habit (HB).023.631❌ Not significant

Insight: Traditional UTAUT2 predictors (PE, EE, FC, HB) do NOT predict AI adoption. Value perception, enjoyment, and social influence matter.

Scoring & Typology

python
# AIRS Score = sum of 8 construct means (range: 8-40)
AIRS = PE + EE + SI + FC + HM + PV + HB + TR

# Typology (94.5% accuracy)
if AIRS <= 20: "AI Skeptic"      # 17% of sample
elif AIRS <= 30: "Moderate User"  # 67% of sample
else: "AI Enthusiast"             # 16% of sample

Appropriate Reliance (AR): Proposed AIRS-18 Extension

The Research Question

Is it not how much you trust AI that predicts adoption, but how well your trust is calibrated to actual AI capability?

Why AR ≠ Trust (TR)

DimensionTrust (TR)Appropriate Reliance (AR)
MeasuresTrust levelTrust calibration accuracy
TypeAttitude (affective state)Metacognitive skill
Failure modeLow trust → under-useLow AR → over-reliance OR under-reliance
Item example"I trust AI tools...""I can tell when AI is reliable..."

Key distinction: TR asks "Do you trust AI?" — AR asks "Can you discern when trust is warranted?"

The 2×2 Independence Matrix

Low AR (Miscalibrated)High AR (Calibrated)
High TR⚠️ Over-reliance → bad outcomes → abandonment✅ Optimal adoption
Low TR❌ Under-reliance → missed value → rejection✅ Calibrated skeptic → gradual adoption

Proposed AR Items

ItemTextComponent
AR1I can tell when AI-generated information is reliable and when it needs verification.CAIR
AR2I know when to trust AI tools and when to rely on my own judgment instead.CSR

CAIR/CSR Framework (Schemmer et al., 2023)

User AcceptsUser Rejects
AI CorrectCAIR ✅ (Correct AI-Reliance)Under-reliance
AI IncorrectOver-relianceCSR ✅ (Correct Self-Reliance)

Metric: Appropriateness of Reliance (AoR) = 1 indicates optimal calibration.


Research Hypotheses for AIRS-18 Validation

#Hypothesis
H1AR demonstrates acceptable reliability (α ≥ .70, CR ≥ .70, AVE ≥ .50)
H2AR shows discriminant validity from TR (HTMT < .85)
H3AR positively predicts BI (β > 0, p < .05)
H4AR provides incremental validity beyond AIRS-16 (ΔR² > .02)
H5AR moderates TR→BI (high AR strengthens the relationship)
H6AR mediates Experience→BI (experience → better calibration → adoption)

Psychometric Standards

Reliability Thresholds

MetricMinimumGoodExcellent
Cronbach's α.70.80.90
Composite Reliability (CR).70.80.90
Average Variance Extracted (AVE).50.60.70

Model Fit Indices

IndexAcceptableGood
CFI≥ .90≥ .95
TLI≥ .90≥ .95
RMSEA≤ .08≤ .06
SRMR≤ .08≤ .05

Discriminant Validity

MethodCriterion
HTMT< .85 (conservative: < .90)
Fornell-Larcker√AVE > inter-construct correlations

Intervention Strategies by Typology

TypologyAIRS-16 Focus+ AR-Informed Focus
AI Skeptics (≤20)Trust-building, low-effort demosCalibration training: "Here's when AI excels vs. struggles"
Moderate Users (21-30)Clear use cases, ROI evidenceVerification skill-building: "How to spot AI errors"
AI Enthusiasts (>30)Advanced features, leadershipReliance audits: "Are you over-relying in high-stakes areas?"

Key References

ReferenceContribution
Correa (2025)AIRS-16 validation, UTAUT2 extension
Passi, Dhanorkar, & Vorvoreanu (2024)AETHER synthesis on appropriate reliance
Schemmer et al. (2023)CAIR/CSR framework
Venkatesh et al. (2012)UTAUT2 original model
Lee & See (2004)Trust calibration in human-automation interaction
Lin et al. (2022)LLMs can verbalize calibrated uncertainty

Troubleshooting

"Is AR just measuring AI experience?"

Problem: Concern that AR conflates with general AI familiarity.

Solution:

  • Include experience as covariate
  • Test discriminant validity (HTMT < .85)
  • AR should predict beyond experience level

"Can self-reported calibration be valid?"

Problem: People may not accurately assess their own calibration ability.

Solution:

  • Self-report measures perceived calibration
  • Future research: correlate with behavioral CAIR/CSR in task studies
  • Perceived calibration may still predict adoption intentions

"Why was Trust marginal in AIRS-16?"

Possible explanations:

  1. Trust level alone is insufficient — calibration matters more
  2. Trust may be necessary but not sufficient
  3. TR × AR interaction: trust only helps when calibrated
  4. Sample characteristics (tech-savvy population)