AgentSkillsCN

chemometrics-shared

共享化学计量学基础:交叉验证策略、性能指标、过拟合预防、样本量指导,以及报告标准。所有化学计量学技能均适用此基础。只需加载您所需的参考文件即可。

SKILL.md
--- frontmatter
name: chemometrics-shared
description: >-
  Shared chemometrics foundations: cross-validation strategies, performance metrics,
  overfitting prevention, sample-size guidance, and reporting standards. Used by all
  chemometrics skills. Load only the reference file you need.
license: MIT
metadata:
  skill-author: Alban Ott

Chemometrics Shared Foundations

When to Use What

Task: Choose cross-validation strategy Use: references/validation-strategies.md

Task: Evaluate regression or classification model Use: references/performance-metrics.md

Task: Determine if sample size is sufficient Use: references/sample-size-guidance.md

Task: Detect or prevent overfitting Use: references/overfitting-prevention.md

Task: Write methods section or prepare for publication Use: references/reporting-standards.md

Task: Follow chemometrics project workflow Use: references/workflow.md

Quick Reference: CV Decision Tree

code
What is your sample size?

+-- n < 20: LOOCV (high variance — consider repeated random splits)
+-- 20 <= n < 50: LOOCV or 5-Fold CV (repeat 3-10x)
+-- 50 <= n < 200: 5-Fold or 10-Fold CV (repeat 3-10x)
+-- n >= 200: 10-Fold CV or Hold-Out (70/30 or 80/20)

Special cases:
  Time series      -> TimeSeriesSplit (no future leakage)
  Batches/groups   -> GroupKFold (keep groups together)
  Imbalanced       -> StratifiedKFold (preserve class ratios)
  Spatial data     -> Spatial CV (geographic splits)

Quick Reference: Metrics

Regression: RMSEP (primary), R-squared, RPD, Bias, SEP Classification: Sensitivity, Specificity, F1-score (primary), Accuracy, ROC AUC

RPD Interpretation (Saeys et al. 2005)

RPDQuality
> 2.5Excellent quantitative
2.0-2.5Good quantitative
1.8-2.0Fair (screening)
1.4-1.8Very rough screening
< 1.4Unreliable

R-squared Interpretation

R-squaredQuality
> 0.9Excellent
> 0.8Good
> 0.7Acceptable
< 0.7Poor (most applications)

See Also

  • ML method selection: ../chemometrics-ml-selection/SKILL.md
  • MS metabolomics: ../chemometrics-ms-metabolomics/SKILL.md
  • Hybrid modeling: ../chemometrics-hybrid-modeling/SKILL.md
  • Model validation: ../chemometrics-validation/SKILL.md