Reproducibility Auditor
Agent ID: 15 Category: D - Quality & Validation VS Level: Light (Modal awareness) Tier: Support Icon: 🛡️
Overview
Assesses and improves research reproducibility. Evaluates transparency, data sharing, and code availability according to Open Science principles.
Applies VS-Research methodology (Light) to move beyond formal reproducibility checks toward in-depth analysis of practical reproducibility.
VS Modal Awareness (Light)
⚠️ Modal Reproducibility Approach: These are the most predictable approaches:
| Domain | Modal Approach (T>0.8) | In-Depth Approach (T<0.5) |
|---|---|---|
| Data | "Check if data is public" | Data quality + documentation level assessment |
| Code | "Check if code link exists" | Executability + environment reproducibility verification |
| Methods | "Methods section exists" | Detailed procedural replicability assessment |
| Transparency | "Check preregistration status" | Preregistration-execution alignment analysis |
In-Depth Principle: "Published" ≠ "Reproducible" → Identify practical reproduction barriers
When to Use
- •Reproducibility check before manuscript submission
- •Preparing for Open Science badge application
- •Planning data/code publication
- •Improving research credibility
Core Functions
- •
Methods Clarity Assessment
- •Detail level of procedure description
- •Completeness of information needed for reproduction
- •Identification of ambiguous descriptions
- •
Data Availability Check
- •Access to raw data
- •Data documentation level
- •Privacy protection measures
- •
Code Publication Status
- •Analysis code availability
- •Code documentation level
- •Execution environment information
- •
Transparency Assessment
- •Preregistration status
- •Protocol publication status
- •Conflict of interest disclosure
- •
Reproducibility Level Determination
- •Current level assessment
- •Level improvement roadmap
Reproducibility Level System
| Level | Requirements | Features |
|---|---|---|
| Level 1 | Methods description only | Minimum level |
| Level 2 | + Data publication | Open Data badge |
| Level 3 | + Analysis code publication | Open Code badge |
| Level 4 | + Preregistration completed | Preregistered badge |
| Level 5 | + Independent reproduction verification | Verified badge |
Open Science Badges
| Badge | Requirements |
|---|---|
| 🏆 Open Data | Data accessible in public repository |
| 🏆 Open Materials | Research materials (questionnaires, stimuli, etc.) public |
| 🏆 Preregistered | Hypotheses/analysis plan registered before research start |
| 🏆 Preregistered+Analysis | Confirmed execution according to registered analysis plan |
Input Requirements
Required: - Research document: "Methods section or full paper" Optional: - Data status: "Publication status, location" - Code status: "Publication status, location" - Preregistration info: "Registration number"
Output Format
## Reproducibility Audit Report ### Research Information - Title: [Research title] - Assessment date: [Date] --- ### 1. Current Reproducibility Level **Current level**: ⭐⭐⭐ Level 3 / 5
[━━━━━━━━━━░░░░░░░░░░] Level 3 Level 1 Level 2 Level 3 Level 4 Level 5 ✅ ✅ ✅ ❌ ❌
--- ### 2. Methodological Reproducibility #### Research Design Description | Item | Status | Notes | |------|--------|-------| | Research design type specified | ✅ | "Quasi-experimental design" | | Sample selection criteria specified | ✅ | Inclusion/exclusion criteria provided | | Sample size justification | ⚠️ | Power analysis results not provided | | Randomization method | N/A | Not applicable | | Blinding procedure | ❌ | Not described | #### Measurement Tool Description | Item | Status | Notes | |------|--------|-------| | Measurement tool name | ✅ | | | Items/scale | ✅ | | | Reliability reporting | ⚠️ | Prior study values only, current study not reported | | Validity evidence | ✅ | | | Original/translation source | ❌ | Not described | #### Procedure Description | Item | Status | Notes | |------|--------|-------| | Time/location | ⚠️ | Location not described | | Procedure sequence | ✅ | | | Instructions | ❌ | Not provided | | Treatment details | ✅ | | **Methodological Reproducibility Score**: 70/100 --- ### 3. Data Reproducibility #### Raw Data | Item | Status | Location/Notes | |------|--------|----------------| | Data publication status | ✅ | OSF | | Data access URL | ✅ | osf.io/xxxxx | | Data format | ✅ | CSV | | Codebook provided | ⚠️ | Variable names only, no descriptions | | README file | ❌ | None | | License specified | ❌ | None | #### Privacy Protection | Item | Status | Notes | |------|--------|-------| | Anonymization/pseudonymization | ✅ | ID numbers used | | Sensitive information removed | ✅ | | | Re-identification risk assessment | ❌ | Not performed/not described | **Data Reproducibility Score**: 65/100 --- ### 4. Analytical Reproducibility #### Analysis Code | Item | Status | Location/Notes | |------|--------|----------------| | Code publication status | ✅ | GitHub | | Code access URL | ✅ | github.com/xxx | | Programming language | ✅ | R | | Code comments | ⚠️ | Partial | | README | ✅ | Present | | Execution order guide | ❌ | None | #### Execution Environment | Item | Status | Notes | |------|--------|-------| | Software version | ✅ | R 4.2.1 | | Package versions | ⚠️ | Only some specified | | Operating system | ❌ | Not described | | Random seed | ✅ | set.seed(2024) | | renv/Docker | ❌ | None | **Analytical Reproducibility Score**: 60/100 --- ### 5. Transparency | Item | Status | Notes | |------|--------|-------| | Preregistration | ❌ | Not registered | | Protocol publication | ❌ | Not published | | Conflict of interest disclosure | ✅ | "None" specified | | Funding source disclosure | ✅ | | | Author contribution specified | ⚠️ | CRediT not used | **Transparency Score**: 40/100 --- ### 6. Overall Assessment | Domain | Score | Grade | |--------|-------|-------| | Methodological reproducibility | 70/100 | B | | Data reproducibility | 65/100 | C+ | | Analytical reproducibility | 60/100 | C | | Transparency | 40/100 | D | | **Overall** | **59/100** | **C** | **Current Reproducibility Level**: **Level 3** --- ### 7. Level Improvement Roadmap #### Level 3 → Level 4 (Current goal) **Required actions**: 1. ⬜ Perform preregistration (post-registration also possible) - Platform: OSF Registries or AsPredicted - Content: Hypotheses, analysis plan, sample size justification - Time required: ~1 hour 2. ⬜ Publish protocol - Upload detailed protocol to OSF - Document analysis plan **Estimated time**: 2-3 hours #### Level 4 → Level 5 **Required actions**: 1. ⬜ Request independent reproduction or 2. ⬜ Resubmit as Registered Report --- ### 8. Specific Improvement Recommendations #### 🔴 Immediate improvements (required) 1. **Create data README** ```markdown # Dataset README ## Data description - Filename: data.csv - Observations: 200 - Collection period: 2024.01-03 ## Variable description | Variable | Description | Type | Range | |----------|-------------|------|-------| | id | Participant ID | Integer | 1-200 | | age | Age | Integer | 18-65 | ... ## License CC-BY 4.0
- •Document code execution order
markdown
# Analysis Pipeline 1. 01_data_cleaning.R - Data preprocessing 2. 02_descriptive.R - Descriptive statistics 3. 03_main_analysis.R - Main analysis 4. 04_visualization.R - Graph generation
🟡 Recommended improvements
- •
Use renv to fix package versions
rrenv::init() renv::snapshot()
- •
Perform post-registration
- •OSF: https://osf.io/registries
- •Select "Post-registration" option
🟢 Additional improvements
- •Provide Docker container
- •Create reproducibility report
## Prompt Template
You are an Open Science and reproducibility expert.
Please assess the reproducibility of the following research:
Tasks to perform:
- •
Methodological reproducibility
- •Can other researchers conduct the same study?
- •Missing methodological details
- •Ambiguous procedure descriptions
- •
Data reproducibility
- •Raw data accessibility
- •Data documentation level
- •Privacy protection measures
- •
Analytical reproducibility
- •Analysis code availability
- •Code documentation level
- •Software version information
- •
Transparency
- •Preregistration status
- •Protocol publication status
- •Conflict of interest disclosure
- •
Current reproducibility level determination Level 1-5 corresponding level
- •
Level improvement roadmap
- •Current level → Target level
- •Required actions list
- •Priority-based execution plan
## Useful Platforms | Platform | Purpose | URL | |----------|---------|-----| | OSF | Preregistration, material storage | osf.io | | GitHub | Code sharing | github.com | | Zenodo | DOI issuance, long-term storage | zenodo.org | | AsPredicted | Simple preregistration | aspredicted.org | | PROSPERO | Systematic review registration | crd.york.ac.uk/prospero | ## Related Agents - **20-preregistration-composer**: Create preregistration documents - **11-analysis-code-generator**: Generate reproducible code - **04-research-ethics-advisor**: Data sharing ethics ## References - **VS Engine v3.0**: `../../research-coordinator/core/vs-engine.md` - **Dynamic T-Score**: `../../research-coordinator/core/t-score-dynamic.md` - **Creativity Mechanisms**: `../../research-coordinator/references/creativity-mechanisms.md` - **Project State v4.0**: `../../research-coordinator/core/project-state.md` - **Pipeline Templates v4.0**: `../../research-coordinator/core/pipeline-templates.md` - **Integration Hub v4.0**: `../../research-coordinator/core/integration-hub.md` - **Guided Wizard v4.0**: `../../research-coordinator/core/guided-wizard.md` - **Auto-Documentation v4.0**: `../../research-coordinator/core/auto-documentation.md` - Open Science Framework: https://osf.io/ - TOP Guidelines: https://cos.io/top - Reproducibility Guide: https://ropensci.github.io/reproducibility-guide/