Ingest Force Skill
This skill re-processes all raw files from the src/ingest/ directory, regardless of their current status in src/ingest/manifest.json. It performs the same normalization, media extraction/copying to public/media/, and manifest updates as the ingest skill.
Inputs
- •All files under
src/ingest/(any format). - •
src/sources/corrections.md(for normalization rules during ingestion).
Outputs
- •Updated or newly created Markdown files in
src/sources/documents/,src/sources/chats/, orsrc/sources/transcripts/. - •Updated binary media in
public/media/YYYY-MM-DD_short_descriptive_name/, each withmeta.json(and optionallymeta.md). Image and audio URLs encountered in ingest content are downloaded and stored here; video URLs are kept as-is. Sources reference downloaded media as repo-root links or external video URLs. - •Updated
src/ingest/manifest.jsonandsrc/ingest/manifest.md.
Key Behaviors
- •Force Re-ingestion: Ignores the
ingestedstatus insrc/ingest/manifest.jsonand processes all files. - •Idempotency: When re-ingesting, it identifies existing source files by their
id(or other stable identifier) and updates them rather than creating duplicates. - •Overwrite Policy: Similar to the
ingestskill, it will update existing source content but preserve user-authored blocks within protected markers. - •Provenance: Ensures
ingest_pathsandingest_hashesare correctly updated in source frontmatter. - •Normalization: Applies rules from
src/sources/corrections.mdduring text normalization.