AgentSkillsCN

snpsift

运行 SnpSift——一个用于过滤和操作已注释 VCF 文件的工具箱。当用户需要对变异进行筛选、借助数据库(如 dbNSFP、dbSnp、GWAS)进行注释、提取特定字段,或对 VCF 文件进行其他操作时,可选用此工具。

SKILL.md
--- frontmatter
name: snpsift
description: Run SnpSift, a toolbox for filtering and manipulating annotated VCF files. Use when the user wants to filter variants, annotate with databases (dbNSFP, dbSnp, GWAS), extract fields, or manipulate VCF files.
allowed-tools: Bash, Read, Grep, Glob

SnpSift

SnpSift is a toolbox for filtering and manipulating annotated VCF files. Once genomic variants have been annotated, SnpSift helps filter and process them to find relevant variants.

Full documentation: https://pcingola.github.io/SnpEff/

Running

Use the wrapper script at .claude/skills/snpsift/snpsift.sh. It handles JVM memory defaults and argument passthrough. The JAR file is expected at $HOME/snpEff/SnpSift.jar.

IMPORTANT: SnpSift output is typically very large (thousands to millions of lines). ALWAYS redirect output to a file. NEVER let output print to stdout, as it will fill the context window and make the conversation unusable.

bash
# Correct: redirect to file
.claude/skills/snpsift/snpsift.sh <command> [options] [arguments] > output.vcf 2> snpsift.log

# WRONG: never do this
.claude/skills/snpsift/snpsift.sh <command> [options] [arguments]

Commands

CommandDescriptionDocs
annotateAdd ID and INFO fields from a VCF database (e.g. dbSnp)annotate
annotateMemAnnotate from a VCF database loaded into memoryannotate_mem
caseControlCase vs control variant comparison with p-valuescasecontrol
concordanceConcordance metrics between two VCF filesconcordance
dbnsfpAnnotate using dbNSFP (SIFT, Polyphen2, etc.)dbnsfp
extractFieldsExtract VCF fields to tab-separated formatextractfields
filterFilter using arbitrary expressionsfilter
geneSetsAnnotate using MSigDb gene sets (GO, KEGG, etc.)genesets
gtCompress genotype fieldsgt
gwasCatAnnotate using GWAS Cataloggwascatalog
intersectIntersect intervals from multiple filesintersect
intervalsFilter variants by BED intervalsintervals
intervalsIndexFilter variants by intervals using file indexingintervalsindex
joinJoin files by genomic regionjoin
phastConsAnnotate using phastCons conservation scoresphastcons
privateAnnotate private variants (family/group)private
rmInfoRemove INFO fields from VCFrminfo
rmRefGenRemove reference genotypesrmrefgen
splitSplit VCF by chromosomesplit
tstvCalculate transition/transversion ratiotstv
varTypeAnnotate variant type (SNP, MNP, INS, DEL, MIXED)varianttype
vcfCheckCheck VCF file format correctnessvcfcheck
vcf2tpedConvert VCF to TPED formatvcf2ped

Additional Documentation

TopicFile
Introductionintroduction
FAQfaq
Download & Installdownload
Examplesexamples
Helphelp