AgentSkillsCN

bio-workflows-riboseq-pipeline

从 FASTQ 到翻译效率与 ORF 检测的全流程核糖体测序分析工作流。适用于分析核糖体图谱数据以研究翻译过程时使用。

SKILL.md
--- frontmatter
name: bio-workflows-riboseq-pipeline
description: End-to-end Ribo-seq analysis from FASTQ to translation efficiency and ORF detection. Use when analyzing ribosome profiling data to study translation.
tool_type: mixed
primary_tool: Plastid

Ribo-seq Pipeline

Pipeline Overview

code
FASTQ → Preprocessing → rRNA removal → Alignment → P-site → TE → ORF calling

Step 1: Preprocessing

bash
# Remove adapters
cutadapt -a CTGTAGGCACCATCAAT \
    --minimum-length 25 --maximum-length 35 \
    -o trimmed.fastq.gz reads.fastq.gz

# Remove rRNA
bowtie2 -x rRNA_index --un non_rrna.fastq.gz -U trimmed.fastq.gz

Step 2: Alignment

bash
# Align to transcriptome
STAR --genomeDir star_index \
    --readFilesIn non_rrna.fastq.gz \
    --readFilesCommand zcat \
    --outFilterMismatchNmax 2 \
    --alignEndsType EndToEnd \
    --outSAMtype BAM SortedByCoordinate

Step 3: P-site Calibration

python
from plastid import BAMGenomeArray

# Build metagene profile
metagene_generate annotation.gtf ribo.bam metagene_output/

# Calculate P-site offsets
psite annotation.gtf metagene_output/profile.txt psite_offsets.txt

Step 4: Translation Efficiency

python
# TE = Ribo-seq RPKM / RNA-seq RPKM
from plastid import BAMGenomeArray
import numpy as np

ribo_counts = count_reads(ribo_bam, genes)
rna_counts = count_reads(rna_bam, genes)
te = ribo_counts / rna_counts

Step 5: ORF Detection

bash
# RiboCode for ORF calling
RiboCode -a annotation.gtf -c config.txt -o ribocoded_orfs

Related Skills

  • ribo-seq/ - Individual Ribo-seq analysis skills
  • differential-expression - For differential TE