AgentSkillsCN

codon-optimize

利用 IDT 的 API 对蛋白质序列进行密码子优化,以提升表达效率。适用于用户希望为特定生物体(昆虫、大肠杆菌、哺乳动物、酵母)或载体(438、1-、pVEX)优化基因、序列或 accession 序列时使用。

SKILL.md
--- frontmatter
name: codon-optimize
description: Codon optimize protein sequences for expression using IDT's API. Use when the user asks to codon optimize genes, sequences, or accessions for a target organism (insect, E. coli, mammalian, yeast) or vector (438, 1-, pVEX).

Codon Optimization (IDT)

Use the Python CLI at scripts/codon_optimize_cli.py.

Environment

The CLI loads credentials from .env:

  • IDT_CLIENT_ID
  • IDT_CLIENT_SECRET
  • IDT_USERNAME (IDT account email)
  • IDT_PASSWORD (IDT account password)

Examples

From NCBI Accession

bash
python3 scripts/codon_optimize_cli.py --accession NP_003161 --organism insect --name SUPT6H

From Raw Protein Sequence

bash
python3 scripts/codon_optimize_cli.py --sequence MKTLLLTLVVV... --organism ecoli --name MyProtein

Vector Inference (Infers Organism from Vector)

bash
python3 scripts/codon_optimize_cli.py --accession NP_003161 --vector 438-C

Truncated Construct (Residue Range)

bash
python3 scripts/codon_optimize_cli.py --accession NP_003161 --residues 1-500 --organism human

Options

OptionDescription
--sequence, -sProtein sequence (mutually exclusive with --accession)
--accession, -aNCBI protein accession (NP_, XP_, etc.)
--residues, -rResidue range to extract (e.g., 1-300)
--name, -nGene/construct name
--organism, -oTarget organism (see mapping below)
--vector, -vTarget vector (infers organism)
--jsonOutput as JSON
--fastaOutput as FASTA
--output, -OWrite to file

Organism Mapping

InputIDT Organism
insect, sf9, sf21Spodoptera frugiperda
hi5, trichoplusiaTrichoplusia ni
ecoli, bacteriaEscherichia coli K12
human, mammalian, hekHomo sapiens
choCricetulus griseus
yeastSaccharomyces cerevisiae
pichiaPichia pastoris

Vector → Organism Inference

Vector PatternOrganism
438-*insect
1-*ecoli
pVEX-*human

Output

Returns optimized DNA sequence with:

  • Length in bp (should be 3× input aa)
  • GC content percentage
  • Complexity score (if available from IDT)