AgentSkillsCN

spark

提交并监控Apache Spark作业——管理集群、读取日志,以及运行Spark SQL查询。

SKILL.md
--- frontmatter
name: spark
description: >
  Submit and monitor Apache Spark jobs — manage clusters, read logs,
  and run Spark SQL queries.
metadata:
  openclaw:
    requires:
      bins: [clawdata]
    primaryEnv: SPARK_MASTER
    tags: [compute, spark, big-data, clusters, sql]

Apache Spark

Submit and monitor Spark jobs, manage clusters, and run Spark SQL.

Commands

TaskCommand
Submit jobclawdata spark submit app.py --master <url>
Spark SQLclawdata spark sql "SELECT ..."
List applicationsclawdata spark apps
Application statusclawdata spark status <app-id>
View logsclawdata spark logs <app-id>
Kill applicationclawdata spark kill <app-id>

Configuration

Env VarDefaultDescription
SPARK_MASTERlocal[*]Spark master URL
SPARK_HOMEauto-detectSpark installation directory

When to use

  • User needs distributed processing for large datasets → Spark
  • User wants to run PySpark scripts → clawdata spark submit
  • User asks about cluster utilisation → clawdata spark apps