AgentSkillsCN

brain-dataset-exploration-general

为深入挖掘整个 WorldQuant BRAIN 数据集,提供一套完整的操作流程。流程包括数据集筛选、字段分类、详细描述生成,以及跨平台研究等环节。当用户希望“审计某个数据集”“对字段进行分类”或“探索全新数据集”时,可使用此技能。

SKILL.md
--- frontmatter
name: brain-dataset-exploration-general
description: >-
  Provides a comprehensive workflow for deep-diving into entire WorldQuant BRAIN datasets.
  Includes steps for dataset selection, field categorization, detailed description generation, and cross-platform research.
  Use when the user wants to "audit a dataset", "categorize fields", or "explore a new dataset".

Dataset Exploration Expert Workflow

This workflow guides the deep analysis and categorization of datasets. For the detailed job duty manual and specific MCP tool strategies, see reference.md.

Phase 1: Dataset Selection & Initial Assessment

  1. Identify Dataset: Select based on strategic importance or user needs.
  2. Initial Exploration:
    • Use get_datasets to find datasets.
    • Use get_datafields to count fields and check coverage.
    • Use get_documentations to find related docs.

Phase 2: Field Categorization

Group data fields into logical categories:

  • Business Function: Financials, Market Data, Estimates, etc.
  • Data Type: Matrix, Vector.
  • Update Frequency: Daily, Quarterly.
  • Hierarchy: Primary -> Secondary -> Tertiary (e.g., Financials -> Income Statement -> Revenue).

Phase 3: Enhanced Description & Analysis

  1. Describe: Write detailed descriptions (Business context, Methodology, Typical values).
  2. Analyze: Use brain-datafield-exploration techniques on key fields to understand distributions and patterns.

Phase 4: Integration

  1. Research: Check forum posts for community insights.
  2. Alpha Ideas: Brainstorm alpha concepts based on the dataset characteristics.

Core Responsibilities

  • Deep Dive: Focus on one dataset at a time.
  • Inventory: Catalog all fields.
  • Documentation: Improve descriptions.