AgentSkillsCN

segment-automation

通过 Rube MCP(Composio)自动化 Segment 任务:追踪事件、识别用户、管理群组、页面浏览量、别名、批量操作。务必优先搜索工具,以获取最新的数据结构信息。

SKILL.md
--- frontmatter
name: segment-automation
description: "Automate Segment tasks via Rube MCP (Composio): track events, identify users, manage groups, page views, aliases, batch operations. Always search tools first for current schemas."
requires:
  mcp: [rube]

Segment Automation via Rube MCP

Automate Segment customer data platform operations through Composio's Segment toolkit via Rube MCP.

Prerequisites

  • Rube MCP must be connected (RUBE_SEARCH_TOOLS available)
  • Active Segment connection via RUBE_MANAGE_CONNECTIONS with toolkit segment
  • Always call RUBE_SEARCH_TOOLS first to get current tool schemas

Setup

Get Rube MCP: Add https://rube.app/mcp as an MCP server in your client configuration. No API keys needed — just add the endpoint and it works.

  1. Verify Rube MCP is available by confirming RUBE_SEARCH_TOOLS responds
  2. Call RUBE_MANAGE_CONNECTIONS with toolkit segment
  3. If connection is not ACTIVE, follow the returned auth link to complete Segment authentication
  4. Confirm connection status shows ACTIVE before running any workflows

Core Workflows

1. Track Events

When to use: User wants to send event data to Segment for downstream destinations

Tool sequence:

  1. SEGMENT_TRACK - Send a single track event [Required]

Key parameters:

  • userId: User identifier (required if no anonymousId)
  • anonymousId: Anonymous identifier (required if no userId)
  • event: Event name (e.g., 'Order Completed', 'Button Clicked')
  • properties: Object with event-specific properties
  • timestamp: ISO 8601 timestamp (optional; defaults to server time)
  • context: Object with contextual metadata (IP, user agent, etc.)

Pitfalls:

  • At least one of userId or anonymousId is required
  • event name is required and should follow consistent naming conventions
  • Properties are freeform objects; ensure consistent schema across events
  • Timestamp must be ISO 8601 format (e.g., '2024-01-15T10:30:00Z')
  • Events are processed asynchronously; successful API response means accepted, not delivered

2. Identify Users

When to use: User wants to associate traits with a user profile in Segment

Tool sequence:

  1. SEGMENT_IDENTIFY - Set user traits and identity [Required]

Key parameters:

  • userId: User identifier (required if no anonymousId)
  • anonymousId: Anonymous identifier
  • traits: Object with user properties (email, name, plan, etc.)
  • timestamp: ISO 8601 timestamp
  • context: Contextual metadata

Pitfalls:

  • At least one of userId or anonymousId is required
  • Traits are merged with existing traits, not replaced
  • To remove a trait, set it to null
  • Identify calls should be made before track calls for new users
  • Avoid sending PII in traits unless destinations are configured for it

3. Batch Operations

When to use: User wants to send multiple events, identifies, or other calls in a single request

Tool sequence:

  1. SEGMENT_BATCH - Send multiple Segment calls in one request [Required]

Key parameters:

  • batch: Array of message objects, each with:
    • type: Message type ('track', 'identify', 'group', 'page', 'alias')
    • userId / anonymousId: User identifier
    • Additional fields based on type (event, properties, traits, etc.)

Pitfalls:

  • Each message in the batch must have a valid type field
  • Maximum batch size limit applies; check schema for current limit
  • All messages in a batch are processed independently; one failure does not affect others
  • Each message must independently satisfy its type's requirements (e.g., track needs event name)
  • Batch is the most efficient way to send multiple calls; prefer over individual calls

4. Group Users

When to use: User wants to associate a user with a company, team, or organization

Tool sequence:

  1. SEGMENT_GROUP - Associate user with a group [Required]

Key parameters:

  • userId: User identifier (required if no anonymousId)
  • anonymousId: Anonymous identifier
  • groupId: Group/organization identifier (required)
  • traits: Object with group properties (name, industry, size, plan)
  • timestamp: ISO 8601 timestamp

Pitfalls:

  • groupId is required; it identifies the company or organization
  • Group traits are merged with existing traits for that group
  • A user can belong to multiple groups
  • Group traits update the group profile, not the user profile

5. Track Page Views

When to use: User wants to record page view events in Segment

Tool sequence:

  1. SEGMENT_PAGE - Send a page view event [Required]

Key parameters:

  • userId: User identifier (required if no anonymousId)
  • anonymousId: Anonymous identifier
  • name: Page name (e.g., 'Home', 'Pricing', 'Dashboard')
  • category: Page category (e.g., 'Docs', 'Marketing')
  • properties: Object with page-specific properties (url, title, referrer)

Pitfalls:

  • At least one of userId or anonymousId is required
  • name and category are optional but recommended for proper analytics
  • Standard properties include url, title, referrer, path, search
  • Page calls are often automated; manual use is for server-side page tracking

6. Alias Users and Manage Sources

When to use: User wants to merge anonymous and identified users, or manage source configuration

Tool sequence:

  1. SEGMENT_ALIAS - Link two user identities together [Optional]
  2. SEGMENT_LIST_SCHEMA_SETTINGS_IN_SOURCE - View source schema settings [Optional]
  3. SEGMENT_UPDATE_SOURCE - Update source configuration [Optional]

Key parameters:

  • For ALIAS:
    • userId: New user identifier (the identified ID)
    • previousId: Old user identifier (the anonymous ID)
  • For source operations:
    • sourceId: Source identifier

Pitfalls:

  • ALIAS is a one-way operation; cannot be undone
  • previousId is the anonymous/old ID, userId is the new/identified ID
  • Not all destinations support alias calls; check destination documentation
  • ALIAS should be called once when a user first identifies (e.g., signs up)
  • Source updates may affect data collection; review changes carefully

Common Patterns

User Lifecycle

Standard Segment user lifecycle:

code
1. Anonymous user visits -> PAGE call with anonymousId
2. User interacts -> TRACK call with anonymousId
3. User signs up -> ALIAS (anonymousId -> userId), then IDENTIFY with traits
4. User takes action -> TRACK call with userId
5. User joins org -> GROUP call linking userId to groupId

Batch Optimization

For bulk data ingestion:

code
1. Collect events in memory (array of message objects)
2. Each message includes type, userId/anonymousId, and type-specific fields
3. Call SEGMENT_BATCH with the collected messages
4. Check response for any individual message errors

Naming Conventions

Segment recommends consistent event naming:

  • Events: Use "Object Action" format (e.g., 'Order Completed', 'Article Viewed')
  • Properties: Use snake_case (e.g., 'order_total', 'product_name')
  • Traits: Use snake_case (e.g., 'first_name', 'plan_type')

Known Pitfalls

Identity Resolution:

  • Always include userId or anonymousId on every call
  • Use ALIAS only once per user identity merge
  • Identify before tracking to ensure proper user association

Data Quality:

  • Event names should be consistent across all sources
  • Properties should follow a defined schema for downstream compatibility
  • Avoid sending sensitive PII unless destinations are configured for it

Rate Limits:

  • Use BATCH for bulk operations to stay within rate limits
  • Individual calls are rate-limited per source
  • Batch calls are more efficient and less likely to be throttled

Response Parsing:

  • Successful responses indicate acceptance, not delivery to destinations
  • Response data may be nested under data key
  • Check for error fields in batch responses for individual message failures

Timestamps:

  • Must be ISO 8601 format with timezone (e.g., '2024-01-15T10:30:00Z')
  • Omitting timestamp uses server receive time
  • Historical data imports should include explicit timestamps

Quick Reference

TaskTool SlugKey Params
Track eventSEGMENT_TRACKuserId, event, properties
Identify userSEGMENT_IDENTIFYuserId, traits
Batch callsSEGMENT_BATCHbatch (array of messages)
Group userSEGMENT_GROUPuserId, groupId, traits
Page viewSEGMENT_PAGEuserId, name, properties
Alias identitySEGMENT_ALIASuserId, previousId
Source schemaSEGMENT_LIST_SCHEMA_SETTINGS_IN_SOURCEsourceId
Update sourceSEGMENT_UPDATE_SOURCEsourceId
WarehousesSEGMENT_LIST_CONNECTED_WAREHOUSES_FROM_SOURCEsourceId