AgentSkillsCN

tech-stack

概述本代码库所采用的主要技术、框架、库以及语言规范。

SKILL.md
--- frontmatter
name: tech-stack
description: Describes the primary technologies, frameworks, libraries, and language conventions used in this codebase.

Tech Stack Overview

Language-Specific Conventions

  • Python: See PYTHON.md for detailed conventions and examples

Primary Languages

LanguageUsageVersion
Pythondatagen, notebooks3.10+
PySparkFabric notebooksSpark 3.x
KQLEventhouse queriesN/A
JSON/YAMLFabric item definitionsN/A

Frameworks & Libraries

Data Generation (datagen)

  • DuckDB: Local analytical database for historical data
  • Faker: Synthetic data generation
  • Pydantic: Data validation and event schemas
  • azure-eventhub: Event streaming to Azure

Lakehouse

  • Delta Lake: ACID transactions, schema enforcement
  • PySpark: Distributed data processing

Real-Time Analytics

  • Microsoft Fabric Eventhouse: KQL-based analytics
  • Eventstream: Event routing and transformation

Project Architecture

code
Event Flow:
  datagen (Python)
    → Azure Event Hubs
    → Eventstream
    → KQL Tables + Lakehouse Bronze

Data Layers:
  Bronze (raw JSON)
    → Silver (typed Delta)
    → Gold (aggregated Delta)
    → Semantic Model (Power BI)

Key Dependencies

See datagen/pyproject.toml for Python dependencies.

Core packages:

  • pydantic - Schema validation
  • duckdb - Local analytics
  • faker - Data generation
  • azure-eventhub - Event streaming