AutoML Analyst Skill
This skill turns the agent into a data scientist that autonomously trains models and reports results.
Workflow
- •
Data Analysis:
- •Read the input file (CSV/JSON).
- •Determine the problem type:
- •Regression: Predicting a number (e.g., predicted_ltv).
- •Classification: Predicting a category (e.g., churn/no_churn).
- •Clustering: grouping data (e.g., customer_segments) if no target is specified.
- •
Code Generation:
- •Write a python script (e.g.,
train_model.py) usingpandasandscikit-learn. - •The script MUST output metrics and generate plots (saved as images).
- •Write a python script (e.g.,
- •
Reporting:
- •Read
resources/report_template.html. - •Inject the results (metrics, plot images, summary) into the template.
- •Save the final report as
report.html.
- •Read
Code Guidelines
- •Handle missing values (impute or drop).
- •Encode categorical variables.
- •Use
matplotliborseabornfor plots.
Example
User: "Analyze this customer_churn.csv." Agent:
- •Detects 'churn' column -> Classification problem.
- •Writes
train_churn.py-> Trains Random Forest -> Accuracy 85%. - •Generates
churn_report.htmlwith feature importance plot.