Name: retrain
Rating: 87
Author: wxl387

Model Retraining

Retrain ML models using scripts/retrain_models.py. Parse the arguments from $ARGUMENTS.

For individual model tuning: tune-xgboost, tune-lstm, tune-cnn, tune-transformer map to their respective flags.

•Parse arguments and build the CLI command
•Show the user the exact command being run
•CRITICAL: Run with .venv/bin/python only (system Python 3.9.6 causes TF mutex deadlocks)
•
Run: .venv/bin/python scripts/retrain_models.py <flags>
- •Timeout: 600 seconds for single model, 1800 seconds for all models or tuning
•
After completion, summarize:
- •Training accuracy and test accuracy for each model
- •Whether the model improved over production
- •Whether deployment happened (if --deploy was used)
•If the user asked to retrain the transformer specifically, remind them that it currently has 49.8% accuracy and its ensemble weight is 0.3 (lowered to avoid diluting signal)

•TF import order bug: pandas before TF causes model.fit() deadlock on macOS. The scripts already handle this, but always use .venv/bin/python.
•XGBoost requires libomp (already installed via Homebrew)
•LSTM/CNN training uses MPS GPU acceleration on Apple M4
•Transformer is undertrained (49.8% accuracy) — needs more data/tuning before increasing its ensemble weight
•Default deployment threshold is 1% accuracy improvement