YData Profiling EDA
Purpose
Create consistent EDA reports for train and test or accepted and rejected datasets using ProfileReport, including comparisons and saved HTML outputs.
Usage
- •"generate ydata-profiling report"
- •"compare train and test EDA"
- •"create HTML EDA report"
Instructions
- •Set a sampling fraction for large datasets and a fixed random seed.
- •Create
ProfileReportobjects withprogress_bar=False, and disable duplicates and interactions when speed matters. - •Compare reports using
.compare()and save to HTML with.to_file(). - •Use
./scripts/generate_eda_report.pyfor consistent report creation. - •Use
./templates/eda_compare_block.mdto document inputs, sample fractions, and output paths.