Extract Hyperparameters
Locate and document all hyperparameters mentioned in research papers including learning rates, batch sizes, and model configurations.
When to Use
- •Reproducing paper results
- •Setting up model training configurations
- •Comparing hyperparameter choices across papers
- •Planning hyperparameter tuning experiments
Quick Reference
bash
# Extract numeric values and parameters from papers pdftotext paper.pdf - | grep -i "learning rate\|batch\|epochs\|weight decay\|dropout" | head -20 # Common pattern search grep -E "\\b(lr|batch_size|epochs|momentum|dropout|layers)\\s*[=:]" config.py
Workflow
- •Find hyperparameter table: Look for "Table 1" or "Hyperparameters" section
- •Document architecture parameters: Layer sizes, activation functions, normalization
- •Extract training parameters: Learning rate, batch size, epochs, optimizers
- •Note regularization: Dropout, weight decay, batch normalization
- •Create configuration file: Translate to implementation format (YAML/JSON/Mojo)
Output Format
Hyperparameter documentation:
- •Model architecture (layers, sizes, activations)
- •Training parameters (LR, batch size, epochs)
- •Optimizer configuration (type, momentum, decay)
- •Regularization settings (dropout, L1/L2)
- •Data preprocessing (normalization, augmentation)
- •Hardware and precision (float32, float64)
References
- •See
prepare-datasetskill for data configuration - •See
train-modelskill for training implementation - •See
/notes/review/mojo-ml-patterns.mdfor Mojo configuration patterns