Model Upload & Card Generator
Create model cards and upload fine-tuned models to Hugging Face Hub.
Gather Context
If coming from training manager, you should have:
- •
model_path,base_model,dataset,technique - •
training_config(LoRA rank, LR, epochs) - •
final_loss,training_time,hardware
If missing, ask for essential information.
Configuration
1. Repository Settings
Ask for:
- •Repo name:
username/model-name - •Visibility: Public or Private
- •License: MIT, Apache 2.0, CC-BY-4.0, Llama 3 Community, etc.
2. Export Formats
Options:
- •LoRA adapter only (~50-200MB) - Users merge themselves
- •Merged 16-bit (15-140GB) - Ready to use
- •GGUF quantized (4-8GB) - For llama.cpp/Ollama
- •All of the above (Recommended)
3. GGUF Quantization
If GGUF selected, ask which levels. See references/GGUF_GUIDE.md.
| Method | Size | Quality |
|---|---|---|
| Q4_K_M | ~4GB | Good (Recommended) |
| Q5_K_M | ~5GB | Better |
| Q8_0 | ~8GB | Best |
Generate Model Card
Create README.md with:
- •YAML Metadata - license, tags, base_model, datasets
- •Model Description - Table with key attributes
- •Training Details - Hyperparameters, LoRA config, results
- •Usage Examples - Transformers, Unsloth, Ollama, llama.cpp
- •Intended Use - Primary use cases, out-of-scope
- •Limitations - Biases, known issues
- •Citation - BibTeX entry
Execute Upload
1. Create Repository
python
from huggingface_hub import create_repo
create_repo("username/model-name", private=False, exist_ok=True)
2. Upload Files
python
from huggingface_hub import HfApi api = HfApi() # LoRA adapter api.upload_folder(folder_path="./outputs/lora_adapter", repo_id="username/model") # Model card api.upload_file(path_or_fileobj="README.md", path_in_repo="README.md", repo_id="username/model")
3. Generate GGUF (if selected)
python
from unsloth import FastLanguageModel
model, tokenizer = FastLanguageModel.from_pretrained("./outputs/lora_adapter")
model.save_pretrained_gguf("./gguf", tokenizer, quantization_method="q4_k_m")
Use scripts/convert_gguf.py for multiple quantizations.
4. Verify
python
from huggingface_hub import list_repo_files
print(list_repo_files("username/model"))
Final Report
Upload Complete!
Model: https://huggingface.co/{repo_name}
Uploaded:
- •LoRA adapter
- •Model card
- •GGUF files (if selected)
Next steps:
- •Verify model page
- •Add example outputs
- •Run benchmarks
- •Share on social media
Model Card Best Practices
- •Be specific about limitations
- •Include usage examples - copy-pasteable
- •Document training details
- •Credit sources - base model, dataset, tools
- •Use tables - easier to scan
Error Handling
| Error | Resolution |
|---|---|
| Repo exists | Use exist_ok=True |
| Permission denied | Check HF token has write access |
| Upload timeout | Use chunked upload |
Bundled Resources
- •scripts/convert_gguf.py - GGUF conversion
- •references/GGUF_GUIDE.md - GGUF details and Ollama setup
- •references/TROUBLESHOOTING.md - Upload issues