Implementation:Intel Ipex llm Save Pretrained DPO
Appearance
| Knowledge Sources | |
|---|---|
| Domains | NLP, Model_Deployment |
| Last Updated | 2026-02-09 00:00 GMT |
Overview
Concrete tool for saving DPO-trained model artifacts for deployment, used in the IPEX-LLM DPO workflow.
Description
After DPO training, dpo_trainer.model.save_pretrained() saves the LoRA adapter weights, and tokenizer.save_pretrained() saves the tokenizer. For full model export, the shared export_merged_model.py script calls merge_adapter to fold adapters into the base model.
Usage
Use immediately after dpo_trainer.train() to save model artifacts.
Code Reference
Source Location
- Repository: IPEX-LLM
- File: python/llm/example/GPU/LLM-Finetuning/DPO/dpo_finetuning.py
- Lines: 175-179
- File: python/llm/example/GPU/LLM-Finetuning/DPO/export_merged_model.py
- Lines: 28-44
Signature
# Save adapter
dpo_trainer.model.save_pretrained(output_path: str) -> None
tokenizer.save_pretrained(output_path: str) -> None
# Merge adapter (via export_merged_model.py)
# python export_merged_model.py --base_model MODEL --adapter_path ADAPTER --output_path OUTPUT
Import
# save_pretrained is a method on the model/tokenizer objects
# merge_adapter is from common.utils (see Merge_Adapter implementation)
I/O Contract
Inputs
| Name | Type | Required | Description |
|---|---|---|---|
| output_path | str | Yes | Directory to save adapter weights and tokenizer |
Outputs
| Name | Type | Description |
|---|---|---|
| adapter files | Files | adapter_model.bin, adapter_config.json saved to output_path |
| tokenizer files | Files | tokenizer.json, tokenizer_config.json saved to output_path |
Usage Examples
# After DPO training
dpo_trainer.train()
# Save adapter + tokenizer
dpo_trainer.model.save_pretrained("./dpo-output")
tokenizer.save_pretrained("./dpo-output")
# Optionally merge adapter for standalone deployment
# python export_merged_model.py \
# --base_model teknium/OpenHermes-2.5-Mistral-7B \
# --adapter_path ./dpo-output \
# --output_path ./dpo-merged
Related Pages
Implements Principle
Requires Environment
Page Connections
Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment