phi3.5-gutenberg-4B / README.md
nbeerbower's picture
Update README.md
9c6ffd0 verified
|
raw
history blame contribute delete
No virus
527 Bytes
metadata
library_name: transformers
base_model:
  - microsoft/Phi-3.5-mini-instruct
datasets:
  - jondurbin/gutenberg-dpo-v0.1
license: mit

phi3.5-gutenberg-4B

microsoft/Phi-3.5-mini-instruct finetuned on jondurbin/gutenberg-dpo-v0.1.

Method

Finetuned using 2x RTX 4060 Ti for 3 epochs.

Fine-tune Llama 3 with ORPO