DrishtiSharma
/

opt-350m-hh-rlhf

Model card Files Files and versions Metrics Training metrics Community

No model card

New: Create and edit this model card directly on the website!

Contribute a Model Card

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference API

Unable to determine this model's library. Check the docs .

Collection including DrishtiSharma/opt-350m-hh-rlhf

Comparative Study:OPT-350M and GPT-2 w Reward-based Training

Comparative Study: Training OPT-350M and GPT-2 on Anthropic’s HH-RLHF Dataset Using Reward-Based Training • 2 items • Updated Sep 11, 2023