Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

VLM-RLAIF: Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback

Model Summary

This Hub repository contains a HuggingFace's transformers implementation of VLM-RLAIF model of SNUMPR lab.

  • VLM-RLAIF-7b [HF]: 7B RLAIF model
Downloads last month
29
Inference Examples
Inference API (serverless) is not available, repository is disabled.

Space using SNUMPR/vlm_rlaif_video_llava_7b 1

Collection including SNUMPR/vlm_rlaif_video_llava_7b