MM IF-Eval

multimodal

About

A challenging multimodal instruction-following benchmark that includes both compose-level constraints for output responses and perception-level constraints tied to input images, with comprehensive evaluation pipeline.

Evaluation Stats

Total Models1

Organizations1

Verified Results0

Self-Reported1

Benchmark Details

Max Score1

Language

Performance Overview

Score distribution and top performers

Score Distribution

1 models

Top Score

52.7%

Average Score

52.7%

High Performers (80%+)

Top Organizations

#1Mistral AI

1 model

52.7%

Leaderboard

1 models ranked by performance on MM IF-Eval

			License		Links
#01Pixtral-12B	Mistral AI	Sep 17, 2024	Apache 2.0	52.7%

Resources

Research Paper