[EVAL] Add MathVista

## Evaluation short description
- Why is this evaluation interesting?
MathVista is one of the most popular and commonly reported benchmark in MultiModal LLM like Qwen2.5_VL and InternVL3_5
- How used is it in the community?
To evaluate multimodal vision reasoning capabilities on Math questions

## Evaluation metadata
Provide all available
- Paper url:https://arxiv.org/pdf/2310.02255
- Github url: https://github.com/lupantech/MathVista
- Dataset url: https://huggingface.co/datasets/AI4Math/MathVista

https://mathvista.github.io/


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[EVAL] Add MathVista #975

Evaluation short description

Evaluation metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[EVAL] Add MathVista #975

Description

Evaluation short description

Evaluation metadata

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions