OST-VQA: VQA dataset for On-Screen Text
VILQA: VQA dataset for visual perception
PLABA-EVAL: A Multi-Dimensional, In-Context Sentence Readability Dataset for Medical Text
CTSEG: CEFR-based Targeted Syntactic Evaluation Dataset for Grammatical Error Correction
JMWE-parallel: Dataset of literal usage examples for Japanese multiword expressions (MWEs)
JaFaithSum: A Japanese Faithfulness Evaluation Dataset for LLM Summarization
JADOS-eval: 文書レベルの日本語平易化の評価データセット