Participants will evaluate their models on short-answer questions (SAQs) to assess their model's ability to generate accurate responses while accounting for cultural and linguistic diversity. This ...