MedQA (USMLE)Dataset45/100 via “multilingual clinical knowledge assessment across english and chinese variants”
12.7K USMLE medical exam questions for clinical AI evaluation.
Unique: Includes validated multilingual variants (English, simplified Chinese, traditional Chinese) of USMLE questions, enabling direct cross-lingual evaluation of clinical knowledge; most medical QA datasets are English-only, and multilingual medical datasets typically lack the rigor of USMLE-aligned questions
vs others: Enables evaluation of clinical reasoning across languages using the same standardized exam format, whereas other multilingual medical datasets (e.g., PubMedQA) lack language-specific variants or use lower-quality translations without medical validation