Academic Subject Taxonomy And Hierarchical Filtering

1

MMLUBenchmark61/100

via “hierarchical subject organization and result aggregation”

57-subject knowledge benchmark — 15K+ questions across STEM, humanities, professional domains.

Unique: Implements hierarchical subject organization (57 subjects grouped into 4 major categories: STEM, humanities, social sciences, other) with multi-level result aggregation, enabling both granular subject-level analysis and high-level category comparison in a single evaluation framework

vs others: More structured than flat subject lists and more informative than single overall scores, enabling researchers to identify domain-specific weaknesses and guide targeted model improvements

2

mmluDataset23/100

Dataset by cais. 4,76,392 downloads.

Unique: Explicit subject labels for every question enable filtering without external knowledge graphs or NLP-based categorization. 57-subject taxonomy is comprehensive and expert-validated, covering STEM, humanities, social sciences, and professional domains in single dataset.

vs others: More granular than generic QA datasets (SQuAD, RACE) while maintaining simplicity of flat taxonomy versus complex hierarchical ontologies

3

objaverseDataset23/100

via “semantic object category filtering and hierarchical retrieval”

Dataset by allenai. 5,33,157 downloads.

Unique: Implements hierarchical category filtering across 12+ heterogeneous source taxonomies with automated normalization and deduplication — enables consistent semantic retrieval despite source inconsistencies, unlike raw source APIs that expose unharmonized category structures

vs others: Provides unified semantic filtering across multiple sources in a single query, whereas downloading from individual sources (Sketchfab, TurboSquid) requires separate API calls and manual taxonomy reconciliation

Top Matches

Also Known As

Company