Capability
Extensible Framework Architecture For Custom Evaluations
13 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →Top Matches
Microsoft's unified LLM evaluation and prompt robustness benchmark.
Unique: Uses inheritance-based extension pattern with base classes (LLMModel, Dataset, AttackMethod, Metric) that enable custom implementations to be registered and used without modifying core framework code.
vs others: More extensible than monolithic evaluation tools because it provides clear extension points and base classes, whereas tools like HELM require forking or external wrappers for custom components.