Model Architecture Registry With Automatic Name Resolution

1

vLLMFramework63/100

via “model registry with automatic architecture detection”

High-throughput LLM serving engine — PagedAttention, continuous batching, OpenAI-compatible API.

Unique: Implements automatic architecture detection from config.json with dynamic plugin registration, enabling model-specific optimizations without user configuration

vs others: Reduces configuration complexity vs manual architecture specification, enabling new models to benefit from optimizations automatically

2

AutoAWQRepository59/100

via “multi-architecture model registry with automatic implementation selection”

4-bit weight quantization for LLMs on consumer GPUs.

Unique: Uses a centralized registry that maps model architecture strings to implementation classes, enabling single-line model loading (from_pretrained/from_quantized) without users needing to know which specific quantizer or inference kernel to use. This abstraction layer decouples user code from architecture-specific implementation details.

vs others: Simpler API than GPTQ (which requires manual kernel selection) and more maintainable than bitsandbytes (which uses conditional imports); the factory pattern makes it trivial to add new architectures without changing user code.

3

vllmPlatform42/100

via “model registry with automatic architecture detection”

A high-throughput and memory-efficient inference and serving engine for LLMs

Unique: Implements automatic architecture detection by parsing model config.json and matching against a registry of known architectures, with fallback to generic transformer implementation for unknown models. Supports custom model registration through a plugin system without modifying core code.

vs others: Eliminates manual architecture specification for 95%+ of HuggingFace models; automatic detection reduces setup time from minutes to seconds vs. manual configuration approaches.

4

unslothWeb App39/100

via “model-architecture-registry-with-automatic-name-resolution”

Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Unique: Uses a hierarchical registry pattern with architecture-specific submodules (llama.py, mistral.py, vision.py) that apply targeted patches for each model family, combined with automatic name resolution via regex and config inspection to eliminate manual architecture specification

vs others: More automatic than PEFT (which requires manual architecture specification) and more comprehensive than transformers' built-in optimizations because it maintains a curated registry of proven optimization patterns for each major open model family

5

transformersFramework38/100

via “unified model loading with auto-discovery across 400+ architectures”

Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Unique: Uses a centralized registry pattern (src/transformers/models/auto/modeling_auto.py) that maps config class names to model classes, enabling zero-code-change support for new architectures added to the Hub. Unlike monolithic frameworks, Transformers decouples architecture definition from discovery, allowing community contributions without core library changes.

vs others: Faster model switching than frameworks requiring explicit imports (e.g., timm, torchvision) because architecture selection is data-driven from config.json rather than code-driven, and supports 400+ models vs ~50-100 in specialized vision/audio libraries.

Top Matches

Also Known As

Company