Model Export To Onnx And Openvino Backends

1

KerasFramework57/100

via “model export to multiple deployment formats (savedmodel, onnx, litert, openvino)”

High-level deep learning API — multi-backend (JAX, TensorFlow, PyTorch), simple model building.

Unique: Keras 3's export system supports multiple formats (SavedModel, ONNX, LiteRT, OpenVINO) from a single model definition, enabling deployment across diverse hardware without framework-specific conversion tools. Export functions in keras/src/saving/ handle format-specific serialization, and the system supports quantization and optimization for each format independently.

vs others: Unlike PyTorch (torch.onnx.export for ONNX only) or TensorFlow (SavedModel-centric), Keras 3 provides unified export to four major formats from a single API, and unlike ONNX converters (which are format-specific), Keras export is built into the framework, ensuring consistency and reducing conversion errors.

2

Kokoro TTSRepository57/100

via “model export and optimization for production deployment”

Lightweight 82M parameter open-source TTS with high-quality output.

Unique: Provides explicit export utilities rather than automatic ONNX export, giving developers control over export parameters and optimization settings; separates export from inference, enabling offline optimization workflows

vs others: More flexible than automatic export because developers can customize export parameters; avoids runtime overhead of on-demand export compared to systems that export during first inference

3

all-MiniLM-L6-v2Model57/100

via “multi-format-model-export-and-inference”

sentence-similarity model by undefined. 23,35,18,673 downloads.

Unique: Distributed across multiple ecosystem projects (sentence-transformers for PyTorch, ONNX community for format conversion, OpenVINO toolkit for Intel optimization) rather than single unified export pipeline; enables best-in-class optimization per format but requires manual orchestration

vs others: More deployment flexibility than proprietary embedding APIs (OpenAI, Cohere) which lock you into their inference infrastructure; more mature ONNX support than newer models due to wide adoption in sentence-transformers ecosystem

4

YOLOv8Repository55/100

via “multi-format model export with autobackend inference”

Real-time object detection, segmentation, and pose.

Unique: Implements AutoBackend pattern that auto-detects exported format and dynamically routes inference to appropriate runtime (ONNX Runtime, TensorRT, CoreML, etc.) without explicit backend selection, handling format-specific preprocessing/postprocessing transparently

vs others: More comprehensive than ONNX Runtime alone (supports 13+ formats vs 1) and more automated than manual TensorRT compilation because format detection and backend routing are implicit rather than explicit

5

Piper TTSRepository55/100

via “onnx model export and optimization for edge deployment”

Fast local neural TTS optimized for Raspberry Pi and edge devices.

Unique: Implements ONNX export with built-in quantization and operator fusion specifically tuned for VITS architecture, enabling 50-70% model size reduction with minimal quality loss vs. generic ONNX converters

vs others: More optimized for TTS than generic ONNX export tools; supports quantization strategies specific to VITS; produces models 2-3x smaller than unoptimized exports while maintaining quality

6

UltralyticsRepository55/100

via “multi-format model export with quantization and optimization”

Unified YOLO framework for detection and segmentation.

Unique: Unified exporter interface abstracts 10+ format-specific implementations (ONNX, TensorRT, CoreML, OpenVINO, etc.) through a single export() call with format auto-detection. Built-in validation layer compares exported model outputs against PyTorch baseline to catch numerical drift. Generates deployment code snippets for each format.

vs others: More comprehensive format coverage than TensorFlow Lite (supports TensorRT, CoreML, OpenVINO natively) and simpler than ONNX Runtime alone (handles quantization and validation automatically)

7

all-MiniLM-L12-v2Model54/100

via “multi-format-model-export-and-deployment”

sentence-similarity model by undefined. 28,25,304 downloads.

Unique: Provides native export to four distinct inference formats with automatic tokenizer serialization and config preservation, enabling single-command deployment across CPU, GPU, mobile, and edge hardware without manual format conversion or architecture reimplementation; SafeTensors format ensures secure deserialization preventing arbitrary code execution

vs others: More deployment-flexible than OpenAI embeddings (API-only); simpler than custom ONNX conversion pipelines; safer than pickle-based PyTorch exports due to SafeTensors format

8

multi-qa-mpnet-base-dot-v1Model52/100

via “onnx-and-openvino-export-for-edge-deployment”

sentence-similarity model by undefined. 25,30,482 downloads.

Unique: Provides native ONNX and OpenVINO export support with quantization-friendly architecture (no custom ops). Enables deployment on edge devices and CPU-only infrastructure with minimal code changes, supporting both float32 and int8 quantized inference.

vs others: Faster edge deployment than PyTorch models because ONNX Runtime and OpenVINO use optimized inference engines with hardware-specific optimizations, and quantization support reduces model size by 4x and latency by 2-3x compared to full-precision models.

9

multilingual-e5-smallModel52/100

via “onnx and openvino model export for edge deployment”

sentence-similarity model by undefined. 70,32,108 downloads.

Unique: Provides pre-optimized ONNX and OpenVINO representations of multilingual-e5-small, enabling single-model deployment across diverse hardware (CPUs, mobile, edge) without language-specific optimizations. OpenVINO export includes graph-level optimizations (operator fusion, constant folding) and quantization-aware training compatibility, reducing inference latency by 2-4x on Intel CPUs.

vs others: Smaller and faster than PyTorch deployment for edge use cases; more portable than TensorFlow Lite (which lacks transformer support); enables privacy-preserving on-device inference without cloud dependencies.

10

openvinoFramework52/100

via “multi-framework model import with unified intermediate representation”

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

Unique: Implements dedicated frontend plugins for each framework (PyTorch, ONNX, TensorFlow) that parse framework-specific graph formats and map them to OpenVINO's unified Opset, rather than relying on a single generic conversion layer. This architecture allows framework-specific optimizations (e.g., PyTorch's traced graph structure) to be leveraged during conversion while maintaining a single downstream optimization pipeline.

vs others: Supports more input frameworks (7+) with dedicated parsers than ONNX Runtime (primarily ONNX-focused) and provides tighter integration with Intel hardware than generic converters like ONNX-to-TensorFlow bridges.

11

multilingual-e5-baseModel51/100

via “onnx and openvino model export for edge deployment”

sentence-similarity model by undefined. 36,60,082 downloads.

Unique: Supports three inference backends (PyTorch, ONNX Runtime, OpenVINO) from a single model artifact, with automatic optimization for each target platform — ONNX for cross-platform compatibility, OpenVINO for Intel hardware, PyTorch for development

vs others: More portable than PyTorch-only deployment and faster than unoptimized ONNX due to OpenVINO's graph-level optimizations; enables 2-4x latency reduction on CPU compared to PyTorch inference

12

ChatTTSAgent51/100

via “onnx export for cross-platform deployment”

A generative speech model for daily dialogue.

Unique: Provides ONNX export capability for all major pipeline components (GPT, DVAE, Vocos), enabling end-to-end deployment without PyTorch. The export process includes optimization and quantization options, enabling deployment on resource-constrained devices.

vs others: More flexible than PyTorch-only deployment because ONNX enables use of alternative inference runtimes (ONNX Runtime, TensorRT, CoreML). More portable than TorchScript because ONNX is a standard format with broad ecosystem support.

13

distil-large-v3Model50/100

via “onnx-export-and-cross-platform-inference”

automatic-speech-recognition model by undefined. 13,05,832 downloads.

Unique: Leverages ONNX's standardized opset to enable deployment across 10+ platforms (Windows, Linux, macOS, iOS, Android, web browsers, embedded systems) with a single model export — ONNX Runtime's execution providers automatically select optimal hardware acceleration (CPU, GPU, CoreML, NNAPI) without code changes

vs others: Enables true cross-platform deployment with a single model file, unlike PyTorch Mobile (iOS/Android only) or TensorFlow Lite (mobile-focused); ONNX Runtime's graph optimizations often match or exceed framework-native inference speed while providing broader platform coverage

14

paraphrase-mpnet-base-v2Model50/100

via “multi-format-model-export-and-deployment”

sentence-similarity model by undefined. 18,87,172 downloads.

Unique: Provides pre-converted artifacts for all major inference formats directly from HuggingFace Hub, eliminating manual conversion overhead; includes format-specific optimizations (attention fusion for ONNX, graph optimization for OpenVINO) baked into each export

vs others: Faster deployment than converting from PyTorch source (no conversion step required) and more reliable than manual ONNX export due to official format validation; supports more deployment targets than single-format models like BERT-base

15

e5-base-v2Model49/100

via “onnx and openvino model export for edge and on-premise deployment”

sentence-similarity model by undefined. 17,78,169 downloads.

Unique: Provides native ONNX and OpenVINO export through sentence-transformers' built-in conversion utilities, supporting both full-precision and quantized models without custom export code. The export process preserves the tokenizer and preprocessing logic, enabling end-to-end inference without reimplementing text preprocessing.

vs others: One-command export to multiple formats (ONNX, OpenVINO) with quantization support, whereas most models require separate conversion pipelines and manual tokenizer integration for edge deployment.

16

mDeBERTa-v3-base-xnli-multilingual-nli-2mil7Model47/100

via “onnx-model-export-and-inference”

zero-shot-classification model by undefined. 3,03,704 downloads.

Unique: Enables ONNX export of the DeBERTa-v3-base architecture with full transformer semantics preserved, supporting dynamic batch sizes and sequence lengths without reexport. Unlike simple PyTorch-to-ONNX conversion, this approach maintains cross-lingual capabilities and NLI reasoning patterns across different runtime environments.

vs others: Provides hardware-agnostic inference without PyTorch dependency, enabling 2-5x faster startup and lower memory overhead than PyTorch on CPU, and supports quantization for 4x model size reduction with minimal accuracy loss vs full-precision models.

17

mask2former-swin-large-cityscapes-semanticModel46/100

via “model export to onnx and torchscript formats”

image-segmentation model by undefined. 1,55,904 downloads.

Unique: Supports export to both ONNX and TorchScript, enabling deployment across diverse inference engines (ONNX Runtime, TensorRT, CoreML) — though deformable attention may require custom ONNX operators not available in standard opset

vs others: Enables multi-platform deployment vs PyTorch-only inference, though export complexity and potential operator compatibility issues add deployment friction

18

nli-MiniLM2-L6-H768Model43/100

via “multi-format model export and deployment”

zero-shot-classification model by undefined. 2,58,745 downloads.

Unique: Provides native multi-format export (ONNX, OpenVINO, SafeTensors) directly from Hugging Face Hub without custom conversion scripts, enabling one-click deployment to diverse runtimes — most NLI models require manual export pipelines or are locked to single frameworks

vs others: Eliminates custom export boilerplate compared to models that only ship PyTorch weights; more deployment-flexible than framework-specific alternatives, though quantization and hardware-specific optimization still require manual tuning

19

rtdetr_r18vd_coco_o365Model42/100

via “onnx and torchscript export for cross-platform deployment”

object-detection model by undefined. 5,21,638 downloads.

Unique: Supports both ONNX and TorchScript export with transformer-aware optimization, preserving attention mechanisms and dynamic shapes; many detection models only export to ONNX with limited shape flexibility

vs others: Enables deployment on 10+ inference runtimes (ONNX Runtime, TensorRT, CoreML, NCNN, OpenVINO) vs single-runtime models; reduces deployment friction across cloud, mobile, and edge

20

sat-12l-smModel41/100

via “onnx-optimized inference export for production deployment”

token-classification model by undefined. 3,07,609 downloads.

Unique: Provides pre-exported ONNX weights alongside safetensors format, eliminating conversion overhead and enabling immediate deployment to ONNX Runtime without requiring PyTorch/TensorFlow toolchains on target systems

vs others: Faster deployment than converting from PyTorch at runtime; ONNX format is hardware-agnostic unlike TensorRT (NVIDIA-only) or CoreML (Apple-only), enabling single export for multi-platform deployment

Top Matches

Also Known As

Company