What can span-marker-mbert-base-multinerd do?

multilingual named entity recognition with span-based token classification, cross-lingual entity type classification with shared embedding space, fine-grained entity type disambiguation with 10+ entity categories, batch entity extraction with efficient span enumeration, contextual entity representation extraction for downstream tasks, safetensors model serialization for secure and efficient model loading, multilingual tokenization with mbert's shared vocabulary

span-marker-mbert-base-multinerd

Q: What is span-marker-mbert-base-multinerd?

tomaarsen/span-marker-mbert-base-multinerd — a token-classification model on HuggingFace with 2,84,856 downloads

ModelFree

token-classification model by undefined. 2,84,856 downloads.

Open Source

/ 100

7 capabilities

Capabilities7 decomposed

multilingual named entity recognition with span-based token classification

Medium confidence

Performs token-level classification using a span-marker architecture built on mBERT (multilingual BERT), enabling detection and classification of named entities across 10+ languages simultaneously. The model uses a two-stage span-based approach: first identifying entity boundaries via token classification, then assigning entity type labels to detected spans. This differs from traditional sequence labeling by operating on variable-length spans rather than individual tokens, reducing cascading errors from boundary misalignment.

Solves for

extract person names, organizations, locations, and other entity types from multilingual text documentsbuild NER pipelines that work across multiple languages without language-specific retrainingidentify fine-grained entity categories (e.g., distinguishing between person names and product names) in unstructured textprocess documents containing mixed-language content with a single model

Best for

NLP teams building multilingual information extraction systems

developers creating document processing pipelines for international content

researchers working with low-resource languages covered by mBERT

Requires

PyTorch 1.9+ or TensorFlow 2.4+

Transformers library 4.25+

Python 3.7+

Limitations

Trained only on MultiNERD dataset — may not recognize domain-specific entities (medical, legal, financial terminology) outside training distribution

mBERT base model has 110M parameters, requiring ~500MB GPU memory; slower inference than distilled alternatives (50-100ms per document on CPU)

Span-marker approach assumes entities are contiguous sequences; cannot handle discontinuous or overlapping entity mentions

What makes it unique

Uses span-marker architecture with mBERT base, enabling entity boundary detection and type classification in a unified span-based framework rather than traditional BIO tagging; trained on MultiNERD's 10+ entity types across 55 languages, providing broader entity coverage than single-language NER models

vs alternatives

Outperforms spaCy's multilingual models on fine-grained entity types and handles more languages natively; faster than rule-based or regex approaches while maintaining higher accuracy on entity boundaries compared to token-only classifiers

cross-lingual entity type classification with shared embedding space

Medium confidence

Leverages mBERT's multilingual embedding space to classify entity types consistently across languages without language-specific fine-tuning. The model encodes text through mBERT's 12 transformer layers, projecting tokens into a shared 768-dimensional space where entity semantics align across languages. This enables zero-shot or few-shot entity classification for languages not explicitly seen during training, as long as they're covered by mBERT's 104-language pretraining.

Solves for

classify entities in languages not explicitly in the training set using cross-lingual transferreduce annotation burden by training on high-resource languages and applying to low-resource languagesbuild entity classification systems that generalize across language families (e.g., Romance, Germanic, Slavic)

Best for

multilingual NLP teams with limited annotation budgets for low-resource languages

organizations processing documents in 50+ languages with a single model

researchers studying cross-lingual transfer learning in NER tasks

Requires

mBERT tokenizer (included in transformers library)

Understanding of target language's script and tokenization conventions

Multilingual training data or validation set to measure cross-lingual performance

Limitations

Cross-lingual transfer quality depends on mBERT's pretraining coverage; languages with minimal Wikipedia representation (e.g., minority languages) see 10-20% accuracy drops

Entity types must be semantically similar across languages; culturally-specific entity categories may not transfer well

No explicit language identification — model assumes input is valid text in a supported language

What makes it unique

Inherits mBERT's 104-language pretraining to enable cross-lingual entity classification without explicit language-specific training; span-marker architecture preserves entity boundary information across languages, enabling consistent entity type assignment even when entity mentions vary in length across languages

vs alternatives

Requires no language-specific fine-tuning unlike language-specific NER models (e.g., separate German, French, Spanish models); more efficient than maintaining separate models per language while maintaining comparable accuracy on high-resource languages

fine-grained entity type disambiguation with 10+ entity categories

Medium confidence

Classifies detected entities into 10+ distinct entity types (person, organization, location, product, event, etc.) as defined by the MultiNERD dataset, enabling fine-grained information extraction beyond simple binary entity/non-entity classification. The model learns type-specific patterns through supervised training on MultiNERD's annotated corpus, using mBERT's contextual representations to disambiguate entities with identical surface forms but different types (e.g., 'Apple' as company vs. fruit).

Solves for

extract and categorize entities into specific types for downstream knowledge graph constructiondisambiguate entities with multiple possible types based on contextbuild entity-centric search and recommendation systems with type-aware filtering

Best for

information extraction pipelines requiring structured entity type labels

knowledge graph construction systems needing entity type classification

semantic search systems that filter by entity type

Requires

MultiNERD entity type taxonomy or custom mapping to model's output classes

Sufficient context around entities (model uses 512-token window)

Limitations

Entity types are fixed to MultiNERD's taxonomy (10+ types); custom entity types require model retraining

Accuracy varies by entity type; rare types (e.g., 'event', 'product') have lower F1 scores (~70-75%) compared to common types like 'person' (~90%)

Requires sufficient context (typically 3-5 surrounding tokens) for accurate type disambiguation; isolated entity mentions may be misclassified

What makes it unique

Trained on MultiNERD's comprehensive 10+ entity type taxonomy across 55 languages, providing finer-grained entity classification than generic NER models; span-marker architecture enables type assignment at the span level rather than token level, reducing type fragmentation across multi-token entities

vs alternatives

Supports more entity types than spaCy's default models (which typically support 7-8 types); more accurate than rule-based type assignment while maintaining interpretability through attention weights

batch entity extraction with efficient span enumeration

Medium confidence

Processes multiple documents or long documents through efficient span enumeration, where the model identifies all possible entity spans (up to a configurable maximum length, typically 8-10 tokens) and classifies each span's entity type. This approach avoids redundant token-level computations by leveraging mBERT's contextual representations across the entire document, then scoring spans post-hoc. Batch processing is optimized through padding and masking to handle variable-length inputs efficiently.

Solves for

extract entities from large document collections with minimal latency overheadprocess documents longer than 512 tokens by sliding window or chunking strategiesoptimize inference throughput for production NER pipelines handling thousands of documents daily

Best for

production NER systems processing high-volume document streams

batch processing pipelines for historical document archives

real-time entity extraction services with latency constraints (<100ms per document)

Requires

Batch processing framework (PyTorch DataLoader, TensorFlow Dataset, or equivalent)

GPU or multi-core CPU for parallel inference

Memory for storing intermediate span representations (~100MB for 1000-document batches)

Limitations

Span enumeration has quadratic complexity in document length; documents >512 tokens require chunking or sliding window, introducing boundary artifacts

Maximum span length is fixed (typically 8-10 tokens); entities longer than this are missed or fragmented

Batch processing requires padding to common length, wasting computation on short documents; dynamic batching adds complexity

What makes it unique

Implements span-based enumeration rather than token-level tagging, enabling efficient batch processing where all spans are scored in parallel; mBERT's shared embeddings across languages allow single-pass batch processing for multilingual documents without language-specific routing

vs alternatives

Faster than sequential token-level classification for long documents due to span-level parallelization; more memory-efficient than storing full attention matrices for all possible spans

contextual entity representation extraction for downstream tasks

Medium confidence

Exposes mBERT's intermediate layer representations (768-dimensional contextual embeddings) for each detected entity span, enabling downstream tasks like entity linking, coreference resolution, or entity similarity matching. The model outputs not just entity type labels but also the pooled contextual representation of each entity span, computed by averaging mBERT's hidden states across the span's tokens. These representations capture semantic and syntactic context, enabling vector-based entity operations.

Solves for

extract entity embeddings for entity linking to knowledge bases (e.g., Wikidata)compute entity similarity for coreference resolution or duplicate detectionbuild entity-aware semantic search by indexing entity embeddings in vector databases

Best for

knowledge graph construction pipelines requiring entity linking

coreference resolution systems using entity embeddings

entity-centric semantic search and recommendation systems

Requires

Vector database or similarity search library (Faiss, Annoy, Pinecone, etc.)

Entity linking knowledge base (e.g., Wikidata, DBpedia) with embeddings for linking

Storage for 768-dimensional vectors (~3KB per entity)

Limitations

Entity embeddings are context-dependent; same entity mention in different contexts produces different embeddings, requiring careful handling in entity linking

768-dimensional embeddings require vector database infrastructure (e.g., Faiss, Pinecone) for efficient similarity search at scale

Embedding quality depends on mBERT's pretraining; domain-specific entities may have poor representations

What makes it unique

Exposes mBERT's contextual embeddings at the span level, enabling entity representations that capture both entity type and semantic context; span-based pooling (averaging tokens within entity boundaries) preserves entity-specific information better than token-level embeddings

vs alternatives

Provides contextual embeddings natively without additional embedding models, reducing pipeline complexity; more accurate for entity linking than static embeddings (e.g., FastText) due to context awareness

safetensors model serialization for secure and efficient model loading

Medium confidence

Uses safetensors format for model weights instead of traditional PyTorch pickle format, enabling faster model loading, reduced memory overhead, and protection against arbitrary code execution during deserialization. Safetensors is a binary format that stores tensor data with explicit type and shape information, allowing zero-copy memory mapping on compatible systems. The model is distributed as a single safetensors file, eliminating the need for separate config and weight files.

Solves for

load model weights quickly in production environments with minimal startup latencysafely load model weights without risk of arbitrary code execution from untrusted sourcesreduce model storage footprint and download bandwidth through efficient binary serialization

Best for

production systems requiring fast model initialization (<1 second)

containerized deployments with strict security requirements

edge devices with limited storage and bandwidth

Requires

transformers library 4.25+

safetensors library (installed automatically with transformers)

~500MB disk space for model weights

Limitations

Safetensors support requires transformers library 4.25+; older versions require conversion to PyTorch format

Memory mapping benefits only apply on systems with sufficient virtual memory; embedded systems may not benefit

Safetensors format is immutable; model quantization or pruning requires conversion back to PyTorch format

What makes it unique

Distributed in safetensors format instead of PyTorch pickle, providing security benefits (no arbitrary code execution) and performance benefits (faster loading, memory mapping support); eliminates need for separate config files through explicit type/shape metadata in safetensors

vs alternatives

Safer than pickle-based models (no code execution risk); faster loading than ONNX conversion due to native PyTorch compatibility; more portable than TensorFlow SavedModel format

multilingual tokenization with mbert's shared vocabulary

Medium confidence

Leverages mBERT's 119K shared vocabulary across 104 languages, enabling consistent tokenization of multilingual text without language-specific tokenizers. The WordPiece tokenizer handles subword segmentation for out-of-vocabulary words, preserving morphological information across languages. This unified tokenization approach ensures that entities in different languages are represented in a shared token space, enabling the span-marker model to apply consistent entity classification rules across languages.

Solves for

tokenize multilingual documents with a single tokenizer without language detection or switchinghandle code-mixed text (e.g., English-Spanish) with consistent subword segmentationensure entity boundaries align with tokenization across languages

Best for

multilingual NLP pipelines avoiding language-specific tokenizer management

systems processing code-mixed or transliterated text

organizations standardizing on a single tokenization scheme across languages

Requires

mBERT tokenizer (included in transformers library)

Input text in valid Unicode encoding (UTF-8 recommended)

Limitations

Shared vocabulary may be suboptimal for individual languages; language-specific tokenizers (e.g., SentencePiece for CJK languages) often achieve better compression

Tokenization quality degrades for languages with limited mBERT pretraining data; rare scripts may produce excessive subword fragmentation

119K vocabulary size limits expressiveness compared to larger models (e.g., GPT-3's 50K tokens); rare words are heavily subword-segmented

What makes it unique

Uses mBERT's 119K shared vocabulary across 104 languages, enabling unified tokenization without language detection; WordPiece subword segmentation preserves morphological information across language families (e.g., Germanic, Romance, Slavic)

vs alternatives

Simpler than language-specific tokenizer pipelines while maintaining reasonable compression; more consistent across languages than separate tokenizers, reducing entity boundary misalignment

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Related Artifactssharing capabilities

Artifacts that share capabilities with span-marker-mbert-base-multinerd, ranked by overlap. Discovered automatically through the match graph.

Model46

wikineural-multilingual-ner

token-classification model by undefined. 8,05,229 downloads.

multilingual-token-level-named-entity-recognitioncross-lingual-entity-type-transfer-learningentity-type-classification-with-bio-tagging-schemesubword-token-classification-with-wordpiece-alignment

4 shared capabilities

Model43

bert-base-multilingual-cased-ner-hrl

token-classification model by undefined. 3,51,203 downloads.

cross-lingual entity recognition with language-agnostic embeddingsmultilingual named entity recognition with token-level classification

2 shared capabilities

Model43

xlm-roberta-large-ner-hrl

token-classification model by undefined. 5,82,028 downloads.

multilingual named entity recognition with token-level classificationentity span reconstruction from token-level predictions

2 shared capabilities

Framework43

spaCy

Industrial-strength NLP library for production use.

span-categorization-for-fine-grained-classificationnamed-entity-recognition-with-pretrained-and-custom-models

2 shared capabilities

Model48

bert-base-NER

token-classification model by undefined. 18,78,235 downloads.

multilingual named entity recognition via token classificationentity span reconstruction from subword tokens

2 shared capabilities

Model43

roberta-large-ner-english

token-classification model by undefined. 3,22,447 downloads.

entity span extraction with character-level offset mappingtoken-level named entity recognition with roberta embeddings

2 shared capabilities

Best For

✓NLP teams building multilingual information extraction systems
✓developers creating document processing pipelines for international content
✓researchers working with low-resource languages covered by mBERT
✓organizations needing entity recognition without language-specific model management
✓multilingual NLP teams with limited annotation budgets for low-resource languages
✓organizations processing documents in 50+ languages with a single model
✓researchers studying cross-lingual transfer learning in NER tasks
✓information extraction pipelines requiring structured entity type labels

Known Limitations

⚠Trained only on MultiNERD dataset — may not recognize domain-specific entities (medical, legal, financial terminology) outside training distribution
⚠mBERT base model has 110M parameters, requiring ~500MB GPU memory; slower inference than distilled alternatives (50-100ms per document on CPU)
⚠Span-marker approach assumes entities are contiguous sequences; cannot handle discontinuous or overlapping entity mentions
⚠Performance degrades on languages with limited mBERT pretraining data (e.g., low-resource African languages); best performance on high-resource languages (English, Chinese, Spanish, German)
⚠Cross-lingual transfer quality depends on mBERT's pretraining coverage; languages with minimal Wikipedia representation (e.g., minority languages) see 10-20% accuracy drops
⚠Entity types must be semantically similar across languages; culturally-specific entity categories may not transfer well

Requirements

PyTorch 1.9+ or TensorFlow 2.4+Transformers library 4.25+Python 3.7+~1GB disk space for model weights (safetensors format)GPU with 2GB+ VRAM recommended for batch inference; CPU inference supported but slowermBERT tokenizer (included in transformers library)Understanding of target language's script and tokenization conventionsMultilingual training data or validation set to measure cross-lingual performance

Input / Output

Accepts: raw text strings, tokenized sequences (pre-tokenized input), documents up to 512 tokens (mBERT context window), text in any of 104 languages supported by mBERT, mixed-language documents (model processes each token independently), text with entity mentions in context, lists of text documents, pre-tokenized document batches, text with entity mentions, safetensors binary files, raw text in any of 104 supported languages, code-mixed text combining multiple languages

Produces: token-level classification labels (IOB2 or BIOES format), span boundaries with entity type labels, confidence scores per entity, structured JSON with entity offsets and types, entity type labels consistent across languages, confidence scores reflecting cross-lingual alignment, entity type labels from MultiNERD taxonomy, confidence scores per entity type, structured output with entity text, type, and span offsets, batch entity predictions with document-level offsets, structured JSON with document ID, entity text, type, and span offsets, 768-dimensional entity embeddings (float32), entity type labels, span offsets and confidence scores, loaded PyTorch model state dict, model ready for inference, token IDs (integers 0-119K), token strings, attention masks and token type IDs for model input

UnfragileRank

Adoption61%(40% weight)

Quality24%(20% weight)

Ecosystem50%(15% weight)

Match Graph10%(20% weight)

Freshness75%(5% weight)

UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.

Type: Model

7 capabilities

Visit span-marker-mbert-base-multinerd→

Model Details

huggingface

Provider

span-marker

Architecture

284,856

Downloads

Tasks

token-classification

About

tomaarsen/span-marker-mbert-base-multinerd — a token-classification model on HuggingFace with 2,84,856 downloads

Alternatives to span-marker-mbert-base-multinerd

wink-embeddings-sg-100d24Repository

100-dimensional English word embeddings for wink-nlp

Compare →

voyage-ai-provider30API

Voyage AI Provider for running Voyage AI models with Vercel AI SDK

Compare →

@vibe-agent-toolkit/rag-lancedb27Agent

LanceDB implementation of RAG interfaces for vibe-agent-toolkit

Compare →

vectra41Repository

A lightweight, file-backed vector database for Node.js and browsers with Pinecone-compatible filtering and hybrid BM25 search.

Compare →

Are you the builder of span-marker-mbert-base-multinerd?

Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.

Claim this artifact →Verification via email

Get the weekly brief

New tools, rising stars, and what's actually worth your time. No spam.

Data Sources

huggingface

Looking for something else?

Search →

Capabilities7 decomposed

multilingual named entity recognition with span-based token classification

Medium confidence

Solves for

Best for

NLP teams building multilingual information extraction systems

developers creating document processing pipelines for international content

researchers working with low-resource languages covered by mBERT

Requires

PyTorch 1.9+ or TensorFlow 2.4+

Transformers library 4.25+

Python 3.7+

Limitations

Trained only on MultiNERD dataset — may not recognize domain-specific entities (medical, legal, financial terminology) outside training distribution

mBERT base model has 110M parameters, requiring ~500MB GPU memory; slower inference than distilled alternatives (50-100ms per document on CPU)

Span-marker approach assumes entities are contiguous sequences; cannot handle discontinuous or overlapping entity mentions

What makes it unique

vs alternatives

cross-lingual entity type classification with shared embedding space

Medium confidence

Solves for

Best for

multilingual NLP teams with limited annotation budgets for low-resource languages

organizations processing documents in 50+ languages with a single model

researchers studying cross-lingual transfer learning in NER tasks

Requires

mBERT tokenizer (included in transformers library)

Understanding of target language's script and tokenization conventions

Multilingual training data or validation set to measure cross-lingual performance

Limitations

Cross-lingual transfer quality depends on mBERT's pretraining coverage; languages with minimal Wikipedia representation (e.g., minority languages) see 10-20% accuracy drops

Entity types must be semantically similar across languages; culturally-specific entity categories may not transfer well

No explicit language identification — model assumes input is valid text in a supported language

What makes it unique

vs alternatives

fine-grained entity type disambiguation with 10+ entity categories

Medium confidence

Solves for

Best for

information extraction pipelines requiring structured entity type labels

knowledge graph construction systems needing entity type classification

semantic search systems that filter by entity type

Requires

MultiNERD entity type taxonomy or custom mapping to model's output classes

Sufficient context around entities (model uses 512-token window)

Limitations

Entity types are fixed to MultiNERD's taxonomy (10+ types); custom entity types require model retraining

Accuracy varies by entity type; rare types (e.g., 'event', 'product') have lower F1 scores (~70-75%) compared to common types like 'person' (~90%)

Requires sufficient context (typically 3-5 surrounding tokens) for accurate type disambiguation; isolated entity mentions may be misclassified

What makes it unique

vs alternatives

Supports more entity types than spaCy's default models (which typically support 7-8 types); more accurate than rule-based type assignment while maintaining interpretability through attention weights

batch entity extraction with efficient span enumeration

Medium confidence

Solves for

Best for

production NER systems processing high-volume document streams

batch processing pipelines for historical document archives

real-time entity extraction services with latency constraints (<100ms per document)

Requires

Batch processing framework (PyTorch DataLoader, TensorFlow Dataset, or equivalent)

GPU or multi-core CPU for parallel inference

Memory for storing intermediate span representations (~100MB for 1000-document batches)

Limitations

Span enumeration has quadratic complexity in document length; documents >512 tokens require chunking or sliding window, introducing boundary artifacts

Maximum span length is fixed (typically 8-10 tokens); entities longer than this are missed or fragmented

Batch processing requires padding to common length, wasting computation on short documents; dynamic batching adds complexity

What makes it unique

vs alternatives

Faster than sequential token-level classification for long documents due to span-level parallelization; more memory-efficient than storing full attention matrices for all possible spans

contextual entity representation extraction for downstream tasks

Medium confidence

Solves for

Best for

knowledge graph construction pipelines requiring entity linking

coreference resolution systems using entity embeddings

entity-centric semantic search and recommendation systems

Requires

Vector database or similarity search library (Faiss, Annoy, Pinecone, etc.)

Entity linking knowledge base (e.g., Wikidata, DBpedia) with embeddings for linking

Storage for 768-dimensional vectors (~3KB per entity)

Limitations

Entity embeddings are context-dependent; same entity mention in different contexts produces different embeddings, requiring careful handling in entity linking

768-dimensional embeddings require vector database infrastructure (e.g., Faiss, Pinecone) for efficient similarity search at scale

Embedding quality depends on mBERT's pretraining; domain-specific entities may have poor representations

What makes it unique

vs alternatives

safetensors model serialization for secure and efficient model loading

Medium confidence

Solves for

Best for

production systems requiring fast model initialization (<1 second)

containerized deployments with strict security requirements

edge devices with limited storage and bandwidth

Requires

transformers library 4.25+

safetensors library (installed automatically with transformers)

~500MB disk space for model weights

Limitations

Safetensors support requires transformers library 4.25+; older versions require conversion to PyTorch format

Memory mapping benefits only apply on systems with sufficient virtual memory; embedded systems may not benefit

Safetensors format is immutable; model quantization or pruning requires conversion back to PyTorch format

What makes it unique

vs alternatives

Safer than pickle-based models (no code execution risk); faster loading than ONNX conversion due to native PyTorch compatibility; more portable than TensorFlow SavedModel format

multilingual tokenization with mbert's shared vocabulary

Medium confidence

Solves for

Best for

multilingual NLP pipelines avoiding language-specific tokenizer management

systems processing code-mixed or transliterated text

organizations standardizing on a single tokenization scheme across languages

Requires

mBERT tokenizer (included in transformers library)

Input text in valid Unicode encoding (UTF-8 recommended)

Limitations

Shared vocabulary may be suboptimal for individual languages; language-specific tokenizers (e.g., SentencePiece for CJK languages) often achieve better compression

Tokenization quality degrades for languages with limited mBERT pretraining data; rare scripts may produce excessive subword fragmentation

119K vocabulary size limits expressiveness compared to larger models (e.g., GPT-3's 50K tokens); rare words are heavily subword-segmented

What makes it unique

vs alternatives

Simpler than language-specific tokenizer pipelines while maintaining reasonable compression; more consistent across languages than separate tokenizers, reducing entity boundary misalignment

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Alternatives to span-marker-mbert-base-multinerd

wink-embeddings-sg-100d24Repository

100-dimensional English word embeddings for wink-nlp

Compare →

voyage-ai-provider30API

Voyage AI Provider for running Voyage AI models with Vercel AI SDK

Compare →

@vibe-agent-toolkit/rag-lancedb27Agent

LanceDB implementation of RAG interfaces for vibe-agent-toolkit

Compare →

vectra41Repository

A lightweight, file-backed vector database for Node.js and browsers with Pinecone-compatible filtering and hybrid BM25 search.

Compare →

span-marker-mbert-base-multinerd

Capabilities7 decomposed

multilingual named entity recognition with span-based token classification

cross-lingual entity type classification with shared embedding space

fine-grained entity type disambiguation with 10+ entity categories

batch entity extraction with efficient span enumeration

contextual entity representation extraction for downstream tasks

safetensors model serialization for secure and efficient model loading

multilingual tokenization with mbert's shared vocabulary

Related Artifactssharing capabilities

wikineural-multilingual-ner

bert-base-multilingual-cased-ner-hrl

xlm-roberta-large-ner-hrl

spaCy

bert-base-NER

roberta-large-ner-english

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Model Details

About

Categories

Alternatives to span-marker-mbert-base-multinerd

Are you the builder of span-marker-mbert-base-multinerd?

Get the weekly brief

Data Sources

span-marker-mbert-base-multinerd

Capabilities7 decomposed

multilingual named entity recognition with span-based token classification

cross-lingual entity type classification with shared embedding space

fine-grained entity type disambiguation with 10+ entity categories

batch entity extraction with efficient span enumeration

contextual entity representation extraction for downstream tasks

safetensors model serialization for secure and efficient model loading

multilingual tokenization with mbert's shared vocabulary

Related Artifactssharing capabilities

wikineural-multilingual-ner

bert-base-multilingual-cased-ner-hrl

xlm-roberta-large-ner-hrl

spaCy

bert-base-NER

roberta-large-ner-english

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Model Details

About

Categories

Alternatives to span-marker-mbert-base-multinerd

Are you the builder of span-marker-mbert-base-multinerd?

Get the weekly brief

Data Sources