What can koelectra-small-v3-nsmc do?

korean sentiment classification with electra-based fine-tuning, batch inference with dynamic padding and token optimization, hugging face hub model versioning and safetensors format loading, tokenization with korean morphological awareness, transfer learning and fine-tuning foundation for korean text tasks, confidence scoring and probability calibration for sentiment predictions

koelectra-small-v3-nsmc

ModelFree

text-classification model by undefined. 23,55,884 downloads.

Open Source

/ 100

6 capabilities

Capabilities6 decomposed

korean sentiment classification with electra-based fine-tuning

Medium confidence

Performs binary sentiment classification (positive/negative) on Korean text using a small ELECTRA discriminator model fine-tuned on the NSMC (Naver Sentiment Movie Comments) dataset. The model leverages ELECTRA's replaced-token detection pretraining approach combined with task-specific fine-tuning on 200K Korean movie reviews, enabling efficient sentiment inference with 23.5M parameters. Inference runs locally via PyTorch/Hugging Face Transformers without requiring API calls, supporting batch processing and custom confidence thresholds.

Solves for

Classify Korean customer reviews or social media comments as positive or negative sentimentBuild a Korean sentiment analysis pipeline for product feedback or brand monitoringFine-tune or adapt this model for domain-specific Korean sentiment tasks (e.g., restaurant reviews, app ratings)Deploy lightweight sentiment classification in resource-constrained environments (mobile, edge devices)+1 more

Best for

Korean NLP teams building sentiment analysis systems without cloud API dependencies

Startups needing low-latency, on-device sentiment classification for Korean content

Researchers benchmarking Korean text classification models against ELECTRA baselines

Requires

Python 3.7+

PyTorch 1.9+ or TensorFlow 2.4+

Hugging Face Transformers library 4.0+

Limitations

Binary classification only (positive/negative) — no neutral/multi-class support or confidence-weighted gradations

Trained exclusively on movie review domain (NSMC) — may have domain shift when applied to non-review Korean text (e.g., news, technical docs, social media slang)

Small model size (23.5M params) trades off accuracy for speed — likely lower F1 than larger BERT-base or KoBERT models on out-of-domain data

What makes it unique

Uses ELECTRA's discriminator-based pretraining (replaced-token detection) rather than MLM, enabling smaller model size (23.5M params vs 110M for BERT-base) while maintaining competitive accuracy on Korean sentiment tasks. Fine-tuned specifically on NSMC's 200K movie reviews with domain-specific Korean tokenization, making it optimized for review-like Korean text patterns.

vs alternatives

Smaller and faster than KoBERT-base (110M params) or multilingual BERT variants while maintaining NSMC-specific accuracy; more specialized for Korean sentiment than generic mBERT but less generalizable to non-review domains than larger models.

batch inference with dynamic padding and token optimization

Medium confidence

Processes multiple Korean text samples in parallel batches using Hugging Face Transformers' DataCollator with dynamic padding, which pads sequences to the longest sample in each batch rather than a fixed max length. This reduces computational waste and memory overhead when processing variable-length Korean text. Supports configurable batch sizes and automatic device placement (CPU/GPU), enabling efficient throughput for production inference pipelines without manual padding logic.

Solves for

Process 100s-1000s of Korean reviews in a single batch job without manual padding overheadOptimize GPU memory usage when classifying variable-length Korean text (short comments vs long reviews)Build a production sentiment analysis API that batches incoming Korean text requests for throughput efficiencyReduce inference latency per sample by amortizing model loading and GPU transfer costs across batches

Best for

Backend engineers building batch processing pipelines for Korean sentiment analysis

Data teams running nightly jobs to classify large Korean text corpora

ML ops teams deploying inference services with SLA requirements for throughput

Requires

Hugging Face Transformers 4.0+

PyTorch 1.9+ or TensorFlow 2.4+

Sufficient RAM for batch size × max_sequence_length × hidden_dim (e.g., batch_size=32, seq_len=512 requires ~2GB GPU memory)

Limitations

Dynamic padding requires variable batch sizes — incompatible with strict fixed-shape tensor requirements (e.g., ONNX export with fixed input shapes)

Batch processing introduces latency variance — single-sample inference may be slower than dedicated optimized models due to framework overhead

No built-in batching across multiple GPU devices — single-GPU bottleneck for very large datasets (>1M samples)

What makes it unique

Leverages Hugging Face Transformers' native DataCollator with dynamic padding, which automatically computes optimal padding per batch rather than padding to fixed max_length. This is implemented via the collate_fn in DataLoader, reducing wasted computation on padding tokens by ~30-50% for variable-length Korean text.

vs alternatives

More memory-efficient than padding all sequences to fixed 512 tokens; simpler than manual bucketing strategies but less flexible than custom ONNX-optimized inference engines for ultra-low-latency requirements.

hugging face hub model versioning and safetensors format loading

Medium confidence

Loads model weights from Hugging Face Hub using safetensors format (a secure, fast serialization standard) instead of pickle, with automatic version management and caching. The model is stored as a public repository with git-based versioning, allowing reproducible downloads of specific commits/tags. Safetensors format enables faster deserialization (~10x vs pickle) and eliminates arbitrary code execution risks during weight loading, making it suitable for production and untrusted environments.

Solves for

Download and cache the exact version of koelectra-small-v3-nsmc model weights for reproducible inferenceIntegrate the model into production systems with security guarantees (no pickle deserialization vulnerabilities)Track model evolution and revert to previous versions if newer checkpoints degrade performanceShare the model across teams with version pinning to ensure consistent results

Best for

Production ML teams requiring secure, reproducible model loading without pickle vulnerabilities

Organizations with strict security policies prohibiting arbitrary code execution during model loading

Researchers needing version-controlled model snapshots for reproducibility

Requires

Internet connection for initial model download

Hugging Face Transformers 4.0+

safetensors library (auto-installed with transformers)

Limitations

Requires internet connectivity for initial download — no offline-first workflow without pre-caching

Hub caching uses ~/.cache/huggingface/hub directory — can consume significant disk space if multiple large models are cached

Safetensors format is newer — some older tools/frameworks may not support it natively (requires fallback to PyTorch format)

What makes it unique

Uses safetensors format for model serialization, which is a secure, fast alternative to pickle that prevents arbitrary code execution during deserialization. Combined with Hugging Face Hub's git-based versioning, this enables reproducible, version-pinned model loading with built-in security guarantees.

vs alternatives

Safer than pickle-based model loading (eliminates code execution risk); faster deserialization than PyTorch's native format; more reproducible than downloading from custom URLs due to Hub's version control integration.

tokenization with korean morphological awareness

Medium confidence

Tokenizes Korean text using ELECTRA's pretrained WordPiece tokenizer, which was trained on Korean corpora and includes morphological awareness for Korean-specific linguistic patterns (e.g., particles, verb conjugations, compound words). The tokenizer handles Korean-specific edge cases like spacing conventions, Hangul decomposition, and subword segmentation optimized for Korean morphology. Supports both encoding (text → token IDs) and decoding (token IDs → text) with configurable special tokens and truncation strategies.

Solves for

Convert raw Korean text into token IDs compatible with the koelectra model for inferenceHandle Korean-specific tokenization challenges (spacing, particles, compound words) without manual preprocessingDecode model predictions back to human-readable Korean text for interpretabilityCustomize tokenization behavior (max length, truncation strategy, special tokens) for domain-specific Korean text

Best for

Korean NLP engineers building end-to-end pipelines without custom tokenization logic

Teams processing diverse Korean text (formal documents, social media, reviews) with morphological awareness

Researchers comparing tokenization strategies across Korean models

Requires

Hugging Face Transformers 4.0+

Python 3.7+

sentencepiece library (auto-installed with transformers)

Limitations

WordPiece tokenization may split Korean words into many subword tokens — less interpretable than morphological analysis tools (e.g., Mecab, Okt)

Tokenizer vocabulary is fixed at 30K tokens — cannot add new domain-specific Korean terms without retraining

No built-in handling of Korean abbreviations or internet slang (e.g., 'ㅇㅈ' for '인정') — requires preprocessing

What makes it unique

Uses a Korean-specific WordPiece tokenizer trained on Korean corpora, which includes morphological awareness for Korean linguistic patterns (particles, verb conjugations, compound words). This is more effective than generic multilingual tokenizers for Korean text, reducing subword fragmentation and improving model performance.

vs alternatives

More morphologically aware than generic multilingual tokenizers (mBERT) but less interpretable than dedicated Korean morphological analyzers (Mecab, Okt); optimized for ELECTRA's pretraining but not customizable for domain-specific vocabulary.

transfer learning and fine-tuning foundation for korean text tasks

Medium confidence

Provides a pretrained ELECTRA discriminator checkpoint that can be fine-tuned for downstream Korean text classification tasks beyond sentiment analysis. The model's learned representations capture Korean linguistic patterns from pretraining, enabling efficient transfer learning with minimal labeled data. Supports standard fine-tuning workflows (adding task-specific head, freezing/unfreezing layers, learning rate scheduling) via Hugging Face Transformers' Trainer API or custom PyTorch training loops.

Solves for

Fine-tune this model on custom Korean text classification datasets (e.g., toxicity detection, topic classification, intent recognition)Leverage pretrained Korean representations to reduce labeled data requirements for new Korean NLP tasksAdapt the model to domain-specific Korean text (e.g., medical, legal, technical documents) with task-specific fine-tuningBenchmark transfer learning performance across different Korean text classification tasks

Best for

Korean NLP teams with limited labeled data for custom classification tasks

Researchers studying transfer learning effectiveness for Korean language models

Companies building multiple Korean text classification models (toxicity, intent, topic) from a shared pretrained base

Requires

Hugging Face Transformers 4.0+

PyTorch 1.9+ or TensorFlow 2.4+

Labeled Korean text dataset (minimum 100-500 samples for meaningful fine-tuning)

Limitations

Fine-tuning requires labeled data — no zero-shot or few-shot capabilities without additional techniques (e.g., prompt-based learning)

Small model size (23.5M params) limits capacity for complex Korean linguistic phenomena — may underperform on nuanced tasks (e.g., sarcasm, context-dependent meaning)

Fine-tuning on small datasets (<1K samples) risks overfitting — requires careful regularization and validation strategies

What makes it unique

Provides a Korean-specific ELECTRA discriminator pretrained on large Korean corpora, enabling efficient transfer learning for downstream Korean tasks. Unlike generic multilingual models, it captures Korean-specific linguistic patterns (morphology, syntax, semantics) learned during pretraining, reducing fine-tuning data requirements.

vs alternatives

More efficient for Korean tasks than fine-tuning from multilingual BERT or starting from scratch; smaller than KoBERT-base (23.5M vs 110M params) enabling faster fine-tuning and inference; less general-purpose than larger models but more specialized for Korean NLP.

confidence scoring and probability calibration for sentiment predictions

Medium confidence

Outputs softmax-normalized probability distributions over sentiment classes (positive/negative), enabling confidence-based filtering and decision-making. The model produces logits that are converted to probabilities via softmax, allowing downstream systems to reject low-confidence predictions or apply different handling strategies based on confidence thresholds. Supports both hard predictions (argmax class) and soft predictions (probability distributions) for flexible integration into decision pipelines.

Solves for

Filter out low-confidence sentiment predictions to reduce false positives in production systemsApply different business logic based on prediction confidence (e.g., auto-approve high-confidence, escalate medium-confidence to human review)Measure model uncertainty and identify ambiguous Korean text that requires human annotationCalibrate confidence thresholds based on business metrics (precision/recall tradeoff)

Best for

Production systems requiring confidence-based filtering to manage false positive rates

Teams building human-in-the-loop workflows where confidence scores drive escalation decisions

Data annotation pipelines that prioritize uncertain predictions for labeling

Requires

Model inference output (logits or probabilities)

Optional: scikit-learn or other calibration library for post-hoc calibration

Limitations

Softmax probabilities are not calibrated — confidence scores may not reflect true prediction accuracy (e.g., model may be 90% confident but only 70% correct)

No built-in calibration techniques — requires post-hoc calibration (temperature scaling, Platt scaling) if well-calibrated probabilities are critical

Confidence is per-sample only — no uncertainty quantification across the full dataset or model-level confidence estimates

What makes it unique

Provides raw logits and softmax probabilities for both sentiment classes, enabling confidence-based filtering and decision-making without additional uncertainty quantification. The small model size (23.5M params) makes confidence scores computationally cheap to generate at scale.

vs alternatives

Simpler than Bayesian approaches (Monte Carlo Dropout, ensemble methods) but less robust to distribution shift; sufficient for basic confidence filtering but requires post-hoc calibration for well-calibrated probabilities.

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Related Artifactssharing capabilities

Artifacts that share capabilities with koelectra-small-v3-nsmc, ranked by overlap. Discovered automatically through the match graph.

Model46

ko-sroberta-multitask

sentence-similarity model by undefined. 17,63,322 downloads.

fine-tuning and domain adaptation for korean-specific tasksbatch korean text embedding with configurable pooling strategieskorean sentence embedding generation with multitask learning

3 shared capabilities

Model37

koelectra-base-v3-finetuned-korquad

question-answering model by undefined. 84,777 downloads.

extractive question-answering on korean texttransfer learning from electra pretraining to downstream qa taskmultilingual tokenization with korean morphological awareness

3 shared capabilities

Model38

koelectra-small-v2-distilled-korquad-384

question-answering model by undefined. 1,53,788 downloads.

distilled transformer inference with reduced memory footprintextractive question-answering on korean text

2 shared capabilities

Model34

kobart-summary-v3

summarization model by undefined. 41,843 downloads.

batch inference with huggingface transformers pipeline api

1 shared capability

Model41

opus-mt-ko-en

translation model by undefined. 4,06,769 downloads.

batch translation with dynamic batching and padding optimization

1 shared capability

Model47

twitter-xlm-roberta-base-sentiment

text-classification model by undefined. 11,59,018 downloads.

batch-sentiment-inference-with-huggingface-pipeline-abstraction

1 shared capability

Best For

✓Korean NLP teams building sentiment analysis systems without cloud API dependencies
✓Startups needing low-latency, on-device sentiment classification for Korean content
✓Researchers benchmarking Korean text classification models against ELECTRA baselines
✓Companies processing sensitive Korean customer data requiring on-premise inference
✓Backend engineers building batch processing pipelines for Korean sentiment analysis
✓Data teams running nightly jobs to classify large Korean text corpora
✓ML ops teams deploying inference services with SLA requirements for throughput
✓Production ML teams requiring secure, reproducible model loading without pickle vulnerabilities

Known Limitations

⚠Binary classification only (positive/negative) — no neutral/multi-class support or confidence-weighted gradations
⚠Trained exclusively on movie review domain (NSMC) — may have domain shift when applied to non-review Korean text (e.g., news, technical docs, social media slang)
⚠Small model size (23.5M params) trades off accuracy for speed — likely lower F1 than larger BERT-base or KoBERT models on out-of-domain data
⚠No built-in handling of sarcasm, negation scope, or context-dependent sentiment in Korean (e.g., '별로' as subtle negative)
⚠Requires Korean text preprocessing (tokenization, normalization) — no automatic handling of typos, abbreviations, or internet slang common in Korean social media
⚠Dynamic padding requires variable batch sizes — incompatible with strict fixed-shape tensor requirements (e.g., ONNX export with fixed input shapes)

Requirements

Python 3.7+PyTorch 1.9+ or TensorFlow 2.4+Hugging Face Transformers library 4.0+~500MB disk space for model weights (safetensors format)GPU optional but recommended for batch inference >100 samplesHugging Face Transformers 4.0+Sufficient RAM for batch size × max_sequence_length × hidden_dim (e.g., batch_size=32, seq_len=512 requires ~2GB GPU memory)Internet connection for initial model download

Input / Output

Accepts: raw Korean text strings (UTF-8 encoded), pre-tokenized Korean text (space-separated tokens), batch lists of Korean text samples, list of Korean text strings (variable length), pandas DataFrame with text column, generator/iterator of text samples, model identifier string ('daekeun-ml/koelectra-small-v3-nsmc'), optional revision parameter (commit hash, branch, or tag), raw Korean text strings (UTF-8), lists of Korean text samples, token IDs (for decoding), labeled Korean text dataset (CSV, JSON, or Hugging Face Dataset format), task-specific labels (binary, multi-class, or multi-label), model logits (raw scores before softmax), Korean text samples

Produces: binary class labels (0=negative, 1=positive), logits (raw model scores before softmax), probability distributions (softmax-normalized confidence scores per class), numpy arrays of logits (batch_size × num_classes), pandas DataFrame with predictions and confidence scores, generator of predictions (streaming mode), PyTorch model object (PreTrainedModel), tokenizer object (PreTrainedTokenizer), config object (PretrainedConfig), token IDs (list of integers), attention masks (binary mask for padding tokens), token type IDs (for multi-sequence inputs), decoded text (string), fine-tuned model checkpoint (PyTorch or safetensors format), training metrics (loss, accuracy, F1, etc.), predictions on new Korean text, probability distributions (softmax-normalized scores, sum to 1.0), confidence scores (max probability across classes), hard predictions (argmax class label)

UnfragileRank

Adoption70%(40% weight)

Quality22%(20% weight)

Ecosystem50%(15% weight)

Match Graph10%(20% weight)

Freshness75%(5% weight)

UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.

Type: Model

6 capabilities

Visit koelectra-small-v3-nsmc→

Model Details

huggingface

Provider

transformers

Architecture

2,355,884

Downloads

Tasks

text-classification

About

daekeun-ml/koelectra-small-v3-nsmc — a text-classification model on HuggingFace with 23,55,884 downloads

Alternatives to koelectra-small-v3-nsmc

TrendRadar51MCP Server

⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载，你的 AI 舆情监控助手与热点筛选工具！聚合多平台热点 + RSS 订阅，支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机，也支持接入 MCP 架构，赋能 AI 自然语言对话分析、情感洞察与趋势预测等。支持 Docker ，数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。

Compare →

TaskWeaver50Agent

The first "code-first" agent framework for seamlessly planning and executing data analytics tasks.

Compare →

Power Query32Product

Transform data seamlessly with intuitive ETL...

Compare →

Abridge29Product

Revolutionizes healthcare documentation, saving time, enhancing care, Epic-integrated...

Compare →

Are you the builder of koelectra-small-v3-nsmc?

Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.

Claim this artifact →Verification via email

Get the weekly brief

New tools, rising stars, and what's actually worth your time. No spam.

Data Sources

huggingface

Looking for something else?

Search →

Capabilities6 decomposed

korean sentiment classification with electra-based fine-tuning

Medium confidence

Solves for

Best for

Korean NLP teams building sentiment analysis systems without cloud API dependencies

Startups needing low-latency, on-device sentiment classification for Korean content

Researchers benchmarking Korean text classification models against ELECTRA baselines

Requires

Python 3.7+

PyTorch 1.9+ or TensorFlow 2.4+

Hugging Face Transformers library 4.0+

Limitations

Binary classification only (positive/negative) — no neutral/multi-class support or confidence-weighted gradations

Trained exclusively on movie review domain (NSMC) — may have domain shift when applied to non-review Korean text (e.g., news, technical docs, social media slang)

Small model size (23.5M params) trades off accuracy for speed — likely lower F1 than larger BERT-base or KoBERT models on out-of-domain data

What makes it unique

vs alternatives

batch inference with dynamic padding and token optimization

Medium confidence

Solves for

Best for

Backend engineers building batch processing pipelines for Korean sentiment analysis

Data teams running nightly jobs to classify large Korean text corpora

ML ops teams deploying inference services with SLA requirements for throughput

Requires

Hugging Face Transformers 4.0+

PyTorch 1.9+ or TensorFlow 2.4+

Sufficient RAM for batch size × max_sequence_length × hidden_dim (e.g., batch_size=32, seq_len=512 requires ~2GB GPU memory)

Limitations

Dynamic padding requires variable batch sizes — incompatible with strict fixed-shape tensor requirements (e.g., ONNX export with fixed input shapes)

Batch processing introduces latency variance — single-sample inference may be slower than dedicated optimized models due to framework overhead

No built-in batching across multiple GPU devices — single-GPU bottleneck for very large datasets (>1M samples)

What makes it unique

vs alternatives

hugging face hub model versioning and safetensors format loading

Medium confidence

Solves for

Best for

Production ML teams requiring secure, reproducible model loading without pickle vulnerabilities

Organizations with strict security policies prohibiting arbitrary code execution during model loading

Researchers needing version-controlled model snapshots for reproducibility

Requires

Internet connection for initial model download

Hugging Face Transformers 4.0+

safetensors library (auto-installed with transformers)

Limitations

Requires internet connectivity for initial download — no offline-first workflow without pre-caching

Hub caching uses ~/.cache/huggingface/hub directory — can consume significant disk space if multiple large models are cached

Safetensors format is newer — some older tools/frameworks may not support it natively (requires fallback to PyTorch format)

What makes it unique

vs alternatives

tokenization with korean morphological awareness

Medium confidence

Solves for

Best for

Korean NLP engineers building end-to-end pipelines without custom tokenization logic

Teams processing diverse Korean text (formal documents, social media, reviews) with morphological awareness

Researchers comparing tokenization strategies across Korean models

Requires

Hugging Face Transformers 4.0+

Python 3.7+

sentencepiece library (auto-installed with transformers)

Limitations

WordPiece tokenization may split Korean words into many subword tokens — less interpretable than morphological analysis tools (e.g., Mecab, Okt)

Tokenizer vocabulary is fixed at 30K tokens — cannot add new domain-specific Korean terms without retraining

No built-in handling of Korean abbreviations or internet slang (e.g., 'ㅇㅈ' for '인정') — requires preprocessing

What makes it unique

vs alternatives

transfer learning and fine-tuning foundation for korean text tasks

Medium confidence

Solves for

Best for

Korean NLP teams with limited labeled data for custom classification tasks

Researchers studying transfer learning effectiveness for Korean language models

Companies building multiple Korean text classification models (toxicity, intent, topic) from a shared pretrained base

Requires

Hugging Face Transformers 4.0+

PyTorch 1.9+ or TensorFlow 2.4+

Labeled Korean text dataset (minimum 100-500 samples for meaningful fine-tuning)

Limitations

Fine-tuning requires labeled data — no zero-shot or few-shot capabilities without additional techniques (e.g., prompt-based learning)

Small model size (23.5M params) limits capacity for complex Korean linguistic phenomena — may underperform on nuanced tasks (e.g., sarcasm, context-dependent meaning)

Fine-tuning on small datasets (<1K samples) risks overfitting — requires careful regularization and validation strategies

What makes it unique

vs alternatives

confidence scoring and probability calibration for sentiment predictions

Medium confidence

Solves for

Best for

Production systems requiring confidence-based filtering to manage false positive rates

Teams building human-in-the-loop workflows where confidence scores drive escalation decisions

Data annotation pipelines that prioritize uncertain predictions for labeling

Requires

Model inference output (logits or probabilities)

Optional: scikit-learn or other calibration library for post-hoc calibration

Limitations

Softmax probabilities are not calibrated — confidence scores may not reflect true prediction accuracy (e.g., model may be 90% confident but only 70% correct)

No built-in calibration techniques — requires post-hoc calibration (temperature scaling, Platt scaling) if well-calibrated probabilities are critical

Confidence is per-sample only — no uncertainty quantification across the full dataset or model-level confidence estimates

What makes it unique

vs alternatives

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Alternatives to koelectra-small-v3-nsmc

TrendRadar51MCP Server

Compare →

TaskWeaver50Agent

The first "code-first" agent framework for seamlessly planning and executing data analytics tasks.

Compare →

Power Query32Product

Transform data seamlessly with intuitive ETL...

Compare →

Abridge29Product

Revolutionizes healthcare documentation, saving time, enhancing care, Epic-integrated...

Compare →

koelectra-small-v3-nsmc

Capabilities6 decomposed

korean sentiment classification with electra-based fine-tuning

batch inference with dynamic padding and token optimization

hugging face hub model versioning and safetensors format loading

tokenization with korean morphological awareness

transfer learning and fine-tuning foundation for korean text tasks

confidence scoring and probability calibration for sentiment predictions

Related Artifactssharing capabilities

ko-sroberta-multitask

koelectra-base-v3-finetuned-korquad

koelectra-small-v2-distilled-korquad-384

kobart-summary-v3

opus-mt-ko-en

twitter-xlm-roberta-base-sentiment

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Model Details

About

Categories

Alternatives to koelectra-small-v3-nsmc

Are you the builder of koelectra-small-v3-nsmc?

Get the weekly brief

Data Sources

koelectra-small-v3-nsmc

Capabilities6 decomposed

korean sentiment classification with electra-based fine-tuning

batch inference with dynamic padding and token optimization

hugging face hub model versioning and safetensors format loading

tokenization with korean morphological awareness

transfer learning and fine-tuning foundation for korean text tasks

confidence scoring and probability calibration for sentiment predictions

Related Artifactssharing capabilities

ko-sroberta-multitask

koelectra-base-v3-finetuned-korquad

koelectra-small-v2-distilled-korquad-384

kobart-summary-v3

opus-mt-ko-en

twitter-xlm-roberta-base-sentiment

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Model Details

About

Categories

Alternatives to koelectra-small-v3-nsmc

Are you the builder of koelectra-small-v3-nsmc?

Get the weekly brief

Data Sources