Capability

Bilingual Dense Transformer Inference With 34b Parameters

5 artifacts provide this capability.

Want a personalized recommendation?

Top Matches

01.AI's bilingual 34B model with 200K context option.

Unique: Unified bilingual architecture trained on 3 trillion tokens with balanced English-Chinese data composition, avoiding the performance degradation typical of post-hoc language adaptation or separate model ensembles. Maintains competitive MMLU performance (76.3%) while achieving 'particularly strong' Chinese capability through integrated training rather than fine-tuning.

vs others: Outperforms single-language 34B models on bilingual workloads by eliminating model-switching latency and inference overhead, while maintaining better English performance than Chinese-optimized models through unified training.

Bilingual Dense Transformer Inference With 34b Parameters

Top Matches

Also Known As

Company