Google: Gemma 3 4B (free)Model24/100 via “text generation with controlled output length and format”
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Unique: Learns format and length preferences from instruction-tuning data rather than using explicit token limits or template systems, enabling natural language format requests like 'write a 3-bullet summary' without API-level constraints
vs others: More flexible than template-based generation systems and more natural than models requiring explicit token limits, while remaining free and accessible via simple API calls without complex configuration