Best Alternatives to How I topped the HuggingFace open LLM leaderboard on two gaming GPUs
20 alternatives ranked by real usage data. How I topped the HuggingFace open LLM leaderboard on two gaming GPUs scores 43/100 — 20 tools score higher.
I found that duplicating a specific block of 7 middle layers in Qwen2-72B, without modifying any weights, improved performance across all Open LLM Leaderboard benchmarks and took #1. As of 2026, the top 4 models on that leaderboard are still descendants.The weird finding: single-layer duplication do