AI article

770 Experiments to Squeeze 30 tok/s Out of a 35B MoE Model on a $500 GPU

770 Experiments to Squeeze 30 tok/s Out of a 35B MoE Model on a $500 GPU 29.899 tokens per...

Dev.to | Apr 2, 2026 | AlexChen

Read the original article

More AI news