tonetegeatinst 2 days ago

They hit the nail on the head. "These efficient models can save money, time and compute"

Even if your not trying to run tiny models on mobile devices like phones, smaller models mean less traffic, faster responses, and lower barriers to entry.