How we sped up transformer inference 100x for 🤗 API customers

Post Content