Allgemein

Alibaba Cloud says it cut Nvidia AI GPU use by 82% with new pooling system— up to 9x increase in output lets 213 GPUs perform like 1,192

Alibaba Cloud claims its new Aegaeon pooling system reduced the number of Nvidia GPUs required to serve large language models by 82% during a multi-month beta test inside its Model Studio marketplace.