FriendliAI targets idle GPU capacity in cloud clusters with inference optimization stack to monetize unused hardware during downtime. Continuous batching unlocks token throughput gains. #AI #CloudInfra #GPU
bymachine.news/friendliai-inference-idl...