Alibaba just proved its 397B‑A17 Qwen 3.5 can out‑perform bigger rivals using multi‑token prediction and a clever mixture‑of‑experts design—while staying cheaper. Curious how sparse parameters reshape AI? Dive in. #Qwen3_5 #MixtureOfExperts #MultiTokenPrediction
🔗 aidailypost.com/news/alibaba...