CALM is a result of a collaboration between @convai-uiuc.bsky.social and #Oumi.
Special thanks for the great team work, it would not be possible without Jeremiah Greer, Akul Datta, Ze Yang, William Zeng, Oussama Elachqar, Manos Koukoumidis, @dilekh.bsky.social, and @gokhantur.bsky.social.
14.02.2025 18:54
π 2
π 1
π¬ 0
π 0
We are making everything open-source with open models, open data, open checkpoints!
πArxiv: arxiv.org/abs/2502.08820
π» Code: github.com/oumi-ai/oumi...
π€ Models: huggingface.co/collections/...
π€ Dataset: huggingface.co/datasets/uiu...
#ConversationalAgents #LLMs #Agents #OpenSourceAI #NLProc
14.02.2025 18:54
π 3
π 0
π¬ 1
π 0
How does the CALM model family perform?
β
Outperforms GPT-4o & other top domain-specific models on:
π MultiWOZ 2.4 (TOD)
π BFCL V3 (Function Calling)
π API-Bank (Function Calling)
Achieving top zero-shot scores not in one but across all benchmarks!
14.02.2025 18:54
π 1
π 0
π¬ 1
π 0
π₯ Trained on CALM-IT, our unified dataset blending multi-turn ReAct style TOD & complex API use, trained using the Oumi AI platform in partnership with #Oumi and #TogetherAI.
π Models: CALM 8B, CALM 70B, CALM-405B trained from Llama model series
14.02.2025 18:54
π 1
π 0
π¬ 1
π 0
Most models struggle with either long-term conversations and dialogue state tracking (TOD) or function-calling (LA).
CALM (Conversational Agentic Language Model) bridges this gap! π‘
π¦Spoiler: CALM 405B is the largest open model in BFCL V3 Leaderboard ranking #7, surpassing many proprietary models.
14.02.2025 18:54
π 1
π 0
π¬ 1
π 0
πCan a Single Model Master Both Multi-turn Conversations and Tool Use?
Introducing CALM, fully open-source Conversational Agentic Language Models with CALM 8B, CALM 70B, and CALM 405B-excelling in both multi-turn dialogue management & function calling.
πProject Page: emrecanacikgoz.github.io/CALM/
14.02.2025 18:54
π 7
π 1
π¬ 1
π 1