Trending
Acyr Locatelli's Avatar

Acyr Locatelli

@acyrl

Lead pre-training @Cohere

73
Followers
85
Following
2
Posts
14.09.2024
Joined
Posts Following

Latest posts by Acyr Locatelli @acyrl

One feature missing from @bsky.app is bookmarks. Need to keep feeding the hoarding monster

23.11.2024 10:28 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Great book by Thurston -- changed my approach to the field.
His talks are great as well.

22.11.2024 23:14 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Laura Ruis, Maximilian Mozes, Juhan Bae, Siddhartha Rao Kamalakara, Dwarak Talupuru, Acyr Locatelli, Robert Kirk, Tim Rockt\"aschel, Edward Grefenstette, Max Bartolo
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
https://arxiv.org/abs/2411.12580

20.11.2024 07:01 πŸ‘ 14 πŸ” 6 πŸ’¬ 0 πŸ“Œ 1

How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledge🦜? In our new preprint, we look at the pretraining data and find evidence against this:

Procedural knowledge in pretraining drives LLM reasoning βš™οΈπŸ”’

πŸ§΅β¬‡οΈ

20.11.2024 16:31 πŸ‘ 854 πŸ” 138 πŸ’¬ 36 πŸ“Œ 24