Hi! Sorry for the confusion -- we aren't the organizers, just lucky to be presenting in an ELLIS reading group. I'm guessing a non-gmail email should also work once they confirm the registration.
And thank you, @wzuidema.bsky.social! The calendar file Jelle shared contains the zoom info.
06.02.2026 08:36
π 1
π 0
π¬ 1
π 0
Why donβt neural networks learn all at once, but instead progress from simple to complex solutions? And what does βsimpleβ even mean across different neural network architectures?
Sharing our new paper @iclr_conf led by Yedi Zhang with Peter Latham
arxiv.org/abs/2512.20607
03.02.2026 16:19
π 154
π 41
π¬ 7
π 3
How does in-context learning emerge in attention models during gradient descent training?
Sharing our new Spotlight paper @icmlconf.bsky.social: Training Dynamics of In-Context Learning in Linear Attention
arxiv.org/abs/2501.16265
Led by Yedi Zhang with @aaditya6284.bsky.social and Peter Latham
04.06.2025 11:22
π 53
π 18
π¬ 1
π 1