@dylanslack
Researcher at Google DeepMind, previously PhD at UC Irvine
One of Irvineβs most storied institutions
Cold start means running rl on an already sftβd checkpoint?