hailey schoelkopf's Avatar

hailey schoelkopf

@hails.computer

2,358
Followers
127
Following
4
Posts
01.07.2023
Joined
Posts Following

Latest posts by hailey schoelkopf @hails.computer

so academic twitter is like actually-actually migrating this time huh?

i still don’t know if i have it in me to actively use another social network yet 😖

19.11.2024 15:33 👍 48 🔁 0 💬 7 📌 0

thank you for the kind words!! :)

12.11.2024 14:34 👍 2 🔁 0 💬 0 📌 0
Post image

introducing the new Vacuum Use (beta)

27.10.2024 20:50 👍 1 🔁 0 💬 0 📌 0

👋

05.01.2024 20:55 👍 1 🔁 0 💬 1 📌 0
Preview
Dolma: 3 Trillion Token Open Corpus for Language Model Pretraining We released Dolma, OLMo’s pretraining dataset. Dolma open dataset of 3 trillion tokens. Available on HuggingFace under the ImpACT license

We released Dolma, the dataset for OLMo, AI2's LLM. It's 3+ trillion tokens. We hope it will help w study of language models!

Available on HuggingFace w/ ImpACT license huggingface.co/datasets/allenai/dolma

Overview+datasheet blog.allenai.org/dolma-3-trillion-tokens-open-llm-corpus-9a0ff4b8da64

18.08.2023 22:21 👍 23 🔁 10 💬 1 📌 1