Mohammed Hamdy (@mmhamdy)

On the Slow Death of Scaling For the last decade, it has been hard to stray off the beaten path of accepted wisdom for what drives innovation. We have been held hostage to a painfully simpl

Original essay: papers.ssrn.com/sol3/papers....

12.01.2026 14:10 👍 0 🔁 0 💬 0 📌 0

Scaling: A Never-ending Saga My thoughts on the slow death of scaling

Read it here: open.substack.com/pub/surfingm...

12.01.2026 14:10 👍 0 🔁 0 💬 1 📌 0

New article!

My thoughts on the slow death of scaling essay by Sara Hooker

12.01.2026 14:10 👍 0 🔁 0 💬 1 📌 0

Ok, I'll confess! I too like Roland Emmerich's Godzilla. I even like the creature design in this film!

08.06.2025 18:36 👍 1 🔁 0 💬 0 📌 0

The one Frankenstein film to rule them all!

Thank you, @realgdt.bsky.social 🙏

02.06.2025 20:11 👍 1 🔁 0 💬 0 📌 0

The Consciousness API: What if consciousness isn't contained within us, but rather we are temporary antennas, tuning into a vast, universal broadcast of awareness?

31.05.2025 14:56 👍 0 🔁 0 💬 0 📌 0

Pandemonium: The Transformers Story A Blog post by Mohammed Hamdy on Hugging Face

huggingface.co/blog/mmhamdy...

30.03.2025 11:37 👍 0 🔁 0 💬 0 📌 0

In this article, I explore the story behind some of the ideas introduced in the Transformer paper.

Exploring things from the fundamental attention mechanism that lies at its heart to the surprisingly simple explanation for its name.

You may find it interesting! 🙂

👇link below

30.03.2025 11:37 👍 1 🔁 0 💬 1 📌 0

We're particularly proud to release Aya Vision 8B - it's compact 🐭 and efficient 🐎, outperforming models up to 11x its size 📈.

Releasing open weights helps to make breakthroughs in VLMs accessible to the research community.

05.03.2025 17:56 👍 14 🔁 4 💬 1 📌 0

Join the Mozilla AI Discord Server! A global space for sharing and advancing open-source AI. | 3757 members

📅 Event on Mozilla AI discord: discord.gg/QTCRfefF?eve...

📄 ProGen paper: www.biorxiv.org/content/10.1...

27.01.2025 12:29 👍 0 🔁 0 💬 0 📌 0

🧬 Join us this Wednesday on @mozilla.ai discord server in our second session of the Biological Representation Learning series where we discuss landmark papers in the field!

We will be presenting the ProGen protein language model paper from Salesforce. See you there! 😃

27.01.2025 12:29 👍 0 🔁 0 💬 1 📌 0

Join the Mozilla AI Discord Server! A global space for sharing and advancing open-source AI. | 3695 members

📢 Join us on Discord for our first Blueprints Hub event 📢

Discover Blueprints and learn how to transform text into podcast-style conversations using entirely open source tools.

🗓️ Wednesday, Jan. 22nd
⏰ 1:30-2:00 PM EST
🔗 Event: discord.gg/BaYFBaeh?eve...

#OpenSource #AI #Blueprints #MozillaAI

20.01.2025 12:15 👍 5 🔁 1 💬 0 📌 0

As the @cohereforai.bsky.social joins the Bluesky family — we will be sharing paper gems from when we first started as a lab.

This paper is part of a larger research agenda where we have focused on how to better represent the long tail = making AI work for almost all real world distributions.

18.01.2025 04:57 👍 25 🔁 3 💬 0 📌 0

kyutai/helium-1-preview-2b · Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Meet Helium-1 preview, our 2B multi-lingual LLM, targeting edge and mobile devices, released under a CC-BY license. Start building with it today!
huggingface.co/kyutai/heliu...

13.01.2025 17:50 👍 16 🔁 5 💬 1 📌 5

And lastly, big thanks to you for making it this far 🤗, don’t forget to read the paper!

www.dataprovenance.org/Multimodal_D...

11/n

19.12.2024 16:34 👍 0 🔁 0 💬 0 📌 0

This is where the data to build AI comes from New findings show how the sources of data are concentrating power in the hands of the most powerful tech companies.

Big thanks to Melissa Heikkilä for featuring our work in MIT Tech Review.

www.technologyreview.com/2024/12/18/1...

19.12.2024 16:34 👍 0 🔁 0 💬 1 📌 0

Xuhui Zhou, Caiming Xiong, Luis Villa,
@stellaathena.bsky.social, Alex Pentland,
@sarahooker.bsky.social, Jad Kabbara

9/n

19.12.2024 16:34 👍 1 🔁 0 💬 1 📌 0

An Dinh, Shrestha Mohanty, Deividas Mataciunas,
Tobin South, Jianguo Zhang,
@arielnlee.bsky.social , Campbell S. Lund, Christopher Klamm, Damien Sileo, Diganta Misra, Enrico Shippole, Kevin Klyman, Lester JV Miranda, Niklas Muennighoff, Seonghyeon Ye, Seungone Kim, Vipul Gupta, Vivek Sharma

8/n

19.12.2024 16:34 👍 1 🔁 0 💬 1 📌 0

🎉 big thanks to all the contributors to this huge and magnificent effort. I'm truly honored for the chance to work alongside all of you: Manan Dey, Nayan Saxena,
Ahmad Mustafa Anis, Emad A. Alghamdi, Vu Minh Chien, Naana Obeng-Marnu, Da Yin, Kun Qian, Yizhi Li, Minnie Liang

7/n

19.12.2024 16:34 👍 0 🔁 0 💬 1 📌 0

This work was supported by the Mozilla Foundation Data Futures Lab, and was lead by: @shaynelongpre.bsky.social, Nikhil Singh, Manuel Cherep, Kushagra Tiwary, Joanna Materzynska,
William Brannon, and Robert Mahari

6/n

19.12.2024 16:34 👍 0 🔁 0 💬 1 📌 0

4️⃣ Linguistic representation has not improved by most measures: Gini Coefficients for text and speech datasets show significant concentration, indicating limited progress in diversifying data sources.

5/n

19.12.2024 16:34 👍 0 🔁 0 💬 1 📌 0

3️⃣ Geographical representation has not improved for a decade: Datasets from African and South American organizations account for < 0.2% of all modality content, while North American or European organizations span 93% of text tokens and 60%+ hours of speech and video.

4/n

19.12.2024 16:34 👍 0 🔁 0 💬 1 📌 0

2️⃣ Inconsistent dataset licenses: While ~30% of datasets have permissive licenses, 78%+ of their sources carry hidden anti-crawling or licensing restrictions, making compliance a minefield.

3/n

19.12.2024 16:34 👍 0 🔁 0 💬 1 📌 0

📌 Key Findings

1️⃣ The web is still the primary source: The internet, social media platforms, and synthetically generated data are increasingly becoming the predominant sources for multimodal data, compared to curated sources.

2/n

19.12.2024 16:34 👍 0 🔁 0 💬 1 📌 0

✨ Excited to share our latest work from The Data Provenance Initiative ☸️

This is the most comprehensive audit of multimodal training data, auditing ~4000 datasets between 1990 and 2024, and covering more than 400 unique tasks in 608 languages!

🧵 1/n

19.12.2024 16:34 👍 3 🔁 1 💬 1 📌 0

EPIC! 🤗

11.12.2024 12:02 👍 1 🔁 0 💬 0 📌 0

GitHub - johko/computer-vision-course: This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face discord: hf.co/join/disc... This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face discord: hf.co/join/discord - johko/computer-vision-course

🌟 500! 🌟

Our Community Computer Vision Course Repo just reached 500 stars on GitHub: github.com/johko/comput... 🤩

I'm really proud of all the amazing content people from the community have contributed here and that they still keep on adding very cool and helpful material 💪

01.12.2024 20:41 👍 12 🔁 2 💬 0 📌 2

The Hudsucker Proxy is the most underrated Coen Brothers film!

29.11.2024 12:26 👍 3 🔁 0 💬 2 📌 0

Post Training in Deep Learning with Last Kernel One of the main challenges of deep learning methods is the choice of an appropriate training strategy. In particular, additional steps, such as unsupervised pre-training, have been shown to greatly im...

It can go even further...
arxiv.org/abs/1611.04499

27.11.2024 04:22 👍 2 🔁 0 💬 0 📌 0

Funny thought: if "post-training" refers mostly to supervised instruction-tuning and alignment of a "pre-trained" model, then where does the actual "training" happen! 😀

27.11.2024 03:53 👍 2 🔁 0 💬 0 📌 0

Mohammed Hamdy

Latest posts by Mohammed Hamdy @mmhamdy