๐ณ Some notes on "DeepSeek and export control"
Finally took time to go over Dario's essay on DeepSeek and export control and wrote some notes. I mostly disagree and I think it missed the point.
I wrote some reflections on DeepSeek, open-source, AI, US and China, starting from Dario's recent essay calling for stronger export controls.
I mostly disagree with his essay and think it missed the point
You can read it here: thomwolf.io/blog/deepsee...
01.02.2025 15:07
๐ 52
๐ 9
๐ฌ 2
๐ 0
Yeah I just attempted that a few weeks ago and ended up bricking it. Was fun reinstalling everything and dealing with a bunch of broken drivers.
26.11.2024 13:48
๐ 1
๐ 0
๐ฌ 0
๐ 0
Deep global descriptors give a convenient way for retrieval, but local descriptors are a game changer in finding needles in a haystack (particular objects in clutter). Due to their high cost, with AMES we optimize the performance/memory trade-off during re-ranking. #ECCV2024
20.11.2024 21:14
๐ 32
๐ 8
๐ฌ 1
๐ 0
Nvidia's Hymba - an efficient small language model with hybrid architecture.
Their architecture consists of Hymba hybrid blocks, with Mamba and Attention connected in parallel. They found this design to be more effective in disentangling attention into linear and non-linear components.
22.11.2024 05:41
๐ 32
๐ 3
๐ฌ 1
๐ 1
๐๐ผ๐ฒ๐ ๐ฎ๐๐๐ผ๐ฟ๐ฒ๐ด๐ฟ๐ฒ๐๐๐ถ๐๐ฒ ๐ฝ๐ฟ๐ฒ-๐๐ฟ๐ฎ๐ถ๐ป๐ถ๐ป๐ด ๐๐ผ๐ฟ๐ธ ๐ณ๐ผ๐ฟ ๐๐ถ๐๐ถ๐ผ๐ป? ๐ค
Delighted to share AIMv2, a family of strong, scalable, and open vision encoders that excel at multimodal understanding, recognition, and grounding ๐งต
paper: arxiv.org/abs/2411.14402
code: github.com/apple/ml-aim
HF: huggingface.co/collections/...
22.11.2024 08:32
๐ 59
๐ 19
๐ฌ 3
๐ 1