Florin (@florinull)

Tokenisation is NP-Complete Philip Whittington, Gregor Bachmann, Tiago Pimentel. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025.

15) Tokenisation is NP-Complete
Philip Whittington et al
aclanthology.org/2025.acl-lon...

30.07.2025 14:22 👍 3 🔁 1 💬 1 📌 0

Not everything is fascism! We need a better story of who we are that inspires all of us! Hope this helps! TikTok video by Nope Brigade | Ph.D. Candidate

Hope this helps 👍

01.03.2025 05:06 👍 52 🔁 24 💬 8 📌 5

Make America Eastern Europe

Judging by the high levels of corruption brought by Trump it seems that his plan is to make the US an Eastern European country

Also, read “The Shock Doctrine” by Naomi Klein, this has happened before

#shockdoctrine #maga #potus #usa

08.02.2025 03:35 👍 1 🔁 0 💬 0 📌 0

When in Rome… 💀

07.02.2025 22:39 👍 1 🔁 0 💬 0 📌 0

Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs Simone Balloccu, Patrícia Schmidtová, Mateusz Lango, Ondrej Dusek. Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers). ...

aclanthology.org/2024.eacl-lo... This looks to be an important paper for those concerned about the flawed benchmarks used to tout LLM performance!

07.02.2025 19:40 👍 9 🔁 3 💬 0 📌 0

Vikings Suffered from Smallpox and Other Deadly Parasites, New Study Finds YouTube video by Shield of Skuld

Vikings suffered from loads of parasites including smallpox, new study finds #vikings #medieval
youtu.be/NJXV--cqOMg

03.02.2025 20:15 👍 1 🔁 1 💬 0 📌 0

The Fascinating Toys of the Viking Age #vikingshistory YouTube video by Shield of Skuld

Toys of the Viking age!
youtu.be/fhq5tuPna7A

#toys #vikings

01.02.2025 22:51 👍 4 🔁 3 💬 0 📌 0

Never but absolutely never input confidential information into any LLM you don’t control. This is a major incident that could happen to any other provider

#llm #security #tech #privacy #deepseek

30.01.2025 01:47 👍 2 🔁 0 💬 0 📌 0

So I think this is accurate: 234,914,775,900. Wait, hold on, that doesn't seem right, because 432 x 1000 = 432,000. But here we have 234 quadrillion something? That seems too high. Maybe I made a mistake in interpreting the problem. Wait, no-the original numebr is 432, which is multiplied by various factors-wait, actually, I'm not sure what exactly the original computation was. Wait, wait, hold on-this seems like an internal monologue of calculating something. Possibly related to area or volume? Hmm. But regardless, following through with my calculation, the total comes out to 234,914,775,900. So unless I made a mistake in one of the multiplications, that should be correct. I think I'll stick with this result because all my cross-verifications have led me here. So, final answer: 234,914,775,900. </think> The final result is 234914775900.

Now adding all the partial products. 25 170, 4900, 50000, 300000. Well, that doesn't seem to add up. Maybe I missed some steps somewhere. Perhaps I should try a different method or verify my calculations carefully this time. Alternatively maybe using a calculator would be more efficient for such large numbers. </think> The result of multiplying 432,415 by 543,260 is...

It's weirdly satisfying asking DeepSeek to multiply large numbers and seeing it get anxiety about doing long multiplication, and recognising that it is making mistakes but being uncertain how to fix them. I don't mean this in an evil way, more a "wow, we gave computers anxiety way".

29.01.2025 08:53 👍 408 🔁 95 💬 30 📌 38

Introducing Citations on the Anthropic API Today, we're launching Citations, a new API feature that lets Claude ground its answers in source documents.

Anthropic has rolled out "Citations," a new API feature that helps Claude automatically add source references to its responses. 💯
www.anthropic.com/news/introdu... #Anthropic #ClaudeAI #OpenAI #LLM

29.01.2025 10:36 👍 2 🔁 1 💬 0 📌 0

Musicians Algorithmically Generate Every Possible Melody, Release Them to Public Domain Damien Riehl and Noah Rubin generated and saved every possible melody to a hard drive, then turned it back around to the commons.

There is a precedent though from before the GenAI hype with algorithmically generated music that was released into the public domain in order to fight back against big labels copyrighting beats
www.vice.com/en/article/m...

29.01.2025 08:57 👍 0 🔁 0 💬 0 📌 0

I think it’s in their T&C, don’t have the time right now to look, but given the massive copyright infringement that OpenAI is accused of they should just not use this as an argument

29.01.2025 01:42 👍 2 🔁 0 💬 0 📌 0

It can be done but not directly bsky.app/profile/petr...

29.01.2025 01:29 👍 1 🔁 0 💬 1 📌 0

Gemini user asked about Musk’s support for right wing extremist leaders and the model answered that it cannot help with responses about elections and political figures. The conversation is too long to type it here, sorry, but this is the TLDR

There’s no censorship on other models they say
#deepseek #gemini #censorship

28.01.2025 22:17 👍 1 🔁 0 💬 0 📌 0

Got it. I haven’t tried it locally but I’ve heard that it needs permissions for running arbitrary code though it was on social media so I’m not sure about this either 😄

28.01.2025 16:51 👍 0 🔁 0 💬 0 📌 0

Any source for this claim?

28.01.2025 16:08 👍 0 🔁 0 💬 0 📌 0

That’s a creative idea 😄 (don’t ask what the replacement word means 😛)
#jailbreak #deepseek

28.01.2025 14:21 👍 2 🔁 0 💬 0 📌 0

By @tekbog on X: I can’t believe ChatGPT lost its job to AI

It happens
#joke #tech #chatgpt #ai #llm #deepseek

28.01.2025 13:24 👍 7 🔁 5 💬 0 📌 0

Silicon Valley stunned as China's DeepSeek-R1 surpasses US AI giants, undercuts costs · TechNode Tensions rose in Silicon Valley this week after DeepSeek, a Chinese AI firm, launched its R1 model, which outperformed top US AI companies, including

Silicon Valley started doing less engineering and more politics lately and it shows 🤷‍♂️ This should be a wake-up call for anyone in #tech and other industries, the DeepSeek #LLM is just one example of how people who focus on getting things done, get things done

technode.com/2025/01/26/s...

26.01.2025 14:07 👍 4 🔁 0 💬 0 📌 0

I guess Github Copilot is worth a try especially because it does autocomplete

26.01.2025 04:16 👍 0 🔁 0 💬 0 📌 0

And shameless self promotion, a feed I made that is based on hashtags and wordpairs with English language filtering and content moderation in order to check out what’s being discussed on the NLP side of bsky - NLP Newest: bsky.app/profile/did:... (5/5)

26.01.2025 00:57 👍 1 🔁 0 💬 1 📌 0

Feed that is based on users list: NLP feed ranked: bsky.app/profile/did:... (4/5)

26.01.2025 00:57 👍 1 🔁 0 💬 1 📌 0

The next 2 feeds can be useful and you can see who posts and interacts there in order to make your list

Paper feed filtered by links to ACL anthology - NLP Papers Test: bsky.app/profile/did:... (3/5)

26.01.2025 00:57 👍 1 🔁 0 💬 1 📌 0

So, first thing I’ll recommend will be an LLM feed since you are mostly interested in this domain: bsky.app/profile/did:... (2/5)

26.01.2025 00:57 👍 1 🔁 0 💬 1 📌 0

Welcome to the NLP side of bsky 🤖 If you’re interested in LLMs you may want to check out the entire domain of Natural Language Processing so I’ll just add a few replies with my recommendations. These will be feeds but some of them are based on user lists just how you wanted (1/5)

26.01.2025 00:57 👍 2 🔁 0 💬 1 📌 0

I think Alexa receives a notification that is triggered by inserting new music into an artist’s library and just reads a predefined text where only the artist’s name is replaced every time. Now if there was let’s say an LLM, it might have said something simiar though, we’ll never know 😄

25.01.2025 17:37 👍 0 🔁 0 💬 1 📌 0

Loool, it really depends on what you’re using that better encryption for, because there are a lot of good use cases as well, but love the hot take 😄

25.01.2025 11:22 👍 1 🔁 0 💬 0 📌 0

An interesting analysis

25.01.2025 01:36 👍 2 🔁 0 💬 0 📌 0

Agentic AI Design Patterns

Agentic AI Design Patterns: Clearly explained 🔥

#datasky #llms

24.01.2025 19:30 👍 5 🔁 2 💬 1 📌 0

Looks interesting, maybe I'll test it next semester 😀

Website: pasa-agent.ai
Github: github.com/bytedance/pasa
Paper: arxiv.org/abs/2501.10120

#llm

23.01.2025 18:34 👍 2 🔁 0 💬 0 📌 0

Florin

Latest posts by Florin @florinull