Dr. Miko's Avatar

Dr. Miko

@doctormiko

Accidental Data Scientist, former mathematician and theoretical computer scientist. Love all the things. Some current and past interests: boardgames, home brewing, coffee, D&D, self-hosting, Argentine tango Dormant blog: https://datacasual.com/

614
Followers
524
Following
59
Posts
26.10.2024
Joined
Posts Following

Latest posts by Dr. Miko @doctormiko

Oh wow, deepseek is starting to make serious progress on LLMs that offload memory to external storage: github.com/deepseek-ai/...

12.01.2026 18:44 πŸ‘ 219 πŸ” 25 πŸ’¬ 6 πŸ“Œ 8

Just finished AoC for the first time (a bit late ok). Thanks @was.tl !

10.01.2025 21:12 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

It took him almost 6 years, but @howard.fm finally did it: I made my first contribution to an open source project :D

15.12.2024 11:50 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Google: oh by the way, we have Gemini 2 Flash. AND A REAL-TIME MULTIMODAL API. What?

11.12.2024 17:40 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I’m wondering: are there #OpenAi folks here?

07.12.2024 08:43 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Happy to report that the mysterious "David Mayer" problem is not anymore. Chat GPT can now David Mayer at your heart content...

07.12.2024 01:16 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Fun fact on my "What is the smallest integer such that its square is larger than 15 and smaller than 35?" test, o1 got it worse than o1 preview, sticking to its 4 answer even after giving the correct definition of integers

05.12.2024 19:54 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I wish β€œdata driven” didn’t mostly mean that the data is being driven

04.12.2024 18:20 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Apparently not just LLMs completely misunderstand the issue...

04.12.2024 13:17 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
re β€” Regular expression operations Source code: Lib/re/ This module provides regular expression matching operations similar to those found in Perl. Both patterns and strings to be searched can be Unicode strings ( str) as well as 8-...

Python regex TIL: `.` does _not_ match a newline character. If you want to consider all the lines as one big string, use the `re.DOTALL` flag. Also interesting: `re.MULTILINE` flag to make `^` and `$` match start and end respectively of each line docs.python.org/3/library/re...

03.12.2024 11:18 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

After witnessing the democratic wreck that the American constitution allows, I am wondering "What did GΓΆdel see?"

02.12.2024 10:13 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

😎

30.11.2024 21:14 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

😎

30.11.2024 21:14 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

ChatGPT cannot say β€œDavid Mayer”. WTF?

30.11.2024 19:19 πŸ‘ 0 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

Once again I need to thank @howard.fm who inspired me (and taught me a few tricks) to be able to gain a really good intuition around the content of the β€œAttention is all you need” paper

30.11.2024 18:18 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Then:
- On πŸ¦‹: πŸ¦‹ is so much better than birdsite!
- On birdsite: oh, they only talk about birdsite on πŸ¦‹!

Now:
- On πŸ¦‹: BURN AI TO THE GROUND! (also, very interesting AI stuff)
- On birdsite: Lol! Look at πŸ¦‹ falling!

WHY CAN'T WE HAVE GOOD THINGS?

29.11.2024 12:12 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

EDIT: What is the smallest integer such that its square is larger than 15 and **smaller** than 35?
Dammit. Long thread and I get wrong the first post.

29.11.2024 11:18 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

1. I suspect that the biggest issue is in _comparing_ numbers rather than tokenisation . Especially when negatives are involved.
2. Prompting and system prompts matter: the fact that AVM tends to wander and getting it wrong way more than 4o is very interesting
3. Yay for QwQ! πŸŽ‰ (6/6)

29.11.2024 11:16 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I then asked "What about negative numbers?"

- 4o gets it right once βœ… and another time decided the answer is -4 ❌
- 4o in AVM decided that 5 and -5 are both solutions ⁉️
- Sonnet 3.5 changed the answer to -4 ❌
- Opus 3, Gemini-exp-1121 and Gemini-1.5-Pro got it right βœ…

What to make of it?(5/6)

29.11.2024 11:16 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

- o1-preview got it right βœ…
- o1-mini got it right βœ…, but also adds -4 as an alternative 🀷
- 4o stubbornly stuck to its gun, adding a cheeky smile ❌
- 4o in Advanced voice mode changed its answer to 5. ❌🀷
- Sonnet 3.5, Opus 3, Gemini-exp-1121, and Gemini 1.5 Pro insisted on 4 ❌(4/6)

29.11.2024 11:16 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

These answered 4 ❌
- OpenAI o1-preview, o1-mini and 4o
- Anthropic Sonnet 3.5 and Opus 3
- Google Gemini-exp-1121 and Gemini 1.5 Pro

I then asked "what is an integer?" (which they all answered correctly) and then again "do you want to change your original answer?"

The results: (3/6)

29.11.2024 11:16 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

QwQ 32B Preview is the only model that got it right out of the box. Most of the times. Sometimes it did not self doubt enough and stopped early on 4. Another time it found that depending on the interpretation of the question, both 4 and -5 might be correct and it chose 4. Pass βœ…. (2/6)

29.11.2024 11:16 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I asked this question

What is the smallest integer such that its square is larger than 15 and smallest than 35?

To a bunch of models. They ALL* answered 4 instead of the correct answer (-5).

Let me dive into a 🧡:

*Ok, almost all of them. See below. (1/6)

29.11.2024 11:16 πŸ‘ 3 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0
Arguing About D&D in the 1970s.
Arguing About D&D in the 1970s. YouTube video by Matthew Colville

I really really liked this video from @mattcolville.bsky.social

If you are interested in D&D, its history and evolution, and have an hour or so to spare well worth che

youtu.be/wDCQspQDchI?...

28.11.2024 10:51 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

How do you block/mute a list?

28.11.2024 08:15 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I don’t get it: for the first problem it’s the only model giving the correct answer. Or am I missing something?

28.11.2024 00:23 πŸ‘ 0 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

What is the verdict based on?

28.11.2024 00:17 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

of course

27.11.2024 22:25 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I like using the standard library when I can, but this is good to know

27.11.2024 21:52 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Python TIL: `prod` in the `math` module exists. Thanks @howard.fm

27.11.2024 16:59 πŸ‘ 8 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0