Alex Miller's Avatar

Alex Miller

@alexmillerdb

Database Papers as a Service

1,929
Followers
130
Following
360
Posts
22.10.2024
Joined
Posts Following

Latest posts by Alex Miller @alexmillerdb

Post image Post image

[CIDR '25] Adaptive Factorization Using Linear-Chained Hash Tables
vldb.org/cidrdb/pape...

Adaptive execution + factorization + WCOJ = great paper.

The best intro to factorized databases I know of is www.youtube.com/watc....

10.03.2026 16:00 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Burnout is breaking a sacred pact Here’s how to fix it

usefulfictions.substack.com/p/burnout-is... has had me thinking a lot about what is β€œrewardingβ€œ work and trying to separate β€œI should be doing this to achieve things I want to have done” vs β€œI enjoy this”

10.03.2026 02:02 πŸ‘ 6 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Sign Up Β· Scour Scour interesting reads from noisy feeds you can't keep up with and smaller sites you didn't know to check.

And scour.ing/@linearizabl... now works for the interests

09.03.2026 22:17 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image Post image

[VLDB '25] MD-MVCC: Multi-version Concurrency Control for Schema Changes in Azure SQL Database
www.vldb.org/pvldb/v...

A great discussion of the end-to-end impact of allowing multiple versions of schema metadata information to be live concurrently, in a real, production system.

09.03.2026 16:00 πŸ‘ 8 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

I think scour.ing/@linearizabl... or scour.ing/feed/https:%... should work for the likes

08.03.2026 23:29 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

@emschwartz.me has been super responsive to feedback about improving the signal to noise ratio too! :) There's already been a couple great rounds of adjustments, and I look forward to the continued refinement

08.03.2026 18:27 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Scour Scour interesting reads from noisy feeds you can't keep up with and smaller sites you didn't know to check.

https://scour.ing/ has gotten pretty good at surfacing what new stuff I actually want to read on the internet, better than following subreddits. You can see my feed of mostly database things at scour.ing/@linearizable. It surfaces small personal blogs particularly well.

08.03.2026 17:51 πŸ‘ 21 πŸ” 2 πŸ’¬ 1 πŸ“Œ 1

The recording finally went great this time. I also demo'd doing a backup audio recording so that we can more reliably get a good recording to post, so hopefully the trend will continue 🀞

02.03.2026 01:55 πŸ‘ 8 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Preview
three cartoon animals with arrows on their heads are standing in the grass ALT: three cartoon animals with arrows on their heads are standing in the grass
28.02.2026 01:08 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Sitting down with a coding agent and Kuzu/ladybugdb to understand how factorized representations work at the code level across query processing is worth the time and effort

22.02.2026 17:53 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

β€œThe Manga Guide to Databases” fits the criteria for sure

(Which I think I’ve seen you show you have a copy of it already.)

18.02.2026 17:51 πŸ‘ 4 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
LadybugDB - Open Source Columnar Graph Database

ladybugdb.com is the fork & continue project, but I think there’s a slightly different roadmap to be more object storage integrated

15.02.2026 17:13 πŸ‘ 8 πŸ” 1 πŸ’¬ 1 πŸ“Œ 1
Preview
Faster, more flexible databases could be coming to FileMaker or iWork Despite owning FileMaker, Apple has never included a database app with iWork. Apple has now acquired Kuzu, Inc, a firm developing fast, flexible graph databases.

Turns out the rumor of this being an Apple acquisition was actually true: appleinsider.com/articles/26/...

14.02.2026 21:56 πŸ‘ 5 πŸ” 0 πŸ’¬ 0 πŸ“Œ 1

Does anyone know of a good webapp or discord bot or something to help manage a reading group? Something that keeps a list of suggesting things to read, can do voting on the next thing to read, and maybe has a bit of curation support for when the to-read list gets unmanageable?

01.02.2026 02:59 πŸ‘ 5 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

One of our recordings had spotty audio because the presenter would step to the side while talking to gesture at slides. Would that also then mean they’d step out of line for a shotgun mic? I have no idea how precisely directional those actually are.

07.01.2026 18:12 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I’m willing to spend the money, I just know nothing about audio equipment. If you have some audio gadget friend to give a trusted answer for β€œwhat type of microphone and which product should I go buy for this?” that’d be great. I think I can borrow a cheap lapel mic as a test to see if that’s good.

07.01.2026 18:08 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

A 2026 hope of mine is to get our own recording setup figured out so that we can more reliably get recordings up. We’ve been about 50/50, and I feel bad for the speakers when they come give a great talk, but then the recording doesn’t work out for whatever reason. (Like the Morel and QOaaS talks 😒)

06.01.2026 07:46 πŸ‘ 2 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0
South Bay Systems: Innovative Data Systems Research Β· Luma Welcome to another edition of South Bay Systems! This time we bring you three wonderful talks from authors at the just-finishing Conference in Innovative Data…

Our next event will be on January 21st, featuring speakers from (the just-finishing) CIDR! Come to Databricks to hear about:
* DuckDB on xNVMe by @pinartozun.bsky.social of ITU
* Spilling in QP by Maximilian Kuschewski of TUM
* NPUs in DBs by Alexander Baumstark of TU-Ilmenau
luma.com/8a54z94d

05.01.2026 19:46 πŸ‘ 12 πŸ” 4 πŸ’¬ 2 πŸ“Œ 2

"Diva: Making MVCC Systems HTAP-Friendly" dl.acm.org/doi/pdf/10.1... also feels underappreciated, as they literally did an implementation in *both* mysql and postgres.

Seoul National University's DBX Lab has been looking into this area overall for a little while dbx.snu.ac.kr/publications

30.12.2025 22:44 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I've seen a lot of MySQL and Postgres storage discussion as MVCC Wars: VACUUM vs Undo Log. I'd love to see an implementation of a Time Split B-Tree (dl.acm.org/doi/10.1145/...). It's a simple, yet very different design point. You gain the MVCC scan benefits of a CoW BTree, but can be multi-writer.

30.12.2025 22:44 πŸ‘ 3 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

"The Case for 2-Tree for Skewed Datasets" www.cidrdb.org/cidr2023/pap... is a really fun read paired with Bf-tree, as the two papers try to solve the same high-level problem of not keeping cold data in cache, but with two very different approaches.

And, if you're interested in other reading...

30.12.2025 22:44 πŸ‘ 1 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

(The uptick occurred after bsky.app/profile/benj... )

30.12.2025 20:11 πŸ‘ 7 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

I’ve recently seen multiple, unrelated instances of people referencing Bf-trees. Good job, @benjdd.com.

30.12.2025 20:09 πŸ‘ 9 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0

Do you have to tell it in the prompt anything about β€œplease look up any key function’s documentation” or something to get the tools to be used, or do you generally see it making reasonable decisions already?

28.12.2025 21:47 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

It's a notable bit slower, but gemini has a surprisingly generous free tier for its CLI, and I'd rather have slower and correct than the loops of incorrect fixes I'd be sent on before.

Maybe there's some "fetch rust docs" tool that'd be even more helpful that I don't know about?

28.12.2025 20:10 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Asking a coding agent to run `cargo build` and read referenced source files for context has made LLMs significantly more helpful and accurate at actually understanding why a compilation error is happening and being able to explain an appropriate fix. Much better than copy-pasting into online LLMs.

28.12.2025 20:10 πŸ‘ 3 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
Database Research needs an Abstract Relational Query Language For decades, SQL has been the default language for composing queries, but it is increasingly used as an artifact to be read and verified rather than authored. With Large Language Models (LLMs), querie...

Looks like you manifested a paper
arxiv.org/abs/2512.12957

16.12.2025 23:52 πŸ‘ 7 πŸ” 1 πŸ’¬ 1 πŸ“Œ 1
Post image
07.12.2025 20:23 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

gist.github.com/thisismiller...

After ~2015, the focus seemed to shift to looking at stats on SSD failures from large deployments, but that's no longer a "does this SSD work right?" but a "how long until it dies?", and so I don't get why the latter replaced the former.

03.12.2025 17:27 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I had once started compiling SSD powerfault testing papers, and found that academia testing SSDs stopped ~2015. 😱

If you still have any notes of all the sources you found and looked at, I’d greatly appreciate a copy to update the posts with anything I’ve missed!

29.11.2025 23:53 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0