's Avatar

@bruceritchie

35
Followers
67
Following
25
Posts
28.10.2024
Joined
Posts Following

Latest posts by @bruceritchie

Post image

This is interesting. I was testing out Zed with the Apache DataFusion repo. It's definitely quick though I'm still toying with the UI and it's behaviour. However, I happened to look at it's memory usage and it's a fair bit more than RustRover:

28.12.2025 16:43 👍 1 🔁 0 💬 0 📌 0
Inside China's Mini PC Production: How Tiny Computers Are Made
Inside China's Mini PC Production: How Tiny Computers Are Made YouTube video by SatisFactory Process

Inside China's Mini PC Production: How Tiny Computers Are Made
youtu.be/ohwI3V207Ts

14.11.2025 09:22 👍 1076 🔁 194 💬 73 📌 13

I enjoyed that video! I have a very similar mini pc (geekom) with an extremely similar wifi card and connections. Those connections are incredibly difficult to get back on after they pop off!

14.11.2025 21:47 👍 3 🔁 0 💬 0 📌 0

Rolf has some amazing pictures on his website - www.rolfolsenastrophotography.com

14.11.2025 21:19 👍 0 🔁 0 💬 0 📌 0

Continuing my habit of reading a paper a week, this week it's SQLStorm - db.in.tum.de/people/sites... I've been meaning to look into some of the failures reported against Apache DataFusion via github.com/2010YOUY01/d... for some time though it might take a holiday vacation to have the time.

14.11.2025 21:17 👍 0 🔁 0 💬 0 📌 0

Hey Google - can you stop with the firefox bullshit and "making sure you're not a bot" captchas for links to youtube? You are just pissing off a premium member with that shit.

07.11.2025 16:50 👍 0 🔁 0 💬 0 📌 0

I wonder if anyone has done a cost analysis of Python code running in the wild compared to a language that actually is performant. I suspect companies are needlessly spending millions because of lazy developers.

17.10.2025 17:54 👍 0 🔁 0 💬 0 📌 0

Not me, a different Bruce RItchie

01.10.2025 17:45 👍 0 🔁 0 💬 0 📌 0
F3: The Open-Source Data File Format for the Future
SIGMOD 2025

F3: The Open-Source Data File Format for the Future SIGMOD 2025

Our SIGMOD paper with our friends at Tsinghua + @wesmckinney.com + @pateljm.bsky.social on creating a next generation open-source data file format is out. F3 is a future-proof file format avoids the mistakes of Parquet.
📄 Paper: db.cs.cmu.edu/papers/2025/...
📁 Code: github.com/future-file-...

01.10.2025 13:49 👍 70 🔁 21 💬 4 📌 5
Preview
GitHub - future-file-format/F3 Contribute to future-file-format/F3 development by creating an account on GitHub.

github.com/future-file-...

01.10.2025 15:43 👍 2 🔁 0 💬 0 📌 0
Preview
Optimizing ClickHouse for Intel's ultra-high core count processors Intel's latest processor generations are pushing the number of cores in a server to unprecedented levels. For analytical databases like ClickHouse, ultra-high core counts represent a huge opportunity ...

Interesting read on what it takes to optimize a database for high core count machines - clickhouse.com/blog/optimiz...

18.09.2025 14:06 👍 0 🔁 0 💬 0 📌 0
Preview
Vortex | An extensible, SOTA columnar file format Vortex is an extensible, state-of-the-art columnar file format, with associated tools for working with compressed Apache Arrow arrays in-memory, on-disk, and over-the-wire.

I'm tempted to try out the vortex file format (vortex.dev) in my project to see if it has an appreciable impact on performance.

12.09.2025 15:38 👍 0 🔁 0 💬 0 📌 0

ashtom.github.io/developers-r... ... so much absurdity in this it's crazy. Never trust a damn thing from someone whose job depends on selling you something.

06.08.2025 18:38 👍 0 🔁 0 💬 0 📌 0
Apache DataFusion 49.0.0 Released - Apache DataFusion Blog

@apachedatafusion.bsky.social 49.0.0 released. Async UDF's, Parquet modular encryption, WITHIN GROUP support, Dynamic Filters and TopK pushdown and much more ... datafusion.apache.org/blog/2025/07...

29.07.2025 21:04 👍 0 🔁 0 💬 0 📌 0

Medium has turned into a wasteland of AI generated or AI augmented posts. I'd say less than 25% of the daily digest highlights are actual 'real' articles. Sad.

10.07.2025 14:01 👍 0 🔁 0 💬 0 📌 0

A 200 Ok response from S3 ... isn't always ok. Way to go AWS for making your service horrendous to support. repost.aws/knowledge-ce...

30.05.2025 19:56 👍 0 🔁 0 💬 0 📌 0

I am unsure whether Google Summer of Code is a benefit or a hindrance to an open source project. Time will tell I suppose by the PR's submitted.

08.04.2025 16:48 👍 0 🔁 0 💬 0 📌 0

It's been well over a year since I started the process of rewriting a large and very long running job from Apache Spark/Scala to Apache DataFusion/Rust. We're now well into doing poc's to rewrite a few other expensive jobs the same way. It's a very nice feeling.

03.04.2025 21:08 👍 0 🔁 0 💬 0 📌 0
Post image

This one was going around the office today and made me chuckle :)

28.03.2025 13:30 👍 0 🔁 0 💬 0 📌 0

Not by me, I'm not in Florida, nor in the US.

27.02.2025 15:53 👍 0 🔁 0 💬 0 📌 0

coworker in chat: "... cluster is rebalancing and I'm trying to get the jello to stop shaking". Best explanation of rebalancing I've heard in a long time 🤣

27.02.2025 14:43 👍 0 🔁 0 💬 0 📌 0

Thank you Doug Ford for the $200 vote bribe. I'll use it to contribute to another party and vote to get your kind out of office.

09.02.2025 14:20 👍 0 🔁 0 💬 0 📌 0

Had a good chuckle this morning. Gemini was enabled on company corporate accounts and lasted all of 2 days before it was disabled.

31.01.2025 13:58 👍 0 🔁 0 💬 0 📌 0

The latest paper from the #1 CMU-DB PhD student @samarchdb.bsky.social is wild compilation magic! He automatically makes UDFs run 300x faster on SQL Server and 1.3x faster on DuckDB.
Code: github.com/SamArch27/PR...
Paper: www.vldb.org/pvldb/vol18/...

06.12.2024 14:56 👍 53 🔁 10 💬 2 📌 1

Working in Rust for the last year has really made me aware of just how useful some features in other languages really are.

- variadic functions
- Default values for arguments
- Named arguments
- Enum variants as types

Rust is getting if let chains in the 2024 edition though so that is something.

26.11.2024 15:46 👍 3 🔁 0 💬 2 📌 0

Lately there are two things I've been wishing that #Rust had: variadic functions and enum variants as types. Using a builder or macro to work around the first is just that, a workaround. Having the second would make some things much nicer

19.11.2024 20:32 👍 2 🔁 0 💬 0 📌 0

64GB of ram is not enough any more.

17.11.2024 17:45 👍 0 🔁 0 💬 0 📌 0
Post image

Datafusion v43 has seen a lot of performance work especially around reading parquet and the numbers are very nice! From the clickbench benchmark on the same hardware type:

15.11.2024 16:17 👍 0 🔁 0 💬 0 📌 0