The future of data connectivity is columnar. Today we launched
@columnar.tech to accelerate the shift from slow, row-oriented APIs like ODBC and JDBC to >10x faster alternatives powered by @arrow.apache.org. Learn more ๐
The future of data connectivity is columnar. Today we launched
@columnar.tech to accelerate the shift from slow, row-oriented APIs like ODBC and JDBC to >10x faster alternatives powered by @arrow.apache.org. Learn more ๐
ODBC is getting tired. It just can't keep up with the fast new kids in the data world these days. The next generation is ready to take the torch. Meet ADBC, a fast, modern data connectivity standard built on @arrow.apache.org. Watch my new talk from the @db.cs.cmu.edu seminar: youtu.be/TjlmNGNx77E
A screenshot of the Rakuten Kobo shopping cart showing 4 physical and 16 digital books in the cart.
me: i have a huge backlog of things to read
also me:
So I wonder if the term was already floating around the database community, and Julien (or someone else) (unintentionally?) swapped "striping" for "shredding" in the Parquet docs, and then the term took hold as Parquet became popular.
The docs for SQL Server 2000 here talk about shredding
> OPENXML calls can be used to provide rowset view...and process them, for example, inserting them into different tables (this process is also referred to as "Shredding XML into tables")
www.microsoft.com/en-nz/downlo...
This page, supposedly from 2003, talks about SQL Server 2000 adding a function to "shred" XML
> Microsoft SQL Server 2000 also provides the OPENXML function to shred an XML document and provide a rowset representation of the XML data.
web.archive.org/web/20120115...
It seems the first mention in the Parquet repos is from 2013, though. There Julien Le Dem links to a page about "striping" (as used in the Dremel paper) but calls it "shredding". So maybe you should ask him directly :)
github.com/apache/parqu...
I went into a rabbit hole on "record shredding"...Here's something interesting: there's an SO question from 2008 asking about "shredding XML data into relational tables". Maybe the term sort of already existed? stackoverflow.com/questions/61...
I will not trust macOS with external drives ever again. "First Aid" seems to have deleted directories from my local backup SSD...thankfully I have another backup in Backblaze but it's a bit older :/
Good (and painful) reminder to back up regularly!
@yatosaking.bsky.social ๆงใ @galetteweb.bsky.social ๆง
ๅคฑ็คผใใใใพใใ
่ฒ็ดใๅฟใใๆ่ฌใใฆใใใพใใ
๏ผใญใฉใญใฉใใฆใใใจใใ็นใซใๆฐใซๅ
ฅใใพใ๏ผ
ใใใใใๅ
็ใจใฌใฌใใใๅฟๆดใใใใพใ๏ผ
ๆฅๆฌใญโฆใใใฐใในใผใใผใซๆใฃใฆ่กใฃใฆใใๅบๅกใใใฏๅ่ฒทใ็ฉใไธๅฏงใซใใใผใซ่ขใใคใใฆใใใ๐
ไปๆฅTimeLeftใขใใชใฎใใผใใขใใใใใฃใฆใใ็งใฏไผ่ฉฑ่ฝๅ(?)ใใใพใใชใใฆใใใฃใๆฅฝใใใฃใใจๆใ
ใใใใ้ ๅผตใใชใใ
PRใฌใใฅใผใฎใใใซPostgresใจๆฏ่ผใใใใจๆใฃใฆใใใฉโฆใใใฏใใใฃ๐ฑ
Baby blue eyes (nemphila) flowers in Showa Kinen Park
The favorite flower of Himmel the Hero.
Showa Kinen Park, Tachikawa, Tokyo
Olympus E-M10 Mk2/TTArtisans 35mm f/1.4
Spotted in discord (and paraphrased to protect the innocent):
"I don't particularly like actually doing the job, but thinking about it? Hoo boy"
(Someday I'll make that teaching implementation of Arrow...)
The sakura are not _quite_ there, but almost!
Kou has a blog post too: x.com/ktou/status/...
April is a great time to visit :)
Anyone want to book a last minute trip to Japan? Kou, Rok, and I will be there :)
red-data-tools.connpass.com/event/349680/
I'm biased but maybe things like Apache Parquet, Apache Arrow? They have multiple implementations across different languages and Arrow gets used as a means of interchange between different data vendors (Spark, BigQuery, ClickHouse <-> Pandas, polars, etc.)
Check out what is new on the Apache Arrow ADBC 17 libraries release: arrow.apache.org/blog/2025/03...
Data wants to be free: comparing and explaining how Arrow's data serialization can be better than what's in protocols like PostgreSQL's
arrow.apache.org/blog/2025/02...
#apachearrow #arrow
2025 is shaping up to be a breakout year for fast query result transfer with Apache Arrow. But what exactly makes it so fast? David Li, Matt Topol, and I break it down in this new blog post: arrow.apache.org/blog/2025/01...