Trending

#apacheArrow

Latest posts tagged with #apacheArrow on Bluesky

Latest Top
Trending

Posts tagged #apacheArrow

Things worth sharing

If you keep hearing about Apache Arrow, but never quite got what it really is about, check out my blog post. I did a deep dive on Apache Arrow and wrote an educational introduction: thingsworthsharing.dev/arrow/

#dataengineering #apachearrow #softwareengineering

1 0 0 0
Post image

Wes McKinney built pandas in a mouse-infested NYC apartment on founder hours. Now he runs parallel Claude Code sessions and says AI is forcing "radical accountability" on every software vendor shipping mediocre products.

Full conversation: youtu.be/Uso8-yaERkE

#DataRenegades #pandas #ApacheArrow

1 0 0 0
tiny.c

I finally got around to writing my first ADBC driver and it doesn't do anything (and that's the point!): amoeba.github.io/tiniest-adbc...

#apachearrow

3 0 1 0
image for R Consortium webinar: Scaling Up Data Analysis in R with Arrow

image for R Consortium webinar: Scaling Up Data Analysis in R with Arrow

R Consortium webinar: Scale up data analysis in R with Arrow—fast, memory-efficient analytics without a DB or cluster. With Dr Nic Crane (Arrow R maintainer, Apache Arrow PMC). Register:

Don't miss it!
r-consortium.org/webinars/sca...

#rstats #apachearrow

10 0 0 1
Preview
Changelog

We're excited to announce the release of {arrow} 23.0.0 🏹📦

Here's a roundup of the new features and changes in a 🧵

Full details can be found at arrow.apache.org/docs/r/news/

#rstats #apachearrow

26 3 2 0
Preview
Software Engineer (Rust) - Backend (Starting $130K) at Rerun Rerun is hiring a Software Engineer (Rust) - Backend (Starting $130K).

📢 Rerun is hiring a Software Engineer (Rust) - Backend

Salary: $130K - $225K
Locations: 🇺🇸 East Coast - United States (Remote), 🇪🇺 EU (Remote)

#ai #rustlang #aws #gcp #azure #apachearrow #apachedatafusion

www.remotehiro.com/jobs/softwar...

3 5 0 0
Preview
From the dataengineering community on Reddit Explore this post and more from the dataengineering community

It's nice to see people bringing up ADBC in conversations like this one: www.reddit.com/r/dataengine... #apachearrow

1 0 0 0
Preview
Cómo optimizar UDFs en Python para Arrow en Spark Cómo mejorar el rendimiento optimizando funciones UDFs de Python para Apache Arrow con la llegada de la nueva versión de Apache Spark 3.5

⚙️ Optimiza UDFs en #Python para Arrow en Spark

✳️ El uso de UDFs en PySpark ha sido una solución flexible pero ineficiente
✳️ Desde #ApacheSpark 3.5, la integración con #ApacheArrow ha supuesto una mejora significativa de rendimiento
➡️ blog.damavis.com/como-optimiz...

#Spark #Arrow

0 0 0 0

Was anyone else like me, wondering if you can use ADBC with $5 Postgres from @planetscale.com? Well, you can! (No surprise)

I wrote up my test at brycemecum.com/2025/11/15/a...

#apachearrow #adbc

7 2 0 0
Video

Two More Days till Subsuface NYC!

Register at Dremio.com/subsurface

#DataLakehouse #NYC #ApacheIceberg #ApachePolaris #ApacheArrow

0 0 0 0
Post image

REGISTER FOR SUBSURFACE (NOV 6 in San Jose, NOV 13 in NYC)

Register here:

#DataLakehouse #ApacheIceberg #ApachePolaris #ApacheArrow

0 0 0 0
Post image

REGISTER FOR SUBSURFACE (NOV 6 in San Jose, NOV 13 in NYC)

Register here:

#DataLakehouse #ApacheIceberg #ApachePolaris #ApacheArrow

1 0 0 0
Post image

REGISTER FOR SUBSURFACE (NOV 6 in San Jose, NOV 13 in NYC)

Register here:

#DataLakehouse #ApacheIceberg #ApachePolaris #ApacheArrow

0 0 0 0
Post image

REGISTER FOR SUBSURFACE (NOV 6 in San Jose, NOV 13 in NYC)

Register here:

#DataLakehouse #ApacheIceberg #ApachePolaris #ApacheArrow

0 0 0 0

I may have fallen down the rabbit hole here but...
#apacheArrow had my curiosity but now had my attention...
And I'm building wal2arrow for #postgres and #mysql...
It's kind of awesome not to have json /avro around in the hot paths.

3 0 1 0
Preview
amoeba/githubRepoDailyEmailDigest

I updated my @val.town for my daily GitHub repo email digest so it can handle multiple repos: www.val.town/x/amoeba/git.... I like the new, more condensed view. #apachearrow

3 0 1 0

Check out @andrewlamb1111.bsky.social 's talk at the recent Iceberg meetup for a condensed overview of the the new Variant type coming to Parquet #apacheparquet #apachearrow

4 1 1 0
Preview
Active Record ADBC adapter - ADBCで大量データを高速移動!Ruby on RailsアプリからDuckDBも使えるよ! - 2025-08-21 - ククログ Ruby用のデータ処理ツールを提供するプロジェクトRed Data Toolsをやっている須藤です。 数年前からRuby on RailsアプリケーションでADBCを使えるとよさそうだなぁと思っていた(証拠1、証拠2)のですが、ついにそれが動くようになったのでactiverecord-adbc-adapterとしてリリースしました。 ADBCとはArrow Database Connectivit...

Ruby on Rails (ActiveRecord) now supports ADBC with a new adapter written by Sutou Kouhei. Check out the blog post (in Japanese): www.clear-code.com/blog/2025/8/.... The gem is available at rubygems.org/gems/activer.... #apachearrow #rubyonrails

1 0 0 0

Centralized storage, decentralized compute: maximizing data use (OLAP) while minimizing ETL and disparate versions of data.

#hotTake #dataEngineering #dataAnalytics #dataScience #machineLearning #ai #cloudComputing #s3 #apacheArrow #apacheParquet #apacheIceberg

2 0 1 0

Looking forward to attending this! #apachearrow

3 1 0 0
Preview
GitHub - tonbo-io/typed-arrow: Compile‑time Arrow schemas for Rust. Compile‑time Arrow schemas for Rust. Contribute to tonbo-io/typed-arrow development by creating an account on GitHub.

Compile-time Arrow schemas for Rust. Looks pretty nice for use cases where the data schema is well-defined, i.e. either internal to an app, or part of a client/server protocol. #RustLang #ApacheArrow

4 2 0 0
Preview
Parquet Content-Defined Chunking We’re on a journey to advance and democratize artificial intelligence through open source and open science.

PyArrow 21 was a great release, especially for @hf.co users: PyArrow now seamlessly handles hf:// URIs and does content-defined chunking to reduce transfer and storage costs on HF. Check out this blog post: huggingface.co/blog/parquet... #apachearrow #apacheparquet

2 1 0 0
Preview
Recent Improvements to Hash Join in Arrow C++ A deep dive into recent improvements to Apache Arrow’s hash join implementation — enhancing stability, memory efficiency, and parallel performance for modern analytic workloads.

New post up on the Arrow blog about some of the recent improvements to the embedded query engine inside Arrow C++: arrow.apache.org/blog/2025/07... #apachearrow

0 0 0 1
Preview
New Arrow C-API by pdet · Pull Request #18246 · duckdb/duckdb This PR introduces the new Arrow C API, which is intended to replace the deprecated Arrow API. Why a new Arrow C-API? We decided to rewrite the Arrow C-API a while ago and marked all current method...

@duckdb.org just merged a PR for a new, simpler, and -- mostly importantly -- non-deprecated @arrow.apache.org C API: github.com/duckdb/duckd.... #apachearrow #duckdb

6 1 2 0
Video

Next Tuesday, get ready to meet the mind behind #Pandas & #ApacheArrow!

@wesmckinney.com shares his origin story (Part 1) on #TheTestSet. From speedruns to shaping the data stack, this is one you won't want to miss.

Mark your calendar for Tuesday & subscribe at thetestset.co!

#DataScience #Python

22 9 1 0

This update (v0.4.x) provides complete #ApacheArrow data models for 11 file formats and counting, including the GA4GH/htslib formats and UCSC’s BigWig/BigBed.

1 0 1 0

This update (v0.4.x) provides complete #ApacheArrow data models for 11 file formats and counting, including the GA4GH/htslib formats and UCSC’s BigWig/BigBed.

0 0 1 0

This is an exciting release! The Swift implementation of Arrow has been split off into its own repo which means we can now publish it on the @SwiftPackageIndex.mas.to.ap.brid.gy: swiftpackageindex.com/apache/arrow.... #apachearrow #swift #swiftlang

2 0 0 0

It’s true! #apachearrow

1 0 0 0
Post image

Great catch up this morning with Raul + Alenka about maintaining Arrow.

No one holds the whole codebase in their head - & that’s fine. Pick a bit, ask questions, keep things moving. Code helps, but so do comments and nudges.

Even the experienced folks are still learning.

#opensource #apachearrow

4 0 0 0