Trending

#apacheParquet

Latest posts tagged with #apacheParquet on Bluesky

Latest Top
Trending

Posts tagged #apacheParquet

Preview
GH-48277: [C++][Parquet] unpack with shuffle algorithm by AntoinePrv · Pull Request #47994 · apache/arrow Rationale for this change The current bit-unpacking algorithm (which is implemented as a C++ code generator script in Python) does not fully leverage SIMD operations: all loads and some bitshifts u...

At @quantstack.bsky.social we designed novel bit-unpacking SIMD optimizations for @arrow.apache.org and #ApacheParquet, and implemented them entirely using C++ metaprogramming instead of Python-based code generation.

We'll publish a deep dive blog post soon.

github.com/apache/arrow...

9 6 1 0

Great news for Arrow (and more work for us 😄).
Also, those work items also implicitly apply to #ApacheParquet. @julien.ledem.net

10 2 1 0
Preview
Apache Parquet vs. Newer File Formats (BtrBlocks, FastLanes, Lance, Vortex) For over a decade, Apache Parquet has been the cornerstone of analytical data storage. Parquet emerged in the Hadoop era as an open…

dipankar-tnt.medium.com/apache-parquet-vs-newer-...

#data #DataEngineering #ApacheParquet

0 0 0 0

Check out @andrewlamb1111.bsky.social 's talk at the recent Iceberg meetup for a condensed overview of the the new Variant type coming to Parquet #apacheparquet #apachearrow

4 1 1 0

Centralized storage, decentralized compute: maximizing data use (OLAP) while minimizing ETL and disparate versions of data.

#hotTake #dataEngineering #dataAnalytics #dataScience #machineLearning #ai #cloudComputing #s3 #apacheArrow #apacheParquet #apacheIceberg

2 0 1 0
Original post on mastodon.ronandev.ovh

https://blog.ronandev.ovh/apache-parquet/

Apache Parquet est le format de stockage colonnaire incontournable pour le Big Data. Optimisé pour la performance et la compression, il accélère les requêtes et réduit les coûts. Intégré à Spark, Iceberg et les architectures lakehouse, Parquet offre […]

0 0 0 0
Preview
Parquet Content-Defined Chunking We’re on a journey to advance and democratize artificial intelligence through open source and open science.

PyArrow 21 was a great release, especially for @hf.co users: PyArrow now seamlessly handles hf:// URIs and does content-defined chunking to reduce transfer and storage costs on HF. Check out this blog post: huggingface.co/blog/parquet... #apachearrow #apacheparquet

2 1 0 0

https://github.com/cwida/FastLanes

#ApacheParquet #FastLanes #Data

0 0 0 0
Preview
NYC Apache Iceberg™ Community Meetup · Luma 🧊 Apache Iceberg Meetup is coming to the Big Apple! 🗽 Join us in NYC for an afternoon of ideas, innovation, and Iceberg. Whether you're building lakehouses…

I am speaking at the #ApacheIceberg NYC Meetup on July 10th about Variant in
#ApacheParquet
which enable more efficient of processing semi structured data such as that found in JSON.

lu.ma/95a5qys1

4 0 0 0
Preview
Iceberg GEO: Technical Insights and Implementation Strategies In our previous blog post, we announced Apache Iceberg and Parquet’s support for spatial data types and discussed their significance. Today, we take a closer look at these GEO data types in Iceberg…

Geospatial support has been added to the #ApacheParquet and #ApacheIceberg table formats in their latest releases, in the form of Geometry and Geography data types. This is a very good write-up of this effort: wherobots.com/blog/iceberg...
#dataengineering

1 1 0 0
Screenshot of email message highlighting the Apache Parquet vulnerability.

Screenshot of email message highlighting the Apache Parquet vulnerability.

On April 1st, 2025, CVE-2025-30065 in Apache Parquet’s module was disclosed, with a CVSS score of 10.

Despite fears, exploitation is challenging with limited risk.

Discover what we found: https://go.f5.net/6vqac3fy

#F5Labs #ApacheParquet #Cybersecurity

1 0 0 0
Preview
GitHub - F5-Labs/parquet-canary-exploit-rce-poc-CVE-2025-30065 Contribute to F5-Labs/parquet-canary-exploit-rce-poc-CVE-2025-30065 development by creating an account on GitHub.

Is your environment vulnerable to CVE-2025-30065?

We created a user-friendly Canary Exploit Tool to test your systems against the recently disclosed #ApacheParquet vulnerability.

🔧 Check it out: https://go.f5.net/xze11bfh

#F5Labs #Cybersecurity

1 0 0 0
Visual with white text that reads, "Navigating the CVE-2025-30065 Vulnerability."

Visual with white text that reads, "Navigating the CVE-2025-30065 Vulnerability."

#ApacheParquet is key in data science. Understanding CVE-2025-30065 is essential. Our article covers the vulnerability, its risks, and steps to protect your apps.

➡️ Discover more: https://go.f5.net/gsgv6ng3

#F5Labs #Cybersecurity

0 0 0 0
Preview
GitHub - F5-Labs/parquet-canary-exploit-rce-poc-CVE-2025-30065 Contribute to F5-Labs/parquet-canary-exploit-rce-poc-CVE-2025-30065 development by creating an account on GitHub.

Hello folks in compliance, #AppSec, and vault management. We’ve made a tool for you!

Our #CanaryExploit tool helps you test your systems against the #ApacheParquet vulnerability and ensures your patches are effective.

➡️ Try it today: https://go.f5.net/al3et16k

0 0 0 0
Preview
Critical CVE-2025-30065 Apache Parquet Exploit Tool Unleashed A dangerous new CVE-2025-30065 Apache Parquet exploit tool has surfaced. Learn how it works, who’s vulnerable, and how to protect your....

Are your systems exposed? Learn what this means & how to stay secure:

🔗 technijian.com/cyber-security/critical-cve-2025-30065-apache-parquet-exploit-tool-unleashed-what-it-means-how-to-stay-secure/

#CVE202530065 #ApacheParquet #CyberSecurity #DataBreach #ZeroDay #ThreatIntelligence #ExploitTool

0 0 0 0

CRITICAL: Apache Parquet Java vulnerability (CVE-2025-46762) allows RCE; upgrade to 1.15.2 immediately. #ApacheParquet #RCE #Potatosecurity

0 0 0 0
Preview
Canary Exploit tool allows to find servers affected by Apache Parquet flaw F5 Labs researchers released a PoC tool to find servers vulnerable to the Apache Parquet vulnerability CVE-2025-30065.

Canary ExploitツールはApache Parquetの欠陥の影響を受けるサーバーを見つけることができる

Canary Exploit tool allows to find servers affected by Apache Parquet flaw #SecurityAffairs (May 7)

#ApacheParquet #CVE202530065 #リモートコード実行 #セキュリティ脆弱性 #F5Labs

securityaffairs.com/177565/secur...

0 0 0 0
Preview
Apache Parquet exploit tool detect servers vulnerable to critical flaw what was classified as a remote code execution read more about Apache Parquet exploit tool detect servers vulnerable to critical flaw

Apache Parquet exploit tool detect servers vulnerable to critical flaw reconbee.com/apache-parqu...

#apacheparquet #vulnerable #criticalflaw #vulnerability #apache #cyberattack

2 0 0 0
Preview
Critical Flaw in Apache Parquet Allows Remote Attackers to Execute Arbitrary Code a susceptible system must be tricked into reading read more about Critical Flaw in Apache Parquet Allows Remote Attackers to Execute Arbitrary Code

Critical Flaw in Apache Parquet Allows Remote Attackers to Execute Arbitrary Code reconbee.com/critical-fla...

#apacheparquet #remoteattackers #arbitarycode #CyberSecurity #CyberSecurityAwareness

1 0 0 0
Preview
Addressing the Critical CVE-2025-30065 Vulnerability in Apache Parquet | The DefendOps Diaries Learn about the critical CVE-2025-30065 vulnerability in Apache Parquet and how to mitigate its risks.

A critical flaw in Apache Parquet could let attackers run code remotely on your systems—rated a perfect 10.0 for severity. Is your big data framework safe? Read up on the fix and protect your data today.

#cve202530065
#apacheparquet
#rcevulnerability
#bigdatasecurity
#cybersecurity

0 0 0 0
Preview
GitHub - mluttikh/xml2arrow: Efficiently convert XML data to Apache Arrow format for high-performance data processing Efficiently convert XML data to Apache Arrow format for high-performance data processing - mluttikh/xml2arrow

Have _I_ ever needed to convert XML documents into #ApacheArrow data? No, why do you ask?

Behold: github.com/mluttikh/xml... (bindings available in Python too).

I could see this being useful for folks in the sciences who work with systems that don't yet speak #ApacheArrow and #ApacheParquet.

1 0 0 0
Preview
GitHub - motherduckdb/grafana-duckdb-datasource Contribute to motherduckdb/grafana-duckdb-datasource development by creating an account on GitHub.

I have so much work to do today but what I really want to do is kick the tires on the new DuckDB-Grafana plugin: github.com/motherduckdb.... This unlocks some really cool use cases involving Parquet and Arrow data. #DuckDB #ApacheArrow #ApacheParquet

6 1 0 0
Preview
Flight, DataFusion, Arrow, and Parquet: Using the FDAP Architecture to build InfluxDB 3.0 The FDAP stack, which consists of Apache Flight, DataFusion, Arrow, and Parquet, finally permits developers to build new systems without reinventing the wheel, resulting in more features and better pe...

What’s powering #InfluxDB 3.0? The FDAP stack: Apache Flight, Apache DataFusion, #ApacheArrow, and #ApacheParquet.

These #OpenSource technologies form the backbone of fast, scalable, and interoperable analytics systems.

InfluxData’s andrewlamb1111.bsky.social shares more: bit.ly/3DosJuC

1 1 0 0