Shiv Gupta's Avatar

Shiv Gupta

@sgupta.xyz

Data Engineer @ Chainalysis. Views my own πŸ“Toronto sgupta.xyz

71
Followers
391
Following
22
Posts
30.10.2024
Joined
Posts Following

Latest posts by Shiv Gupta @sgupta.xyz

Also got to attend a workshop hosted by Runway folks earlier this week here in Toronto - I really loved the product!

09.05.2025 19:50 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Shivanshu Gupta is a Data Engineer at Chainalysis, where they run Dagster to orchestrate and maintain complex data pipelines.

πŸ” In today's data landscape, engineers like Shivanshu need tools that reduce cognitive load while handling increasingly complex pipelines.

24.03.2025 17:06 πŸ‘ 4 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

Used Cursor briefly but have recently been pretty happy with GH Copilot + VS Code Insiders which supports agent mode

27.02.2025 02:27 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

8/10 times the latter

25.02.2025 02:45 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Pretty cool - ty for sharing!

10.02.2025 16:52 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

This is great, will there be a direct link to the talk so I can share with my team?

31.01.2025 17:13 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

right*

18.12.2024 16:01 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

You are write in that you still have to call `op` every time, and that can be annoying. I’d like to work out a solution that can set env vars for the current shell session using `op` so you don’t have to call it every time.

18.12.2024 16:01 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Practicing secret hygiene Ever since I accidentally pushed a set of credentials to a branch on GitHub, and frantically tried to santizize the repo late on a Friday, I’ve adopted a couple of practices to prevent this from happe...

The CLI has become a part of my dev workflow. I wrote about it here, but the gist is: you can have your environment vars populated dynamically by 1Password using a bash alias and a .env file. www.sgupta.xyz/posts/secret...

18.12.2024 15:58 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Yep. Any company claiming AI is ready to replace humans who provide value that goes far beyond writing code doesn’t know what it’s talking about

10.12.2024 22:20 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Makes sense when you put into perspective the $ constraints + the relative nascency of the Iceberg SDK ecosystem. Appreciate the insight, @benesch.bsky.social.

04.12.2024 21:40 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
A First Look at S3 (Iceberg) Tables AWS announced S3 Tables today, which brings native support for Apache Iceberg to S3. It’s hard to overstate how exciting this is for the data analytics ecosystem. This post is a quick rundown of my th...

S3 (Iceberg) Tables is everything I dreamt of, and more. I blogged some long-form thoughts: meltware.com/2024/12/04/s...

I think we're about to see an explosion of data tools (@materialize.com, @clickhouse.com, @duckdb.org, et al.) learn to write Iceberg tables via S3 table buckets.

#databs

04.12.2024 10:02 πŸ‘ 107 πŸ” 37 πŸ’¬ 15 πŸ“Œ 7

Great read! It sounds like the absence of table compaction feature in the Python and Rust libraries is a blocker to write support, curious to hear your thoughts on why vendors have not taken on a more active role in contributing towards this feature in these libraries. Thanks!

04.12.2024 12:38 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Awesome. Will DM you. Huge fan of the product already btw, and Rill fills the operational/tactial, low-latency analytics gap in BI in a way that no other tool does, kudos to you and the team!

03.12.2024 03:34 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Anyone here try to self-host Rill (@rilldata.com) or is that not a thing? $250/mo to deploy on Rill Cloud feels prohibitive for most hobby projects... #databs

29.11.2024 18:23 πŸ‘ 3 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0

Definitely feel this way with Pydantic/type hints for most production-bound Python code I’ve written recently!

28.11.2024 03:02 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

1

25.11.2024 22:08 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
epochconverter.com still balling

epochconverter.com still balling

this is the only good WebApp out there, don't at me

23.11.2024 02:20 πŸ‘ 99 πŸ” 6 πŸ’¬ 9 πŸ“Œ 0

I've been seeing posts from people coming to #dataBS and getting overwhelmed and worrying they can't contribute.

I get it. I still feel that sometimes.

But I'm trying to use my impostor syndrome as a strength. It's permission to accept that we'll never know everything.

So join in the discussion!

20.11.2024 09:45 πŸ‘ 35 πŸ” 6 πŸ’¬ 3 πŸ“Œ 2
Preview
GitHub - duckdb/uc_catalog: Proof-of-concept extension combining the delta extension with Unity Catalog Proof-of-concept extension combining the delta extension with Unity Catalog - duckdb/uc_catalog

Anyone here manage to attach a Databricks-hosted Unity Catalog to DuckDB? github.com/duckdb/uc_ca...

20.11.2024 17:58 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Love that #DuckDB & #R2 are powering quick & easy access to open datasets. How long until we get the entire library of BQ public datasets but without the GCP fluff? #DataBS

20.11.2024 04:09 πŸ‘ 5 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Custom Bluesky handle off my domain, neat

20.11.2024 04:04 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I open my Bluesky feed and it is full of really cool people saying really smart things and being really passionate about really interesting stuff.

I like this version of social media, and the world.

Thanks, folks.

19.11.2024 12:53 πŸ‘ 51551 πŸ” 3961 πŸ’¬ 902 πŸ“Œ 298

I underestimated how far you can get with prompt engineering. Every time I thought I needed to bite the fine-tuning bullet, what I really needed was better prompts

20.11.2024 03:28 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Going from CSV > #DuckDB > Delta Lake in S3 is a breeze. Only thing that's missing is write support to Delta Lake tables natively in SQL (vs having to go through Python). And DuckDB Google Sheets integration makes me want to use it for everything

20.11.2024 03:27 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Starting to really love DuckDB

20.11.2024 03:23 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

#databs

30.10.2024 03:17 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0