Stefan Grafberger's Avatar

Stefan Grafberger

@stefan-grafberger.com

PhD Student at BIFOLD & TU Berlin, researching data management for ML. Previously worked with UvA, Microsoft GSL, Amazon Research, Oracle Labs, and others. https://stefan-grafberger.com

215
Followers
236
Following
7
Posts
15.11.2024
Joined
Posts Following

Latest posts by Stefan Grafberger @stefan-grafberger.com

Thanks Adam!

03.09.2025 23:09 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I'm joining the SQL Data Types team in Berlin.

The VLDB demo paper:
"mlidea: Interactively Improving ML Data Preparation Code via 'Shadow Pipelines'"
PDF: www.vldb.org/pvldb/vol18/...
Video demo: youtu.be/ePGm1J6S2qk

01.09.2025 23:50 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Very excited to share that I've started as a Software Engineer at Snowflake! πŸ₯³

I’m also wrapping up my PhD: this week I’m at VLDB in London to present the last demo paper from my time as PhD student, and on September 17 I’ll defend my PhD in Amsterdam.

Really looking forward to this next chapter!

01.09.2025 23:46 πŸ‘ 3 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0
Proceedings of the Workshop on Data Management for End-to-End Machine Learning | ACM Conferences

Join us for discussions and talks on data management aspects for end-to-end ML on 27 June at @deem-workshop.bsky.social in Berlin. Keynotes by @pinartozun.bsky.social and @gaelvaroquaux.bsky.social 🀩

Check the full schedule deem-workshop.github.io#schedule & proceedings dl.acm.org/doi/proceedi...

16.06.2025 07:49 πŸ‘ 8 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0
Post image

Our demo "mlidea: Interactively Improving ML Data Preparation Code via 'Shadow Pipelines'" was accepted at VLDB! πŸ₯³

We demo suggestions for ML pipelines, similar to IntelliJ code inspections or Grammarly suggestions

youtu.be/ePGm1J6S2qk

Joint work w/ @mersault.bsky.social @p-groth.bsky.social

30.05.2025 19:09 πŸ‘ 12 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0
DEEM: Workshop on Data Management for End-to-End Machine Learning @ ACM SIGMOD 2025

πŸ“’ Deadline extension for DEEM 2025 @sigmod2025.bsky.social!

Following requests, we're extending the submission deadline to April 1, 5pm Pacific Time. More info at: deem-workshop.github.io

15.03.2025 19:15 πŸ‘ 2 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0
Post image

Our visionΒ "Towards Regaining Control over Messy ML Pipelines"Β was accepted for theΒ DAIS workshop at ICDE! πŸ₯³

Initial experiments show LLMs are promising for extracting declarative query plans from messy ML code.

Joint work w/Β @guangchen811.bsky.social @oovcharenko.bsky.social @mersault.bsky.social

07.03.2025 13:56 πŸ‘ 11 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0

Please help spread the word by reposting!

We've just created the official DEEM Workshop account: @deem-workshop.bsky.social

07.02.2025 21:10 πŸ‘ 6 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0

We have a **Postdoc opening** in Berlin on Responsible Data Engineering!

This is a fully-funded position with salary level E14 at the newly founded DEEM Lab, as part of @bifold.berlin .

Details available at deem.berlin#jobs-57624

05.02.2025 08:31 πŸ‘ 6 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Post image

@stefan-grafberger.com, a Ph.D. student in the DEEM Lab at BIFOLD is among the author team, which presented the paper "Towards Query Optimizer as a Service (QOaaS) in a Unified LakeHouse Platform: Can One QO Rule Them All? at the #CIDR2025.

#QOaaS #CIDR

www.bifold.berlin/news-events/...

22.01.2025 14:14 πŸ‘ 3 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

Interested in a *PhD in Data Engineering* in Berlin? Our institute has several openings for PhD positions as part of its graduate school, see the post below!

And check out the following page for details on how to work with the DEEM Lab as part of the graduate school deem.berlin#jobs-189196

06.01.2025 13:49 πŸ‘ 9 πŸ” 4 πŸ’¬ 0 πŸ“Œ 0

Our CIDR'25 paper "Towards Query Optimizer as a Service (QOaaS) in a Unified LakeHouse Ecosystem: Can One QO Rule Them All?" is now on ArXiv! Excited to have been a part of this project during my internship at Microsoft GSL!

arxiv.org/pdf/2411.13704

22.11.2024 20:18 πŸ‘ 9 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

Pls repost:

We, the DEEM Lab at TU Berlin, are hiring a postdoctoral researcher in data engineering for machine learning. Details available at:

deem.berlin#jobs-57624

This fully-funded position is part of the Berlin Institute for the Foundations of Learning and Data (BIFOLD).

#databs #datasky

15.11.2024 08:17 πŸ‘ 10 πŸ” 9 πŸ’¬ 0 πŸ“Œ 1