Valerio Marsocci's Avatar

Valerio Marsocci

@valeriomarsocci

🌏🌱 trying to make Geospatial Foundation Models work Research Fellow at @ESA PhiLab Previously at @KULeuven, @Cnam PhD in Data Science at @Sapienza website: https://sites.google.com/uniroma1.it/valeriomarsocci #AI4EO #GeoAI #SSL4EO

949
Followers
302
Following
46
Posts
18.11.2024
Joined
Posts Following

Latest posts by Valerio Marsocci @valeriomarsocci

Post image

#8 Copernicus-FM

This paper introduces: a) a new pre-training dataset; b) a new benchmark dataset; c) a GFM, all based on a diverse set of Copernicus data.

⬆️: really appreciate the grid embeddings part
⬇️: some doubts about claims about generalizability

arxiv.org/pdf/2503.11849

27.03.2025 09:30 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

πŸ”₯ 🎯 Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection (#7)

New preprint around :)

Incorporating inductive biases specific to MSI can enhance the fine-tuning of large Earth observation models, pre-trained on RGB

arxiv.org/pdf/2503.09493

17.03.2025 10:18 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

#6 Lossy Neural Compression for Geospatial Analytics

The authors introduce NC and discuss the characteristics of EO and climate data, w.r.t natural images

⬆️: great entry point
⬇️: no baseline exps

arxiv.org/pdf/2503.01505

10.03.2025 13:00 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

#5 Is SSL on Satellite Imagery Better than ImageNet? A Systematic Study with Sentinel-2

This study pretrains two SSL methods on ImageNet and GeoNet. The improvement with GeoNet is minimal.

⬆️ useful to reduce computation?
⬇️ more considerations about the resolutions?

arxiv.org/pdf/2502.10669

24.02.2025 09:08 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

#4 Galileo

Galileo is a family of pretrained RS models designed to flexibly process multimodal RS data. It has two loss: one in the pixel space, one in the latent space.

⬆️: multi-modal/temporal/sensor
⬇️: why just using Sentinel data?

arxiv.org/pdf/2502.09356

14.02.2025 08:43 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Globally scalable glacier mapping by deep learning matches expert delineation accuracy - Nature Communications A deep learning model using open satellite data for scalable, global glacier mapping is developed. This model matches expert-level accuracy, facilitating more reliable glacier monitoring to support cl...

#3 GlaViTU

This paper presents a novel world-wide dataset and a novel convolutional-transformer, named Glacier-VisionTransformer-U-Net (GlaViTU), for multitemporal and global glacier mapping.

⬆️ relevant task and nice results
⬇️ weak zero-shot transferability?

www.nature.com/articles/s41...

05.02.2025 13:24 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

#2 Can Location Embeddings Enhance Super-Resolution of Satellite Imagery?

It looks like they can :)

⬆️: validating it on a real-world task
⬇️: is it super-resolution or mapping S2 to NAIP?

arxiv.org/pdf/2501.15847

30.01.2025 09:59 πŸ‘ 5 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0
Post image

#1 Diffusion Models for RS

This paper provides a comprehensive review of the applications of diffusion models in remote sensing

⬆️ excellent entry point
⬇️ not sure about the statement about the "inherent denoising ability" of diffusion models

arxiv.org/abs/2404.08926

21.01.2025 13:44 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Hellooooo πŸš€

I'll make it swift: I have just started my new position as Internal Research Fellow at European Space Agency - ESA Phi-Lab

I am very happy because it looks like a great place where to do research and because I am back in my beloved hometown, Rome 😍

16.01.2025 14:16 πŸ‘ 7 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

uoooo great news, how many are you trying to cover?

16.01.2025 12:51 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I start the new challenge this week :)

Also, other very cool personal news is coming out

So stay tuned if interested ✨

14.01.2025 14:35 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Back on social, after a break (can you guess where?)

Last year I decided to do a #50paperschallenge

I ended up with 43. Still:
πŸ₯΅ I read more than 50 papers. I just didn't post all
πŸ˜‡ the strategy worked independently of the posted ones

For this reason, this year I will do a #40paperschallenge!

14.01.2025 14:35 πŸ‘ 4 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0
Post image

#41 Beyond Grid Data

GNNs open new possibilities for EO, handling irregular, multi-source datasets (e.g. point clouds) for smarter weather forecasts, disaster relief, etc..

⬆️: excels at non-Euclidean spatial data
⬇️: limited scalability across diverse data (?)

arxiv.org/abs/2411.03223

12.12.2024 13:33 πŸ‘ 3 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0

Did we overlook something? Are you interested in this kind of topic?

We are already considering future updates, so feel free to reach out to give feedbacks and to talk about geospatial foundation models

✨🌏

06.12.2024 14:22 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

A great team collaborated on it!

Thx @yurujia.bsky.social @lebellig.bsky.social @nshaud.bsky.social and all the others 🀩

🧡

06.12.2024 14:22 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

We observed interesting insights, such as:

1. generally speaking GFMs don't really excel when compared to supervised baselines

2. for some specific scenarios (e.g. HR data), it makes sense to use them

3. multi-temporal data are still under-estimated

other insights in the paper!

🧡

06.12.2024 14:22 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

With this benchmark (PANGAEA), we tried to address the following research challenges:

* provide a robust evaluation protocol to benchmark GFMs
* investigate GFMs capabilities, with a focus on a) domain generalization, b) comparison to supervised baselines, c) performance with limited labels

🧡

06.12.2024 14:22 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

We collected 11 datasets to create an inclusive, diverse benchmarks, based on these criteria:
* application domain
* geographical distribution
* type of task
* modality
* temporality

Spoiler: no patch-level classification tasks are included!

🧡

06.12.2024 14:22 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

πŸš€πŸš€πŸŒ

Are geospatial foundation models really impactful?

Check it in our new pre-print!

Welcome to **PANGAEA: a global and inclusive benchmark for GFMs**

arxiv.org/abs/2412.04204

Check also the public GitHub repo (other news/updates soon):
github.com/VMarsocci/pa...

a short thread 🧡

06.12.2024 14:22 πŸ‘ 10 πŸ” 4 πŸ’¬ 2 πŸ“Œ 2
Preview
GitHub - yurujaja/pangaea-bench: Towards Robust Evaluation for Geospatial Foundation Models Towards Robust Evaluation for Geospatial Foundation Models - yurujaja/pangaea-bench

Another paper shows that global models are not always the best choice.

If you are interested in this topic, and in geospatial foundation models in general, next week we will publish an interesting pre-print, connected to our Pangaea repo

Check it here: github.com/yurujaja/pan...

29.11.2024 15:15 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

#39 TCH in African savannas

Can global SatML models solve local challenges?
This study finds local models outperform global & fine-tuned models for TCH mapping in Africa

⬆️: interesting set of research questions
⬇️: what about "generalist" geospatial foundation models?

arxiv.org/pdf/2411.14354

29.11.2024 15:15 πŸ‘ 5 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

also, in the past I posted about this interesting benchmark paper:

#32 GeoFMs for crop type mapping

it investigates the ability of geoFMs to transfer to new geographic regions in agriculture

⬆️the pivotal topic for real-world applications
⬇️the limited number of geoFMs

arxiv.org/pdf/2409.09451

28.11.2024 14:47 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
GitHub - yurujaja/pangaea-bench: Towards Robust Evaluation for Geospatial Foundation Models Towards Robust Evaluation for Geospatial Foundation Models - yurujaja/pangaea-bench

here is the GitHub of PANGAEA code (that we used for the experiments):
github.com/yurujaja/pan...

28.11.2024 14:47 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

πŸš€πŸŒ

If you want to see how geospatial foundation models are working in real-world tasks w.r.t. supervised baselines, stay tuned cause next week we are releasing the pre-print of PANGAEA, showing interesting results on this topic!

28.11.2024 14:43 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

#38 SPECIALIZED FOUNDATION MODELS STRUGGLE
TO BEAT SUPERVISED BASELINES

Specialized FMs in genomics, satellite imaging, and time series, struggle w.r.t. supervised learning pipelines

⬆️: very relevant work
⬇️: just classification, limiting the real-world capabilities*

arxiv.org/abs/2411.02796

28.11.2024 14:43 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

As I haven't found it out there yet, I made the Women in computer vision started pack.

Many more missing, please let me know how is already in bsky to add them!

go.bsky.app/BowzivT

22.11.2024 23:43 πŸ‘ 43 πŸ” 14 πŸ’¬ 11 πŸ“Œ 0
Post image

The strain on scientific publishing: we set out to characterise the remarkable growth of the scientific literature in the last few years, in spite of declining growth in total scientists. What is going on?

direct.mit.edu/qss/article/...

A 🧡 1/n
#AcademicSky #PhDchat #ScientificPublishing #SciPub

19.11.2024 12:27 πŸ‘ 997 πŸ” 560 πŸ’¬ 45 πŸ“Œ 134

πŸ‘‰next week I will post some papers on this topic to be ready for our preprint release

⁉️if you have suggestions or questions let us know!

🌍🌎🌏

22.11.2024 15:20 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

πŸš€ What’s Next?

A paper detailing Pangaea with many results is coming in 1-2 weeks

It highlights the differences with other benchmarks, and shows interesting insights on models' performance

🀯Spoiler: geospatial foundation models are far from being generalist

πŸ‘‡

22.11.2024 15:20 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

🌟 Why is Pangaea relevant?

- Comprehensive Coverage: global datasets from multiple domains

- Multimodal & Multitemporal Data: diverse sensors both single and multi-temporal

- Easy to Extend: you can contribute with models and datasets

πŸ‘‡

22.11.2024 15:20 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0