Flavio Pino (@flaviopino)

Forthcoming IJIO article by Laura Abrardi, Carlo Cambini
@flaviopino.bsky.social "Data brokers competition, synergic datasets, and endogenous information value" doi.org/10.1016/j.ij...
@sciencedirect.bsky.social

20.02.2025 10:38 👍 2 🔁 1 💬 0 📌 0

This DBs' behavior has strong welfare implications: higher prices lead to lower entry, which in turn causes consumer harm. In other words, we show that competing DBs can cause significant harm, without the need of colluding, even in the absence of strong data synergies.

14.02.2025 13:29 👍 0 🔁 0 💬 0 📌 0

If modularity is high enough to ensure superadditivity, DBs coordinate their prices and fully extract firms' WTP, as in a nice paper by Gu,
@leonardomadio.bsky.social and @marcoreggiani.bsky.social
(Data Broker co-opetition). Coordination emerges even when data synergies are very low!

14.02.2025 13:29 👍 0 🔁 0 💬 1 📌 0

The intuition is that data brokers sell data to multiple competing firms. Then, choosing not to buy data puts firms at a disadvantage against its rivals, increasing its WTP. Anticipating this, DBs charge higher prices!

14.02.2025 13:29 👍 0 🔁 0 💬 1 📌 0

Instead, datasets are superadditive if the 𝐯𝐚𝐥𝐮𝐞 of the combined dataset is higher than the sum of the individual ones. We find that supermodularity is 𝐧𝐨𝐭 necessary for superadditivity!

14.02.2025 13:29 👍 0 🔁 0 💬 1 📌 0

Formally, we have two data brokers (DBs) that sell datasets of different accuracy (i.e., probability of operating first-degree price discrimination). Datasets are supermodular if the 𝐚𝐜𝐜𝐮𝐫𝐚𝐜𝐲 of the combined dataset is higher than the sum of the individual ones.

14.02.2025 13:29 👍 0 🔁 0 💬 1 📌 0

Data Brokers Competition, Synergic Datasets, and Endogenous Information Value Data Brokers (DBs) aggregate vast amounts of data and sell them to downstream firms for customer profiling. Firms can decide to purchase data from mul…

shorturl.at/4nuab The answer to this conundrum is: technological additionality and economic additionality differ when you take firm competition into account! We refer to the first one as "modularity", and to the second one as "additionality".

14.02.2025 13:29 👍 0 🔁 0 💬 1 📌 0

Ladies and gentlemen.. this is paper #5! published OA in IJIO (see thread)! Suppose you buy data from two sources, and combined accuracy is lower than sum of individual accuracies. First dataset increases your profits by 5, second by 2, and so you agree to pay 10 for both? What?!

14.02.2025 13:29 👍 1 🔁 0 💬 1 📌 0

Yeah that makes sense, so training data will be even more essential now

28.01.2025 09:18 👍 0 🔁 0 💬 0 📌 0

So, now that the AI stack has been de facto cut in half and cloud computing/chips seem less essential, the question that remains is: which data are more important for foundation models? Training data, or user generated data that may create reinforcement effects? Any insights from tech savvy people?

28.01.2025 08:13 👍 1 🔁 0 💬 1 📌 0

Hi, IO assistant professor focused on data markets theoretical models here!

01.12.2024 20:54 👍 1 🔁 0 💬 0 📌 0

Impressive work @jschneebacher.bsky.social

Thank you!

01.12.2024 20:00 👍 3 🔁 1 💬 0 📌 0

Flavio Pino

Latest posts by Flavio Pino @flaviopino