{"id":4340,"date":"2024-10-04T13:35:47","date_gmt":"2024-10-04T11:35:47","guid":{"rendered":"https:\/\/ssr.upm.es\/?p=4340"},"modified":"2026-01-16T14:14:34","modified_gmt":"2026-01-16T13:14:34","slug":"design-and-implementation-of-data-pricing-models","status":"publish","type":"post","link":"https:\/\/ssr.upm.es\/en\/2024\/10\/04\/design-and-implementation-of-data-pricing-models\/","title":{"rendered":"Design and implementation of data pricing models"},"content":{"rendered":"<div class=\"vc_row wpb_row vc_row-fluid\"><div class=\"wpb_column vc_column_container vc_col-sm-12\"><div class=\"vc_column-inner\"><div class=\"wpb_wrapper\">\n\t<div class=\"wpb_text_column wpb_content_element \" >\n\t\t<div class=\"wpb_wrapper\">\n\t\t\t<div align=\"right\">Responsable: <strong>Santiago Andr\u00e9s Azcoitia\u00a0<\/strong>[&#x73;&#x61;&#x6e;&#x74;&#x69;&#x61;&#x67;&#x6f;&#x2e;&#x61;&#x6e;&#x64;&#x72;&#x65;&#115;&#64;&#117;&#112;&#109;&#46;&#101;&#115;]<\/div>\n\n\t\t<\/div>\n\t<\/div>\n<\/div><\/div><\/div><\/div><div class=\"vc_row wpb_row vc_row-fluid\"><div class=\"wpb_column vc_column_container vc_col-sm-12\"><div class=\"vc_column-inner\"><div class=\"wpb_wrapper\"><div class=\"vc_empty_space\"   style=\"height: 64px\"><span class=\"vc_empty_space_inner\"><\/span><\/div><\/div><\/div><\/div><\/div><div class=\"vc_row wpb_row vc_row-fluid\"><div class=\"wpb_column vc_column_container vc_col-sm-12\"><div class=\"vc_column-inner\"><div class=\"wpb_wrapper\">\n\t<div class=\"wpb_text_column wpb_content_element \" >\n\t\t<div class=\"wpb_wrapper\">\n\t\t\t<p><strong>Datos TFM<\/strong><br \/>\nSupervisor: Santiago Andr\u00e9s Azcoitia, Departamento de Se\u00f1ales, Sistemas y Radiocomunicaciones<br \/>\nFecha de inicio: 1 de febrero, 2025<br \/>\nRequisitos: Estudiante de M\u00e1ster de un t\u00edtulo oficial de la ETSIT, preferiblemente en Ingenier\u00eda de Telecomunicaci\u00f3n, o en Tratamiento de la Se\u00f1al y Comunicaciones.<br \/>\nSolicitudes: Enviar CV y expediente acad\u00e9mico a <a href=\"&#x6d;a&#x69;l&#x74;&#111;&#x3a;&#115;a&#x6e;t&#x69;&#97;&#x67;&#111;&#x2e;&#97;n&#x64;r&#x65;&#115;&#x40;&#117;&#x70;&#x6d;&#46;&#x65;s\">s&#97;&#x6e;&#x74;&#x69;a&#103;&#x6f;&#x2e;&#x61;n&#100;&#x72;&#x65;&#x73;&#64;&#117;&#x70;&#x6d;&#x2e;e&#115;<\/a> antes del 7 de enero de 2025.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>Background:<\/strong><br \/>\nSpurred by the widespread adoption of AI \/ ML, \u2018data\u2019 is becoming a key production factor, comparable in importance to capital, land, or labour in an increasingly digital economy. In spite of an ever-growing demand for third-party data in the B2B market, firms are generally reluctant to share their information. This is due to the unique characteristics of \u2018data\u2019 as an economic good (a freely replicable, nondepletable asset holding a highly combinatorial and context-specific value). As a result, most of those valuable assets still remain unexploited in corporate silos nowadays.<br \/>\nThere is already an ecosystem of companies that trade data over the Internet [1]. Some analysts have estimated the potential value of the data economy at $ 2.5 trillion globally by 2025 [2, 3], and the development of healthy data markets would be the key to making the most of AI\/ML, which is expected to reach a market of $ 15-20 trillion in 2030 [4,5]. Recent studies revealed more than 2k data providers offering data products in commercial data marketplaces [6]. Setting the price for their data assets represents a significant challenge for companies offering their data, which would value a price reference based on the existing offer in the market.<\/p>\n<p><strong>Objective<\/strong><br \/>\nThis Master Thesis aims to design, build and optimize prediction models to estimate the value of a data product based on already-available information about data products in the market. The models will be developed using Python and libraries like pytorch, tensorflow, or keras. The student will also carry out an explainability analysis of the resulting models to provide insights on the most relevant features driving the value of data, answering questions such as what characteristics of data were more valuable, what kind of data products command lower prices, and why, etc.<\/p>\n<p><strong>Methodology<\/strong><br \/>\nThis research will involve the design and development and optimization of a DNN model regressor to guess the price of data out of the metadata that describes a data product [6]. The student will use sentence transformers to capture the semantics of data product description, and AI interpretability techniques such as SHAP to understand the price predictions of data products, feature importance techniques to understand the features of data driving its price in the market and why, etc.<\/p>\n<p><strong>Expected results<\/strong><br \/>\nThis Master Thesis is expected to produce a DNN regression model that outperforms SOTA in estimating the price of data products in data marketplaces, and generate explainable and reasonable predictions based on existing data [6]. Optionally, the student will participate in writing a research paper to disseminate the results of the project.<\/p>\n<p>&nbsp;<\/p>\n<p>[1] S. Andr\u00e9s Azcoitia and N. Laoutaris, A Survey of Data Marketplaces and Their Business Models. ACM SIGMOD Record, 51(3), (Sep 2022), ACM, New York, NY, USA.<br \/>\n[2] N. Henke, J. Bughin, M. Chui, J. Manyika, T. Saleh, B. Wiseman and G. Sethupathy. The Age of analytics: Competing in a data-driven world. McKinsey Global Institute. Dec. 2016<br \/>\n[3] G. Micheletti; N, Raczko, C. Moise; D. Osimo, and G. Cattaneo. European DATA Market Study 2021\u20132023. IDC &amp; The Lisbon Council. May 2023<br \/>\n[4] PWC Consulting. Sizing the prize What\u2019s the real value of AI for your business and how can you capitalise? 2017<br \/>\n[5] J. Bughin, J. Seong, J. Manyika, M. Chui, and R. Joshi. Notes from the AI frontier: Modeling the impact of AI on the world economy. McKinsey Global Institute. 2018<br \/>\n[6] S. Andr\u00e9s Azcoitia, C. Iordanou and N. Laoutaris, \u00abUnderstanding the Price of Data in Commercial Data Marketplaces<\/p>\n\n\t\t<\/div>\n\t<\/div>\n<div class=\"vc_empty_space\"   style=\"height: 64px\"><span class=\"vc_empty_space_inner\"><\/span><\/div><\/div><\/div><\/div><\/div>\n","protected":false},"excerpt":{"rendered":"<p>Responsable: Santiago Andr\u00e9s Azcoitia\u00a0[s&#97;&#x6e;&#x74;&#x69;a&#103;&#x6f;&#x2e;&#x61;n&#100;&#x72;&#x65;&#x73;&#64;&#117;&#x70;&#x6d;&#x2e;e&#115;] Datos TFM Supervisor: Santiago Andr\u00e9s Azcoitia, Departamento de Se\u00f1ales, Sistemas y Radiocomunicaciones Fecha de inicio: 1 de febrero, 2025 Requisitos: Estudiante de M\u00e1ster de un t\u00edtulo oficial de la ETSIT, preferiblemente en Ingenier\u00eda de Telecomunicaci\u00f3n, o en Tratamiento de la Se\u00f1al y Comunicaciones. Solicitudes: Enviar CV y expediente acad\u00e9mico a sa&#110;&#116;&#105;&#97;&#x67;&#x6f;&#x2e;&#x61;&#x6e;dr&#101;&#115;&#64;&#117;&#x70;&#x6d;&#x2e;&#x65;&#x73;&hellip;<\/p>\n","protected":false},"author":10,"featured_media":4341,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[35],"tags":[],"_links":{"self":[{"href":"https:\/\/ssr.upm.es\/en\/wp-json\/wp\/v2\/posts\/4340"}],"collection":[{"href":"https:\/\/ssr.upm.es\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ssr.upm.es\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ssr.upm.es\/en\/wp-json\/wp\/v2\/users\/10"}],"replies":[{"embeddable":true,"href":"https:\/\/ssr.upm.es\/en\/wp-json\/wp\/v2\/comments?post=4340"}],"version-history":[{"count":2,"href":"https:\/\/ssr.upm.es\/en\/wp-json\/wp\/v2\/posts\/4340\/revisions"}],"predecessor-version":[{"id":4343,"href":"https:\/\/ssr.upm.es\/en\/wp-json\/wp\/v2\/posts\/4340\/revisions\/4343"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/ssr.upm.es\/en\/wp-json\/wp\/v2\/media\/4341"}],"wp:attachment":[{"href":"https:\/\/ssr.upm.es\/en\/wp-json\/wp\/v2\/media?parent=4340"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ssr.upm.es\/en\/wp-json\/wp\/v2\/categories?post=4340"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ssr.upm.es\/en\/wp-json\/wp\/v2\/tags?post=4340"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}