Twitter data mining for the diagnosis of leaks in drinking water distribution networks
Jiménez-Cabas, Javier | 2023-03-14
This article presents a methodology for using data from social networks, specifically from Twitter, to diagnose leaks in drinking water distribution networks. The methodology involves the collection of tweets from citizens reporting leaks, the extraction of information from the tweets, and the processing of such information to run the diagnosis. To demonstrate the viability of this methodology, 358 Twitter leak reports were collected and analyzed in Mexico City from 1 May to 31 December 2022. From these reports, leak density and probability were calculated, which are metrics that can be used to develop forecasting algorithms, identify root causes, and program repairs. The calculated metrics were compared with those calculated through telephone reports provided by SACMEX, the entity that manages water in Mexico City. Results show that metrics obtained from Twitter and phone reports were highly comparable, indicating the usefulness and reliability of social media data for diagnosing leaks.