Mining point-of-interest data from social networks for urban land use classification and disaggregation
Authors
Shan Jiang
Ana Cristina da Costa Oliveira Alves
Filipe Rodrigues
Joseph Ferreira
Francisco Câmara Pereira
Ana Cristina da Costa Oliveira Alves
Filipe Rodrigues
Joseph Ferreira
Francisco Câmara Pereira
Abstract
Over the last few years, much online volunteered geographic information (VGI) has emerged and has been increasingly analyzed to understand places and cities, as well as human mobility and activity. However, there are concerns about the quality and usability of such VGI. In this study, we demonstrate a complete process that comprises the collection, unification, classification and validation of a type of VGI—online point-of-interest (POI) data—and develop methods to utilize such POI data to estimate disaggregated land use (i.e., employment size by category) at a very high spatial resolution (census block level) using part of the Boston metropolitan area as an example. With recent advances in activity-based land use, transportation, and environment (LUTE) models, such disaggregated land use data become important to allow LUTE models to analyze and simulate a person’s choices of work location and activity destinations and to understand policy impacts on future cities. These data can also be used as alternatives to explore economic activities at the local level, especially as government-published census-based disaggregated employment data have become less available in the recent decade. Our new approach provides opportunities for cities to estimate land use at high resolution with low cost by utilizing VGI while ensuring its quality with a certain accuracy threshold. The automatic classification of POI can also be utilized for other types of analyses on cities.Keywords
Information extraction; Machine learning; Points of interest; Land use; Volunteered geographic informationRelated Project
InfoCrowds - Social Web Information Retrieval for crowds mobility managementJournal
Computers, Environment and Urban Systems, J.C. Thill, January 2015DOI
Cited by
Year 2020 : 2 citations
Hu, S., He, Z., Wu, L., Yin, L., Xu, Y., & Cui, H. (2020). A framework for extracting urban functional regions based on multiprototype word embeddings using points-of-interest data. Computers, Environment and Urban Systems, 80, 101442
Zhou, W., Ming, D., Lv, X., Zhou, K., Bao, H., & Hong, Z. (2020). SO–CNN based urban functional zone fine division with VHR remote sensing image. Remote Sensing of Environment, 236, 111458
Year 2019 : 37 citations
Yue, W., Chen, Y., Zhang, Q., & Liu, Y. (2019). Spatial Explicit Assessment of Urban Vitality Using Multi-Source Data: A Case of Shanghai, China. Sustainability, 11(3), 638.
Sparks, K., Thakur, G., Pasarkar, A., & Urban, M. (2019). A global analysis of cities’ geosocial temporal signatures for points of interest hours of operation. International Journal of Geographical Information Science, 1-18.
De Kok, R., Mauri, A., & Bozzon, A. (2019). Automatic processing of user-generated content for the description of energy-consuming activities at individual and group level. Energies, 12(1), 15.
Cao, K., Guo, H., & Zhang, Y. (2019). Comparison of approaches for urban functional zones classification based on multi-source geospatial data: A case study in Yuzhong district, Chongqing, China. Sustainability, 11(3), 660.
Niu, H., & Silva, E. (2019, September). Crowdsourced Data Mining for Urban Activity: Review of Data Sources, Applications, and Methods. ASCE.
Ge, P., He, J., Zhang, S., Zhang, L., & She, J. (2019). An Integrated Framework Combining Multiple Human Activity Features for Land Use Classification. ISPRS International Journal of Geo-Information, 8(2), 90.
Yi, D., Yang, J., Liu, J., Liu, Y., & Zhang, J. (2019). Quantitative Identification of Urban Functions with Fishers’ Exact Test and POI Data Applied in Classifying Urban Districts: A Case Study within the Sixth Ring Road in Beijing. ISPRS International Journal of Geo-Information, 8(12), 555.
Hu, Y., & Han, Y. (2019). Identification of Urban Functional Areas Based on POI Data: A Case Study of the Guangzhou Economic and Technological Development Zone. Sustainability, 11(5), 1385.
Lee, D., & Lee, S. (2019, September). Inferring the character of urban commercial areas from age-biased online search results: how place recommendation data can reveal dynamic seoul neighborhoods. In Adjunct Proceedings of the 2019 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2019 ACM International Symposium on Wearable Computers (pp. 991-995).
Wu, J., Li, J., & Ma, Y. (2019). Exploring the Relationship between Potential and Actual of Urban Waterfront Spaces in Wuhan Based on Social Networks. Sustainability, 11(12), 3298.
Liu, X., Huang, Q., & Gao, S. (2019). Exploring the uncertainty of activity zone detection using digital footprints with multi-scaled DBSCAN. International Journal of Geographical Information Science, 33(6), 1196-1223.
Han, Z., Long, Y., Wang, X., & Hou, J. (2019). Urban redevelopment at the block level: Methodology and its application to all Chinese cities. Environment and Planning B: Urban Analytics and City Science, 2399808319843928.
Min, M., Lin, C., Duan, X., Jin, Z., & Zhang, L. (2019). Spatial distribution and driving force analysis of urban heat island effect based on raster data: a case study of the Nanjing metropolitan area, China. Sustainable Cities and Society, 101637.
Yang, J., Zhu, J., Sun, Y., & Zhao, J. (2019). Delimitating urban commercial central districts by combining kernel density estimation and road intersections: a case study in nanjing city, china. ISPRS International Journal of Geo-Information, 8(2), 93.
Zhao, Y., Li, Q., Zhang, Y., & Du, X. (2019). Improving the Accuracy of Fine-Grained Population Mapping Using Population-Sensitive POIs. Remote Sensing, 11(21), 2502.
Sideris, N., Bardis, G., Voulodimos, A., Miaoulis, G., & Ghazanfarpour, D. (2019). Using Random Forests on Real-World City Data for Urban Planning in a Visual Semantic Decision Support System. Sensors, 19(10), 2266.
Hong, Y., & Yao, Y. (2019). Hierarchical community detection and functional area identification with OSM roads and complex graph theory. International Journal of Geographical Information Science, 33(8), 1569-1587.
Gao, J., Zhang, Y. C., & Zhou, T. (2019). Computational socioeconomics. Physics Reports.
Zhu, Y., Deng, X., & Newsam, S. (2019). Fine-grained land use classification at the city scale using ground-level images. IEEE Transactions on Multimedia.
Zhai, W., Bai, X., Shi, Y., Han, Y., Peng, Z. R., & Gu, C. (2019). Beyond Word2vec: An approach for urban functional region extraction and identification by combining Place2vec and POIs. Computers, Environment and Urban Systems, 74, 1-12.
Pan, Y., Chen, S., Li, T., Niu, S., & Tang, K. (2019). Exploring spatial variation of the bus stop influence zone with multi-source data: A case study in Zhenjiang, China. Journal of Transport Geography, 76, 166-177.
Ye, T., Zhao, N., Yang, X., Ouyang, Z., Liu, X., Chen, Q., ... & Jia, P. (2019). Improved population mapping for China using remotely sensed and points-of-interest data within a random forests model. Science of the total environment, 658, 936-946.
Yang, X., Ye, T., Zhao, N., Chen, Q., Yue, W., Qi, J., ... & Jia, P. (2019). Population Mapping with Multisensor Remote Sensing Images and Point-Of-Interest Data. Remote sensing, 11(5), 574.
Li, Q., Zhou, S., & Wen, P. (2019). The relationship between centrality and land use patterns: Empirical evidence from five Chinese metropolises. Computers, Environment and Urban Systems, 78, 101356.
Martí, P., Serrano-Estrada, L., & Nolasco-Cirugeda, A. (2019). Social media data: Challenges, opportunities and limitations in urban studies. Computers, Environment and Urban Systems, 74, 161-174.
Chen, M., Arribas-Bel, D., & Singleton, A. (2019). Understanding the dynamics of urban areas of interest through volunteered geographic information. Journal of Geographical Systems, 21(1), 89-109.
Lee, D., & Lee, S. (2019). Inferring the Character of Urban Commercial Areas from Age-biased Online Search Results.
Martí Ciriquián, P., Serrano-Estrada, L., & Nolasco-Cirugeda, A. (2019). Social Media data: Challenges, opportunities and limitations in urban studies.
Yang, C. (2019). A new perspective on urban form with the integration of Space Syntax and MCDA–An exploratory analysis of the city of Xi’an, China (Master's thesis, University of Waterloo).
Lin, Y., & Geertman, S. (2019, July). Can Social Media Play a Role in Urban Planning? A Literature Review. In International Conference on Computers in Urban Planning and Urban Management (pp. 69-84). Springer, Cham.
Wu Wanyu, & Niu Xinyi. (2019). Research on the Impact of the Diversity of the Built Environment on the Vitality of Streets: A Case Study of Nanjing West Road in Shanghai. Southern Architecture , (2), 14.
Soundararaj, B. (2019). Estimating Footfall From Passive Wi-Fi Signals: Case Study with Smart Street Sensor Project (Doctoral dissertation, UCL (University College London)).
Bahadorizadeh, H., & Malek, M. R. (2019). User Generate Spatial Content in Land Administration and Cadastre: Types and Usage. Geospatial Engineering Journal, 10(2), 51-62.
Vedernikov, O. (2019). Optimal route planning for hitchhiking (Doctoral dissertation). University of Melbourne, Australia. http://hdl.handle.net/11343/227596
Sideris, N. (2019). Spatial decision support in urban environments using machine learning, 3D geo-visualization and semantic integration of multi-source data (Doctoral dissertation). Université de Limoges, France. https://tel.archives-ouvertes.fr/tel-02449667/file/2019LIMO0083.pdf
Chen, E., Ye, Z., Wang, C., & Zhang, W. (2019). Discovering the spatio-temporal impacts of built environment on metro ridership using smart card data. Cities, 95, 102359.
Firzatullah, R. M. (2019). A Development of Spatial Skyline Query Based on Surrounding Environment for Data Streaming Using Apache-Spark (Master thesis dissertation). Institut Pertanian Bogor University, Indonesia. http://repository.ipb.ac.id/handle/123456789/98738
Year 2018 : 22 citations
Akerkar, R., & Hong, M. (2018, May). Unlocking Value from Ubiquitous Data. In International Conference on Information and Communication Technologies in Education, Research, and Industrial Applications (pp. 3-17). Springer, Cham.
Rosina, K., Batista e Silva, F., Vizcaino, P., Marín Herrera, M., Freire, S., & Schiavina, M. (2018). Increasing the detail of European land use/cover data by combining heterogeneous data sets. International Journal of Digital Earth, 1-25.
Fan, D., Qin, K., & Kang, C. (2018, June). Understanding Urban Functionality from POI Space. In 2018 26th International Conference on Geoinformatics (pp. 1-6). IEEE.
Mou, F., He, Y., Peng, J., Ma, Y., Zheng, Z. Z., Wang, S. L., & Li, J. (2018, December). A New Urban Functional Regions Minig Method with MPETM. In 2018 15th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP) (pp. 73-76). IEEE.
Liu, X. (2018). Detection and Exploration of Individual Semantic Trajectories Using Social Media Data (Doctoral dissertation).
Yu, Y., Li, J., Zhu, C., & Plaza, A. (2018). Urban Impervious Surface Estimation from Remote Sensing and Social Data. Photogrammetric Engineering & Remote Sensing, 84(12), 771-780.
Wang, Y., de Almeida Correia, G. H., van Arem, B., & Timmermans, H. H. (2018). Understanding travellers’ preferences for different types of trip destination based on mobile internet usage data. Transportation Research Part C: Emerging Technologies, 90, 247-259.
Lin, J., & Cromley, R. G. (2018). Inferring the home locations of Twitter users based on the spatiotemporal clustering of Twitter data. Transactions in GIS, 22(1), 82-97.
Chen, W., Huang, H., Dong, J., Zhang, Y., Tian, Y., & Yang, Z. (2018). Social functional mapping of urban green space using remote sensing and social sensing data. ISPRS Journal of Photogrammetry and Remote Sensing, 146, 436-452.
Aubrecht, C., Ungar, J., Aubrecht, D. O., Freire, S., & Steinnocher, K. (2018). Mapping Land Use Dynamics Using the Collective Power of the Crowd. In Earth Observation Open Science and Innovation (pp. 247-253). Springer, Cham.
Novack, T., Peters, R., & Zipf, A. (2018). Graph-Based Matching of Points-of-Interest from Collaborative Geo-Datasets. ISPRS International Journal of Geo-Information, 7(3), 117.
Zhou, H., & Hirasawa, K. (2018). Spatiotemporal traffic network analysis: technology and applications. Knowledge and Information Systems, 1-37.
Huang, L., Wu, Y., Zheng, Q., Zheng, Q., Zheng, X., Gan, M., ... & Zhang, J. (2018). Quantifying the Spatiotemporal Dynamics of Industrial Land Uses through Mining Free Access Social Datasets in the Mega Hangzhou Bay Region, China. Sustainability, 10(10), 3463.
Wang, S., Xu, G., & Guo, Q. (2018). Street Centralities and Land Use Intensities Based on Points of Interest (POI) in Shenzhen, China. ISPRS International Journal of Geo-Information, 7(11), 425.
Yang, S., Shen, J., Kone?ný, M., Wang, Y., & Štampach, R.( 2018). STUDY ON THE SPATIAL HETEROGENEITY OF THE POI QUALITY IN OPENSTREETMAP.
Liu, X., Niu, N., Liu, X., Jin, H., Ou, J., Jiao, L., & Liu, Y. (2018). Characterizing mixed-use buildings based on multi-source big data. International Journal of Geographical Information Science, 32(4), 738-756.
Song, J., Lin, T., Li, X., & Prishchepov, A. (2018). Mapping Urban Functional Zones by Integrating Very High Spatial Resolution Remote Sensing Imagery and Points of Interest: A Case Study of Xiamen, China. Remote Sensing, 10(11), 1737.
Martí, P., Serrano-Estrada, L., & Nolasco-Cirugeda, A. (2018). Social Media data: Challenges, opportunities and limitations in urban studies. Computers, Environment and Urban Systems.
Lei, P., Marfia, G., Pau, G., & Tse, R. (2018). Can we monitor the natural environment analyzing online social network posts? A literature review. Online Social Networks and Media, 5, 51-60.
Zhu, Y., Deng, X., & Newsam, S. (2018). Fine-grained land use classification at the city scale using ground-level images. arXiv preprint arXiv:1802.02668.
Chen, Y., Ge, Y., An, R., & Chen, Y. (2018). Super-Resolution Mapping of Impervious Surfaces from Remotely Sensed Imagery with Points-of-Interest. Remote Sensing, 10(2), 242.
Jana, Arnab & Verma, Deepank & Ramamritham, Krithivasan. (2018). HOW DIVERSE ARE THE NEIGHBOURHOODS? A DIVERSITY INDEX TO ASSESS LAND USE MIX THROUGH OPEN SOURCE AND ONLINE DATASETS.
Year 2017 : 27 citations
Lin J, Cromley RG. Inferring the home locations of Twitter users based on the spatiotemporal clustering of Twitter data. Transactions in GIS. 2017;00:1–16. https://doi.org/10.1111/tgis.12297
Khoshamooz, G. and Taleai, M. (2017), Multi-Domain User-Generated Content Based Model to Enrich Road Network Data for Multi-Criteria Route Planning. Geogr Anal, 49: 239–267. doi:10.1111/gean.12124
Zhang, Y.; Li, Q.; Huang, H.; Wu, W.; Du, X.; Wang, H. The Combined Use of Remote Sensing and Social Sensing Data in Fine-Grained Urban Land Use Mapping: A Case Study in Beijing, China. Remote Sens. 2017, 9, 865.
Jeyasree, J., & Bhuvaneshwari, K. (2017). The Segmentation of Age Related Macular Degeneration in Color Fundus Image. Asian Journal of Applied Science and Technology (AJAST), 1(3), 27-30.
Bao, J., Xu, C., Liu, P., & Wang, W. (2017). Exploring Bikesharing Travel Patterns and Trip Purposes Using Smart Card Data and Online Point of Interests. Networks and Spatial Economics, 1-23.
Jia, T., & Ji, Z. (2017). Understanding the Functionality of Human Activity Hotspots from Their Scaling Pattern Using Trajectory Data. ISPRS International Journal of Geo-Information, 6(11), 341.
Wang, H., Dong, Y., & Zhang, K. (2017, May). A spatial-temporal model to improve PM2. 5 inference. In Computer and Information Science (ICIS), 2017 IEEE/ACIS 16th International Conference on (pp. 173-177). IEEE.
Xing, H., Meng, Y., Hou, D., Song, J., & Xu, H. (2017). Employing Crowdsourced Geographic Information to Classify Land Cover with Spatial Clustering and Topic Model. Remote Sensing, 9(6), 602.
Xing, H., Meng, Y., Hou, D., Cao, F., & Xu, H. (2017). Exploring point-of-interest data from social media for artificial surface validation with decision trees. International Journal of Remote Sensing, 38(23), 6945-6969.
Mesbah, S., Bozzon, A., Lofi, C., & Houben, G. J. (2017, February). Describing data processing pipelines in scientific publications for big data injection. In Proceedings of the 1st Workshop on Scholarly Web Mining (pp. 1-8). ACM.
Emmanouil Chaniotakis, Constantinos Antoniou, Georgia Aifadopoulou, and Loukas Dimitriou. Inferring Activities from Social Media Data. Transportation Research Record: Journal of the Transportation Research Board 2017 2666:, 29-37
Xiao, Y.; Chen, X.; Li, Q.; Yu, X.; Chen, J.; Guo, J. Exploring Determinants of Housing Prices in Beijing: An Enhanced Hedonic Regression with Open Access POI Data. ISPRS Int. J. Geo-Inf. 2017, 6, 358.
Chaniotakis, E., Antoniou, C., Aifadopoulou, G., & Dimitriou, L. (2017). Inferring activities from social media data. Transportation research record, 2666(1), 29-37.
e Silva, F. B., Rosina, K., Schiavina, M., Marin, M., Freire, S., Craglia, M., & Lavalle, C. (2017). Spatiotemporal mapping of population in Europe: The “ENACT” project in a nutshell. In 57th european regional science association (ERSA) congress.
Lei, P., Marfia, G., Pau, G., & Tse, R. (2017). Online Social Networks and Media.
Deng, X., & Newsam, S. (2017, November). Quantitative Comparison of Open-Source Data for Fine-Grain Mapping of Land Use. In Proceedings of the 3rd ACM SIGSPATIAL Workshop on Smart Cities and Urban Analytics (p. 4). ACM.
Wood, S., Muthyala, R., Jin, Y., Qin, Y., Rukadikar, N., Rai, A., & Gao, H. (2017, December). Automated industry classification with deep learning. In Big Data (Big Data), 2017 IEEE International Conference on (pp. 122-129). IEEE.
Touya, Guillaume, et al. "Assessing Crowdsourced POI Quality: Combining Methods Based on Reference Data, History, and Spatial Relations." ISPRS International Journal of Geo-Information 6.3 (2017): 80.
Ermagun, Alireza, et al. "Real-time trip purpose prediction using online location-based search and discovery services." Transportation Research Part C: Emerging Technologies 77 (2017): 96-112.
Liu, Xiaoping, et al. "Classifying urban land use by integrating remote sensing and social media data." International Journal of Geographical Information Science (2017): 1-22.
Yue, Yang, et al. "Measurements of POI-based mixed use and their relationships with neighbourhood vibrancy." International Journal of Geographical Information Science 31.4 (2017): 658-675.
Gao, Song, Krzysztof Janowicz, and Helen Couclelis. "Extracting urban functional regions from points of interest and human activities on location?based social networks." Transactions in GIS 21.3 (2017): 446-467.
Yao, Yao, et al. "Sensing spatial distribution of urban land use by integrating points-of-interest and Google Word2Vec model." International Journal of Geographical Information Science 31.4 (2017): 825-848.
Ricciato, Fabio, et al. "Beyond the “single-operator, CDR-only” paradigm: An interoperable framework for mobile phone network data analyses and population density estimation." Pervasive and Mobile Computing 35 (2017): 65-82.
Yao, Yao, et al. "Simulating urban land-use changes at a large scale by integrating dynamic land parcel subdivision and vector-based cellular automata." International Journal of Geographical Information Science (2017): 1-28.
Niu, Ning, et al. "Integrating multi-source big data to infer building functions." International Journal of Geographical Information Science (2017): 1-20.
Chen, Y., Liu, X., Li, X., Liu, X., Yao, Y., Hu, G., Xu, X., Pei, F. Delineating urban functional areas with building-level social media data: A dynamic time warping (DTW) distance based k-medoids method. Landscape and Urban Planning
volume 160, issue , year 2017, pp. 48 - 60
Year 2016 : 11 citations
Muhammad Adnan, Francisco C. Pereira, Carlos Lima Azevedo, Kakali Basak, Milan Lovric, Sebastián Raveau, Yi Zhu, Joseph Ferreira, Christopher Zegras, Moshe Ben-Akiva, SimMobility: A Multi-scale Integrated Agent-Based Simulation Platform (2016)
Yao, Yao, et al. "Sensing spatial distribution of urban land use by integrating points-of-interest and Google Word2Vec model." International Journal of Geographical Information Science (2016): 1-24.
Vedernikov, Oleksii, Lars Kulik, and Kotagiri Ramamohanarao. "The Hitchhiker’s guide to the pick-up locations." Open Geospatial Data, Software and Standards 1.1 (2016): 12.
Gong, X. "Exploring Human Activity Patterns Across Cities through Social Media Data." MSc Thesis. TU Delft. Netherlands (2016).
Umwelt, Ingenieurfakultät Bau Geo. "Visual Analysis of Large Floating Car Data-A Bridge-Maker between Thematic Mapping and Scientific Visualization." Master Thesis. 2016 TECHNISCHE UNIVERSITÄT MÜNCHEN
Psyllidis, Achilleas. "Revisiting Urban Dynamics through Social Urban Data." A+ BE| Architecture and the Built Environment 6.18 (2016): 1-334.
Ricciato, Fabio, et al. "Beyond the “single-operator, CDR-only” paradigm: An interoperable framework for mobile phone network data analyses and population density estimation." Pervasive and Mobile Computing (2016).
Milad Mirbabaie, Stefan Stieglitz, and Stephan Volkeri. 2016. Volunteered Geographic Information and Its Implications for Disaster Management. In Proceedings of the 2016 49th Hawaii International Conference on System Sciences (HICSS) (HICSS '16). IEEE Computer Society, Washington, DC, USA, 207-216. DOI=http://dx.doi.org/10.1109/HICSS.2016.33
Guy Lansley, Paul A. Longley, The geography of Twitter topics in London, Computers, Environment and Urban Systems, Volume 58, July 2016, Pages 85-96, ISSN 0198-9715, http://dx.doi.org/10.1016/j.compenvurbsys.2016.04.002.
Jonietz, D.; Zipf, A. Defining Fitness-for-Use for Crowdsourced Points of Interest (POI). ISPRS Int. J. Geo-Inf. 2016, 5, 149. doi:10.3390/ijgi5090149
Yimin Chen, Xiaoping Liu, Xia Li, Xingjian Liu, Yao Yao, Guohua Hu, Xiaocong Xu, Fengsong Pei, Delineating urban functional areas with building-level social media data: A dynamic time warping (DTW) distance based k-medoids method, Landscape and Urban Planning, Volume 160, April 2017, Pages 48-60, ISSN 0169-2046, http://dx.doi.org/10.1016/j.landurbplan.2016.12.001.
Year 2015 : 1 citations
E. Chaniotakis, C. Antoniou and E. Mitsakis.Data for Leisure Travel Demand from Social Networking Services. hEART 2015. 4th symposium of European Association for Research in Transportation. September 2015. http://www.heart2015.transport.dtu.dk/-/media/Sites/hEART2015/abstracts hEART/hEART_2015_submission_60.ashx?la=da