Urban2Vec

Urban neighborhoods are complicated systems containing local environments, local businesses, and people living there. There are multiple modalities of data in a neighborhood like street view images which reflect local environments, and the textual data describing local points-of-interest (POI), like the ratings, categories, and customer reviews of restaurants and stores. The characterisitcs of urban neighborhoods are important information to support people-centric decision making. However, the common approach to collect such information (e.g., census) are very expensive and not timely. In this project, we proposed an unsupervised learning model, called Urban2Vec, to capture neighborhood characteristics from open multi-modal data including street view images and textual data of POIs. The model can project each neighborhood into a high-dimensional vector space. Such vectorized representations can be used in many applications. For example, they can be used as input features to predict neighborhood-level characteristics like demographic variables, real estate sale price, and solar adoption rates. It can also be used to search for neighborhoods similar to a query neighborhood, like a search engine, based on the similarity between vectors. For example, given a neighborhood in Chicago, we can retrieve the most and the least similar neighborhoods to it in New York.

Publication:

  • Zhecheng Wang*, Haoyuan Li*, and Ram Rajagopal (2020). Urban2Vec: Incorporating Street View Imagery and POIs for Multi-Modal Urban Neighborhood Embedding. AAAI Conference on Artificial Intelligence. (* equal contribution) (link)

  • Tianyuan Huang*, Zhecheng Wang*, Hao Sheng*, Andrew Ng, and Ram Rajagopal (2021). M3G: Learning Urban Neighborhood Representation from Multi-Modal Multi-Graph. ACM SIGKDD Workshop on Deep Learning for Spatiotemporal Data. (* equal contribution). (link)