2014/08 - present, Assistant Professor, Department of Geography, Texas State University, TX, USA
2014/08 - present, Faculty Scholar, Texas Center for Geographic Information Science, Texas State University, TX, USA
2012/04-2013/12, Visiting Researcher (Academic Guest) , Institute of Cartography and Geoinformation, ETH Zurich, Switzerland
- Evaluating the effectiveness of location-based social media in human activity modeling (in progress)
Recent studies have investigated the use of location-based social media (LBSM) to generalize, model and predict human mobility. However, LBSM data suffer from different levels of data quality issues (such as accuracy, precision, scale, and sampling bias) across population groups. The representativeness and reliability of LBSM data and its influence on the quality of derived human mobility (i.e., inference accuracy) have not been systematically investigated. Answers to these questions are fundamental to the proper use of such data for human mobility related decision making. Given the need to evaluate the efficacy of LBSM data in modeling human mobility, this research proposes to examine the sampling biases of LBSM data and its reliability to support deriving human mobility patterns at both individual and aggregated levels, as different analytical scales may reveal distinct patterns. The evaluation of population bias of LBSM data fills the research gap on the representativeness of LBSM data in human mobility modeling, which has been a continuing challenge in data mining.
Figure 1. Approximating LBSM activity space based on different indicators.
- Exploring human mobility from social network usage - a cross country and cross-cultural study (in progress)
The continued development of social networking websites like Twitter, Flickr and Facebook provides ever-increasing opportunities to explore activity patterns of population groups in diverse geographic environments, social statuses and cultural backgrounds. These sites can be used to gain insight into human activity in an effort to analyze long term choices such as where one lives; medium term choices such as the purchase of cars; and shorter term choices such as activity scheduling on a weekly or daily scale. Despite of potential issues such as low sampling resolution, previous studies have demonstrated the effectiveness of these datasets to construct a more powerful mobility model. However, most existing research focuses only on travel distances and the overall distribution of human mobility without looking into various aspects of human activities. Additionally, there is a deficit in research on how these datasets can be utilized in a cross-country and/or cross-cultural setting. Hence, the proposed research concentrates on characterizing human activities in various cities and countries based on social media data (with a focus on China, which is considered as one of the most rapidly-developing countries globally).
- Exploring inter-country connection in mass media (completed)
The development of theories and techniques for big data analytics offers tremendous possibility for investigating large-scale events and patterns that emerge over space and time. In this research, we used a unique open dataset “The Global Data on Events, Location and Tone” (GDELT) to model the image of China in mass media, specifically, how China has related to the rest of the world and how this connection has evolved upon time. We examined and compared the distance decay effect in mass media, social media, and public transportation data. This study generated valuable input regarding the interpretation of the diplomatic and regional correlation for the nation of China. It also provides methodological references for investigating international relations in other countries and regions in the big data era.
Figure 2. Top 15 countries with the closest connection with China in GDELT dataset
- Characterizing human mobility from mobile phone usage (completed)
This project was funded by the Swiss national science foundation (SNSF) (PI: M.Raubal) and the U.S. Department of Transportation (PI: M.Raubal, Co-PI: Y.Yuan).
Our mobile information society depends increasingly on the use of Information and Communication Technologies (ICTs) such as mobile phones. People’s usage of these technologies impacts many aspects of their lives, but the relationship between ICTs and human activities is not fully known. A better understanding of this relationship will help in predicting people’s mobility patterns and provide important guidelines for maintaining sustainable transportation, updating environmental policies, and designing early warning and emergency response systems. The goal of this project is to develop a framework for extracting and characterizing human mobility patterns from georeferenced mobile phone datasets. We analyzed different types of information that can be stored in mobile phone datasets, and developed human mobility models and data mining methodologies for spatio-temporal knowledge discovery. These models provide the basis for investigating and quantifying the relationship between human physical travel, communication travel, and environmental structure. This research also addressed issues of uncertainty, which may arise from the natural variability of human mobility, the inaccuracy and imprecision of recorded trajectories, and the imperfection of the underlying models. The results advance conventional geographic knowledge discovery by focusing on knowledge extraction from sparse datasets with low resolution and individual attributes. The case study from northeast China allows us to examine the influence of mobile phone usage in a highly populated and rapidly developing country.
Figure 3. Trajectory of an example user
Figure 4. Mobility patterns at different time of a day
- Context-based place name searching system (completed)
With the development of the World Wide Web (WWW), unstructured or semi-structured documents that contain place names are considered an important source for retrieving geographical information. This project (funded by the Chinese National Science Foundation, PI: Y. Liu) designed a geographical knowledge-informed digital gazetteer service, KIDGS, which provides a standard web service that enables XML-based interfaces for various applications. Previous digital gazetteer systems are mostly implemented based on relational databases; however, it is important to consider geographical knowledge, such as categories and relation information when conducting queries in a gazetteer service. To this end, KIDGS adopts OWL(Protégé) and PostGIS to conceptualize and incorporate geographic knowledge into a new place name searching system.
Figure 5. The framework of KIDGS