Search query and tourism forecasting during the pandemic: When and where can digital footprints be helpful as predictors?

https://doi.org/10.1016/j.annals.2022.103365Get rights and content

Highlights

  • We predicted daily tourism demand across 74 countries in 2020.

  • A lasso method was used to incorporate Google search queries as predictors.

  • We evaluated the usefulness of search queries in boosting forecasting accuracy.

  • Usefulness of search queries in forecasting is associated with pandemic severity.

Abstract

During the COVID-19 pandemic, daily tourism demand forecasting provides actionable insight on tourism operations amid intense uncertainty. This paper applies the lasso method to predict daily tourism demand across 74 countries in 2020. We evaluate the usefulness of online search queries in boosting forecasting accuracy. The lasso method is used to select appropriate predictors and their lag orders. Results indicate that, in general, no evidence supports the usefulness of Google Trends data in generating more accurate forecasts. However, in some countries, the data can be useful for reducing the forecasting errors. Regression analysis further demonstrates that the relative usefulness of online search queries is associated with pandemic severity, the dominance of inbound tourism, and island geography. Lastly, implications are provided.

Introduction

The outbreak of COVID-19, a contagious disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has significantly affected the global tourism and travel industry. Declared a Public Health Emergency of International Concern on January 30, 2021, the pandemic has led to 180,817,269 confirmed cases and 3,923,238 deaths as of June 28, 2021 (WHO, 2021). Tourism and travel involve human movement and interaction and may contribute to the spread of disease (Farzanegan et al., 2021). Therefore, many countries have shut down their borders and enforced mobility restrictions; these policies dealt a devastating blow to the global tourism industry. As demand plummeted, many tourism businesses, such as airlines, hotels, and attractions, laid off and furloughed employees. A supply shrink followed. According to UNWTO statistics, the pandemic led to a loss of US $910 billion to US $1.2 trillion from tourism exports and was projected to reduce global GDP by 1.5% to 2.8% in 2020 (UNWTO, 2021).

During this highly uncertain time, tourism forecasting has come to play a more important role than ever in informing decisions among government personnel, industry professionals, and other tourism stakeholders. First, forecasting allows for tourism recovery predictions and can provide valuable insight for tourism policy design and implementation. Second, as demand has been volatile during the pandemic, accurate forecasts can help industry players better allocate their resources (e.g., inventory and staffing) to serve incoming demand. In addition to industry forecasting reports, tourism researchers have striven to develop and evaluate multiple forecasting methods to predict tourism demand at different levels (Kourentzes et al., 2021; Liu et al., 2021; Qiu et al., 2021). Various forecasting methods have since been leveraged and compared in terms of forecasting accuracy. Examples include time series models (Wickramasinghe & Ratnasiri, 2021), artificial neural network (ANN) models (Polyzos et al., 2021), stacking models (Qiu et al., 2021), forecasting combinations (Kourentzes et al., 2021), and the scenario-based mixed judgmental forecasting method (Liu et al., 2021).

Many tourism studies have highlighted the importance of big data indicators in enhancing tourism forecasting accuracy (Tian et al., 2021). Digital footprints, such as online search queries, web traffic, and social media posts, reflect prospective travelers' tourism interests and can reasonably predict tourism demand. Customer information processing theory suggests that consumers' path to purchase entails a funnel-like process from search to purchase. Digital footprints can reflect information about consumers' potential choices, some of which will be bought. Empirical studies have confirmed the usefulness of these footprints in improving tourism forecasting accuracy in various contexts (Pan et al., 2012; Pan and Yang, 2017a, Pan and Yang, 2017b); however, it remains unclear whether this usefulness applies in the COVID-19 era due to the volatility of tourism demand and travelers' unique information search behavior during this crisis.

To bridge this knowledge gap, we adopted the lasso method to forecast daily tourism demand across 74 countries in 2020. The lasso method is particularly useful when selecting variables from high-frequency, high-dimensional predictors of tourism demand (Tian et al., 2021). Different predictors were incorporated into our model to improve forecasting accuracy. More specifically, we compared the forecasting performance of a model with and without Google search query data. Doing so enabled us to examine the effectiveness of digital footprints in reducing forecasting error during the pandemic. Following our forecasting performance comparison, we employed regression analysis to understand when and where Google search query data were most helpful in improving forecasting accuracy. Results make at least three major contributions to the literature. First, we evaluated models' forecasting performance for daily tourism demand during the COVID-19 pandemic, providing guidelines for model and predictor selection in a time of great uncertainty. Our findings, based on a global sample, offer generalizable implications for academics and practitioners. Second, we scrutinized whether Google search query data (as a typical form of digital footprints) can improve forecasting accuracy across different forecasting horizons. Our conclusions helped evaluate the potential usefulness of digital footprints in tourism forecasting during this chaotic time. Last but not least, we systematically assessed factors explaining the effectiveness of Google search query data in tourism forecasting via rigorous regression analysis of results from 11,396 forecasting models. Although previous meta-analyses addressed potential determinants of forecasting accuracy (Kim & Schwartz, 2013; Peng et al., 2014), no single study has systematically evaluated these contributions in a context allowing for large-scale model comparison. Our findings thus clarify predictors' applicability in tourism forecasting.

Section snippets

Tourism demand forecasting

Many forecasting methods have been applied in tourism demand analysis, spanning quantitative and qualitative approaches (Law et al., 2019). Among qualitative methods, Delphi and consensus approaches are commonly adopted to forecast tourism demand; these methods rely on qualitative insight from experts in specific tourism markets. Qualitative methods' inherent constraints, such as limited generalizability (Witt & Witt, 1995), have led researchers to become more interested in quantitative means

Data sources

We collected daily data from several sources between January 2019 and November 2020. We referred to hotels' daily occupancy change versus the same-day level in 2019 to measure tourism demand (Yang et al., 2014). These data were obtained from STR, LLC, the global leading hotel data vendor, and were collected from a large and representative sample of hotel properties (Haywood et al., 2017). Additional data were gathered to predict tourism demand. First, we obtained daily flight departure data

Forecasting results

We first estimated the model using the lasso method for each of 74 countries in the dataset. We estimated a model for two periods: from January 1 to July 18, 2020 and from January 1 to November 15, 2020. To evaluate the effectiveness of Google Trends data, we estimated a model with and without Google Trends variables. In total, we estimated 74 (number of countries) × 2 (two forecasting periods: July and November 2020) × 2 (two models: one with and without Google Trends data) × 5 (five λ

Conclusion and discussion

In this study, we forecasted daily tourism demand across 74 countries in 2020. A lasso method was used to estimate the forecasting model based on a set of predictors, including Google search queries, air traffic, governmental policies related to the pandemic, and human mobility patterns. In particular, we assessed the effectiveness of online search queries in tourism forecasting by comparing forecasting errors between models with and without Google Trends data across different forecasting

CRediT authorship contribution statement

Yang Yang: Conceptualization; Formal analysis; Project management.

Yawen Fan: Data curation; Formal analysis; Writing - original draft.

Lan Jiang: Writing - original draft.

Xiaohui Liu: Conceptualization; Visualization; Writing- revision.

Declaration of competing interest

None.

References (85)

  • R. Law et al.

    Tourism demand forecasting: A deep learning approach

    Annals of Tourism Research

    (2019)
  • X. Li et al.

    Forecasting tourism demand with composite search index

    Tourism Management

    (2017)
  • X. Li et al.

    Impacts of COVID-19 on tourists' destination preferences: Evidence from China

    Annals of Tourism Research

    (2021)
  • A. Liu et al.

    Visitor arrivals forecasts amid COVID-19: A perspective from the Europe team

    Annals of Tourism Research

    (2021)
  • L.F. Martins et al.

    An empirical analysis of the influence of macroeconomic determinants on world tourism demand

    Tourism Management

    (2017)
  • M. Meleddu et al.

    Evaluation of individuals’ intention to pay a premium price for ecotourism: An exploratory study

    Journal of Behavioral and Experimental Economics

    (2016)
  • E. Nebolsina

    The impact of the Covid-19 pandemic on the business interruption insurance demand in the United States

    Heliyon

    (2021)
  • B. Peng et al.

    A meta-analysis of international tourism demand forecasting and implications for practice

    Tourism Management

    (2014)
  • R.T.R. Qiu et al.

    Visitor arrivals forecasts amid COVID-19: A perspective from the Asia and Pacific team

    Annals of Tourism Research

    (2021)
  • R. Rivera

    A dynamic linear model to forecast hotel registrations in Puerto Rico using Google trends data

    Tourism Management

    (2016)
  • C. Seabra et al.

    Heterogeneity in risk and safety perceptions of international tourists

    Tourism Management

    (2013)
  • S. Shen et al.

    Combination forecasts of international tourism demand

    Annals of Tourism Research

    (2011)
  • H. Song et al.

    Tourism demand modelling and forecasting—A review of recent research

    Tourism Management

    (2008)
  • H. Song et al.

    A review of research on tourism demand forecasting: Launching the Annals of Tourism Research Curated Collection on tourism demand forecasting

    Annals of Tourism Research

    (2019)
  • S. Sun et al.

    Forecasting tourist arrivals with machine learning and internet search index

    Tourism Management

    (2019)
  • B. Uniejewski et al.

    Understanding intraday electricity markets: Variable selection and very short-term price forecasting using LASSO

    International Journal of Forecasting

    (2019)
  • S.F. Witt et al.

    Forecasting tourism demand: A review of empirical research

    International Journal of Forecasting

    (1995)
  • X. Yang et al.

    Forecasting Chinese tourist volume with search engine data

    Tourism Management

    (2015)
  • Y. Yang et al.

    Spatial-temporal forecasting of tourism demand

    Annals of Tourism Research

    (2019)
  • H. Zhang et al.

    Forecasting tourism recovery amid COVID-19

    Annals of Tourism Research

    (2021)
  • D. Aaronson et al.

    Forecasting unemployment insurance claims in realtime with Google trends

    International Journal of Forecasting

    (2021)
  • H. Akaike

    Maximum likelihood identification of Gaussian autoregressive moving average models

    Biometrika

    (1973)
  • I. Blal et al.

    The differential effects of the quality and quantity of online reviews on hotel room sales

    Cornell Hospitality Quarterly

    (2014)
  • H. Choi et al.

    Predicting the present with Google trends

    The Economic Record

    (2012)
  • H.C. Cousins et al.

    Regional Infoveillance of COVID-19 case rates: Analysis of search-engine query patterns

    Journal of Medical Internet Research

    (2020)
  • A. Dutta et al.

    Brexit uncertainty and volatility persistence in tourism demand

  • M. Ettredge et al.

    Using web-based search data to predict macroeconomic statistics

    Communications of the ACM

    (2005)
  • M.R. Farzanegan et al.

    International tourism and outbreak of coronavirus (COVID-19): A cross-country analysis

    Journal of Travel Research

    (2021)
  • C.J. Ferguson et al.

    Publication bias in psychological science: Prevalence, methods for identifying and controlling, and implications for the use of meta-analyses

    Psychological Methods

    (2012)
  • D. Fesenmaier et al.
  • J. Ginsberg et al.

    Detecting influenza epidemics using search engine query data

    Nature

    (2009)
  • C. Goh et al.

    Analyzing and forecasting tourism demand: A rough sets approach

    Journal of Travel Research

    (2008)
  • Cited by (36)

    • Addressing Google Trends inconsistencies

      2024, Technological Forecasting and Social Change
    View all citing articles on Scopus

    Yang Yang's research focuses on tourism analytics.

    Yawen Fan's research focuses on applied statistics.

    Lan Jiang's research focuses on hospitality finance.

    Xiaohui Liu's research interests include theoretical and applied statistics.

    View full text