Before and After COVID-19 Outbreak Using Variance Representation Comparative Analysis of Newspaper Articles on the Travel Hotel Industry

Yeqing Yang, Yumi Asahi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This study explores the impact of COVID-19 on Japan’s travel industry by analyzing differences before and after the pandemic through articles from the Nihon Keizai Shimbun. It employs text mining techniques like Latent Semantic Analysis (LSA) and BERT (a natural language model) to process and categorize information from newspaper titles. The analysis involves extracting nouns using MeCab, creating a frequency matrix, decomposing it with NMF for clustering, and setting topics. BERT is used for text classification, focusing on token attention weights and variance representation. The data includes articles from Nikkei Morning News pre- and post-COVID-19, specifically tagged with “Travel & Hotel,” totaling 792 articles. Analysis revealed ten topics such as vaccines, business structures, and financial results. Hierarchical clustering grouped these topics across eight clusters. Findings indicate a shift in topics post-COVID-19 towards financial impacts and business activities, highlighting tokens related to company activities and keywords associated with the pandemic. Future work aims at improving classification accuracy and leveraging data insights.

Original languageEnglish
Title of host publicationHuman Interface and the Management of Information - Thematic Area, HIMI 2024, Held as Part of the 26th HCI International Conference, HCII 2024, Proceedings
EditorsHirohiko Mori, Yumi Asahi
PublisherSpringer Science and Business Media Deutschland GmbH
Pages157-174
Number of pages18
ISBN (Print)9783031601132
DOIs
Publication statusPublished - 2024
EventThematic Area Human Interface and the Management of Information, HIMI 2024, Held as Part of the 26th HCI International Conference, HCII 2024 - Washington, United States
Duration: 29 Jun 20244 Jul 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14690 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceThematic Area Human Interface and the Management of Information, HIMI 2024, Held as Part of the 26th HCI International Conference, HCII 2024
Country/TerritoryUnited States
CityWashington
Period29/06/244/07/24

Keywords

  • BERT Text Classification
  • COVID-19
  • Travel Hotel Industry of Japan

Fingerprint

Dive into the research topics of 'Before and After COVID-19 Outbreak Using Variance Representation Comparative Analysis of Newspaper Articles on the Travel Hotel Industry'. Together they form a unique fingerprint.

Cite this