Automated Construction and Mining of Text-Based Modern Chinese Character Databases: A Case Study of Fujian
Historical figures are crucial for understanding historical processes and social changes. However, existing databases of historical figures primarily focused on ancient Chinese individuals and are limited by the simplistic organization of textual information, lacking structured processing. Therefore...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-04-01
|
| Series: | Information |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2078-2489/16/4/324 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Historical figures are crucial for understanding historical processes and social changes. However, existing databases of historical figures primarily focused on ancient Chinese individuals and are limited by the simplistic organization of textual information, lacking structured processing. Therefore, this study proposes an automatic method for constructing a spatio-temporal database of modern Chinese figures. The character state transition matrix reveals the spatio-temporal evolution of historical figures, while the random walk algorithm identifies their primary migration patterns. Using historical figures from Fujian Province (1840–2009) as a case study, the results demonstrate that this method effectively constructs the spatio-temporal chain of figures, encompassing time, space, and events. The character state transition matrix indicates a fluctuating trend of state change from 1840 to 2009, initially increasing and then decreasing. By applying keyword extraction and the random walk method, this study finds that the state transitions and their causes align with the historical trends. The four-dimensional analytical framework of “character-time-space-event” established in this study holds significant value for the field of digital humanities. |
|---|---|
| ISSN: | 2078-2489 |