The Classical Model of Type-Token Systems Compared with Items from the Standardized Project Gutenberg Corpus
We compare the “classical” equations of type-token systems, namely Zipf’s laws, Heaps’ law and the relationships between their indices, with data selected from the Standardized Project Gutenberg Corpus (SPGC). Selected items all exceed 100,000 word-tokens and are trimmed to 100,000 word-tokens each....
Saved in:
| Main Authors: | Martin Tunnicliffe, Gordon Hunter |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-06-01
|
| Series: | Analytics |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2813-2203/4/2/16 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Diachronic Emergence of Zipf-like Patterns in Construction-Specific Frequency Distributions: A Quantitative Study of the Way Too Construction
by: Quentin Feltgen
Published: (2020-12-01) -
Scaling Laws in Language Families
by: Maelyson Rolim Fonseca dos Santos, et al.
Published: (2025-05-01) -
Compilation, Analysis and Application of a Comprehensive Bangla Corpus KUMono
by: Aysha Akther, et al.
Published: (2022-01-01) -
Bottlenose Dolphins’ Clicks Comply with Three Laws of Efficient Communication
by: Arthur Stepanov, et al.
Published: (2025-06-01) -
Relevansi Pemeringkatan Kata Kunci Dengan Menggunakan Dalil Zipf Pada Abstrak Skripsi Hukum Perdata Fakultas Hukum Universitas Sriwijaya Tahun 2018-2022
by: Novita Vitriana, et al.
Published: (2023-12-01)