Word frequencies and bigrams in bahasa Melayu
This paper reports on some preliminary findings on word frequency in the MALEX database. The most frequent words are described, attention being paid to the position of content words in the frequency list. Nouns emerge as the most problematic to a particular set of genres, or even to a particular te...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Universiti Malaya
2017-07-01
|
| Series: | Journal of Modern Languages |
| Online Access: | http://borneojournal.um.edu.my/index.php/JML/article/view/3746 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850146919800635392 |
|---|---|
| author | Zuraidah Mohd. Don Gerry Knowles |
| author_facet | Zuraidah Mohd. Don Gerry Knowles |
| author_sort | Zuraidah Mohd. Don |
| collection | DOAJ |
| description |
This paper reports on some preliminary findings on word frequency in the MALEX database. The most frequent words are described, attention being paid to the position of content words in the frequency list. Nouns emerge as the most problematic to a particular set of genres, or even to a particular text. Bigrams are studied both as sequences of individual words and as sequences of grammatical tags. Whereas the tag sequences reflect syntactic rules and thus the hierarchical struture of syntax, sequence of individual words reflect quite a different kind of linear structure which has begun to emerge in recent years in corpus linguistcs.
|
| format | Article |
| id | doaj-art-433ca4bee37845fda1000e51263c1047 |
| institution | OA Journals |
| issn | 1675-526X 2462-1986 |
| language | English |
| publishDate | 2017-07-01 |
| publisher | Universiti Malaya |
| record_format | Article |
| series | Journal of Modern Languages |
| spelling | doaj-art-433ca4bee37845fda1000e51263c10472025-08-20T02:27:42ZengUniversiti MalayaJournal of Modern Languages1675-526X2462-19862017-07-01151Word frequencies and bigrams in bahasa MelayuZuraidah Mohd. Don0Gerry Knowles1University of Malaya, MalaysiaLancaster University, UK This paper reports on some preliminary findings on word frequency in the MALEX database. The most frequent words are described, attention being paid to the position of content words in the frequency list. Nouns emerge as the most problematic to a particular set of genres, or even to a particular text. Bigrams are studied both as sequences of individual words and as sequences of grammatical tags. Whereas the tag sequences reflect syntactic rules and thus the hierarchical struture of syntax, sequence of individual words reflect quite a different kind of linear structure which has begun to emerge in recent years in corpus linguistcs. http://borneojournal.um.edu.my/index.php/JML/article/view/3746 |
| spellingShingle | Zuraidah Mohd. Don Gerry Knowles Word frequencies and bigrams in bahasa Melayu Journal of Modern Languages |
| title | Word frequencies and bigrams in bahasa Melayu |
| title_full | Word frequencies and bigrams in bahasa Melayu |
| title_fullStr | Word frequencies and bigrams in bahasa Melayu |
| title_full_unstemmed | Word frequencies and bigrams in bahasa Melayu |
| title_short | Word frequencies and bigrams in bahasa Melayu |
| title_sort | word frequencies and bigrams in bahasa melayu |
| url | http://borneojournal.um.edu.my/index.php/JML/article/view/3746 |
| work_keys_str_mv | AT zuraidahmohddon wordfrequenciesandbigramsinbahasamelayu AT gerryknowles wordfrequenciesandbigramsinbahasamelayu |