Word frequencies and bigrams in bahasa Melayu

This paper reports on some preliminary findings on word frequency in the MALEX database. The most frequent words are described, attention being paid to the position of content words in the frequency list. Nouns emerge as the most problematic to a particular set of genres, or even to a particular te...

Full description

Saved in:
Bibliographic Details
Main Authors: Zuraidah Mohd. Don, Gerry Knowles
Format: Article
Language:English
Published: Universiti Malaya 2017-07-01
Series:Journal of Modern Languages
Online Access:http://borneojournal.um.edu.my/index.php/JML/article/view/3746
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850146919800635392
author Zuraidah Mohd. Don
Gerry Knowles
author_facet Zuraidah Mohd. Don
Gerry Knowles
author_sort Zuraidah Mohd. Don
collection DOAJ
description This paper reports on some preliminary findings on word frequency in the MALEX database. The most frequent words are described, attention being paid to the position of content words in the frequency list. Nouns emerge as the most problematic to a particular set of genres, or even to a particular text. Bigrams are studied both as sequences of individual words and as sequences of grammatical tags. Whereas the tag sequences reflect syntactic rules and thus the hierarchical struture of syntax, sequence of individual words reflect quite a different kind of linear structure which has begun to emerge in recent years in corpus linguistcs.
format Article
id doaj-art-433ca4bee37845fda1000e51263c1047
institution OA Journals
issn 1675-526X
2462-1986
language English
publishDate 2017-07-01
publisher Universiti Malaya
record_format Article
series Journal of Modern Languages
spelling doaj-art-433ca4bee37845fda1000e51263c10472025-08-20T02:27:42ZengUniversiti MalayaJournal of Modern Languages1675-526X2462-19862017-07-01151Word frequencies and bigrams in bahasa MelayuZuraidah Mohd. Don0Gerry Knowles1University of Malaya, MalaysiaLancaster University, UK This paper reports on some preliminary findings on word frequency in the MALEX database. The most frequent words are described, attention being paid to the position of content words in the frequency list. Nouns emerge as the most problematic to a particular set of genres, or even to a particular text. Bigrams are studied both as sequences of individual words and as sequences of grammatical tags. Whereas the tag sequences reflect syntactic rules and thus the hierarchical struture of syntax, sequence of individual words reflect quite a different kind of linear structure which has begun to emerge in recent years in corpus linguistcs. http://borneojournal.um.edu.my/index.php/JML/article/view/3746
spellingShingle Zuraidah Mohd. Don
Gerry Knowles
Word frequencies and bigrams in bahasa Melayu
Journal of Modern Languages
title Word frequencies and bigrams in bahasa Melayu
title_full Word frequencies and bigrams in bahasa Melayu
title_fullStr Word frequencies and bigrams in bahasa Melayu
title_full_unstemmed Word frequencies and bigrams in bahasa Melayu
title_short Word frequencies and bigrams in bahasa Melayu
title_sort word frequencies and bigrams in bahasa melayu
url http://borneojournal.um.edu.my/index.php/JML/article/view/3746
work_keys_str_mv AT zuraidahmohddon wordfrequenciesandbigramsinbahasamelayu
AT gerryknowles wordfrequenciesandbigramsinbahasamelayu