Grammar or Crammer? The Role of Morphology in Distinguishing Orthographically Similar but Semantically Unrelated Words
We show that n-gram-based distributional models fail to distinguish unrelated words due to the noise in semantic spaces. This issue remains hidden in conventional benchmarks but becomes more pronounced when orthographic similarity is high. To highlight this problem, we introduce OSimUnr, a dataset o...
Saved in:
| Main Authors: | Gokhan Ercan, Olcay Taner Yildiz |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10947740/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
PAP900: A dataset of semantic relationships between affective words in PortugueseMendeley Data
by: André Fernandes dos Santos, et al.
Published: (2025-08-01) -
Orthographic processing of proper Names: A proposal to investigate the orthographic cue for second language readers
by: Kimberly Klassen
Published: (2022-12-01) -
Morpho-orthographic segmentation on visual word recognition in Brazilian Portuguese speakers
by: Humberto dos Reis Pereira, et al.
Published: (2024-11-01) -
Catching a CAPTCHA: the impact of variable input on the processing of emerging orthographic representations
by: Olga Solaja, et al.
Published: (2025-01-01) -
Orthographic influence on English word recognition by Spanish and Korean learners
by: Maria Teresa Martinez Garcia
Published: (2025-06-01)