Nova Publishers
My Account Nova Publishers Shopping Cart
HomeBooksSeriesJournalsReference CollectionseBooksInformationSalesImprintsFor Authors
            
  Top » Catalog » Books » Language and Linguistics » Bilinguals: Cognition, Education and Language Processing Chapters » My Account  |  Cart Contents  |  Checkout   
Quick Find
  
Use keywords to find the product you are looking for.
Advanced Search
What's New? more
Doxycycline: Medical Uses and Effects
$73.80
Shopping Cart more
0 items
Information
Shipping & Returns
Privacy Notice
Conditions of Use
Contact Us
Notifications more
NotificationsNotify me of updates to A Novel Transliteration Approach in an English-Arabic Cross Language Information Retrieval System, pp.229-242
Tell A Friend
 
Tell someone you know about this product.
A Novel Transliteration Approach in an English-Arabic Cross Language Information Retrieval System, pp.229-242 $100.00
Authors:  Ghita Amor-Tijani and Abdelghani Bellaachia, Department of Computer Science, The George Washington University, Washington DC
Abstract:
One of the main issues facing Cross Language Information Retrieval (CLIR) is untranslatable
words, i.e., words not found in dictionaries, which are usually referred to as Out Of
Vocabulary (OOV) words. Bilingual dictionaries in general do not cover most proper nouns
(e.g., names of places, people, countries, etc.), which constitute a large proportion of OOV
words. As they are often primary keys in a query, their correct translation is often necessary to
maintain a good retrieval performance. Because they are spelling variants of each other in
most languages, an approximate string matching technique against the target database index is
usually used to find the target language correspondents of the original query key. The n-gram
technique has proven to be the most effective among other approximate string matching
techniques. A more complicated issue arises when the languages dealt with have different
alphabets. The approach usually taken is transliteration. It is applied based on phonetic
similarities between the languages involved. However, transliteration by itself cannot
guarantee the exact spelling of the transliterated words as found in the document collection.
There are a variety of ways that a transliterated word can be spelled despite conventions that
might exist. The fact that there is no one correct way of spelling a transliterated word shows
the need for a technique that is capable of generating the different spellings found in the
document collection. In this study, we chose to combine both transliteration and the n-gram
technique in an English-Arabic CLIR system, in which Arabic documents were searched
using English queries. We evaluated the effectiveness of this approach and compared it with
other transliteration approaches. Experimental results showed the retrieval improvement
gained using our transliteration approach over other existing approaches. 


Available Options:
Version:
This Item Is Currently Unavailable.
Special Focus Titles
01.Violent Communication and Bullying in Early Childhood Education
02.Cultural Considerations in Intervention with Women and Children Exposed to Intimate Partner Violence
03.Chronic Disease and Disability: The Pediatric Lung
04.Fruit and Vegetable Consumption and Health: New Research
05.Fire and the Sword: Understanding the Impact and Challenge of Organized Islamism. Volume 2

Nova Science Publishers
© Copyright 2004 - 2021

A Novel Transliteration Approach in an English-Arabic Cross Language Information Retrieval System, pp.229-242