FULL TEXT SEARCH AND INDEXING IN LANGUAGES WITH TWO ALPHABETS

Journal Title: Journal of Information Technology and Application (JITA) - Year 2014, Vol 4, Issue 1

Abstract

The languages spoken in Bosnia and Herzegovina use both Cyrillic and Latin equally. This is an additional problem with indexing and full text searching. In this paper, we are analyzing this problem. Using the tools available on PostgreSQL and ispell dictionaries, we made a solution. As part of the solutions, we created a dictionary of stop words, adjusted the affi x fi le for both alphabets and from the list of words made functional vocabularies for indexing and searching. We made a full search confi guration which is useful for indexing texts in both alphabets.

Authors and Affiliations

Tijana Talić

Keywords

Related Articles

SPACE COMPLEXITY ANALYSIS OF THE BINARY TREE ROLL ALGORITHM

This paper presents the space complexity analysis of the Binary Tree Roll algorithm. The space complexity is analyzed theoretically and the results are then confi rmed empirically. The theoretical analysis consists of de...

BIOMETRIC SYSTEM TO SECURE THE INTERNET OF THINGS

Today, Internet of Things (IoT) is becoming part of a diverse organization, from academic to large enterprises. Also, we use IoT in our daily lives like home appliances, security monitoring such as baby, smoke detectors,...

ONLINE EVALUATION OF RECOMMENDER SYSTEM WITH MOVIELENS DATASET

The purpose of this paper is to explore the advantages of recommender systems based on the matrix factorization in respect to classical first neighbor recommender systems to real users through A/B test, as these studies...

MUTATION TESTING: OBJECT-ORIENTED MUTATION AND TESTING TOOLS

Software testing represents activity in detecting software failures. Mutation testing represents a way to test a test. The basic idea of mutation testing is to seed lots of artifi cial defects into the program, test all...

A CASE STUDY ON INTRODUCING E-LEARNING INTO SEAFARERS’ EDUCATION

This paper considers beginning steps in introducing e-learning into seafarers’ education, as additional mode of acquiring knowledge at the Faculty of Maritime Studies which is a part of the University of Montenegro. Rela...

Download PDF file
  • EP ID EP244493
  • DOI 10.7251/JIT1401041T
  • Views 102
  • Downloads 0

How To Cite

Tijana Talić (2014). FULL TEXT SEARCH AND INDEXING IN LANGUAGES WITH TWO ALPHABETS. Journal of Information Technology and Application (JITA), 4(1), 41-45. https://www.europub.co.uk/articles/-A-244493