Development of the method for filtering verbal noise while search keywords for the English text
Journal Title: Технологический аудит и резервы производства - Year 2018, Vol 6, Issue 2
Abstract
<p><em>The object of research is the processing of verbal information to identify keywords in the text. The most important step in the search for key terms is the calculation of their weights in the document in question, which makes it possible to evaluate their significance relative to each other in this context. To solve this problem, there are many approaches that are conditionally divided into two groups: they require learning and do not require learning. Learning implies the need to pre-process the original body of texts in order to extract information about the frequency of occurrence of terms in the entire body. An alternative approach is using linguistic ontologies, which are more or less approximate models of the existing set of words in a given language. On the basis of both approaches, systems are created for the automatic extraction of key terms. Nevertheless, in the direction of searching for keywords, research is not stopped in order to improve the accuracy and completeness of the results, as well as to use methods of extracting information from the text to solve new problems.</em></p><p><em>Existing approaches to the definition of keywords are characterized. The best quality of text processing is achieved by linguistic methods or when their combinations are statistical. A system for automatically determining key phrases from natural language text should be developed using the morphological dictionary and syntax rules.</em></p><em>The study uses an approach to defining keywords based on finding syntactic links between word forms in sentences in English text using the instrumental capabilities of modern linguistic packages. In the framework of the general approach to reducing verbal noise in the method, it is proposed that it is achieved with the help of formalized operations: the replacement of pronouns with the corresponding nouns; removal of noise connections; removing noise words; withdrawal of stop words. The described operations can be used as additional modules that improve the results of finding keywords for both the developed method for determining keywords of English text and other algorithms for finding keywords.</em>
Authors and Affiliations
Oleg Bisikalo, Alexander Yahimovich, Yaroslav Yahimovich
Modeling of the optimal composition of the enterprise technical development program
<p><em>The object of this research is the program of technical development of the enterprise. This research is devoted to the problem of forming the optimal composition of a technical development program within the frame...
Development of a methodology for creating adaptive energy efficiency clusters of the architecture and construction industry
<p><em>The object of research is the process of creating adaptive clusters of energy efficiency in the architecture and construction industry. Today, it is important to solve infrastructural problems of energy saving; th...
Analysis of modern approaches to the formation of the portfolio investor shares stock
<p><em>The object of research is an investment portfolio consisting of a set of investment instruments (securities, assets, projects, etc.) in which the investor's finances are distributed. The main purpose of forming an...
Studying of the power modes in the traction line for ensuring the high-speed traffic
<p><em>The object of research is the power regimes in traction power systems for both centralized and distributed power when introducing high-speed traffic. The introduction of high-speed traffic on electrified railways...
Development of the information platform model for the neutralization of potentially dangerous underwater objects
<p><em>The object of research is the processes of managing the creation of information support for projects to neutralize potentially dangerous underwater objects. In such projects, complex information flows circulate at...