|

Application of Statistic Methods for Development of Linguistic Support for Data Retrieval System

Authors: Smirnov Yu.M., Andreev A.M., Berezkin D.V., Brik A.V. Published: 04.09.2014
Published in issue: #2(43)/2001  
DOI:

 
Category: Informatics & Computing Technology  
Keywords:

Problems of the data retrieval system development with natural language interface of requests are considered, among them, the preparation of dictionaries and search index taking into account syntactic structure of the document sentences. A method of the automatic creation of both the morphological and word-combination dictionary is suggested using statistical analysis of the sufficient amount of texts. The two-stage algorithm of the text syntax analysis is considered (using the simple formal and grammatical analysis at the first stage and the statistical refinement of the analysis results - at the second stage), and the text search algorithm as well, based on results of the two-stage algorithm application. Experimental estimations of the suggested methods operation quality are given.