Algorithm of the detection of the outdated information on the basis of analysis of data sites

DOI №______

Authors

  • А. О. Аронов, (Aronov A. O.) State University of Telecommunications, Kyiv

Abstract

The paper proposes an algorithm for implementing the method of identifying outdated information on the basis of the analysis of text data of the pages of the site. The algorithm of the software application for the search of outdated information on the pages of the site, which describes its settings. The criteria for finding outdated information and the sequence of their checks are determined. It is foreseen to execute search queries in nested pages. The main criteria of relevance of the site information on different indicators are determined. Describes the process of running search queries, which is governed by separate settings: date editing pages; start date of the search query; periodicity of the search query. After performing the preparatory steps to find out the outdated information in the page section, a search query is performed in the database to select the pages in which the texts will search for outdated information. As a result of the operation of the algorithm, templates are used that convert text data into a single unified representation. The scientific novelty of the results obtained is that an algorithm for the automatic detection of outdated information on the basis of information analytical analysis of the site's data, which differs from the existing ones, that the detection of outdated information is analyzed not only using time indices of the time of creation / updating of pages of the site, but directly the content of text page. The principle of the function in the software is described in detail, all regular expressions are described, which is used by the function to identify date markers in the text data of the analyzed pages. The proposed algorithm is intended for use by system administrators of the site.

Keywords: maintenance of the site, outdated information, information analytical analysis of data, site administrator, criteria for finding outdated information, periodicity of the search query.

References (MLA)
1. Aronov A. O. "Development of the Model of Structural-Logical Representation of the Data of the Site of the Higher Educational Institution on the Basis of the Hierarchical Classifier." Collection of scientific works of the Military Institute of Taras Shevchenko Kyiv National University 59 (2018): 70-75. Print.
2. Aronov A. O., Vyshnivskyi V. V., and Zamaruieva I. V. "The Method of the Detection of Outdated Information on the Basis of Information Analytical Analysis of the Data of the Site."Suchasni Informatsiyni Systemy 1 (2018): 28-31. Print.
3. Gojverst Yan, and Levitan Stiven. Regular Expressions. A Collection of Recipes. Symvol-Pius, 2010. Print.

Published

2018-12-06

Issue

Section

Articles