A survey of methods for the extraction of information from Web resources
- Authors: Varlamov M.I.1, Turdakov D.Y.1,2
-
Affiliations:
- Institute for System Programming
- Lomonosov Moscow State University
- Issue: Vol 42, No 5 (2016)
- Pages: 279-291
- Section: Article
- URL: https://journals.rcsi.science/0361-7688/article/view/176446
- DOI: https://doi.org/10.1134/S0361768816050078
- ID: 176446
Cite item
Abstract
Earlier surveys of research in the field of extracting structured data from Web-pages are analyzed, and a scheme for the classification of the available approaches based on the extent of their application is proposed.
About the authors
M. I. Varlamov
Institute for System Programming
Author for correspondence.
Email: varlamov@ispras.ru
Russian Federation, ul. Solzhenitsyna 25, Moscow, 109004
D. Yu. Turdakov
Institute for System Programming; Lomonosov Moscow State University
Email: varlamov@ispras.ru
Russian Federation, ul. Solzhenitsyna 25, Moscow, 109004; Moscow, 119991