Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

Template-Driven Semantic Parsing for Focused Web Crawler

Authors

[ 1 ] Instytut Automatyki i Inżynierii Informatycznej, Wydział Elektryczny, Politechnika Poznańska | [ P ] employee

Year of publication

2015

Chapter type

paper

Publication language

english

Keywords
EN
  • template
  • parsing
  • focused web crawler
  • Semantic Web
  • expression language
Abstract

EN We present Template-Driven Semantic Parser (TDSP) capable to represent, at least to some degree, the semantics of Web pages being processed. Data extraction process realized by means of TDSP is driven by a set of instructions stored in an easily modifiable XML-based template. In order to enhance the precision of Web page data extraction, the TDSP template format allows to use a specialized Expression Language (EL). The template may be easily created and modified using a tool called Visual Template Designer. TDSP provides an output document containing an RDF graph composed of triples that represent the website resources under exploration. In accordance to the Semantic Web paradigm, each resource has its semantics assigned and is connected to other resources by means of one or many relations. The semantic types of the resources and the relations between them are predefined in an ontology of Web artifacts.

Pages (from - to)

351 - 358

DOI

10.1007/978-3-319-15615-6_26

URL

https://link.springer.com/chapter/10.1007/978-3-319-15615-6_26

Book

Semantic Technology : 4th Joint International Conference, JIST 2014, Chiang Mai, Thailand, November 9-11, 2014. Revised Selected Papers

Presented on

4th Joint International Semantic Technology Conference, 9-11.11.2015, Chiang Mai, Thailand

Publication indexed in

WoS (15)

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.