Derzis: A Path Aware Linked Data Crawler

dc.contributor.author André Fernandes Santos en
dc.contributor.author José Paulo Leal en
dc.contributor.other 7076 en
dc.contributor.other 5125 en
dc.date.accessioned 2023-08-02T08:18:35Z
dc.date.available 2023-08-02T08:18:35Z
dc.date.issued 2021 en
dc.description.abstract Consuming Semantic Web data presents several challenges, from the number of datasets it is composed of, to the (very) large size of some of those datasets and the uncertain availability of querying endpoints. According to its core principles, accessing linked data can be done simply by dereferencing the IRIs of RDF resources. This is a light alternative both for clients and servers when compared to dataset dumps or SPARQL endpoints. The linked data interface does not support complex querying, but using it recursively may suffice to gather information about RDF resources, or to extract the relevant sub-graph which can then be processed and queried using other methods. We present Derzis1, an open source semantic web crawler capable of traversing the linked data cloud starting from a set of seed resources. Derzis maintains information about the paths followed while crawling, which allows to define property path-based restrictions to the crawling frontier. en
dc.identifier P-00V-AKZ en
dc.identifier.uri https://repositorio.inesctec.pt/handle/123456789/14294
dc.language eng en
dc.rights info:eu-repo/semantics/openAccess en
dc.title Derzis: A Path Aware Linked Data Crawler en
dc.type en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
P-00V-AKZ.pdf
Size:
895.33 KB
Format:
Adobe Portable Document Format
Description: