Project RSS Feeds

Contextualized Knowledge Repositories with Justifiable Exceptions

Planet Data - Fri, 10/17/2014 - 11:33
Authors: Loris Bozzato, Thomas Eiter, Luciano SerafiniYear: 2014Presentation Date: Sunday, 20 July, 2014Presented at: 27th International Workshop on Description Logics (DL2014)

Categories: Project RSS Feeds

A Contextualized Knowledge Repository for Open Data about Trentino

Planet Data - Thu, 10/16/2014 - 16:58
Authors: Loris Bozzato, Gaetano Calabrese, Luciano SerafiniYear: 2014Presentation Date: Wednesday, 30 July, 2014Presented at: Internal presentation to OpenData Trentino team

A Contextualized Knowledge Repository for Open Data about Trentino from PlanetData Network of Excellence
Categories: Project RSS Feeds

Reconciling Schema Matching Networks Through Crowdsourcing

Planet Data - Thu, 10/16/2014 - 16:44
Year: 2014Published in: ICST TransactionsAuthors: Nguyen Quoc Viet Hung, Nguyen Thanh Tam, Zoltán Miklós, Karl AbererAbstract: 

Schema matching is the process of establishing correspondences between the attributes of database schemas for data integration purposes. Although several automatic schema matching tools have been developed, their results are often incomplete or erroneous. To obtain a correct set of correspondences, usually human effort is required to validate the generated correspondences. This validation process is often costly, as it is performed by highly skilled experts. Our paper analyzes how to leverage crowdsourcing techniques to validate the generated correspondences by a large group of non-experts.


In our work we assume that one needs to establish attribute correspondences not only between two schemas but in a network. We also assume that the matching is realized in a pairwise fashion, in the presence of consistency expectations about the network of attribute correspondences. We demonstrate that formulating thes  expectations in the form of integrity constraints can improve the process of reconciliation. As in the case of crowdsourcing the user’s input is unreliable, we need specific aggregation techniques to obtain good quality. We demonstrate that consistency constraints can not only improve the quality of aggregated answers, but they also enable us to more reliably estimate the quality answers of individual workers and detect spammers. Moreover, these constraints also enable to minimize the necessary human effort needed, for the same expected

quality of results.

AttachmentSize Reconciling Schema Matching Networks Through.pdf638.55 KB
Categories: Project RSS Feeds

Towards Enabling Probabilistic Databases for Participatory Sensing

Planet Data - Wed, 10/15/2014 - 16:26
Authors: Nguyen Quoc Viet Hung, Saket Sathe, Duong Chi Thang, Karl AbererYear: 2014Presentation Date: Friday, 24 October, 2014Presented at: CollaborateCom 2014

Towards Enabling Probabilistic Databases for Participatory Sensing from PlanetData Network of Excellence
Categories: Project RSS Feeds

Privacy-Preserving Schema Reuse

Planet Data - Wed, 10/15/2014 - 16:22
Authors: Nguyen Quoc Viet Hung, Do Son Thanh, Nguyen Thanh Tam, Karl AbererYear: 2014Presentation Date: Tuesday, 22 April, 2014Presented at: DASFAA 2014

Categories: Project RSS Feeds

Pay-as-you-go Reconciliation in Schema Matching Networks

Planet Data - Wed, 10/15/2014 - 16:21
Authors: Nguyen Quoc Viet Hung, Nguyen Thanh Tam, Zoltan Miklos, Karl Aberer, Avigdor Gal and Matthias WeidlichYear: 2014Presentation Date: Tuesday, 1 April, 2014Presented at: ICDE 2014

Pay-as-you-go Reconciliation in Schema Matching Networks from PlanetData Network of Excellence
Categories: Project RSS Feeds

Crowdsourcing Event Extraction

Planet Data - Mon, 10/13/2014 - 12:24
Year: 2014Publication Date: Friday, 29 August, 2014Published in: NewsKDD2014 - Data Science for News Publishing WorkshopAuthors: Aljaz Kosmerlj, Jenya Belyaeva, Gregor Leban, Blaz Fortuna and Marko GrobelnikAbstract: 

We present a system for manually extracting structured event information from freeform news wire text. The extraction is performed on news articles preprocessed by services developed within the XLike project and is guided by suggestions the system produces using machine learning techniques. Results of testing performed using human annotators show the system can produce meaningful data and suggest several avenues for improvement of the system.

AttachmentSize newskdd2014_submission_8.pdf1.03 MB
Categories: Project RSS Feeds

Global Media Monitoring

Planet Data - Mon, 10/13/2014 - 12:05
Authors: Marko GrobelnikYear: 2014Presentation Date: Tuesday, 2 September, 2014Presented at: ESWC2014 Summer School

Global Media Monitoring presented through several systems for collecting, extracting and enriching data, forming and exploring events across languages in real-time - ...resulting in the system Event Registry (

Global Media Monitor - Marko Grobelnik from Marko Grobelnik
Categories: Project RSS Feeds

M48 Deliverables are available on our website

Planet Data - Thu, 10/09/2014 - 11:46
News Type: 

M48 deliverables are now available at the PlanetData website:

  • D21.3 LinkedMap Final report and project showcase Call2: Linked Map Management 
  • D28.1 ETIHQ Tourism Decision Support using Linked Data Call2: ETIHQ System Development
  • D29.2 ETIHQ Dissemination and Exploitation Report Call2: ETIHQ Dissemination
  • D30.1 ETIHQ Project Report Call2: ETIHQ Management
  • D18.2 LinkedMap Platform Beta version Call2: Linked Map Platform integration&development
  • D1.7 Distributed streaming RDF storage engine WP 1 - Data Streams and Dynamicity
  • D18.3 LinkedMap Platform Monitoring report Call2: Linked Map Platform integration&development
  • D20.3 LinkedMap Exploitation plan LinkedMap Exploitation plan
  • D33.1 MetaReasons Prototype evaluation: modelling and scalability Call2: Meta Reasons Prototype evaluation
  • D20.4 LinkedMap Standardization report Call2: Linked Map Dissemination  
  • D4.5 PlanetData data sets, vocabularies and provisioning tools catalogue and access portal WP 4 - Data Provisioning
  • D20.5 LinkedMap Web enabled public showcase Call2: Linked Map Dissemination
  • D5.4 PlanetData data management tools catalogue and access portal WP 5 - PlanetData Lab 
  • D19.2 LinkedMap Report on crowdsourcing trade-offs for geospatial data curation
Categories: Project RSS Feeds

A case study on propagating and updating provenance information using the CIDOC CRM

Planet Data - Wed, 10/08/2014 - 23:32
Year: 2014Publication Date: Friday, 29 August, 2014Published in: International Journal on Digital Libraries #799Authors: Christos Strubulis, Giorgos Flouris, Yannis Tzitzikas, Martin DoerrAbstract:  Provenance information of digital objects maintained by digital libraries and archives is crucial for authenticity assessment, reproducibility and accountability. Such information is commonly stored on metadata placed in various Metadata Repositories (MRs) or Knowledge Bases (KBs). Nevertheless, in various settings it is prohibitive to store the provenance of each digital object due to the high storage space requirements that are needed for having complete provenance. In this paper, we introduce provenance based inference rules as a means to complete the provenance information, to reduce the amount of provenance information that has to be stored, and to ease quality control (e.g., corrections). Roughly, we show how provenance information can be propagated by identifying a number of basic inference rules over a core conceptual model for representing provenance. The propagation of provenance concerns fundamental modelling concepts such as actors, activities, events, devices and information objects, and their associations. However, since a MR/KB is not static but changes over time due to several factors, the question that arises is how we can satisfy update requests while still supporting the aforementioned inference rules. Towards this end, we elaborate on the specification of the required add/delete operations, consider two different semantics for deletion of information, and provide the corresponding update algorithms. Finally, we report extensive comparative results for different repository policies regarding the derivation of new knowledge, in datasets containing up to one million RDF triples. The results allow us to understand the tradeoffs related to the use of inference rules on storage space and performance of queries and updates.


AttachmentSize IJDL paper.pdf2.46 MB Publisher: Springer-Verlag Berlin Heidelberg 2014
Categories: Project RSS Feeds

Provenance Management for SPARQL Updates

Planet Data - Wed, 10/08/2014 - 23:24
Authors: Argyro Avgoustaki, Giorgos Flouris, Irini FundulakiYear: 2014Presentation Date: Wednesday, 10 September, 2014Presented at: Reasoning Web Summer School 2014

In the last few years we have witnessed an explosion in the publication of data in the form of Linked Data. Recording the provenance information of Linked Data is an essential task in order to effectively support trustworthiness, accountability and repeatability. In this context, our work:

  • Introduces a new provenance model for SPARQL INSERT Updates
  • Allows the reconstructability of SPARQL INSERT Updates from their provenance
  • Provides algorithmic support via the Provenance Construction and the Update Reconstruction algorithms

Download here

Categories: Project RSS Feeds

Hotel Websites, Web 2.0, Web 3.0 and Online Direct Marketing: The Case of Austria

Planet Data - Wed, 10/08/2014 - 23:04
Year: 2014Publication Date: Friday, 24 January, 2014Published in: In Proceedings of the 21th International Conference on Information and Communication Technologies in Travel and Tourism (ENTER2014)Authors: Ioannis Stavrakantonakis, Ioan Toma, Anna Fensel, Dieter FenselAbstract: 

Direct communication with customers in order to increase sales has become one of the most important marketing methods used by small, medium and large hotels alike. With the rapid development of ICT technologies, including the Internet, Web, and recently Web 2.0 and 3.0, the number of channels in which hotels can interact directly with customers has grown even larger. Being visible on all these channels and using these technologies has now become a requirement if effective marketing and massive direct sales are to be achieved. In this chapter, we perform a rigorous empirical analysis of the advances towards the employment of Web 2.0 and 3.0 technologies in the tourism domain. We begin by presenting our methodology, including criteria and evaluation metrics, and follow by analysing the uptake of Web 2.0 and 3.0 technologies for Austrian hotels. As this chapter demonstrates, despite the benefits of new Web technology for online marketing, the hotels in Austria are not using these technologies and do not follow the online developments. Since employing their use is a relatively cheap undertaking, a severe competence gap seems to emerge either directly in the touristic service industry, or in the industry providing them with their on-line presence.

AttachmentSize ENTER2014_Stavrakantonakis.pdf590.54 KB
Categories: Project RSS Feeds

Using semantic technologies for scalable multi-channel communication

Planet Data - Wed, 10/08/2014 - 22:59
Year: 2014Publication Date: Thursday, 29 May, 2014Published in: 11th European Semantic Web Conference (ESWC'2014)Authors: Ioan Toma, Christoph Fuchs, Corneliu Stanciu, and Dieter FenselAbstract: 

The development of theWeb in the direction of user-generated content, information sharing, online collaboration and social media, have drastically increased the number of communication channels that can be used to interact with potential customers. In this demonstration we present the latest developments of our multi-channel communication so-

lution, which enables touristic service providers, e.g. hoteliers and touristic associations, in dealing with the challenge of improving and maintaining their communication needs. We make use of semantic technologies, i.e. semantic analysis, semantic annotations, ontologies, semantic matching and rules in order to automate several multi-channel  communication tasks.

AttachmentSize Using semantic technologies for scalable multi-channel communication.pdf263.29 KB
Categories: Project RSS Feeds

Improving the online visibility of touristic service providers by using semantic annotations

Planet Data - Wed, 10/08/2014 - 22:56
Year: 2014Publication Date: Thursday, 29 May, 2014Published in: In Proceedings of the 11th European Semantic Web Conference (ESWC'2014)Authors: Toma, I., Stanciu, C.-V., Fensel, A., Stavrakantonakis, I., Fensel, D.Abstract: 

The vast majority of people use the Internet to search for various products and services including those touristic. Now more than ever it becomes critical for touristic businesses to have a strong online presence. In order to achieve this goal it is however essential that multiple communication channels and technologies are properly used. In particular having semantic annotations on the website that can be understood by search engines is extremely important. In this paper we present our ongoing eort on using Linked Data technologies to improve the online visibility of touristic service providers from Innsbruck and its surroundings. We show which technologies are relevant, how they can be applied

in our real world pilot and we measure the impact of using such technologies.

AttachmentSize Improving the online visibility of touristic service providers by using semantic annotations.pdf216.6 KB Publisher: Springer-Verlag, LNCS
Categories: Project RSS Feeds

Matching Web Entities with Potential Actions

Planet Data - Wed, 10/08/2014 - 22:53
Year: 2014Publication Date: Thursday, 4 September, 2014Published in: In Proceedings of the Poster and Demo Paper track International Conference on Semantic Systems (I-SEMANTICS 14), CEUR-WSAuthors: Stavrakantonakis, I., Fensel, A., Fensel, D.Abstract: 

The creation of as the de facto vocabulary for the implementation of Semantic Annotations was the dawn of a new era for the Web by motivating the Web developers to start weaving semantics in the content, mainly, for visibility reasons in search engine results. Moving further, the new version of the vocabulary enables Web entities to self-describe the Actions with which they interact with users, agents or services. In this scope, we present our ongoing work on automatic weaving of Actions based on the existing semantic annotations of a website.

AttachmentSize Matching Web Entities with Potential Actions.pdf196.65 KB
Categories: Project RSS Feeds

Updates on Cataloguing Activities of PlanetData

Planet Data - Tue, 10/07/2014 - 13:22
News Type: 

PlanetData Data Sets and  Vocabularies and Access Portal deliverable introducing BTC2014 dataset has been released:

In this deliverable it is reported on the cataloguing activities of PlanetData. During the reporting period, the following results were achieved: In order to discover additional Linked Datasets, KIT as well as UMA performed large-scale crawls of the Linked Data Web. The Web data corpus crawled by KIT consists of 4 billion RDF triples and is offered as evaluation data for the Semantic Web Challenge 2014, the premier event for showcasing Semantic Web applications. The Web data corpus crawled by UMA was further analysed concerning the topical domains of the published data as well as the compliance of the data sources with the Linked Data best practices. The results of the analysis were published in the form of a data catalog covering 1900 datasets and classifying each dataset according to its topical domain, vocabulary usage and compliance with the best practices. The linkage relationships between the datasets were visualized in the form of an updated Linked Data Cloud diagram. A paper describing the results of the analysis was accepted for publication at the International Semantic Web Conference and will be presented there in October. More at

Semantic WebLinked DataComputingData WebInformationInternetVoiDTechnologyWeb applicationsWeb ChallengeWeb data corpus
Categories: Project RSS Feeds

PlanetData at ISWC2014

Planet Data - Tue, 10/07/2014 - 11:55
News Type: External Event

A publication about the "Adoption of the Linked Data Best Practices in Different Topical Domains" has been accepted for the ISWC2014 - RDB Track. More information about the paper and supplementary material is found at

Linked DataSemantic WebWorld Wide WebAdoptionSocial IssuesTechnology
Categories: Project RSS Feeds

Linked Data Cloud diagram and Linked Dataset catalog updated

Planet Data - Tue, 10/07/2014 - 11:52
News Type: 

As part of deliverable 4.5, UMA has released an updated version of the Linked Data Cloud diagram and an Linked Dataset catalog. More information about the diagram and the catalog is found at: and

Linked DataSemantic WebWorld Wide WebCloud computingDiagramComputingTechnology
Categories: Project RSS Feeds