Project RSS Feeds
Work package 4 is happy to announce the release of a corpus containing 147 million quasi-relational Web tables.
The Web contains vast amounts of HTML tables. Most of these tables are used for layout purposes, but a fraction of the tables is also quasi-relational, meaning that they contain structured data describing a set of entities. A corpus of Web tables can be useful for research and applications in areas such as data search, table augmentation, knowledge base construction, and for various NLP tasks.
The WDC Web Tables corpus has been extracted from the 2012 version of the Common Crawl, the largest Web crawl that is available to the public. The corpus contains the subset of the 11 billion HTML tables found in the Common Crawl that are likely quasi-relational. More information about the corpus, its application domains as well as information about how to download the corpus is found at: http://webdatacommons.org/webtables/
We want to thanks the Common Crawl Foundation for providing their great web crawl and thus enabling the creation of the WDC Web Tables corpus.
The creation of the WDC Web Tables corpus was supported by the German Research Foundation (DFG), the EU FP7 project PlanetData and by Amazon Web Services. We thank our sponsors a lot.
Applied linguisticsCorporaInternational Corpus of EnglishTechnologyHTMLWeb Tables corpusdata searchquasi-relational Web tablesWeb CrawlWeb tables
HealthCare Use Case
The Health Use Case Demo showcases the use of the access control enforcement techniques developed in PlanetData in order to provide selective exposure of patients' Personal Health Record information to various users/roles (doctors, medical staff, public services, organizations, hospitals etc) according to the access rights that the patient himself/herself has provided. More details on the demo can be found bellow:
Event Registry Use Case
The “Event Registry” system is developed as a prototype to support a standardization working group at the IPTC level (publishers’ standardization organization – IPTC.org). The aim is to release recommendations to collect, annotate and interoperate information on global events and storylines across languages, domains and granularities.
Smart Cities Use Case
The usecase describes some of the challenges and opportunities that arise from the existence of diverse sets of open (and closed) public and private data related to city infrastructures and territory, demography, public transport facilities and commercial activities across the city, specifically, focusing on the area of geomarketing.
A System for Aligning Taxonomies and Debugging Taxonomies and Their Alignments
Valentina Ivanova, Patrick Lambrix
PCKS: Commandments for Benchmarking Semantic Flow Processing Systems
Thomas Scharrenbach, Emanuele Della Valle
Applying SemanticWeb Technologies for Diagnosing Road Traffic Congestions
ourSpaces - Design and Deployment of a Semantic Virtual Research Environment
Google: “Get the best summary”
Andreas Thalhammer, Magnus Knuth
Market Sensing Using a Graph Database
Formal Verification of Data Provenance Records
A Formal Semantics for Weighted Ontology Mappings
Feature LDA: a Supervised Topic Model for Automatic Detection of Web API Documentations from the Web
Link Discovery with Guaranteed Reduction Ratio in Ane Spaces with Minkowski Measures
Axel-Cyrille Ngonga Ngomo
MORE: Modular Classification of OWL 2 Ontologies
Rapidly Integrating Services into the Linked Data Cloud
On the Diversity and Availability of Temporal Information in Linked Open Data
Discovering Concept Coverings in Aligning Ontologies of Linked Data
Applications for the ESWC 2014 Summer school are now open. The fourth ESWC Summer School will take place the beautiful village of Kalamaki Crete 1st – 6th September, 2014. Keynotes include: Abraham Bernstein, University of Zurich, Marko Grobelnik, Josef Stefan Institute, Dominic Oldman from the British Museum and Guus Schreiber, VU University Amsterdam.VU University AmsterdamUniversity of ZurichJosef Stefan Institutefourth ESWC Summer SchoolVrije UniversiteitSchreiberGuusZurichJoseph StefanEuropeSloveniaEducationHospitalityKalamaki CreteUniversity of ZurichBritish MuseumVU UniversityJosef Stefan Institute