TaskForces/CommunityProjects/LinkingOpenData/THALIATestbed - W3C Wiki (original) (raw)

THALIA testbed

This page collects information about THALIA testbed for benchmarking relational database to RDF mapping tools.

The page is part of the Linking Open Data Project.

You can download the latest version of THALIA benchmark immediately now.

Introduction

THALIA (Test Harness for the Assessment of Legacy information Integration Approaches) is a publicly available testbed and benchmark for testing and evaluating integration technologies. It provides researchers and practitioners with a collection of 40 relational database tables representing University course catalogs from computer science departments around the world. The data in the testbed provide a rich source of syntactic and semantic heterogeneities since we believe they still pose the greatest technical challenges to the research community. In addition, this testbed provides a set of twelve benchmark queries as well as a scoring function for ranking the performance of an integration system.

Testbed content

The initial XML/XMLSchema/XQuery version is accessible here. Our SQL/OWL/SPARQL version is presented below.

SQL scripts

You can either download THALIA benchmark or THALIA testbed SQL Schema versions. Benchmark version includes challenge university schemas and data. Testbed version contains schemas and data for all available universities courses.

MySQL 5.0.37 (bundled with archive)

PostgreSQL 8.2.3-1 (bundled with archive)

Virtuoso 5.x (and higher)

First public version of THALIA benchmark schema already available.

Full testbed version doesn't construct yet.

University computer science course ontology

First public version of universities computer science departments courses around the world already available too.

Benchmark SPARQL queries

A set of twelve benchmark queries represented in SPARQL already available too.

Examples of testbed SQL data in RDF format

Benchmark data is required for SPARQL queries in RDF format already available too.

THALIA testbed downloads

Some testbed examples

Arizona State University

110 Principles of Programming with Java. (3) MORE INFO Concepts of problem solving using Java, algorithm design, structured programming, fundamental algorithms and techniques, and computer systems concepts. Social and ethical responsibility. Lecture, lab. Prerequisite: MAT 170.

SQL to RDF mapping tools

There's a more comprehensive list at RdfAndSql.

Resources

Bibliography

Presentations

People Interested in the Area