Semantic Publishing Instance Matching Benchmark

  • Posted on: 29 December 2014
  • By: Irini Fundulaki

The Semantic Publishing Instance Matching Benchmark (SPIMBench) is a novel benchmark for the assessment of instance matching techniques for RDF data with an associated schema. SPIMBench extends the state-of-the art instance matching benchmarks for RDF data in three main aspects: it allows for systematic scalability testing, supports a wider range of test cases including semantics-aware ones, and provides an enriched gold standard.

Further Developments in SNB BI Workload

  • Posted on: 18 December 2014
  • By: Orri Erling

We are presently working on the SNB BI workload. Andrey Gubichev of TU Munchen and myself are going through the queries and are playing with two SQL based implementations, one on Virtuoso and the other on Hyper.

As discussed before, the BI workload has the same choke points as TPC-H as a base but pushes further in terms of graphiness and query complexity.

Sizing AWS instances for the Semantic Publishing Benchmark

  • Posted on: 12 December 2014
  • By: Iliya Enchev. V...

LDBC’s Semantic Publishing Benchmark (SPB) measures the performance of an RDF database in a load typical for metadata-based content publishing, such as the famous BBC Dynamic Semantic Publishing scenario. Such load combines tens of updates per second (e.g.

DATAGEN – A realistic social network data generator

  • Posted on: 4 December 2014
  • By: Duc Pham

In previous posts (Getting started with SNB, DATAGEN: data generation for the Social Network Benchmark), Arnau Prat discussed the main features and characteristics of DATAGEN: realism, scalability, determinism, usability. DATAGEN is the social network data generator used by the three LDBC-SNB workloads, which produces data simulating the activity in a social network site duri

Making Semantic Publishing Execution Rules

  • Posted on: 18 November 2014
  • By: Orri Erling

LDBC SPB (Semantic Publishing Benchmark) is based on the BBC linked data platform use case. Thus the data modelling and transaction mix reflects the BBC's actual utilization of RDF. But a benchmark is not only a condensation of current best practices. The BBC linked data platform is an Ontotext Graph DB deployment. Graph DB was formerly known as Owlim.

On the upcoming, 5th LDBC TUC meeting and the previous, 4th LDBC TUC meeting

  • Posted on: 23 October 2014
  • By: Ioan Toma

The 5th LDBC TUC meeting will take place in Athens on 14.11.2014, this is the agenda. We welcome RDF and graph database users to explain their use-cases, describe the limitations they have found in current technology and see the progress of the LDBC benchmarks i.e. the Semantic Publishing Benchmark (SPB) and the Social Network Benchmark (SNB).

The Day of Graph Analytics

  • Posted on: 9 October 2014
  • By: Orri Erling

Note: consider this post as a continuation of the "Making it interactive" post by Orri Erling.

I have now completed the Virtuoso TPC-H work, including scale out.  Optimization possibilities extend to infinity but the present level is good enough. TPC-H is the classic of all analytics benchmarks and is difficult enough, I have extensive commentary on this on my blog (In Hoc Signo Vinces series), including experimental results.  This is, as it were, the cornerstone of the true science.  This is however not the totality of it.  From the LDBC angle, we might liken this to the last camp before attempting a mountain peak.

Making It Interactive

  • Posted on: 26 September 2014
  • By: Orri Erling

Synopsis: Now is the time to finalize the interactive part of the Social Network Benchmark (SNB). The benchmark must be both credible in a real social network setting and pose new challenges. There are many hard queries but not enough representation for what online systems in fact do. So, the workload mix must strike a balance between the practice and presenting new challenges.