Sixth TUC Meeting: 20 billion triples in production by Jerven Bolleman (Swiss Institute of Bioinformatics)
The second part of presentations during the first day at the TUC Meeting in Barcelona started with the presentation from Jerven Bolleman from the Swiss Institute of Bioinformatics called "20 billion triples in production"
With a very striking title Jerven Bolleman explains why and how they provide a SPARQL endpoint which makes public the data available for biological datasets and allows answering science questions that need cross database queries while being common to have up to 200 joins, through a REST interface.
It's also specially interesting to learn about the challenges they have been facing as an SPARQL endpoint, like the loading time being unpredictable depending on how the query is done, query plan also making a big difference, simple data discovery queries are hard with TB of data and query timeouts not being sufficient and limited by the http support.
Watch Jerven Bolleman talking about Uniprot SPARQL endpoint that allows working with 20 billion triples.
You can also read more about Virtuoso and SPARQL in our blog.
Stay tuned for the rest of presentations from the TUC Meeting!