  • Posted on: 8 February 2017
  • By: Adrian Diaz

Juan Sequeda, Co-founder of Capsenta, gave an interesting talk on how can we integrate data using graphs and semantics (semantic data virtualization). As Mr. Sequeda said, the idea is to integrate data without needing to move it around. Juan started off his presentation talking about the huge gap that exists between the IT departments, guardians of the data and the business development departments, trying to extract insights about the data. He used a clear example to illustrate this gap:

Imagine that an ecommerce manager is asking how many orders were placed during the last month and 3 departments return a different number although it should be the same one, what happens? Sequeda explains that 4 factors are intervening here:

  1. Ambiguity in the data.
  2. Lack of semantics.
  3. No shared understanding.
  4. Data coming from different sources.

Let’s imagine again that we want to integrate data from various organizations to conduct a study. Usually the databases are from different vendors and the organizations are not willing to release their data in data warehouses. Sometimes, not even different departments of a company don’t even do it. This, Sequeda explains, is a major issue. Continuing with the example, he talked about Constitute, a search engine created by Capsenta, Google and different political scientists, for the World’s Constitutions. They worked with 200 different databases with very structured information about constitutions. What they did was help them out organise their knowledge and integrate it using RDF so the researches could use with via a search engine.

In his words graphs are cool because they’re flexible, easy to integrate, treat data and metadata as one, permit traversal, navigation and reachability while being also semantic.

After this introduction, Mr.Sequeda introduced the main concept of his talk, the integration of data using graphs and semantics. To do so in a business level, he pointed out the importance of generating an Enterprise Knowledge Graph that is fed by other graphs. Once this graph is created then another layer can be built with particular applications, using the EKG as a universal API.

