LDBC SNB Datagen – The winding path to SF100K
Tags:
DATAGEN , SNBLDBC SNB provides a data generator, which produces synthetic datasets, mimicking a social network’s activity during a period of time. Datagen is defined by the charasteristics of realism, scalability, determinism and usability. More than two years have elapsed since my last technical update on LDBC SNB Datagen, in which I discussed
the reasons for moving the code to Apache Spark from the MapReduce-based Apache Hadoop implementation and the …