Overview
The Graphalytics datasets are compressed using zstd. The total size of the compressed archives is approx. 350 GB. When decompressed, the datasets require approximately 1.5 TB of disk space.
For detailed information on the datasets, see the table with their statistics.
Download scripts
- Script to download all datasets
- Scripts to download size groups: test graphs, sizes up to S, size M, size L, size XL, sizes 2XL+
Dataset links
| dataset | nodes | edges | scale | size | download |
|---|---|---|---|---|---|
| cit-Patents | 3M | 16M | XS | 119.1 MB | tar.zst |
| com-friendster | 65M | 1B | XL | 6.7 GB | tar.zst |
| datagen-7_5-fb | 633k | 34M | S | 162.3 MB | tar.zst |
| datagen-7_6-fb | 754k | 42M | S | 200.0 MB | tar.zst |
| datagen-7_7-zf | 13M | 32M | S | 434.5 MB | tar.zst |
| datagen-7_8-zf | 16M | 41M | S | 544.3 MB | tar.zst |
| datagen-7_9-fb | 1M | 85M | S | 401.2 MB | tar.zst |
| datagen-8_0-fb | 1M | 107M | M | 502.5 MB | tar.zst |
| datagen-8_1-fb | 2M | 134M | M | 625.4 MB | tar.zst |
| datagen-8_2-zf | 43M | 106M | M | 1.4 GB | tar.zst |
| datagen-8_3-zf | 53M | 130M | M | 1.7 GB | tar.zst |
| datagen-8_4-fb | 3M | 269M | M | 1.2 GB | tar.zst |
| datagen-8_5-fb | 4M | 332M | L | 1.5 GB | tar.zst |
| datagen-8_6-fb | 5M | 421M | L | 1.9 GB | tar.zst |
| datagen-8_7-zf | 145M | 340M | L | 4.6 GB | tar.zst |
| datagen-8_8-zf | 168M | 413M | L | 5.3 GB | tar.zst |
| datagen-8_9-fb | 10M | 848M | L | 3.7 GB | tar.zst |
| datagen-9_0-fb | 12M | 1B | XL | 4.6 GB | tar.zst |
| datagen-9_1-fb | 16M | 1B | XL | 5.8 GB | tar.zst |
| datagen-9_2-zf | 434M | 1B | XL | 13.7 GB | tar.zst |
| datagen-9_3-zf | 555M | 1B | XL | 17.4 GB | tar.zst |
| datagen-9_4-fb | 29M | 2B | XL | 14.0 GB | tar.zst |
| datagen-sf3k-fb | 33M | 2B | XL | 12.7 GB | tar.zst |
| datagen-sf10k-fb | 100M | 9B | 2XL | 40.5 GB | tar.zst |
| dota-league | 61k | 50M | S | 114.3 MB | tar.zst |
| graph500-22 | 2M | 64M | S | 202.4 MB | tar.zst |
| graph500-23 | 4M | 129M | M | 410.6 MB | tar.zst |
| graph500-24 | 8M | 260M | M | 847.7 MB | tar.zst |
| graph500-25 | 17M | 523M | L | 1.7 GB | tar.zst |
| graph500-26 | 32M | 1B | XL | 3.4 GB | tar.zst |
| graph500-27 | 63M | 2B | XL | 7.1 GB | tar.zst |
| graph500-28 | 121M | 4B | 2XL | 14.4 GB | tar.zst |
| graph500-29 | 232M | 8B | 2XL | 29.6 GB | tar.zst |
| graph500-30 | 447M | 17B | 3XL | 60.8 GB | tar.zst |
| kgs | 832k | 17M | XS | 65.7 MB | tar.zst |
| twitter_mpi | 52M | 1B | XL | 5.7 GB | tar.zst |
| wiki-Talk | 2M | 5M | 2XS | 34.9 MB | tar.zst |
| example-directed | 10 | 17 | - | 1.0 KB | tar.zst |
| example-undirected | 9 | 12 | - | 1.0 KB | tar.zst |
| test-bfs-directed | <100 | <100 | - | <2.0 KB | tar.zst |
| test-bfs-undirected | <100 | <100 | - | <2.0 KB | tar.zst |
| test-cdlp-directed | <100 | <100 | - | <2.0 KB | tar.zst |
| test-cdlp-undirected | <100 | <100 | - | <2.0 KB | tar.zst |
| test-pr-directed | <100 | <100 | - | <2.0 KB | tar.zst |
| test-pr-undirected | <100 | <100 | - | <2.0 KB | tar.zst |
| test-lcc-directed | <100 | <100 | - | <2.0 KB | tar.zst |
| test-lcc-undirected | <100 | <100 | - | <2.0 KB | tar.zst |
| test-wcc-directed | <100 | <100 | - | <2.0 KB | tar.zst |
| test-wcc-undirected | <100 | <100 | - | <2.0 KB | tar.zst |
| test-sssp-directed | <100 | <100 | - | <2.0 KB | tar.zst |
| test-sssp-undirected | <100 | <100 | - | <2.0 KB | tar.zst |
Note
Some Graphalytics datasets were fixed in March 2023. If you downloaded the datasets prior to this point, some datasets had missing/incorrect reference outputs for certain algorithms. Therefore, we recommend to download the datasets again.