ldbc/ldbc_snb_docs

Add statistics Interactive v1.0 updates

GLaDAP opened this issue · 3 comments

The interactive v1.0 update stream statistics are missing in the documentation, only the totals are provided. Below are statistics derived from the numpart-1 files for scale factors up to SF30. Up to SF1000 still needs to be done.

T C Type SF1 SF3 SF10 SF30 SF100 SF300 SF1000
I E Add Like to Post 494410 1460471 4875874 14378128
I E Add Like to Comment 460487 1450891 5210730 16114277
I N Add Forum 19757 49223 131439 330288
I E Add Forum Membership 1566914 4874316 16647977 51095793
I N Add Post 229614 592875 1655168 4304447
I N Add Comment 490328 1372420 4414427 12588582
I E Add Friendship 40124 122714 431916 1304053
I N Add Person 1108 2672 7355 18570
Total insert node operations 740807 2017190 6208389 17241887
Total insert edge operations 2561935 7908392 27166497 82892251
TOTAL 3302742 9925582 33374886 100134138

I'm running a script to calculate these now. Where are the totals in the document? I took a glance but was unable to locate them.

The totals are shown in Appendix B, B1 Number of entities for SNB Interactive v1.0 (p.144)

Thanks! I was looking for the total of the entities inserted by the update streams. I'll add the table with the update stream statistics once the script finished.

for SF in 0.1 0.3 1 3 10 30 100 300 1000; do
    cat social_network-sf${SF}-numpart-1/updateStream_0_0_person.csv | wc -l | tr -d '\n'
    echo -n ' & '
done
echo
for UPDATE_TYPE in `seq 2 8`; do
    for SF in 0.1 0.3 1 3 10 30 100 300 1000; do
        grep "^[0-9]*|[0-9]*|${UPDATE_TYPE}|" social_network-sf${SF}-numpart-1/updateStream_0_0_forum.csv | wc -l | tr -d '\n'
        echo -n ' & '
    done;
    echo
done