The Jaccard Coefficient can be used to
measure the amount of perturbation introduced to the sanitized graph
during the release process:
where
is the centrality of the node
and
the Jaccard Coefficient
is defined in this context
as follows:
.
In the above expression, the numerator counts the number of edges that
are left unchanged in
, taking directionality into account.
The denominator counts all edges that exist in either direction in either
, or
.
A more obvious measure that simply counts the number of edges added or
removed, as a fraction of the total number of edges, would ignore the
effect of perturbation on individual nodes. By contrast, our measure
takes this into account, weighing nodes in proportion to their centrality
in the network (this is the purpose of the
factor).
Arvind Narayanan 2009-03-19