The shortest path is not always a straight line
2017.04.25 14:26
ABSTRACT
In this paper, we leverage the concept of the metric backbone
to improve the efficiency of large-scale graph analytics. The
metric backbone is the minimum subgraph that preserves
the shortest paths of a weighted graph. We use the metric
backbone in place of the original graph to compute vari-
ous graph metrics exactly or with good approximation. By
computing on a smaller graph, we improve the performance
of graph analytics applications on two di erent systems, a
batch graph processing system and a graph database.
Further, we provide an algorithm for the computation of
the metric backbone on large graphs. While one can com-
pute the metric backbone by solving the all-pairs-shortest-
paths problem, this approach incurs prohibitive time and
space complexity for big graphs. Instead, we propose a
heuristic that makes computing the metric backbone prac-
tical even for large graphs. Additionally, we analyze several
real datasets of di erent sizes and domains and we show
that we can approximate the metric backbone by removing
only rst-order semi-metric edges; edges for which a shorter
two-hop path exists.
We provide a distributed implementation of our algorithm
and apply it in large scale scenarios. We evaluate our algo-
rithm using a variety of real graphs, including a Facebook
social network subgraph of 50 billion edges. We measure
the impact of using the metric backbone on runtime per-
formance in two graph management systems. We achieve
query speedups of up to 6.7x in the Neo4j commercial graph
database and job speedups of up to 6x in the Giraph graph
processing system.
댓글 1
| 번호 | 제목 | 글쓴이 | 날짜 | 조회 수 |
|---|---|---|---|---|
| 57 |
jsp 소스
| 관리자 | 2006.03.20 | 5471 |
| 56 | Advanced SQL | 관리자 | 2006.03.22 | 5541 |
| 55 |
자바소스
| 관리자 | 2006.03.27 | 5417 |
| 54 |
ant
| 관리자 | 2006.04.06 | 5454 |
| 53 | 실습예제 | 관리자 | 2006.04.10 | 8230 |
| 52 | 실습예제2 | 관리자 | 2006.04.10 | 5442 |
| 51 | sqlite실행 | 관리자 | 2006.04.24 | 5374 |
| 50 |
sqlite
| 관리자 | 2006.04.24 | 5349 |
| 49 |
stack
| 관리자 | 2006.04.27 | 5377 |
| 48 |
sqlite
| 관리자 | 2006.05.01 | 5611 |
| 47 | Large Block Flash Memeory | 관리자 | 2006.05.15 | 5566 |
| 46 |
LSTAFF (Flash Memeory)
| 관리자 | 2006.05.29 | 6319 |
| 45 |
BinaryTree
| 관리자 | 2006.06.01 | 5422 |
| 44 |
바이너리트리
| 관리자 | 2006.06.05 | 5461 |
| 43 | Specification을 볼때 알아야할 의미 | 관리자 | 2009.01.21 | 6077 |