•  
  •  
 

Journal of System Simulation

Abstract

Abstract: The parallelized graph clustering algorithm is researched. A new parallelized graph clustering algorithm is proposed based on Spark. As the top operation of Spark occupies a lot of memory space, a new algorithm which is used to substitute the top operation is proposed to reduce the memory consumption. By improving bottom up hierarchical clustering algorithm, the speed of the proposed algorithm is improved. A new data filtering method based on the feature of graph data is proposed. By the method, the running time and memory space comsuption is reduced greatly. The reason of the high efficiency of this filtering method is explained. Simulation result indicates that the proposed algorithm is better than other parallelized graph clustering algorithms.

First Page

1038

Revised Date

2019-03-02

Last Page

1050

CLC

TP391

Recommended Citation

Liu Dongjiang, Li Jianhui. Study of Parallelized Graph Clustering Algorithm Based on Spark[J]. Journal of System Simulation, 2020, 32(6): 1038-1050.

DOI

10.16182/j.issn1004731x.joss.18-0722

Share

COinS