ForceAtlas2, a continuous graph layout algorithm for handy network visualization designed for the Gephi software - PubMed (original) (raw)

ForceAtlas2, a continuous graph layout algorithm for handy network visualization designed for the Gephi software

Mathieu Jacomy et al. PLoS One. 2014.

Abstract

Gephi is a network visualization software used in various disciplines (social network analysis, biology, genomics...). One of its key features is the ability to display the spatialization process, aiming at transforming the network into a map, and ForceAtlas2 is its default layout algorithm. The latter is developed by the Gephi team as an all-around solution to Gephi users' typical networks (scale-free, 10 to 10,000 nodes). We present here for the first time its functioning and settings. ForceAtlas2 is a force-directed layout close to other algorithms used for network spatialization. We do not claim a theoretical advance but an attempt to integrate different techniques such as the Barnes Hut simulation, degree-dependent repulsive force, and local and global adaptive temperatures. It is designed for the Gephi user experience (it is a continuous algorithm), and we explain which constraints it implies. The algorithm benefits from much feedback and is developed in order to provide many possibilities through its settings. We lay out its complete functioning for the users who need a precise understanding of its behaviour, from the formulas to graphic illustration of the result. We propose a benchmark for our compromise between performance and quality. We also explain why we integrated its various features and discuss our design choices.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1

Figure 1. Layouts with different types of forces.

Layouts with Fruchterman-Reingold (formula image), ForceAtlas2 (formula image) and the LinLog mode of ForceAtlas2 (formula image).

Figure 2

Figure 2. Regular repulsion vs. repulsion by degree.

Fruchterman-Rheingold layout on the left (regular repulsion) and ForceAtlas2 on the right (repulsion by degree). While the global scheme remains, poorly connected nodes are closer to highly connected nodes. (formula image).

Figure 3

Figure 3. Effects of the gravity.

ForceAtlas2 with gravity at 2 and 5. Gravity brings disconnected components closer to the center (and slightly affects the shape of the components as a side-effect).

Figure 4

Figure 4. Effects of the scaling.

ForceAtlas2 with scaling at 1, 2 and 10. The whole graph expands as scaling affects the distance between components as well as their size. Note that the size of the nodes remains the same; scaling is not zooming.

Figure 5

Figure 5. Effects of the edge weight influence.

ForceAtlas2 with Edge Weight Influence at 0, 1 and 2 on a graph with weighted edges. It has a strong impact on the shape of the network.

Figure 6

Figure 6. Effects of the overlapping prevention.

ForceAtlas2 without and with the nodes overlapping prevention.

Figure 7

Figure 7. The oscillation of nodes increases with speed.

Fruchterman-Rheingold layout at speeds 100, 500 and 2,500 (superposition at two successive steps).

Figure 8

Figure 8. Adaptive local speed is a good compromise.

Evolution of the quality of ForceAtlas2 variants at each iteration (the higher the better). Different values of the local speed give different behaviors. The adaptive local speed achieves the best compromise between performance and quality. The network used is “facebook_ego_0” from our dataset.

Figure 9

Figure 9. Effects of adaptive local speed on different networks.

Evolution of the quality of ForceAtlas2 variants at each iteration on the other facebook ego-networks of our dataset. The adaptive local speed is always the best. Local speed 0.001 converges poorly because the speed is too low. Local speed 0.1 converges poorly because it oscillates a lot: the speed is too high. Local speed 0.01 is sometimes adapted to the network, and sometimes not, but never outperforms the adaptive speed.

Figure 10

Figure 10. Records for a single network.

Evolution of the layout quality for a single network over 2048 steps. Rows are the 4 different layouts and columns the 3 different randomizations. The red dot is the “Quick and dirty point” where 50% of the maximum quality is reached, and the blue dot is the “Quasi-optimal point” where 90% of the maximum quality is reached. The full visualization is available at this URL:

https://github.com/medialab/benchmarkForceAtlas2/tree/master/benchmarkResults

.

Figure 11

Figure 11. Overall results of the benchmark.

Note that the second and third charts have logarithmic scales. FR is really slow, YH has a good performance and FA2 has a good quality.

Figure 12

Figure 12. Quasi-Optimal Time over network size.

The lower is the better. Note that both scales are logarithmic. On small networks, FR is the best while FA2_LL is slower. On large networks, FR has a poor performance while other algorithms perform similarly on large networks.

Figure 13

Figure 13. Layouts give visibly different results.

We find that FA2_LL and FA2 are more readable, because the different areas of the network are more precisely defined. However, we do not know any quality measure that captures this phenomenon.

References

    1. Bastian M, Heymann S, Jacomy M (2009) Gephi: an open source software for exploring and manipulating networks. In: International AAAI Conference on Weblogs and Social Media. Association for the Advancement of Artificial Intelligence.
    1. Diminescu D (2008) The connected migrant: an epistemological manifesto. Social Science Information 47: 565–579.
    1. Batagelj V, Mrvar A (1998) Pajek-program for large network analysis. Connections 21: 47–57.
    1. Adar E (2006) Guess: a language and interface for graph exploration. In: Proceedings of the SIGCHI conference on Human Factors in computing systems. ACM, 791–800.
    1. Frick A, Ludwig A, Mehldau H (1994) A fast adaptive layout algorithm for undirected graphs (extended abstract and system demonstration).

MeSH terms

Grants and funding

The authors have no support or funding to report.

LinkOut - more resources