No.072 Analytics on complex networks: Scalable solutions for empirical questions

Icon

NII Shonan Meeting Seminar 072

Organizers

  • George Fletcher, Eindhoven University of Technology
  • Taro Takaguchi, NII
  • Yuichi Yoshida, NII

Participants

  • George Fletcher, Eindhoven University of Technology
  • Taro Takaguchi, NII
  • Yuichi, Yoshida, NII
  • Andrew Lumsdaine, Indiana University
  • Aydın Buluç, Lawrence Berkeley Laboratory
  • Yuqing Melanie Wu, Pomona College
  • Matias Korman, Tohoku University
  • Makoto Hamana, Gunma University
  • Keisuke Nakano, The University of Electro-Communications
  • Juyong Park, KAIST
  • Jean-Charles Delvenne, Université catholique de Louvain
  • Hang-Hyun Jo, POSTECH
  • David F. Gleich, Purdue University
  • Makoto Onizuka, Osaka University
  • Herman Haverkort, Eindhoven University of Technology
  • Mykola Pechenizkiy, TU Eindhoven
  • Laetitia Gauvin, ISI foundation
  • Dragan Bosnacki, Eindhoven University of Technology
  • Alexandru Iosup, Delft University of Technology
  • Ana Lucia Varbanescu, University of Amsteram

Schedule

Arrival (Sunday, 7th February)

15:00-18:30 Check-In in Shonan Center
19:00-20:30 Welcome Dinner
21:00- Free Time

Day 1 (Monday, 8th February)

07:30-09:00 Breakfast
09:00-09:10 Shonan Introduction by Staff
09:10-12:00 – Presentations by the organizers
– Short introduction by each member
12:00-14:00 Group photo and Lunch
14:00-18:00 – Short introduction by each member (cont.)
– Group formation (4 groups of 5 members each)
– Group break-out discussions
Goal: to identify multidisciplinary (broad) research themes
18:30-19:30 Dinner
19:30- Free Time

Day 2 (Tuesday, 9th February)

07:30-09:00 Breakfast
09:00-12:00 – Plenary discussions by the groups
Goal: to present results of group discussions
 (20min for each) & select 4 most interesting themes to explore more deeply
– Group reformation around interests of 4 themes
12:00-14:00 Lunch
14:00-18:00 – Group break-out discussions
Goal: to identify concrete interdisciplinary research problems within group themes
18:30-19:30 Dinner
19:30- Free Time

Day 3 (Wednesday, 10th February)

07:30-09:00 Breakfast
09:00-12:00 – Plenary discussion
Goal: to present and share the results of the group break-out discussions on concrete problems
12:00-14:00 Lunch
14:00-21:00 Excursion (including dinner)
21:00- Free Time

Day 4 (Thursday, 11th February)

07:30-09:00 Breakfast
09:00-12:00 – Plenary discussion to consolidate problems
– Plenary discussion: multidisciplinary action routes
(e.g., new research collaborations, research exchange visits, …)
12:00-14:00 Lunch
14:00- Departure

 

Overview

Real-world networks, such as web graphs, social networks, and biological networks, have remarkable topological features in common: the scale-free property, that is, the degree distribution obeys a power-law function, and the small-world property, that is, the average distance between two vertices is small. These networks are collectively referred to as complex networks and have been intensively studied in the last decade.

In their empirical study of complex networks, researchers across the fields of data engineering, theoretical computer science, and network science are developing solutions for analytics. Due to rapid growth of the sizes of real-world networks, however, we are facing challenges in scaling these tools, as we will describe below. We desire to solve these issues by bridging the knowledge in the three fields with the single key notion: complex networks.

Researchers in the engineering of data-intensive systems, such as database and data mining systems, have traditionally focused on the study of scalable solutions for analytics on big data collections. With the increasing availability and broad interest in massive complex networks, researchers in data engineering have hence focused considerable effort in recent years on scalable mining and querying over such data collections. However, very little work has been done on coordinating the results of these investigations with the foundational work in this area in the theoretical computer science and network science communities. Stronger ties between these communities would help deepen the understanding and development of theoretically grounded analytics solutions for the empirical investigations of interest in the broader scientific community.

From the perspective of theoretical computer science, the important point is that we need algorithms for real-world networks but not for all the possible networks. The majority of theoretical studies on graph problems pay attention to worst-case time complexity or average-case time complexity, which may have little to do with real-world networks. On the other hand, practical algorithms on graph problems proposed in the fields of databases, data mining, and machine learning have shown their efficacy on real-world networks, but these algorithms lack theoretical groundings to understand why they work well in practice. By applying theoretical analysis frameworks to these algorithms, we want to answer foundational questions such as “what is the right time complexity of this particular problem when the given graph is a complex network?” and “why does this particular heuristic make the algorithm faster or more accurate when the given graph is a complex network?”

Network science, a new subfield of statistical physics seeking universality hidden behind real-world networks, can move on to a new stage with aids provided by data engineering and theoretical computer science. For example, a large number of models describing growth processes of complex networks and information spreading on them have been studied in network science. However, their computational properties are largely unexplored, which prevents us from discussing their properties with rigorous theoretical groundings. As another example, various kinds of vertex centralities, which measure the importance of vertices, have been proposed and applied to network data. However, the majority of these centralities are computationally hard and efficient (approximation) algorithms should be studied to apply them to real-world networks at a large scale. In theoretical computer science, strong analytical tools for graphs such as graph minor theory and spectral graph theory have been developed and these theories may shed new light on novel characteristics of complex networks.

As we have seen, complex networks are studied in diverse fields from different perspectives. The main goal of this workshop is bringing researchers in these fields together to take the first steps towards bridging works on scalable analytics over complex networks.