This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy.
Brought to you by:

Modelling and inference in the dynamics of complex interaction networks

Peter Sollich, Yasser Roudi and Manfred Opper
Picture. Peter Sollich, Yasser Roudi and Manfred Opper

Guest Editors

Peter Sollich Kings College, London.
Yasser Roudi Kavli Institute and Centre for Neural Computation, Norway
Manfred Opper Technische Universitaet Berlin, Germany


Scope

There is a close relationship between statistical inference and the statistical physics of disordered systems which can be characterized as large systems of simple units linked by a complex network of interactions. While this relationship is by now well established, there has in the past few years been significant new work on the development of methods for statistical inference using concepts and approximations that have their roots in statistical mechanics. One of the main reasons for this wave of interest lies in the recent and ongoing developments in many areas of experimental biology, finance and other areas, where very large data sets are becoming available but there is a shortage of modern theoretical techniques for analysing them.

In the past, as well as the initial years of the current wave, the main interaction between statistical physics and statistical inference has been in the area of developing equilibrium tools and analysing data sets in an equilibrium framework. This means coming up with efficient methods for building stationary and equilibrium probability distributions over the state of the system under investigation, on the basis of recorded experimental or other empirical data, and to analyse such distributions for gaining insight into the system. However, more recently, the interest in the community has shifted more towards dynamic models. This shift has occurred for two main reasons:

(a) Most of the interesting systems for which statistical analysis techniques are required, e.g. neuronal networks, gene regulatory networks, protein-protein interaction networks, stock markets, routing problems exhibit very rich temporal or spatiotemporal dynamics; if this is ignored by focusing on stationary distributions alone it will lead to the loss of a significant amount of interesting information and possibly even qualitatively wrong conclusions (see e.g. Tyrcha et al, JSTAT 2013).

(b) Current technological breakthroughs in collecting data from the complex systems referred to above allow for ever increasing temporal resolution. This in turn allows in depth analyses of the fundamental temporal aspects of the function of the system, if combined with strong theoretical methods. It is widely accepted that these dynamical aspects are crucial for understanding the function of biological and financial system, warranting the development of techniques for studying them. The purpose of the proposed special issue is to gather original research articles as well as pedagogical review articles both on the theoretical aspects of the interface between statistical inference and the physics of non-equilibrium disordered and complex systems, and on applications to real data.

The purpose of the proposed special issue is to gather original research articles as well as pedagogical review articles both on the theoretical aspects of the interface between statistical inference and the physics of non-equilibrium disordered and complex systems, and on applications to real data.

The articles listed below are the first accepted contributions to the collection and further additions will appear on an ongoing basis.

Topical Review

Path integral methods for the dynamics of stochastic and disordered systems

John A Hertz et al 2017 J. Phys. A: Math. Theor. 50 033001

We review some of the techniques used to study the dynamics of disordered systems subject to both quenched and fast (thermal) noise. Starting from the Martin–Siggia–Rose/Janssen–De Dominicis–Peliti path integral formalism for a single variable stochastic dynamics, we provide a pedagogical survey of the perturbative, i.e. diagrammatic, approach to dynamics and how this formalism can be used for studying soft spin models. We review the supersymmetric formulation of the Langevin dynamics of these models and discuss the physical implications of the supersymmetry. We also describe the key steps involved in studying the disorder-averaged dynamics. Finally, we discuss the path integral approach for the case of hard Ising spins and review some recent developments in the dynamics of such kinetic Ising models.

Papers

Evolutionary optimization of network reconstruction from derivative-variable correlations

Marc G Leguia et al 2017 J. Phys. A: Math. Theor. 50 334001

Topologies of real-world complex networks are rarely accessible, but can often be reconstructed from experimentally obtained time series via suitable network reconstruction methods. Extending our earlier work on methods based on statistics of derivative-variable correlations, we here present a new method built on integrating an evolutionary optimization algorithm into the derivative-variable correlation method. Results obtained from our modification of the method in general outperform the original results, demonstrating the suitability of evolutionary optimization logic in network reconstruction problems. We show the method's usefulness in realistic scenarios where the reconstruction precision can be limited by the nature of the time series. We also discuss important limitations coming from various dynamical regimes that time series can belong to.

Open access
Joint statistics of strongly correlated neurons via dimensionality reduction

Taşkın Deniz and Stefan Rotter 2017 J. Phys. A: Math. Theor. 50 254002

The relative timing of action potentials in neurons recorded from local cortical networks often shows a non-trivial dependence, which is then quantified by cross-correlation functions. Theoretical models emphasize that such spike train correlations are an inevitable consequence of two neurons being part of the same network and sharing some synaptic input. For non-linear neuron models, however, explicit correlation functions are difficult to compute analytically, and perturbative methods work only for weak shared input. In order to treat strong correlations, we suggest here an alternative non-perturbative method. Specifically, we study the case of two leaky integrate-and-fire neurons with strong shared input. Correlation functions derived from simulated spike trains fit our theoretical predictions very accurately. Using our method, we computed the non-linear correlation transfer as well as correlation functions that are asymmetric due to inhomogeneous intrinsic parameters or unequal input.

The appropriateness of ignorance in the inverse kinetic Ising model

Benjamin Dunn and Claudia Battistin 2017 J. Phys. A: Math. Theor. 50 124002

We develop efficient ways to consider and correct for the effects of hidden units for the paradigmatic case of the inverse kinetic Ising model with fully asymmetric couplings. We identify two sources of error in reconstructing the connectivity among the observed units while ignoring part of the network. One leads to a systematic bias in the inferred parameters, whereas the other involves correlations between the visible and hidden populations and has a magnitude that depends on the coupling strength. We estimate these two terms using a mean field approach and derive self-consistent equations for the couplings accounting for the systematic bias. Through application of these methods on simple networks of varying relative population size and connectivity strength, we assess how and under what conditions the hidden portion can influence inference and to what degree it can be crudely estimated. We find that for weak to moderately coupled systems, the effects of the hidden units is a simple rotation that can be easily corrected for. For strongly coupled systems, the non-systematic term becomes large and can no longer be safely ignored, further highlighting the importance of understanding the average strength of couplings for a given system of interest.

Action selection in growing state spaces: control of network structure growth

Dominik Thalmeier et al 2017 J. Phys. A: Math. Theor. 50 034006

The dynamical processes taking place on a network depend on its topology. Influencing the growth process of a network therefore has important implications on such dynamical processes. We formulate the problem of influencing the growth of a network as a stochastic optimal control problem in which a structural cost function penalizes undesired topologies. We approximate this control problem with a restricted class of control problems that can be solved using probabilistic inference methods. To deal with the increasing problem dimensionality, we introduce an adaptive importance sampling method for approximating the optimal control. We illustrate this methodology in the context of formation of information cascades, considering the task of influencing the structure of a growing conversation thread, as in Internet forums. Using a realistic model of growing trees, we show that our approach can yield conversation threads with better structural properties than the ones observed without control.

Expectation propagation for continuous time stochastic processes

Botond Cseke et al 2016 J. Phys. A: Math. Theor. 49 494002

We consider the inverse problem of reconstructing the posterior measure over the trajectories of a diffusion process from discrete time observations and continuous time constraints. We cast the problem in a Bayesian framework and derive approximations to the posterior distributions of single time marginals using variational approximate inference, giving rise to an expectation propagation type algorithm. For non-linear diffusion processes, this is achieved by leveraging moment closure approximations. We then show how the approximation can be extended to a wide class of discrete-state Markov jump processes by making use of the chemical Langevin equation. Our empirical results show that the proposed method is computationally efficient and provides good approximations for these classes of inverse problems.

Variational perturbation and extended Plefka approaches to dynamics on random networks: the case of the kinetic Ising model

L Bachschmid-Romano et al 2016 J. Phys. A: Math. Theor. 49 434003

We describe and analyze some novel approaches for studying the dynamics of Ising spin glass models. We first briefly consider the variational approach based on minimizing the Kullback–Leibler divergence between independent trajectories and the real ones and note that this approach only coincides with the mean field equations from the saddle point approximation to the generating functional when the dynamics is defined through a logistic link function, which is the case for the kinetic Ising model with parallel update. We then spend the rest of the paper developing two ways of going beyond the saddle point approximation to the generating functional. In the first one, we develop a variational perturbative approximation to the generating functional by expanding the action around a quadratic function of the local fields and conjugate local fields whose parameters are optimized. We derive analytical expressions for the optimal parameters and show that when the optimization is suitably restricted, we recover the mean field equations that are exact for the fully asymmetric random couplings (Mézard and Sakellariou 2011 J. Stat. Mech. 2011 L07001). However, without this restriction the results are different. We also describe an extended Plefka expansion in which in addition to the magnetization, we also fix the correlation and response functions. Finally, we numerically study the performance of these approximations for Sherrington–Kirkpatrick type couplings for various coupling strengths and the degrees of coupling symmetry, for both temporally constant but random, as well as time varying external fields. We show that the dynamical equations derived from the extended Plefka expansion outperform the others in all regimes, although it is computationally more demanding. The unconstrained variational approach does not perform well in the small coupling regime, while it approaches dynamical TAP equations of (Roudi and Hertz 2011 J. Stat. Mech. 2011 P03031) for strong couplings.

Data quality for the inverse lsing problem

Aurélien Decelle et al 2016 J. Phys. A: Math. Theor. 49 384001

There are many methods proposed for inferring parameters of the Ising model from given data, that is a set of configurations generated according to the model itself. However little attention has been paid until now to the data, e.g. how the data is generated, whether the inference error using one set of data could be smaller than using another set of data, etc. In this paper we discuss the data quality problem in the inverse Ising problem, using as a benchmark the kinetic Ising model. We quantify the quality of data using effective rank of the correlation matrix, and show that data gathered in a out-of-equilibrium regime has a better quality than data gathered in equilibrium for coupling reconstruction. We also propose a matrix-perturbation based method for tuning the quality of given data and for removing bad-quality (i.e. redundant) configurations from data.

Extended Plefka expansion for stochastic dynamics

B Bravi et al 2016 J. Phys. A: Math. Theor. 49 194003

We propose an extension of the Plefka expansion, which is well known for the dynamics of discrete spins, to stochastic differential equations with continuous degrees of freedom and exhibiting generic nonlinearities. The scenario is sufficiently general to allow application to e.g. biochemical networks involved in metabolism and regulation. The main feature of our approach is to constrain in the Plefka expansion not just first moments akin to magnetizations, but also second moments, specifically two-time correlations and responses for each degree of freedom. The end result is an effective equation of motion for each single degree of freedom, where couplings to other variables appear as a self-coupling to the past (i.e. memory term) and a coloured noise. This constitutes a new mean field approximation that should become exact in the thermodynamic limit of a large network, for suitably long-ranged couplings. For the analytically tractable case of linear dynamics we establish this exactness explicitly by appeal to spectral methods of random matrix theory, for Gaussian couplings with arbitrary degree of symmetry.

Rare events statistics of random walks on networks: localisation and other dynamical phase transitions

Caterina De Bacco et al 2016 J. Phys. A: Math. Theor. 49 184003

Rare event statistics for random walks on complex networks are investigated using the large deviation formalism. Within this formalism, rare events are realised as typical events in a suitably deformed path-ensemble, and their statistics can be studied in terms of spectral properties of a deformed Markov transition matrix. We observe two different types of phase transition in such systems: (i) rare events which are singled out for sufficiently large values of the deformation parameter may correspond to localised modes of the deformed transition matrix; (ii) 'mode-switching transitions' may occur as the deformation parameter is varied. Details depend on the nature of the observable for which the rare event statistics is studied, as well as on the underlying graph ensemble. In the present paper we report results on rare events statistics for path averages of random walks in Erdős–Rényi and scale free networks. Large deviation rate functions and localisation properties are studied numerically. For observables of the type considered here, we also derive an analytical approximation for the Legendre transform of the large deviation rate function, which is valid in the large connectivity limit. It is found to agree well with simulations.

Mean-field inference of Hawkes point processes

Emmanuel Bacry et al 2016 J. Phys. A: Math. Theor. 49 174006

We propose a fast and efficient estimation method that is able to accurately recover the parameters of a d-dimensional Hawkes point-process from a set of observations. We exploit a mean-field approximation that is valid when the fluctuations of the stochastic intensity are small. We show that this is notably the case in situations when interactions are sufficiently weak, when the dimension of the system is high or when the fluctuations are self-averaging due to the large number of past events they involve. In such a regime the estimation of a Hawkes process can be mapped on a least-squares problem for which we provide an analytic solution. Though this estimator is biased, we show that its precision can be comparable to the one of the maximum likelihood estimator while its computation speed is shown to be improved considerably. We give a theoretical control on the accuracy of our new approach and illustrate its efficiency using synthetic datasets, in order to assess the statistical estimation error of the parameters.

A theory of solving TAP equations for Ising models with general invariant random matrices

Manfred Opper et al 2016 J. Phys. A: Math. Theor. 49 114002

We consider the problem of solving TAP mean field equations by iteration for Ising models with coupling matrices that are drawn at random from general invariant ensembles. We develop an analysis of iterative algorithms using a dynamical functional approach that in the thermodynamic limit yields an effective dynamics of a single variable trajectory. Our main novel contribution is the expression for the implicit memory term of the dynamics for general invariant ensembles. By subtracting these terms, that depend on magnetizations at previous time steps, the implicit memory terms cancel making the iteration dependent on a Gaussian distributed field only. The TAP magnetizations are stable fixed points if a de Almeida–Thouless stability criterion is fulfilled. We illustrate our method explicitly for coupling matrices drawn from the random orthogonal ensemble.