This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy.
Brought to you by:
Letter The following article is Open access

Dead or alive: Distinguishing active from passive particles using supervised learning(a)

, , , , , and

Published 28 June 2023 Copyright © 2023 The author(s)
, , Focus Issue on Statistical Physics of Self-Propelled Colloids Citation Giulia Janzen et al 2023 EPL 143 17004 DOI 10.1209/0295-5075/acdf1b

0295-5075/143/1/17004

Abstract

A longstanding open question in the field of dense disordered matter is how precisely structure and dynamics are related to each other. With the advent of machine learning, it has become possible to agnostically predict the dynamic propensity of a particle in a dense liquid based on its local structural environment. Thus far, however, these machine-learning studies have focused almost exclusively on simple liquids composed of passive particles. Here we consider a mixture of both passive and active (i.e., self-propelled) Brownian particles, with the aim to identify the active particles from minimal local structural information. We compare a state-of-the-art machine learning approach for passive systems with a new method we develop based on Voronoi tessellation. Both methods accurately identify the active particles based on their structural properties at high activity and low concentrations of active particles. Our Voronoi method is, however, substantially faster to train and deploy because it requires fewer, and easy to compute, input features. Notably, both become ineffective when the activity is low, suggesting a fundamentally different structural signature for dynamic propensity and non-equilibrium activity. Ultimately, these efforts might also find relevance in the context of biological active glasses such as confluent cell layers, where subtle changes in the microstructure can hint at pathological changes in cell dynamics.

Export citation and abstract BibTeX RIS

Published by the EPLA under the terms of the Creative Commons Attribution 4.0 International License (CC-BY). Further distribution of this work must maintain attribution to the author(s) and the published article's title, journal citation, and DOI.

Introduction

A central notion in the study of active particulate matter —systems of discrete entities which consume energy to perform work and move autonomously— is that the presence of activity can dramatically alter both spatial organization and (collective) dynamics [1]. The fact that active matter is intrinsically out of equilibrium renders the standard tools of statistical physics of limited use and leaves open the question of which quantifiers most accurately characterize and predict the dynamics of active matter [2]. This is a profound issue, especially in densely disordered phases such as liquids and glasses, where the relation between spatial structure and emergent dynamics is notoriously obscure [1,3]. A better understanding of structure-dynamics relations in active matter would be highly desirable both from a fundamental and more applied perspective. Notably, in the context of biological tissues and confluent cell layers, subtle structural changes can correlate with the motile properties of the individual cells, with relevance in processes such as cancer metastasis and embryonic development [4,5].

Recent efforts have demonstrated that machine learning (ML) approaches are extremely effective in finding simple structural indicators that predict dynamical properties in densely disordered (near-)equilbrium systems [625]. These findings firmly establish a correlation between local structure and the propensity of passive particles to move in a crowded environment. While ML has also been applied with considerable success in purely active systems [2533], in practice and particularly in biological settings the system of interest will generally contain actors whose activity parameters are different, and distributed. In tumor tissue, for instance, there may be a distribution of epithelial (more stationary, passive) phenotypes and mesenchymal (more motile, active) ones, with the presence of mesenchymal cells being generally associated with greater metastatic potential [5,34]. Thus, developing the ability to reliably distinguish active from passive elements in dense collectives holds diagnostic and prognostic potential.

Inspired by this challenge, our work addresses a seemingly simple issue: Can we identify the active species in a binary model system of active and passive particles? Here we refer to passive entities as "dead" particles that do not possess any self-propulsion, while the active, self-propelled particles are "alive". Naturally, the distinction would be most conveniently done based on dynamical (i.e., time-resolved) information, but in biological settings such dynamical data may be difficult to obtain. This is why we complicate the challenge considerably, and demand that the identification is performed based solely on static images ("snapshots") of the system.

Recent studies [3539] have shown that the presence of a small number of active particles within a dense passive bath can induce specific structural inhomogeneities. Explicitly, bath particles tend to accumulate at the front of an active particle while leaving a depletion zone behind it. This spatial anisotropy is usually irrelevant in the absence of activity, and is therefore also frequently ignored in ML methods designed for purely passive systems [17,20,22,40]. However, it could be essential to distinguish dead particles from those that are alive.

In this work, we quantify the spatial anisotropy of each particle's local environment through a simple Voronoi tesselation [40,41]; from this instantaneous structural information, we seek to predict the particle identity (dead or alive) using ML. We show that when the active particle fraction is low and the activity level is high, the shape of the Voronoi polygons around active particles exhibits distinct characteristics compared to those around passive particles, thus providing sufficient static information to reliably distinguish the two species. This observation highlights the effectiveness of Voronoi tessellation as a suitable tool for quantifying unique structural signatures associated with active particles, at least within a suitably chosen parameter range.

We also compare our method with an existing state-of-the-art ML approach based on particle configurations, originally developed for purely passive simulated supercooled liquids [16,17,22]. We demonstrate that both approaches are only successful when the fraction of active particles is low and the activity is high. Furthermore, we show that the Voronoi method is substantially faster and easier to compute because it requires fewer and cheaper input features. This emphasizes the significance of anisotropy in telling apart dead from alive. However, when the active particle concentration is not dilute or the active force is too small, all these static approaches fail to provide accurate predictions, suggesting that the instantaneous local structure no longer fully encodes the particle identity. For this more challenging regime, we develop a pseudo-static approach, detailed in the Supplementary Material Supplementarymaterial.pdf (SM), which also incorporates information on the averaged statistical fluctuations of the local structure.

Briefly, our Voronoi approach to distinguish between active and passive particles from a single snapshot is outlined in fig. 1. First, the snapshots are processed by performing a Voronoi tessellation [41], from which we compute 11 relevant features that effectively capture local anisotropies. At the core of our Voronoi approach is a gradient boosting model, discussed in the "Methods" section, that takes into account the local environment of each particle to predict if it is dead or alive. Furthermore, the predictions generated by our ML model are explainable, enhancing the interpretability of the results.

Fig. 1:

Fig. 1: Sketch of our Voronoi-based machine learning approach for the identification of active particles in an active/passive mixture. The snapshot is processed by constructing its Voronoi tessellation, then extracting particle-specific features. A gradient boosting model is trained to evaluate the probability that each particle is dead (passive) or alive (active), only using the static features extracted from the single snapshot.

Standard image

Ultimately, our findings lead us to conclude that there are two distinct regions within the parameter space defined by the fraction of active particles and the active force. In the first region, characterized by high activity and a low fraction of active particles, active and passive particles can be distinguished effectively using a static approach. However, in the second region, active and passive particles become indistinguishable from a static point of view.

Methods

Simulation model

Our model system is a two-dimensional (2D) Kob-Andersen mixture [4244] composed of 650 and 350 particles of type A and B, respectively. We extend the model by turning a fraction $\phi_a$ into active Brownian particles [4548] (same proportion for each particle type). We have also conducted similar investigations in a three-dimensional (3D) mixture, comprising 800 particles of type A and 200 particles of type B, and obtained similar results.

The self-propelled motion is characterized by a constant active force of magnitude Fa . To develop a model that can be applied to study highly crowded biological systems, such as dense cellular collectives, we have chosen to focus on steady-state configurations with a density of $\rho=1.2$ and temperature of $T=0.5$ (in simulation units) and retrieve 2000 different independent configurations for each studied setting. Moreover, to effectively characterize the large activity regime, we consider activities ranging from 0.5 to 100, as cells primarily move due to active motion. Furthermore, we have verified that similar results are obtained at a temperature of $T=0.6$ . Finally, to confirm that the dynamics at high activities are not influenced by finite-size effects, we performed the same analysis on a larger system with 10000 particles, and we obtained similar results. All the details of the simulation model and the data collection are reported in the SM.

Gradient boosting model

We treat the identification of the active particles as a binary classification problem. For this we use LightGBM, which is a gradient-boosting machine-learning method based on decision trees [49]. This algorithm is widely used due to its efficiency, accuracy, and interpretability, and has been successfully employed, e.g., in the context of low-temperature glassy materials [23]. We show in the SM that it outperforms neural networks for our task of distinguishing active from passive particles.

In general, gradient boosting decision trees [50] comprise an ensemble learning method that combines the power of decision trees [51] and gradient boosting. It involves building a sequence of decision trees, where each subsequent tree corrects the errors made by the previous ones, by performing a residual update step. By iteratively adding decision trees and adjusting their weights based on the residuals, the model gradually improves its ability to make accurate predictions.

LightGBM introduces gradient-based one-side sampling and exclusive feature bundling [49] in order to avoid the need to scan all the data to update the residuals at each step. As a result, gradient boosting can be used for large datasets, such as the collection of simulation snapshots used in this work. In the end, after calculating all the relevant structural input features (discussed in the next section), the training of our model takes only several minutes on a standard laptop.

To quantify our ML performance we use the accuracy, defined as the number of correct predictions divided by the total number of predictions. When the fraction of active particles $\phi_a\neq 0.5$ we simply discard a subset of the particles from the data such that the number of active and passive species is equal. This allows us to work with a balanced dataset, which enhances the classifier's generalizability. Since our input features are particle-resolved and additional data collection is quite inexpensive, this presents no practical challenges.

Structural input features

Our main goal is to distinguish the active particles from the passive ones using only instantaneous static information, i.e., a single snapshot. For this, we employ two alternative sets of local, particle-resolved structural properties as input for our machine-learning model. We refer to these as i) the shell-based approach, developed earlier for purely passive systems [16], and ii) the Voronoi approach, which we introduce here to capitalize on the information contained in local anisotropy.

The features of the shell-based approach have been introduced to effectively predict the dynamic propensity of purely passive particles [16,17]. Since the dynamic propensity quantifies the average squared displacement of a particle from its initial condition [52,53], and active particles have an additional self-propelling force, it is natural to expect that active particles have a larger dynamic propensity. We also confirm this observation by measuring the mean-squared displacement [54,55], reported in the SM. Notice that similar indicators also predict localized plastic events [56,57]. This means that if the dynamical information remains encoded in the structure even for active systems, then this approach should be able to pick it up and identify the active particles from their enhanced propensity.

To implement the shell-based approach we follow the definitions of ref. [16]. Briefly, these features consist of n-th–order radial and angular descriptors, denoted as $G_i^{(n)}$ and $q_i^{(n)}$ , respectively. The former measures for each particle i the local particle density within a radial shell, while the latter expands the local density within a shell in terms of spherical harmonics; the order n indicates the degree of averaging over the structural features of nearby particles. Here we consider $n=0, 1, 2$ . For n = 0, we compute 50 radial descriptors per species with radial distance $r \in [0,5]$ and shell width $\delta=0.1$ , and 192 angular descriptors with $r \in [1,2.5]$ , $\delta=0.1$ , and spherical harmonics of order $l \in [1,12]$ . In sum, we use a total of 876 static quantities as input in the shell-based approach.

As an alternative to the shell-based features of ref. [16], we introduce the Voronoi approach, which naturally quantifies the local anisotropy created by the active particles. Briefly, we perform a Voronoi tessellation of all particle positions and compute the features based on the shape of the Voronoi polygon around each particle. Explicitly, we consider the number of neighbors, the polygon area and perimeter, the distance between the center of the polygon and the actual particle position, the maximum and minimum distances between nearest neighbors, the maximum and minimum distances between the vertices of a polygon, the maximum and minimum distances between the particle position and its polygon vertices, and finally we add the particle type (A or B). Overall this amounts to only 11 relatively simple input features. Compared to the shell-based approach, the Voronoi approach is thus significantly cheaper.

Results and discussion

Distinguishing dead from alive

We report in fig. 2 the accuracy achieved by our Voronoi ML model in the classification of an active/passive mixture with $\phi_a=0.008$ , for different values of the active force Fa . We compare the accuracy that we get when the model is trained and tested at a single specific value of Fa (orange circles), with the accuracy of a model trained when fixing $F_a=50$ (blue squares) or $F_a=100$ (pink triangles). All three curves produce rather similar accuracies, implying that a single ML model trained at a fixed Fa can also produce good predictions for unseen parameter regimes. This shows reasonable generalizability of the model. However, the accuracy in all three cases drops significantly for lower activity. We attribute this to the fact that small values of Fa render it extremely challenging to differentiate between structural signatures induced by either active forces or passive Brownian motion. Hence, any static approach based purely on instantaneous structural properties fails.

Fig. 2:

Fig. 2: Accuracy of the Voronoi ML approach as a function of the active force $F_a=F_a^{\text{test}}$ , with $\phi_a=0.008$ . The orange circles represent scores obtained from separate models, where each one was trained using $F_a^{\text{train}}=F_a^{\text{test}}$ . The blue squares and pink triangles represent scores obtained from a single global model trained with data only for $F_a=50$ or $F_a=100$ , respectively. The inset shows the comparison between the Voronoi and the shell-based approaches for three different active forces $F_{a}^{\text{test}}=F_{a}^{\text{train}}=5,50,100$ . The purple circles, blue stars, and red diamonds represent the ratio between accuracy, computational times, and number of features, respectively.

Standard image

The inset of fig. 2 demonstrates that very similar accuracies can be achieved when training the ML algorithm with the more intricate structural properties $G_i^{(n)}$ and $q_i^{(n)}$ as input (shell-based approach). However, the shell-based approach is considerably more computationally expensive compared to the Voronoi method. The reason is that the shell-based approach employs almost two orders of magnitude more features. Consequently, we can conclude that, for this specific classification task, the Voronoi approach is more efficient. Moreover, we find similar results for both the Voronoi and the shell-based approach in 3D (see the SM).

Figure 3 reports the performance of the Voronoi active/passive classifiers as a function of the percentage of active particles $\phi_a$ , while keeping Fa fixed at 100. For $\phi_a \leq 0.1$ , our approach demonstrates excellent predictive capability when the ML model is trained and tested at a specific $\phi_a$ value (represented by orange circles). Additionally, we compare the accuracy obtained when training the model with $\phi_a$ fixed at 0.008 (blue squares) or 0.05 (pink triangles). The latter case yields better transferability than when training only on $\phi_a=0.008$ , especially in the region $\phi_a\leq 0.1$ , where it rivals the performance of the model trained at each $\phi_a$ value separately. Noticeably, however, when the fraction of active particles $\phi_a$ becomes large, the accuracy drops to below 70%, even for our best ML model. This indicates that the instantaneous structural input becomes less informative to differentiate between active and passive particles, which we attribute to the fact that the presence of more active particles increasingly affects the entire structure of the material and diminishes the front-wake asymmetry. Finally, we have also repeated the Voronoi tessellation analysis at a lower density of $\rho=1.0$ with a fraction of active particles $\phi_a=0.008$ . Interestingly, the results are qualitatively similar to those obtained at $\rho=1.2$ , but with a more pronounced anisotropy observed at a lower activity $(F_a=50)$ compared to the higher-density system (see the SM for more details).

Fig. 3:

Fig. 3: Accuracy of the Voronoi ML approach as a function of the fraction of active particles $\phi_a=\phi_a^{\text{test}}$ , with $F_a=100$ . The orange circles represent scores obtained from separate models, where each one was trained using $\phi_a^{\text{train}}=\phi_a^{\text{test}}$ . The blue squares and pink triangles represent scores obtained from a single global model trained with data only for $\phi_a=0.008$ or $\phi_a=0.05$ , respectively.

Standard image

A full overview of the Voronoi ML model accuracy in the $(F_a, \phi_a)\text{-plane}$ is reported in fig. 4. This figure reveals two distinct regions: a red region representing a "distinguishable" area and a blue region representing an "indistinguishable" area. In the red region, characterized by low $\phi_a$ and high Fa values, the algorithm achieves an accuracy greater than 0.75, enabling it to differentiate adequately between active and passive particles based on a single snapshot. In this region, the structure surrounding the active particle significantly differs from that of the passive particle. As a result, even a shell-based approach, which does not consider the directional dependence of passive particle density around the active particle, still yields accurate results (inset of fig. 2).

Fig. 4:

Fig. 4: Accuracy map of the Voronoi ML approach in the $(F_a, \phi_a)\text{-plane}$ for a 2D system at temperature $T=0.5$ and density $\rho=1.2$ .

Standard image

In the blue region, characterized by either high $\phi_a$ or low Fa , any static approach (either shell-based or Voronoi) is insufficient to effectively differentiate between active and passive particles. When Fa  < 40, the activity is not high enough to induce noticeable spatial inhomogeneities. On the other hand, when $\phi_a$ is too high, the interaction among active particles complicates the classification task. Particularly, in cases where both Fa and $\phi_a$ are high, the system can undergo motility-induced phase separation (MIPS) [37]. In view of the poor overall performance of our ML method for both high $\phi_a$ and low Fa , we conclude that if a local structure-dynamics relationship exists in active systems, it must be significantly different from that observed in passive systems, as indicated by the failure of the shell-based approach. Additionally, this relationship is not hidden in the anisotropy, as indicated by the failure of the Voronoi approach.

In order to still achieve good predictability in the regime where a purely static ML approach fails (blue region in fig. 4), we have developed a so-called pseudo-static method. This approach, detailed in the SM, is able to identify active particles at the cost of more input information. Briefly, while not requiring a fully time-resolved dynamical trajectory, it requires some knowledge of the averaged statistical fluctuations of the local structure. It is possible to obtain these statistics from a collection of snapshots that do not have to be time-ordered, but in which each particle has to be tracked. This tracking information makes the method naturally more computationally expensive. Overall, in the pseudo-static approach, the information about the local structure is complemented by statistical measurements of the same features at different times. We have verified that using this information on the structural fluctuations, an accuracy of 95%–100% can be achieved (see the SM).

Model explanation

Let us return to the parameter regime where our Voronoi ML model is most successful in distinguishing active from passive particles, i.e., small $\phi_a$ and large Fa . In this regime we can interpret the decisions made by the ML model. To do so, we calculate the model explanations using SHapley Additive exPlanation (SHAP) analysis [58]. Briefly, this method, based on cooperative game theory, calculates Shapley values that distribute the model's prediction among the features while considering their individual contributions to the output. We apply this analysis to the Voronoi model trained on $F_a=100$ and $\phi_a=0.008$ .

Figure 5 shows the SHAP beeswarm plot, which indicates the six most important features and how the values of these features influence the model's predictions. The first six features are, in order of importance: the distance between the center of the polygon and the particle (d(c, p)), the perimeter of the polygon, the maximum distance between the particle and its neighbors $(\max(d(n,p)))$ , the maximum distance between the vertices of the polygon $(\max(d(v,v)))$ , the area of the polygon, and the maximum distance between the particle and the vertices of the polygon $(\max(d(v,p)))$ . The colors show that the model interprets low values of d(c, p) as a passive particle and high values of d(c, p) as an active particle. Considering that passive particles tend to accumulate in front of active particles while creating a void behind them, we anticipate the polygons around active particles to exhibit elongation. Consequently, due to this anisotropy, we expect the active particle to be situated at a different position compared to the center of the polygon, as we confirm in representative polygons shown at the top of fig. 5. The SHAP analysis clearly validates this expectation. Hence, rather than using the ML model solely as a black box, we infer that the model captures the correct physical picture and confirms the importance of spatial anisotropy induced by activity. Lastly, the SHAP analysis has also been performed in 3D, and the corresponding results can be found in the SM.

Fig. 5:

Fig. 5: The six most important features for the Voronoi model, represented as a SHAP beeswarm plot at $F_a=100$ and $\phi_a=0.008$ . The position of the dots is determined by the SHAP values of the features, and the color is used to display the value of the features. The top two polygons in the figure are a detail of a snapshot that represents the typical shapes observed in the passive (left) and active (right) cases. In the passive scenario, the center of the polygon aligns with the particle position, whereas in the active case they do not coincide.

Standard image

Conclusions

The focus of this work is to identify active (alive) entities in a densely disordered mixture of active and passive (dead) particles, using only a single picture of the system as input. We compare two static ML approaches: one based on the Voronoi tessellation and another employing a shell-based approach that utilizes descriptors previously employed in passive systems to predict dynamic particle propensities [911,13,14,16].

We find that a static approach based on a single snapshot is able to predict whether a particle is dead or alive when the activity is high and the fraction of active particles is low, in both 2D and 3D. However, this approach fails when the activity is low or the fraction of active particles is high, implying a different structural signature for activity and dynamic propensity in passive systems. Moreover, we observe that while the Voronoi and shell-based approaches yield similar accuracies, the shell-based approach is computationally more expensive.

Within the region of the $(F_a,\phi_a)\text{-plane}$ where the static approach is effective for particle classification, our model exhibits reasonably good performance when extrapolating to active forces and compositions outside its training range, demonstrating its robustness. This transferability can be particularly useful in experiments, where the active force or the precise number of active species may be difficult to quantify.

Additionally, we explain the decisions of our ML model in order to understand how it is able to identify the active entities from the Voronoi input. We find that the distance between a particle and the center of its corresponding polygon is the most significant feature for this classification task, serving as a simple and reliable measure of the distinct spatial anisotropy surrounding an active particle.

We hypothesize that the region of the $(F_a,\phi_a)\text{-plane}$ where a purely static approach fails depends on system parameters such as density and temperature. Indeed, when considering a lower density, we have found that our classifier can perform better for certain levels of activity (see the SM). However, while the boundary of the distinguishable region may thus be varied with external control parameters, we still expect a region where active and passive particles remain indistinguishable from a single snapshot. In this regime more information is inevitably needed, as confirmed by our pseudo-static approach that incorporates additional knowledge of structural fluctuations.

In summary, this work demonstrates that the shape of the polygon is sufficient to differentiate between active and passive particles in a specific region of the $(F_a,\phi_a)\text{-plane}$ . This work serves as a step to better understand the elusive structure-dynamics relation in densely disordered non-equilibrium systems. We believe that our approach can also be a valuable tool for investigating experimental systems such as biological cells, where identifying the most active entities visually can be challenging. It is thus our hope that the here presented approach can help to process large experimental datasets and contribute to the discovery of new connections between structural and dynamical properties in complex non-equilibrium materials.

Acknowledgments

This work has been financially supported by the Dutch Research Council (NWO) through a START-UP grant (VED, CL, and LMCJ), Physics Projectruimte grant (GJ and LMCJ), ENW-XL grant (CS and LMCJ), and Vidi grant (LMCJ).

Data availability statement: The data that support the findings of this study are available upon reasonable request from the authors.

Please wait… references are loading.
10.1209/0295-5075/acdf1b