Brought to you by:
Regular Paper

State-space modeling for dynamic response of graphene FET biosensors

, , , , , , , , and

Published 6 February 2020 © 2020 The Japan Society of Applied Physics
, , Citation Shota Ushiba et al 2020 Jpn. J. Appl. Phys. 59 SGGH04 DOI 10.7567/1347-4065/ab65ac

1347-4065/59/SG/SGGH04

Abstract

Graphene field effect transistor (G-FET) biosensors exhibit high sensitivity owing to their high electron/hole mobilities and unique 2D nature. However, a baseline drift is observed in their response in aqueous environment, making it difficult to analyze their response against target molecules. Here, we present a computational approach to build state-space models (SSMs) for the time-series data of a G-FET biosensor; the approach helps separate the response against target molecules from the baseline drift. The charge neutral point of the G-FET sensor was continuously measured while sensing target molecules. The obtained time-series data were modeled using the proposed SSMs. The model parameters were estimated through Markov chain Monte Carlo methods. The SSMs were evaluated using the widely-applicable Bayesian information criterion. The SSMs well fitted the time-series data of the G-FET biosensor, and the sensor response to target molecules was extracted from the baseline-drift data.

Export citation and abstract BibTeX RIS

1. Introduction

Since the introduction of ion-sensitive field effect transistors (ISFETs) in 1970,1) FET-based biosensors have attracted significant interest owing to their promising potential for a wide range of applications.2,3) In particular, graphene films offer an ideal sensing platform owing to their high electron/hole mobilities4) and 2D nature,5) and thus graphene field effect transistor (G-FET) biosensors have been used to detect ions,6) biomolecules,7) and bacteria.8) However, a baseline drift is observed in the response of FET-based sensors,9,10) particularly in the case of G-FET biosensors because of their high sensitivity. The baseline drift makes it difficult to accurately estimate the concentration of target molecules. Although various approaches have been proposed to overcome this issue,11,12) the mechanism of baseline drift remains unclear. The uncertainties in the measurement system, such as the temperature, ion concentration, and pH, may also affect the sensor signal. Therefore, it remains challenging to compensate for the signal using hardware and/or describe the signal using simple models.

We have previously proposed simple state-space models (SSMs) to describe the time-series data of a G-FET biosensor.13) Herein, we present more sophisticated SSMs and provide detailed discussions. State-space modeling is a framework established to understand stochastic and deterministic dynamical systems, referred to as states, which are observed through a stochastic process.14) SSMs are widely used in time-series analyzes, e.g. in Apollo and Polaris aerospace programs15) and for studying animal movements.14) Least-squares methods16) are unsuitable for the state and parameter estimation of SSMs, because they systematically under- or overestimate unknown model parameters, as there is a serial correlation between successive observations.17) The Kalman filter (KF) algorithm is commonly used for the parameter estimation of SSMs.18) However, in the KF algorithm, the state and parameter are represented by a Gaussian probability distribution function. This makes it unsuitable for nonlinear dynamical systems. Therefore, we used the Markov chain Monte Carlo (MCMC) algorithms for state and parameter estimation.19) The MCMC algorithms provide a general methodology that can be applied to nonlinear and non-Gaussian state models, because they allow sampling from an arbitrary posterior distribution. After estimating the state and parameters through the MCMC methods, we divided the time-series data of the G-FET biosensor into sensor response to target molecules and baseline drift through the proposed SSMs. In addition, we built competitive SSMs to determine the one that best describes the obtained dynamic response of the G-FET biosensor.

2. Experimental background

Figure 1(a) shows a schematic of the measurement system including a G-FET sensor. A graphene film was first grown on a Cu foil by chemical vapor deposition and then transferred onto a Si/SiO2 substrate. A source/drain electrode was formed with 10 nm Ti and 90 nm Au. Finally, a graphene channel was formed by O2 plasma etching. The obtained G-FET was immersed in 15 mM D-PBS (Nacalai Tesque, Inc.). A bias voltage (VDS) of 0.1 V was applied, and a top-gate voltage (VGS) was applied through the solution using a Ag/AgCl electrode. While sweeping VGS, the drain current (IDS) was measured using a semiconductor parameter analyzer (Keysight Technologies, B1500A). Figure 1(b) shows a typical IDSVGS plot. From the IDSVGS plot, the charge neutral point (CNP), which is VGS at minimum IDS in the plot, was calculated. Figure 1(c) shows the time-series data of the CNP. While monitoring the CNP, bovine serum albumin (BSA) molecules (Sigma-Aldrich), as a sensing target, were intermittently exposed to the sensor, with varying concentrations of 50 pM, 500 pM, 5 nM, 50 nM, 500 nM, 5 μM, and 10 μM, in D-PBS solution [Fig. 1(d)]. The BSA molecules were physisorbed on graphene. In the dynamic response of the sensor, shown in Fig. 1(c), the CNP varies with the BSA concentration. In addition, there is an apparent baseline drift, as the CNP sequentially varies even when there is no BSA in the solution.

Fig. 1.

Fig. 1. (Color online) (a) Schematic of a G-FET sensor. (b) A typical IDSVGS plot. (c) Time-series data of the CNP against BSA. The arrows indicate the time at which BSA is introduced to the sensor. (d) Time-series data of the BSA concentration during the measurement. Note that the concentration during the time interval 0–65 min is not plotted in the semi-logarithm graph because the concentration was 0 nM during this time.

Standard image High-resolution image

3. Results and discussion

3.1. Results

To model the obtained time-series data in Fig. 1(c), we propose an SSM, shown in Fig. 2(a), described by Eqs. (M 1.1)–(M 1.6). We define the model as Model 1

Equation (M 1.1)

Equation (M 1.2)

Equation (M 1.3)

Equation (M 1.4)

Equation (M 1.5)

Equation (M 1.6)

Fig. 2.

Fig. 2. (Color online) (a) Graphical model of the developed SSM. (b) Trace plots of four chains for KD. (c) The posterior distribution of KD calculated from (b). The colors represent each chain.

Standard image High-resolution image

According to Eq. (M 1.1), which is called the observation equation, the observed CNP (yt) is a summation of the baseline (xt), regression component regarding BSA (qt), and observation noise [εt ∼ N (0, σε)]. The subscript t (= 1, 2, 3, ..., 104) corresponds to the data index. In Eq. (M 1.2), which is called the state equation, the unobserved baseline (xt) is assumed to follow a quadratic trend with the system noise [wt ∼ N (0, σw)]. The regression component (qt) is described based on two concepts. The first concept is that the signal reaches a plateau defined by the Langmuir model in Eq. (M 1.5), where KD, ci, and a are the dissociation constant, BSA concentration, and a coefficient, respectively. The subscript i (=0, 1, 2, ..., 7) is the sample index, which increases when BSA is introduced to the sensor. The assumption, represented by Eq. (M 1.5), agrees with experimental data,6,20) confirming that the sensor response to BSA follows the Langmuir equation. The shift direction of the CNP against the adsorption of BSA is debatable. Some studies showed that the CNP shifts in the positive direction,6) whereas others showed that it shifts in the negative direction.21,22) This is because the detection mechanism involves electrostatic gating effect,23) charge doping,24) and charged-impurity scattering.25) The difference is presumably due to the surface condition of graphene, which is influenced by the manufacturing process. According to our data, shown in Fig. 1(c), the CNP seems to shift in the negative direction. The second concept is that the signal varies following an exponential decay after the BSA is introduced [Eq. (M 1.4)]. According to previous studies, the response time of FET-based biosensors is in the order of seconds to minutes, even though the target concentration ranges from the order of fM to the order of μM.26,27) This implies that the decay time τi is not directly proportional to the concentration (ci). Therefore, in Eq. (M 1.6), we assume that the mean of τi is proportional to the logarithm of ci, not directly proportional to ci.

We estimate the states and parameters of Model 1 to fit the dynamic response shown in Fig. 1(c). As the regression component (qt) in our SSMs is highly nonlinear, the KF algorithms cannot be applied to the subsequent parameter estimation. Therefore, we used the MCMC methods19) implemented in Python and Stan's probabilistic programming languages28) to estimate the state and parameters of Eqs. (M 1.1)–(M 1.6). In the computation, four chains were run with 10 000 iterations for each chain. The first 5000 iterations were discarded as burn-in, and the last 5000 iterations in each chain were used for subsequent inferences. Figure 2(b) shows the trace plots for KD. The posterior distribution is well sampled, as there is no serial correlation, and the chains explored the sample space many times. The other model parameters have similar trace plots (some are shown in Fig. S1, available online at stacks.iop.org/JJAP/59/SGGH04/mmedia). The Rhat value, which is an indicator of model convergence,29) was lower than 1.1 for all the parameters (Table I and Table SI). This result indicates that the model reaches convergence. The histogram of the trace plots, shown in Fig. 2(b), corresponds to the posterior distribution of the dissociation constant KD [Fig. 2(c)]. The posterior mean and the 95% credible interval for KD were calculated to be 151 nM and (85 nM, 257 nM), respectively. These values are comparable to those reported previously.6) Table I lists some representative parameters (see other parameters in Table SI). Using the proposed SSM and estimated parameters, we divided the time-series data, shown in Fig. 1(c), into the baseline [xt, Fig. 3(a)] and a signal related to BSA [qt, Fig. 3(b)]. Figure 3(c) shows the sensor response against BSA as a function of the concentration. It should be noted that the effect of baseline drift is excluded in Fig. 3(c), and thus it is indicated that the dynamic response in Fig. 1(c), in which the BSA concentration ranges from 0 to 5 nM, was mainly attributed to the baseline drift.

Table I.  Summary of representative parameters for the SSM estimated using the MCMC method. The posterior mean, the 95% credible interval, and Rhat are shown for KD, a, σε, and τ6. τ6 corresponds to the decay time when the BSA concentration increases from 500 nM to 5 μM.

  Mean 2.5% 97.5% Rhat
KD [nM] 151 85 257 1.0
a [mV] −81 −120 −53 1.0
σε [mV] 2.2 1.9 2.6 1.0
τ6 [min] 6.35 1.46 13.83 1.0
Fig. 3.

Fig. 3. (Color online) (a) Measurement data of CNP (red) and baseline (xt) estimated using the SSM (black). (b) The dynamic response to BSA. The shaded area indicates the 95% Bayesian confidence interval. (c) The G-FET sensor response against BSA as a function of the concentration (ci) calculated from Model 1.

Standard image High-resolution image

3.2. Discussion

We compared Model 1 with three competing SSMs, namely Model 2, Model 3, and Model 4, with different hypotheses. Table II gives a summary of the difference between the four models. The equations are given in the supplementary data. The state and parameters of Models 2, 3 and 4 were estimated in the same manner as that done for Model 1. After MCMC sampling, all the models reached convergence. Figure S2 shows the fitting results obtained using Models 2, 3, and 4. To find the model that best fits the obtained data, the widely-applicable Bayesian information criterion (WBIC),30) which is an information criterion, was calculated for the four models (Table II). The WBIC for Model 1 is the lowest among the SSMs, indicating that Model 1 best describes the obtained G-FET dynamic response. This indicates that the baseline drift is well fitted with a quadratic trend, rather than a fixed or a linear trend. In addition, the WBIC calculation result confirms that the response time correlates with the target concentration ci, consistent with a previous study.27)

Table II.  Comparison between four SSMs and WBIC values.

  Model 1 Model 2 Model 3 Model 4
Baseline drift Quadratic trend Fixed trend Linear trend Quadratic trend
Response time Proportional to logarithm of ci Proportional to logarithm of ci Proportional to logarithm of ci No correlation with ci
WBIC −204 −129 −175 −194

4. Conclusions

We developed SSMs to describe the dynamic response of a G-FET biosensor against BSA. The parameters of the models were estimated using the MCMC methods. The SSMs effectively fitted the dynamic response. Using the estimated parameters, we divided the time-series data into BSA response and baseline drift. Although the true baseline or states cannot be observed directly, state-space modeling helps estimate the probabilities of the states and parameters by explicitly including the observation equation. Furthermore, it allows incorporating any established theories and/or empirical insights into the models, making it flexible. Although the true state equation is unknown, state-space modeling can provide an insight into the dynamic data by generating a series of competing models and comparing them using model selection methods based on criteria such as the WBIC. The proposed method can be applied not only to G-FET biosensors but also to FET-based sensors for an accurate analysis of the sensor response. Therefore, this work paves the way for further applications based on FET-based sensors.

Acknowledgments

This work was supported by JST CREST Grant Number JPMJCR15F4, Japan.

Please wait… references are loading.
10.7567/1347-4065/ab65ac