Architecture of zero-latency ultrafast amplitude detector for high-speed atomic force microscopy

The advent of high-speed atomic force microscopy has revolutionized the ﬁeld of biological science. However, there remains a myriad of phe-nomena that cannot be observed due to their near-instantaneous dynamics. Improving the temporal resolution is the ﬁrst step toward observing these biofunctional dynamics. So far, the narrow bandwidth of the amplitude detector circuit has been a major obstacle to this objective. Here, we invent a differential-based ultrafast amplitude detection method with zero intrinsic latency based on the basic trigonometric theorem. We demonstrate that our method manifests superior performance over the conventional methods in respect of the bandwidth and noninvasiveness. This will be a breakthrough in the development of faster high-speed atomic force microscopy.

Submolecular-scale biofunctional dynamic processes play central roles in life science, and hence, unraveling the mechanisms underlying these dynamic processes is essential for the advancement of such fields. High-speed atomic force microscopy (HS-AFM) is an emerging paradigm-shifting technique that can perform real-time observation of living biomolecular dynamics at submolecular spatial and sub-second temporal resolutions. [1][2][3][4][5][6][7][8][9][10] However, there are numerous biofunctional processes that cannot be imaged by the current HS-AFM system due to its inadequate temporal resolution, e.g., imaging the motion of ion transporters and lipid rafts embedded in a suspended lipid bilayer or cell membrane requires at least 1-10 ms temporal resolution. 11 Therefore, further broadening of the bandwidth of HS-AFM to fill the gap between the current and required temporal resolutions is an important challenge.
The critical scan rate-limiting devices in the current HS-AFM system are the Z piezo scanner, cantilever, and amplitude detector. The latencies of the Z piezo scanner and cantilever are dictated by their resonance frequencies (f z and f cl , respectively), which can be improved by miniaturizing their dimensions. A complementary approach is to double the scan rate by simply skipping the backward scan. 12 Most importantly, the HS-AFM setup adopts an amplitude-modulation scheme, 13 where the cantilever oscillation amplitude is exploited for the feedback signal, and thus, the amplitude detector constitutes the heart of the technology and is one of the most important rate-limiting devices. By improving its bandwidth, we aim to improve the scan rate and temporal resolution of HS-AFM, broadening its applicability to short timescale biofunctional processes.
In this study, we conceived a differential-based (DB) detection algorithm by modifying a recently proposed phase-shift-based (PSB) algorithm. 14 Our method is simple and eliminates intrinsic latency; hence, it is much faster than conventional methods. We believe that this invention will be an indispensable component in the faster HS-AFM.
The standard dynamic-mode AFM setups adopt a common RMS-DC converter, but its bandwidth is inadequate for HS-AFM applications because sufficient ripple suppression requires the time integral over at least several cantilever oscillation periods. Therefore, three types of amplitude detector have hitherto been proposed, Fourier-analysis-based (FAB), 15 sample-and-hold-based (SHB), 1 and PSB methods. 14 Figure 1 schematically summarizes the theoretical latencies of these methods. FAB is a type of digital 2-phase lock-in amplifier, scoring the Fourier coefficients at f cl in the cantilever oscillation signal every cycle via high-speed digital computing. This means that, when the cantilever has f cl of 500 kHz (period corresponds to 2 ls), the amplitude value is generated with a 2 ls delay and the phase delay becomes 360 . Despite this significant latency, it has been widely utilized in previous HS-AFM studies due to its excellent operability. SHB scores the difference between the oscillation peak top and bottom voltages every half oscillation cycle via two SH circuits. Therefore, its throughput is two times higher than FAB (only 1 ls latency and 180 delay); however, it is rarely used in practical experiments because a timing adjustment is required every time the cantilever resonance phase changes. Finally, Miyagi et al. recently proposed the PSB method; 14 our simulation revealed its latency to be only 0.17 and 0.25 ls (30 and 45 delay) for the analog and digital circuits, respectively (supplementary material, Note 1 and Figs. S2 and S5). In this study, we have conceived a DB method with no intrinsic latency, which results in the fastest performance. Below we compare and contrast the DB and PSB methods since they are analogous, demonstrating the utility of the DB method. Figure 2 shows the schematics of the PSB and DB methods, which are based on the basic trigonometric theorem as follows: where A and x c are oscillation amplitude and radial carrier (driving) frequency, respectively. Since f c (¼x c /2p) is not necessarily f cl , the cantilever drifts do not disturb the measurements. The input sine wave first undergoes a high-pass filter (HPF) to remove the low-frequency cantilever deflection arising from the tip-sample interaction force. The HPF cutoff frequency (f HPF ) should be higher than f z , above which the force is not modulated (Supplementary Note 1). Then, it is divided into two lines, one of which is converted to a cosine wave via a 90 phase shifter (PS) according to the following relationship: Then, both the signals are squared and summed up again to produce the squared amplitude (A 2 ), followed by a square root conversion to generate the A signal. Theoretical simulations revealed that the phase delay of the PSB analog circuit is predicted to be 30 .
The DB method can be achieved by simply replacing the PS with a differentiator circuit to produce the cosine wave as follows: where there is no theoretical phase delay and only the circuit processing limits the conversion rate. As discussed later, we revealed that the square root computation is the bottleneck in the analog circuit; to overcome this, the A 2 output can also be exploited for the feedback signal with a remarkably faster throughput than the A signal. Moreover, a simulation revealed that the A 2 output shows superior frequency characteristics over the A output even excluding the analog latency ( Fig. S2). Hereafter, we abbreviate the A 2 and A output of DB (PSB) as DB-A 2 (PSB-A 2 ) and DB-A (PSB-A), respectively. Importantly, for the PSB operation, the center frequency must be tuned to f cl for each experiment. Similarly, for the DB operation, since the differential value is proportional to the cantilever frequency, the differentiator gain must be optimized for each experiment. We built two types of DB and PSB test circuits; an analog circuit employing commercial operational amplifiers (Fig. S1) and a digital circuit employing a field-programmable gate array (FPGA) (National

ARTICLE
scitation.org/journal/apl Instruments: PXIe-7966R). To demonstrate the superiority of the DB method, we first measured the frequency characteristics using amplitude-modulated signals with f c of 300-1500 kHz [ Fig. 3(a)]. The sweeping of the modulation signal (modulation depth was 0.5) and the response detection were carried out using a frequency response analyzer (NF: FRA5097). In this experiment, we did not use an HPF to characterize the intrinsic detector performance, but an HPF with f HPF of 159 kHz was used in all other subsequent experiments. From acquired spectra (Figs. S6-S9), we revealed that FAB and SHB produce dips at the harmonic frequencies of f c while PSB creates a single broad dip at the second harmonics of f c , and thus, these techniques do not produce flat gain and phase curves. These characteristics were reproduced by the simulation, confirming the intrinsic latency that exists in these techniques. In contrast, the gain curve of DB-A 2 showed a nearly flat characteristic, especially at the frequency below f c [ Fig. 3(b)]. Although the phase curve decreased with increasing modulation frequency, this is not intrinsic latency but rather the analog circuit latency arising from the differentiator because this curve was well fitted by the dead time theorem, wherein the phase delay (Du Dead ) is expressed by Du Dead ðxÞ ¼ ÀxDs; (4) where Ds is the dead time.
We defined the detector bandwidth (f D ) as the frequency that results in a 45 phase delay [orange broken arrow in Fig. 3(b)] beyond which the feedback loop cannot be appropriately controlled. In the summarized f D graph [ Fig. 3(c)], the FAB and SHB results showed an almost linear increase in f D as f c increases. Notably, the SHB curve overlapped well with the theoretical 180 line while FAB was slightly lower than the theoretical 360 line due to the digitalization latency. In contrast, DB-A and PSB-A (the latter being equivalent to the previously published setup 14 ) exhibited almost a saturated flat appearance, indicating that the throughput of the whole device was bottlenecked by the square root circuit whose latency is much larger than the intrinsic ones.
DB-A 2 and PSB-A 2 were 5 times faster than DB-A, and PSB-A, respectively, while DB-A 2 was almost twice as fast as PSB-A 2 , 10-20  Fig. 3(c), while those of DB-A 2 and PSB-A 2 (50 and 90 ) were taken from the experimental data obtained with HPF (data are not shown).

ARTICLE
scitation.org/journal/apl times faster than FAB and 3-8 times faster than SHB. The PSB-A 2 curve showed a linear dependence on f c due to the intrinsic latency, but gradually saturated in the high frequency range because of the latency in the square processing. In contrast, DB-A 2 showed almost a saturated appearance over the entire f c range. Although zero latency could not be observed experimentally due to the analog circuit latency, surprisingly, below f c of 1 MHz, f D went beyond f c (corresponding to the 45 line) and even the theoretical PSB bandwidth of the 30 line. These observations clearly evidence the zero latency of the DB method. This saturated appearance is more clearly visible in the DB-A 2 digital circuit (see square marked broken line), whereby f D was 330 kHz over the entire f c range, because the cutoff frequency of the differentiator circuit slightly correlates with f c while the dead time of digital processing is constant. In our FPGA system, the main latency is from the transceiver (300 ns), an interface that converts the signals from analog to digital. Although f D was not as high as that achieved by the analog circuit [ Fig. 3(c), Figs. S10-S12, and supplementary material, Note 2], it exceeded those of the conventional methods. In the digitalization, DB has an additional advantage over PSB because DB retains the zero-latency performance while the phase delay of PSB slightly deteriorates from 30 to 45 (supplementary material, Note 1). Note that we did not employ the fastest models of operational amplifiers and transceiver because the present work is intended as a demonstration, and hence, the performance could be further improved using faster models. (We expect f D of faster than 2 and 1 MHz for the analog and digital circuits, respectively.) Importantly, in practical experiments, the signal-to-noise ratio (i.e., noise performance) of DB and that of the other methods are comparable. This is because the cantilever transfer function suppresses the noise in the high frequency range, averting significant deterioration in the noise performance of DB even through the derivative-and squarecomputations. A detailed analysis of the noise performance of all methods will be published elsewhere.
In practical measurements, a bandpass-like cantilever resonance characteristic suppresses even the AM signals, significantly deteriorating the feedback bandwidth (f FB ). To quantify the superiority of DB in such conditions, we subsequently evaluated f FB by employing an AFM mock circuit [ Fig. 3(d)]. 16 We set the resonance frequencies (f cl and f z ) to 1200 and 150 kHz and the quality factors of the cantilever and Z scanner (Q cl and Q z ) to 1.5 and 0.7, respectively. Upon optimizing the PID feedback parameters under certain criteria (Fig. S13), the frequency response characteristics of the whole feedback circuit were measured by sweeping the frequency of a sinusoidal topography wave [Figs. 3(e) and S14-S17].
We defined f FB as the 45 -crossover frequency and plotted them in Fig. 3(f) as a function of the setpoint/free amplitude ratio. The cantilever oscillation amplitude was set to five times larger than the topographic wave amplitude (also see Fig. S18 for other oscillation amplitudes). The bandwidth gradually increased as the setpoint ratio was reduced for all devices. DB-A 2 showed 20%-30% better performance than PSB-A 2 and SHB, and it was almost three times faster than FAB. DB-A 2 showed several times greater f D than that of SHB, but this difference became slight in f FB due to the existence of Q cl and Q z .
To explain this difference, we also make a theoretical consideration below. The f FB can be estimated by an explicit analytical equation, formulated based on a dead time theorem approximation in the open-loop system, 16 where Du amp (¼p/8 Â f cl /f D ) is the phase delay in the amplitude detector and d is the latency in the other components. Although the calculated values are on the same order as the experimental data, it could not reproduce the significantly large difference in f FB with respect to Du amp . This discrepancy may arise since the real transfer functions of the cantilever and Z piezo scanner are not expressed by the dead time theorem. To verify this hypothesis, we also simulated f FB based on the closed-loop feedback system (supplementary material, Note 3 and Fig.  S19) and obtained a much more quantitative agreement between the overall experimental and theoretical results with respect to the individual amplitude detectors [ Fig. 3(f)]. Note that spurious peaks caused by the acoustic excitation increase the apparent Q cl in liquid-AFM, 17 but it ought not to correlate with f FB because the tip-sample interaction force is directly exerted on the tip. Finally, to experimentally validate the performance of the DB-A 2 method, we performed HS-AFM imaging of biopolymeric actin filaments with FAB and DB-A 2 because they are easily depolymerized by the tip perturbation and well-suited for the evaluation of the noninvasiveness of the feedback control. We used a lab-built AFM setup with a BL-AC7DS-KU4 (Olympus) cantilever and a standard Z scanner whose resonance frequencies were 1440 and 190 kHz, respectively. The scan range and frame rate were 200 Â 200 nm 2 (100 Â 100 pixels 2 ) and 50 ms/frame, respectively. This corresponds to a scanning velocity of 800 lm/s, four times faster than a typical velocity of 200 lm/s in the standard setup. Although the sampling frequency (400 kHz) is oversampling, the resulting Nyquist frequency is 200 kHz, which is sufficiently above f FB [$80 kHz for DB-A 2 , see Fig. 3(f)] to avoid aliasing noise and maximize the effect of subsequent image processing.
In Fig. 4, both the first images exhibited a clear 36-nm-pitch helical structure (highlighted by the cyan arrows). However, in the FAB result, after 1 s elapsed, the filament began to collapse due to the insufficient feedback bandwidth. In contrast, using DB-A 2 , the molecule was stably observed and remained intact over a period of 30 s owing to the significantly faster feedback operation than FAB. Importantly, the noise performance of DB-A 2 did not produce any practical problems in AFM imaging. Furthermore, by performing force curve measurements, we found that the square conversion makes the amplitude distance dependence steeper; this could further reduce the invasiveness of bioimaging as a side effect (supplementary material, Note 4 and Fig.  S20).
The amplitude detector is a key device even in many AFM techniques, e.g., the viscoelastic measurements in frequency-modulation AFM, 18,19 bimodal AFM, 19,20 Kelvin-prove force microscopy, 21 and three-dimensional hydration measurements. 18,22,23 Moreover, this method can also be utilized for improved phase shift detection with further diverse applications 24,25 (supplementary material, Note 5).
In summary, we developed a differential-based amplitude detection method with no intrinsic latency, which achieves significantly faster performance than the conventional methods. This technique also contributes to reducing invasiveness of AFM imaging, making it possible to observe fragile biomolecules. By overcoming the obstacle of the amplitude detector bandwidth, we have opened the road to increasing the temporal resolution of HS-AFM. When improving the speed performance of Z-scanner and cantilever, it will bring out the full potential.
See the supplementary material for the details of the experimental works and theoretical considerations.