Hostname: page-component-848d4c4894-jbqgn Total loading time: 0 Render date: 2024-06-26T02:29:28.621Z Has data issue: false hasContentIssue false

Seamless and intuitive control of a powered prosthetic leg using deep neural network for transfemoral amputees

Published online by Cambridge University Press:  28 September 2022

Minjae Kim
Affiliation:
Department of Physical Medicine and Rehabilitation, Northwestern University, Evanston, IL, USA Center for Bionic Medicine, Shirley Ryan AbilityLab, Chicago, IL, USA
Ann M. Simon
Affiliation:
Department of Physical Medicine and Rehabilitation, Northwestern University, Evanston, IL, USA Center for Bionic Medicine, Shirley Ryan AbilityLab, Chicago, IL, USA
Levi J. Hargrove*
Affiliation:
Department of Physical Medicine and Rehabilitation, Northwestern University, Evanston, IL, USA Center for Bionic Medicine, Shirley Ryan AbilityLab, Chicago, IL, USA
*
*Author for correspondence: Levi J. Hargrove, Email: l-hargrove@northwestern.edu

Abstract

Powered prosthetic legs are becoming a promising option for amputee patients. However, developing safe, robust, and intuitive control strategies for powered legs remains one of the greatest challenges. Although a variety of control strategies have been proposed, creating and fine-tuning the system parameters is time-intensive and complicated when more activities need to be restored. In this study, we developed a deep neural network (DNN) model that facilitates seamless and intuitive gait generation and transitions across five ambulation modes: level-ground walking, ascending/descending ramps, and ascending/descending stairs. The combination of latent and time sequence features generated the desired impedance parameters within the ambulation modes and allowed seamless transitions between ambulation modes. The model was applied to the open-source bionic leg and tested on unilateral transfemoral users. It achieved the overall coefficient of determination of 0.72 with the state machine-based impedance parameters in the offline testing session. In addition, users were able to perform in-laboratory ambulation modes with an overall success rate of 96% during the online testing session. The results indicate that the DNN model is a promising candidate for subject-independent and tuning-free prosthetic leg control for transfemoral amputees.

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press

1. Introduction

One of the greatest challenges to achieving a meaningful impact on the lives of amputees is the development of safe, robust, and intuitive control strategies for robotic prostheses. Amputation of the lower limb, affecting an estimated 1 million Americans (Ziegler-Graham et al., Reference Ziegler-Graham, MacKenzie, Ephraim, Travison and Brookmeyer2008) causes profound disability – significantly limiting mobility, independence, and potentially the ability to pursue employment or leisure activities. A new category of powered joints, including powered knees and ankles, may allow for better outcomes, such as enabling users to walk across more challenging terrains (e.g., walking up and down hills, ramps, and stairs) more efficiently than using passive prostheses (Montgomery and Grabowski, Reference Montgomery and Grabowski2018; Kim et al., Reference Kim, Wensman, Colabianchi and Gates2021). However, to take advantage of the device capabilities, it is critical that they are controlled properly.

Modern self-contained powered-leg devices are generally constructed from strong, lightweight materials and use a battery-powered brushless-DC motor(s) (Au et al., Reference Au, Berniker and Herr2008; Lawson et al., Reference Lawson, Mitchell, Truex, Shultz, Ledoux and Goldfarb2014). These devices generally include multiple sensors within their construction to provide information with regard to the current state of the devices. To provide actuation, the current is commanded to the motor, and the current is then transformed into a torque produced at the shaft of a motor. The motor torque is further amplified by the transmission of the device to result in torque available at the appropriate joint. Commercially available brushless-DC motor controllers can be purchased to ensure that the commanded current is properly executed by the motor (using hall effect sensors and ensuring that the motor is commutated properly). Well-established control principles from the robotics field can be used to allow for the calculation of the motor current required to achieve a certain position, velocity, force, torque, or impedances (e.g., allowing the system to behave like a virtual spring damper system). These are all examples of low-level controllers. Choosing the desired value that needs to be controlled (e.g., position trajectory, impedance values, and so forth) remains a more challenging problem in controlling assistive devices.

Determining what commands to send to the low-level controller is usually achieved using a finite-state machine. It is relatively straightforward to create a small and self-contained finite-state machine to implement walking (Sup et al., Reference Sup, Bohara and Goldfarb2008). This is often termed an intrinsic or mid-level controller. For example, a load cell can easily be used to separate the stance and swing phases of gait, and joint velocity can be used to differentiate between swing flexion and swing extension. Next, each joint can be modeled as a set of virtual impedance parameters in every sub-phase. When all of the parameters are carefully tuned, usually by hand (Simon et al., Reference Simon, Ingraham, Fey, Finucane, Lipschutz, Young and Hargrove2014), but more recently with machine learning approaches (Huang et al., Reference Huang, Crouch, Liu, Sawicki and Wang2016) or deep learning techniques (Wen et al., Reference Wen, Si, Brandt, Gao and Huang2019), a smooth gait can be generated by the user. Alternative approaches can be used to generate mid-level controllers that are less reliant on finite-state machines. For example, prosthesis control strategies that mimic the underlying dynamics and control policies of the human neuromuscular system can be created (Thatte and Geyer, Reference Thatte and Geyer2015). Instead of approximating the behavior of individual human leg joints with impedance or similar functions, they model the human control system that generates this behavior (Song and Geyer, Reference Song and Geyer2015). Another alternative is to use a phase-based mid-level control architecture (Quintero et al., Reference Quintero, Villarreal, Lambert, Kapp and Gregg2018). Using this approach, the gait cycle is typically viewed as a periodic sequence of events over time, starting with heel contact during the initial stance and ending with knee extension during a late swing. Recent work has shown that a mechanical variable, specifically the phase angle of the hip joint, can accurately parameterize human leg patterns across perturbations and be used to control a powered leg during walking (Villarreal and Gregg, Reference Villarreal and Gregg2016; Villarreal et al., Reference Villarreal, Poonawala and Gregg2016).

Regardless of the style of mid-level control, the difficulty in creating a finite-state machine is compounded when more activities need to be restored. This is especially true when allowing seamless and automatic transitions between activities. This is usually achieved using a high-level controller. Examples of high-level controllers include key-fobs, threshold-based rules (Xu et al., Reference Xu, Feng, Mai and Wang2018), or machine learning algorithms (Liu et al., Reference Liu, Zhang and Huang2017) that automatically predict which low-level controller should be executed. In addition, a human-inspired variable-based prediction method (Quintero et al., Reference Quintero, Lambert, Villarreal and Gregg2017) was proposed.

The user interacts directly with their device (and indirectly with the environment) through the socket. The device responds to these interactions through the control system using the previously described approach. Rather than imposing specific mid- or high-level controllers, we believe that an alternative approach to solving this problem would be to use deep-learning tools to create a direct mapping between the sensors available on the device and the motor commands required to restore locomotion. Machine learning, including statistical pattern recognition classifiers and artificial neural networks, has been used to help control upper-limb prostheses for decades (Graupe et al., Reference Graupe, Salahi and Kohn1982; Hudgins et al., Reference Hudgins, Parker and Scott1993) and has recently been applied to infer user intention as part of the high-level controller for powered leg prostheses (Huang et al., Reference Huang, Zhang, Hargrove, Dou, Rogers and Englehart2011; Young et al., Reference Young, Simon and Hargrove2013). Ambulation modes were estimated by analyzing muscle activation patterns from ultrasound images (Rabe et al., Reference Rabe, Jahanandish, Boehm, Fey, Hoyt and Fey2020) or electromyography signals (Spanias et al., Reference Spanias, Simon, Finucane, Perreault and Hargrove2018).

Deep neural networks (DNNs) include several layers that can be connected in a variety of ways depending on the applications. DNNs enable end-to-end learning from inputs to outputs with little or no prior knowledge, and this simplifies the system. The input data space, which is usually of high dimension, can be encoded into a lower-dimensional hidden (or latent) space to uncover structure within the data. The latent space can then be classified to determine user intention or, more relevantly for this application, decoded into a set of continuous outputs. DNNs have recently been demonstrated in a wide range of applications, including several in the powered prosthetics and orthotics field. For robust, high-level control decisions, convolutional neural networks (CNNs) were used for vision-based environment sensing to predict ambulation modes prior to physical movement (Laschowski et al., Reference Laschowski, McNally, Wong and McPhee2022). A CNN-based method (Su et al., Reference Su, Wang, Liu, Sheng, Jiang and Xiang2019) using inertial measurement units (IMUs) was used to predict 13 ambulation modes, including transitional states. A stacked autoencoder was employed to estimate multi-degrees of freedom wrist torque (Yu et al., Reference Yu, Chen, Sheng and Zhu2020) based on EMG signals, and it outperformed the conventional regression methods for eight able-bodied subjects and one amputee subject. A multi-stream long short-term memory (LSTM) dueling model (Ren et al., Reference Ren, Chien, Chia, Fu and Lai2019) has been proposed to predict arm trajectory based on EMG signals and IMU data for control of an upper limb rehabilitation exoskeleton. For control of a robotic hip exoskeleton (Kang et al., Reference Kang, Molinaro, Duggal, Chen, Kunapuli and Young2021), CNNs and LSTM networks were used to estimate the human gait phase and generate desired hip torque for various ambulation modes: level-ground walking (LGW), ascending/descending stairs (SA/SD), and ascending/descending a ramp (RA/RD). Although this approach showed accurate performance of about 5% phase estimation error between actual and normative trajectories for 10 subjects, the model was trained on a user-specific basis (i.e., all 10 individual models for 10 subjects). Our group has used DNNs with data augmentation to learn robust representations of movement intention for powered leg prostheses (Hu et al., Reference Hu, Simon and Hargrove2019) and to improve classification performance in the presence of noise for EMG-based control of upper-limb prosthetic limbs (Teh and Hargrove, Reference Teh and Hargrove2021).

In this article, we propose a DNN-based unified model to control prosthetic legs for transfemoral amputees. We demonstrated that the DNN model (i) extracts latent features to distinguish ambulation modes and time sequence features to recognize the gait phase; (ii) facilitates control for five ambulation modes with seamless transition between modes; (iii) facilitates subject-independent and tuning-free control.

2. Methods

2.1. Powered prosthetic leg

Data collection and online evaluation of the DNNs were performed using a self-contained open-source bionic leg (OSL) (Azocar et al., Reference Azocar, Mooney, Duval, Simon, Hargrove and Rouse2020). The OSL includes a powered knee and ankle joint (actuation along the sagittal plane only) and has a mass of $ \sim $ 4,000 g. Actuation at each joint is provided using exterior rotor brushless DC motors with a closed loop controller (Dephy Inc., Maynard, MA) powered by a 36 V lithium-polymer battery back. The knee uses a 3-stage belt-drive transmission, and the ankle uses a 2-stage belt-drive transmission in series with a single-stage kinematic linkage. The OSL also includes several sensors which can be used to monitor the state of the device. For this experiment, we collected data from 18 mechanical sensors (Table 1) embedded in the OSL: data of the six-axis force/moment, knee/ankle joint angles, and velocities, acceleration and angular velocity, and thigh/shank angles.

Table 1. Configuration of mechanical sensors

Abbreviation: IMU, inertial measurement unit.

We implemented an impedance controller such that the motors generated torque for the knee and ankle based on the following equation:

(1) $$ {\tau}_i\hskip0.35em =\hskip0.35em -{k}_i\left({\theta}_i-{\theta}_i^{eq}\right)-b{\dot{\theta}}_i, $$

where $ i $ represents the knee or ankle joint, $ \tau $ represents the joint torque, $ \theta $ and $ \dot{\theta} $ represent the joint angle and velocity, respectively, and $ k $ , $ b $ , and $ {\theta}^{eq} $ denote the stiffness, damping coefficient, and equilibrium angle, respectively. The knee joint measured zero at full extension, and knee flexion was measured as a positive value. Sign convention for ankle dorsiflexion was positive, and ankle plantar flexion was negative. In the case of the thigh, extension was positive, and flexion was negative. Additional details with respect to the hardware, available sensors, and impedance controller can be found in Azocar et al. (Reference Azocar, Mooney, Duval, Simon, Hargrove and Rouse2020).

2.2. State machine

A state machine-based impedance controller (Simon et al., Reference Simon, Ingraham, Fey, Finucane, Lipschutz, Young and Hargrove2014) was used to collect training data. The state machine was configured to include five ambulation modes: LGW, SA/SD, and RA/RD. Generally, each ambulation mode was subdivided into four states corresponding to early stance, late stance, early swing, and late swing. The impedance parameters within each state were adjusted in each session based on a combination of user feedback and visual inspections of the kinematics conducted by the prosthetist and therapist. The latest range of impedance parameters by ambulation mode and phase for all users can be found in the Supplementary Material.

State transitions between individual states were configured using the mechanical sensors on the limb to allow for seamless transitions between activities. The state machine ran on an embedded controller and updated impedance parameters every 25 ms. Additional details on the finite-state machine and configuration process can be found in Simon et al. (Reference Simon, Ingraham, Fey, Finucane, Lipschutz, Young and Hargrove2014 and Reference Simon, Ingraham, Spanias, Young, Finucane, Halsne and Hargrove2016).

2.3. Deep neural network

The overview of the proposed DNN architecture is presented in Figure 1. Network modules include flattening, dense sigmoid, reshaping, concatenation, split, LSTM sigmoid, and time distributed modules (Figure 1e). An encoder module and decoder module were constructed from these modules. The encoder module maps the input data to lower-dimensional latent features, and the decoder module maps the encoded latent features back to the original input data. These modules were arranged to create a DNN comprised of two sub-networks: the latent and main networks; the networks have a total of 37,601 trainable parameters. The hyperparameters in the networks, such as units of the dense layers, were heuristically chosen.

  1. 1. Latent network: The purpose of the latent network is to extract latent features that discriminate the differences between the ambulation modes. The latent network (Figure 1a) extracts latent features (Figure 1b) from 50 ms of data recorded from all 18 sensors, using the encoder and decoder. The network was trained for 50 epochs (batch size: 256, RMSprop (Hinton et al., Reference Hinton, Srivastava and Swersky2012) with a learning rate of 0.001 and the loss function of the mean squared error (MSE)). At each epoch, the data were shuffled and split into training and validation data with a ratio of 7:3.

  2. 2. Main network: After the encoder parameters were identified, the main network (Figure 1c), including the LSTM network, was trained. The purpose of the LSTM network is to extract discriminant latent features that change sensitively during the gait cycle. The LSTM network only used the user weight, thigh angle, and shank angle among the 18 sensors. After that, the time sequence features (Figure 1d) and the latent features (Figure 1b) were combined to extract output impedance parameters. This main network was trained with the following learning configuration: 50 epochs, batch size: 256, RMSprop with a learning rate of 0.001 and the loss function of the MSE). At each epoch, the data were shuffled and split into training and validation data with a ratio of 7:3.

Figure 1. Architecture of the proposed DNN. The model takes a history of 18 mechanical sensor datapoints (see Table 1) as the inputs to obtain the single-time step impedance parameters. The latent network (a) extracts latent features (b). The latent features and history of three sensor data (weight, thigh angle, and shank angle) are used as the inputs of the main network (c) to extract time sequence features (d) and output impedance parameters. (e) The network modules represent the function of layers. The number in the brackets represents the number of units in the layers (e.g., Dense (N) represents the Dense layer with N units, and Load cell (Hist, N) represents N load cell sensor data points of the Hist-time step).

Here, all sensor data and impedance parameters were scaled to values between 0 to 1 for application to the DNN model. In the case of the sensor data, forces and moments were normalized by weight; joint angles and velocities were divided by 36 and 500, respectively; and accelerations and angular velocities were divided by 0.8 and 200, respectively. Then, additional scaling was applied as follows:

(2) $$ {v}_n\hskip0.35em =\hskip0.35em \left(v+5\right)/10, $$

where $ v $ denotes the raw sensor data, and $ {v}_n $ denotes the normalized data that is bounded from 0 to 1 centered at 0.5.

Additionally, the output impedance parameters from the DNN models were limited (Table 2) for safety.

Table 2. The upper and lower limits of the impedance parameters

The pairs of sensor data (i.e., network inputs) and impedance parameters (i.e., network outputs) calculated by the state machine were used to train the DNN model with TensorFlow (v. 2.4.1, Google) in Python on a laptop (ROG Strix G17, ASUS) under Windows 10 with NVIDIA GeForce RTX 2070 SUPER GPU. The trained DNN model was deployed to an Android 10 smartphone (Galaxy Note 9, Samsung) and ran on a CPU (Exynos 9810). The execution time of the proposed DNN on the smartphone was about 1.2 ms, and it is sufficient to control prosthetic legs in real time. A small microcontroller module (Pyboard D-Series) acted as a bidirectional translation module between the smartphone and the OSL. Its purpose was simply to accept commands from the phone over USB serial communication and transmit them to the OSL via CAN and vice versa. The smartphone parsed sensor data and generated impedance parameters every 5 ms, but the parameters could be transmitted to the leg only every 25 ms due to limitations in the translation board. The proposed system diagram is shown in Figure 2.

Figure 2. System configuration. The Pyboard was connected with the OSL and the Android smartphone, respectively, by CAN and USB. The Pyboard acted as a bidirectional translation module between the smartphone and the OSL.

2.4. Experimental protocol

Six individuals (TF1–6) with unilateral transfemoral amputation participated in this study (Table 3). All individuals provided written informed consent to a protocol approved by the Northwestern University Institutional Review Board. TF1 and TF6, who had less experience (<4 hr) walking on the OSL, were considered novice users; and TF2–5, who had experience (>10 hr) walking on the OSL, were considered experienced users. All users were fit with the prosthesis by a certified prosthetist and instructed by a licensed therapist. The users performed in-laboratory ambulation modes, including LGW at self-selected speed, SA and SD on a six-step staircase, and RA and RD on a 10° inclined surface.

Table 3. Subject demographics

Abbreviation: OSL, open-source bionic leg.

The collected datasets consisted of three cases: training data, offline testing data, and online testing data. The training data and offline testing data were obtained with the state machine control; they were collected separately on different days (i.e., the offline testing data were not included in the training data). During these sessions, we changed the ambulation mode manually using a key fob to prevent incorrect transitions between different ambulation modes. The online testing data were obtained with the DNN model. During each session, we collected as much data as possible as experimental time allowed per subject. In total, 5,417, 1,127, and 744 steps on all ambulation modes were collected for training, offline testing, and online testing, respectively. The detailed number of steps per user is shown in Table 4.

Table 4. Collected datasets for training, offline testing, and online testing

Abbreviations: LGW, level-ground walking; RA/RD, ascending/descending a ramp; SA/SD, ascending/descending stairs.

A single unified DNN model was trained using all ambulation mode data from all training users. Model training took about 312 min. Subsequently, the DNN model was applied to the offline testing data to compare output parameters to those of the state machine.

In the online testing session, the single unified DNN model controlled the OSL for all online testing users. During online testing, at least five repetitions of each ambulation mode for each subject were performed. If the leg did not perform as expected, the user re-attempted the activity without any changes made to the DNN model. After several attempts, if the therapist judged that completion of the activity was not possible for the user, that activity was not attempted anymore.

2.5. Data analysis

The performance of the proposed DNN model was evaluated in two aspects: offline testing data analysis and online testing data analysis. For offline testing analysis, we computed the root mean square difference (RMSD) between impedance parameters: those from the state machine that controlled the leg and those estimated using the proposed DNN. To evaluate the differences in the impedance parameters between the state machine and DNN, the coefficient of determination ( $ {R}^2 $ ) was computed as follows:

(3) $$ {R}^2\hskip0.35em =\hskip0.35em 1-\frac{\sum {\left({p}_{\mathrm{SM}}^i-{p}_{\mathrm{DNN}}^i\right)}^2}{\sum {\left({p}_{\mathrm{SM}}^i-\overline{p_{\mathrm{SM}}}\right)}^2}, $$

where $ i $ represents the impedance, $ {p}_{\mathrm{SM}} $ and $ {p}_{\mathrm{DNN}} $ denote trajectory of individual impedance parameters (i.e., $ k $ , $ b $ , or $ {\theta}^{eq} $ ) obtained from the state machine and the DNN model, respectively. $ \overline{p_{\mathrm{SM}}} $ denotes the mean of the state machine-based impedance parameter.

For online testing analysis, the ratio between the number of successful steps and total steps was computed. A successful step was categorized as a step where a natural gait pattern, judged by the therapist who was supervising the subject, occurred.

In addition, latent and time sequence features of the proposed DNN for online testing were investigated to verify that the DNN can properly extract information regarding ambulation modes and gait phase from sensor data. To visualize the distribution of latent features, t-distributed stochastic neighbor embedding (t-SNE, Van der Maaten and Hinton, Reference Van der Maaten and Hinton2008) was employed for dimension reduction. To evaluate the correlation of time sequence features across ambulation modes, Pearson correlation coefficients between time sequence feature sets (i.e., $ {f}^i $ and $ {f}^k $ ) were obtained as follows:

(4) $$ r\left({f}^i,{f}^k\right)\hskip0.35em =\hskip0.35em \frac{\sum_m{\sum}_n\left({f}_{mn}^i-\overline{f^i}\right)\left({f}_{mn}^k-\overline{f^k}\right)}{\sqrt{\left({\sum}_m{\sum}_n{\left({f}_{mn}^i-\overline{f^i}\right)}^2\right)\left({\sum}_m{\sum}_n{\left({f}_{mn}^k-\overline{f^k}\right)}^2\right)}}, $$

where $ r $ denotes correlation coefficient, $ f\hskip0.35em \in \hskip0.35em {R}^{m\times n} $ denotes the time sequence feature set, $ m $ denotes the number of time sequence features (i.e., 12 in this study, see Figure 1d), $ n $ denotes the length of the feature (i.e., [0–100]% gait cycle), and $ \overline{f} $ denotes mean of the time sequence feature set.

To evaluate the statistical differences, two-sample t-tests were performed using the assumption that differences are from normal distributions with unknown and unequal variances.

3. Results

3.1. Offline testing

Figures 3 and 4 show the fitted offline impedance parameters and their RMSD, respectively. The median $ {R}^2 $ across all the impedance parameters for all ambulation modes between the DNN model and the state machine for the user who was used to train the model (i.e., TF2) and the users who were not used to train the DNN model (i.e., TF5 and TF6) were 0.77, 0.46, and 0.79, respectively.

Figure 3. Comparison of impedance parameters generated from the state machine (blue lines) and DNN (red lines) in offline tests. In general, the DNN made impedance parameters similar to the state machine for the user who was used to train the DNN model (i.e., TF2) and the new users (i.e., TF5 and TF6). The median $ {R}^2 $ across all impedance parameters for all ambulation modes were 0.77, 0.46, and 0.79 for TF2, TF5, and TF6, respectively; the median $ {R}^2 $ for all users was 0.72. All plots show 75th and 25th percentiles in lighter bands.

Figure 4. RMSD of all five ambulation modes in offline tests. The users who were not used for the training (i.e., TF5 and TF6) showed a similar error level as the user who was used for training (i.e., TF2). Bar plots show the 25th, 50th (median), and 75th percentiles. Asterisk indicates $ p $  < .05.

Although TF5 had the largest deviation, there was no difference in the RMSD between TF2, TF5, and TF6 at a 5% significance level for impedance parameters except the ankle equilibrium angle, as shown in Figure 4. Thus, the proposed DNN showed consistent performance characteristics for the new and trained users.

3.2. Online testing

During ambulation, the DNN generated the impedance parameters for all configured activities (Figure 5). These parameters allowed the subjects to ambulate successfully in most trials (Table 5), with an overall success rate of 96%. However, we found that some users had problems performing specific activities. TF2 had difficulty in performing SD. She could not bend the knee, and the leg remained too stiff to descend the stairs using a reciprocal gait pattern. TF3 had difficulty in performing SA. The leg remained in an LGW even though the user wished to transition into a SA activity. Qualitatively, the users felt differences between the state machine and proposed DNN approach, but they did not consistently choose one method over the other. Some notes from users follow: TF2 preferred the DNN because she said it generated more natural motions in LGW and RD; TF3 stated he felt less support (i.e., a looser knee joint) in RD when using DNN; and TF5 stated he felt more support (i.e., a stiffer knee joint) in SD and RD.

Figure 5. Gait trajectory and corresponding impedance parameters across five ambulation modes during online testing. The blue, red, and yellow lines represent TF2, TF3, and TF5, respectively. All plots show 75th and 25th percentiles in lighter bands.

Table 5. Successful motions during online testing with respect to total trials

Abbreviations: LGW, level-ground walking; RA/RD, ascending/descending a ramp; SA/SD, ascending/descending stairs.

3.3. Feature extraction

The proposed DNN architecture extracts latent and time sequence features; the combination of latent and time sequence features allows the system to distinguish user intention across five ambulation modes and to generate the desired impedance parameters. The latent features shown in this section are a result of performing t-SNE from data recorded during the online testing. LGW and RA have similar gait trajectories (Figure 6). By contrast, SA has the most distinctive trajectory among the modes. The visualized features have long and curved shapes, as shown in Figure 6b–f. Although the axes in t-SNE have no physical meaning, we can presume that the extracted features represent a continuous gait trajectory.

Figure 6. Visualization of latent features using t-SNE from the online test dataset. (a) Feature distributions for all ambulation modes. Features are clearly separated except for LGW and RA because their characteristics are similar. (b)–(f) present feature distributions of individual ambulation modes. The feature distributions have long and curved shapes because the features represent a continuous gait trajectory.

Figure 7 presents the time sequence features extracted from the online test. The time sequence features were sensitive and changed according to the gait phases regardless of the ambulation modes. For example, the fourth feature from the bottom (purple line) was activated at the end of the gait cycle for all ambulation modes, except for SA; the fourth feature from the top (red line) was more activated in the swing phase than in the stance phase. Therefore, time sequence features may contain phase information for each ambulation mode. Furthermore, the time sequence features had a high correlation (median of 0.82) for all users across all ambulation modes, as shown in Figure 8. These results indicate that the time sequence features are sensitive to the gait phases and are less sensitive to ambulation modes. In other words, the time sequence features contain information about the gait phase.

Figure 7. Median gait trajectory (upper figures) and corresponding time sequence feature distribution (lower figures) during online testing. The blue, yellow, and red lines in the upper figures represent the normalized weight, ankle angle, and knee angle, respectively. Each color in the lower figures represents the activation level of each time sequence feature. The time sequence features were normalized with respect to the maximum and minimum. The features were sensitively changed according to the gait phase.

Figure 8. Feature correlation with respect to ambulation modes and users. High correlation (minimum of 0.46 and median of 0.82) indicates that the time sequence features are sensitive to the gait phase, rather than ambulation mode. SD for TF2 was excluded because TF2 was unable to complete the mode.

In conclusion, the proposed DNN model generated the impedance parameters by identifying the ambulation mode and gait phase through the latent and main networks, respectively.

3.4. Gait comparison

TF3 and TF5 participated in both the offline and online tests. Their gait trajectories are presented in Figure 9. In general, they show similar gait trajectory shapes. While the shape was similar, the timing of the DNN was typically longer and more variable (Figure 10). We speculate that the DNN model trained on a variety of users has more gait variability than the user-specific state machine.

Figure 9. Gait comparison of offline (blue lines) and online data (red lines). All plots show 75th and 25th percentiles in lighter bands.

Figure 10. Comparison of gait duration between offline data (blue lines) and online data (red lines) for five ambulation modes. In general, the DNN results in longer gait duration than the state machine. Bar plots show the 25th, 50th (median), and 75th percentiles. Asterisk indicates $ p $  < .05.

4. Discussion

The DNN model extracts time sequence features and latent features representing information regarding the gait phase and intended ambulation modes to learn the inherent structure of the state machine, including impedance parameter values, within ambulation mode state-transitions and between ambulation mode state-transitions. Though the model was only trained from a relatively small number of subjects who walked a combined total of 5,417 steps, the method showed promising results, even when tested by a novel subject.

The DNN training data used in the study was based on data generated when subjects were ambulating with state machine control. The input data were sensor readings, and the output data were impedance parameters. This approach could be applied to different input/output mappings. For example, it would be possible to perhaps map the input sensors to motor currents, bypassing the need to use finite-state machine-based impedance control. In such a situation, it could be possible to incorporate different mid-level control approaches into the overall control scheme. For example, the gait-phase-based control style proposed by Gregg’s group (Quintero et al., Reference Quintero, Villarreal and Gregg2016) could be used to generate training data for walking, while an impedance-based controller could be used to generate training data for walking on stairs. When applying the DNN approach, the resulting controller could learn attributes from both the phase-based and impedance-based control approaches. This remains to be tested.

The impedance parameters included in the state machine should not be considered as ground truth or optimal. They were configured to allow safe and comfortable walking. The parameters predicted by the DNN are different. Still, they should, in most cases, not be considered better or worse, with the exception of the situations where the users could not perform the activity. This is reinforced by the qualitative feedback from the users who reported that they felt differences between the two methods, but most noted that it was difficult to say which method, if either, was preferred. However, as shown in Figure 10, there was a difference in terms of gait duration between the two methods. Detailed comparisons that consider variable factors, such as metabolic costs, may be needed to verify which is better.

In the future, a more detailed and quantitative analysis should be conducted to investigate the performance of the DNN in terms of the naturalness and intuitiveness of the gait. We expect that the performance can be improved with more training datasets since the current model was trained with a dataset combined from only four transfemoral amputee users. Including data from more users as well as users performing a wider variety of postures to initiate movement, including a greater variability within each ambulation mode, may allow the model to recognize a higher percentage of tasks. Our study had several limitations. The collected data were unbalanced, and no data sampling method was used to compensate for the imbalance. There are far more data available for LGW than the sum of the rest ambulation modes. Although this may be likely consistent with real-world use (i.e., ambulation in LGW occurs with much higher frequency than on stairs and ramps), there may be a risk that the model parameters will not properly generate motion for stairs and ramps. In other words, the model parameters can be trained by focusing only on LGW.

TF2 and TF3 had more steps than other users for training. A large amount of data obtained from a particular user can lead to model bias since the model parameters may be overfitted to specific physical characteristics. For instance, the users have different stride lengths. Ascending ramps of the same distance (i.e., a 14-foot ramp) took about 9, 6, and 4 steps for TF2, TF3, and TF5, respectively. This imbalance may have produced a biased model working for users who have similar physical characteristics to users for training. The low $ {R}^2 $ obtained for TF5 in offline testing (Section 3.1) may be induced by the model bias since the height and weight of TF5 were significantly different from those of other users, as shown in Table 3.

Prior to testing unilateral transfemoral users, we performed bypass testing with four non-disabled individuals for model validation. This study used 50 ms of sensor data history to generate the impedance parameters. Although we investigated models that use a longer history (e.g., 75, 100, or 150 ms), there was no difference in performance based on visual inspections of gait. Thus, we chose the 50 ms history to reduce computation cost. The latent network was crucial to generating impedance parameters for the ambulation modes; the DNN model tended to generate parameters only for LGW when the latent network was not used. We speculate that the unbalanced data caused this problem. Time sequence features in the main network were extracted from three of 18 sensors (i.e., weight, thigh angle, and shank angle) because we believe this data directly represents lower limb movement. In addition, when we included knee and ankle angles in the input of the main network, the model generated equilibrium angles very close to the current knee and ankle angles; the leg did not move during the online testing. We speculate that the model predicted the equilibrium angles using only the current joint angles (i.e., knee and ankle). Therefore, the model may have tried to maintain the current stationary joint angle instead of generating a trajectory to restore locomotion.

Ideally, we would have included more data in our analysis and testing. The COVID-19 pandemic resulted in many subjects canceling visits. In spite of this, we were still able to collect a sufficient amount of data to demonstrate the feasibility of the approach with online evaluation. Future research will require experiments under well-organized conditions: considering a wider variety of subjects (i.e., stride length, height, weight, and natural speed) and under various environments (i.e., stairs with different heights and ramps with different slopes).

5. Conclusion

In this study, we designed and tested DNN-based powered prosthetic leg control for unilateral transfemoral amputee users. The proposed DNN model was trained with four subjects and applied in offline and online tests with four subjects. The DNN successfully generated impedance parameters for the gait across various ambulation modes and enabled seamless transitions.

The proposed DNN consists of two sub-networks: the latent and main networks. Their combination can distinguish user intention for all five ambulation modes and gait phases. However, further investigation is needed because there were wrong step motions during SA; more specifically, the DNN generated LGW motions in some cases instead. In addition, although participant TF2 was used to train the DNN, she could not perform SD. We believe that testing with big data will result in more reliable gait generation.

In conclusion, the proposed DNN can facilitate its application to any patient because it does not require tuning; parameter fitting for state machines is time-consuming and inconvenient for prosthetists/therapists and patients. This training-free and intuitive control method that works across different ambulation modes will help develop technologies to improve the quality of life for lower limb amputees.

Supplementary Material

The supplementary material for this article can be found at https://doi.org/10.1017/wtc.2022.19.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.

Funding Statement

This work was supported by the National Institutes of Health 2R01HD079428–05.

Ethical Standards

The authors assert that all procedures contributing to this work were approved by the Northwestern University Institutional Review Board. All individuals provided written informed consent.

Competing Interests

The authors declare no competing interests exist.

Authorship Contributions

M.K. designed the study, performed data collection, analyzed data, and wrote the first draft. A.M.S. performed data collection, and revised the manuscript. L.J.H. revised the manuscript. All authors approved the final version.

References

Au, S, Berniker, M and Herr, H (2008) Powered ankle-foot prosthesis to assist level-ground and stair-descent gaits. Neural Networks 21(4), 654666.CrossRefGoogle ScholarPubMed
Azocar, AF, Mooney, LM, Duval, J-F, Simon, AM, Hargrove, LJ and Rouse, EJ (2020) Design and clinical implementation of an open-source bionic leg. Nature Biomedical Engineering 4(10), 941953.CrossRefGoogle ScholarPubMed
Graupe, D, Salahi, J and Kohn, KH (1982) Multifunctional prosthesis and orthosis control via microcomputer identification of temporal pattern differences in single-site myoelectric signals. Journal of Biomedical Engineering 4(1), 1722.Google ScholarPubMed
Hinton, G, Srivastava, N and Swersky, K (2012) Neural networks for machine learning lecture 6a overview of mini-batch gradient descent. Cited on 14(8), 2.Google Scholar
Hu, B, Simon, AM and Hargrove, L (2019) Deep generative models with data augmentation to learn robust representations of movement intention for powered leg prostheses. IEEE Transactions on Medical Robotics and Bionics 1(4), 267278.CrossRefGoogle Scholar
Huang, H, Crouch, DL, Liu, M, Sawicki, GS and Wang, D (2016) A cyber expert system for auto-tuning powered prosthesis impedance control parameters. Annals of Biomedical Engineering 44(5), 16131624.CrossRefGoogle ScholarPubMed
Huang, H, Zhang, F, Hargrove, LJ, Dou, Z, Rogers, DR and Englehart, KB (2011) Continuous locomotion-mode identification for prosthetic legs based on neuromuscular–mechanical fusion. IEEE Transactions on Biomedical Engineering 58(10), 28672875.CrossRefGoogle ScholarPubMed
Hudgins, B, Parker, P and Scott, RN (1993) A new strategy for multifunction myoelectric control. IEEE Transactions on Biomedical Engineering 40(1), 8294.CrossRefGoogle ScholarPubMed
Kang, I, Molinaro, DD, Duggal, S, Chen, Y, Kunapuli, P and Young, AJ (2021) Real-time gait phase estimation for robotic hip exoskeleton control during multimodal locomotion. IEEE Robotics and Automation Letters 6(2), 34913497.Google ScholarPubMed
Kim, J, Wensman, J, Colabianchi, N and Gates, DH (2021) The influence of powered prostheses on user perspectives, metabolics, and activity: A randomized crossover trial. Journal of Neuroengineering and Rehabilitation 18(1), 113.CrossRefGoogle ScholarPubMed
Laschowski, B, McNally, W, Wong, A and McPhee, J (2022) Environment classification for robotic leg prostheses and exoskeletons using deep convolutional neural networks. Frontiers in Neurorobotics 15, 730965.Google ScholarPubMed
Lawson, BE, Mitchell, J, Truex, D, Shultz, A, Ledoux, E and Goldfarb, M (2014) A robotic leg prosthesis: Design, control, and implementation. IEEE Robotics & Automation Magazine 21(4), 7081.CrossRefGoogle Scholar
Liu, M, Zhang, F and Huang, H (2017) An adaptive classification strategy for reliable locomotion mode recognition. Sensors 17(9), 2020.Google ScholarPubMed
Montgomery, JR and Grabowski, AM (2018) Use of a powered ankle–foot prosthesis reduces the metabolic cost of uphill walking and improves leg work symmetry in people with transtibial amputations. Journal of the Royal Society Interface 15(145), 20180442.Google ScholarPubMed
Quintero, D, Lambert, DJ, Villarreal, DJ and Gregg, RD (2017) Real-time continuous gait phase and speed estimation from a single sensor. In 2017 IEEE Conference on Control Technology and Applications (CCTA). Hawaii: IEEE, pp. 847852.Google Scholar
Quintero, D, Villarreal, DJ and Gregg, RD (2016) Preliminary experiments with a unified controller for a powered knee-ankle prosthetic leg across walking speeds. In 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, pp. 54275433.CrossRefGoogle Scholar
Quintero, D, Villarreal, DJ, Lambert, DJ, Kapp, S and Gregg, RD (2018) Continuous-phase control of a powered knee–ankle prosthesis: Amputee experiments across speeds and inclines. IEEE Transactions on Robotics 34(3), 686701.Google ScholarPubMed
Rabe, KG, Jahanandish, MH, Boehm, JR, Fey, AM, Hoyt, K and Fey, NP (2020) Ultrasound sensing can improve continuous classification of discrete ambulation modes compared to surface electromyography. IEEE Transactions on Biomedical Engineering 68(4), 13791388.Google Scholar
Ren, J-L, Chien, Y-H, Chia, E-Y, Fu, L-C and Lai, J-S (2019) Deep learning based motion prediction for exoskeleton robot control in upper limb rehabilitation. In 2019 International Conference on Robotics and Automation (ICRA). Montreal, QC: IEEE, pp. 50765082.CrossRefGoogle Scholar
Simon, AM, Ingraham, KA, Fey, NP, Finucane, SB, Lipschutz, RD, Young, AJ and Hargrove, LJ (2014) Configuring a powered knee and ankle prosthesis for transfemoral amputees within five specific ambulation modes. PLoS One 9(6), e99387.CrossRefGoogle ScholarPubMed
Simon, AM, Ingraham, KA, Spanias, JA, Young, AJ, Finucane, SB, Halsne, EG and Hargrove, LJ (2016) Delaying ambulation mode transition decisions improves accuracy of a flexible control system for powered knee-ankle prosthesis. IEEE Transactions on Neural Systems and Rehabilitation Engineering 25(8), 11641171.CrossRefGoogle ScholarPubMed
Song, S and Geyer, H (2015) A neural circuitry that emphasizes spinal feedback generates diverse behaviours of human locomotion. The Journal of Physiology 593(16), 34933511.CrossRefGoogle ScholarPubMed
Spanias, J, Simon, A, Finucane, S, Perreault, E and Hargrove, L (2018) Online adaptive neural control of a robotic lower limb prosthesis. Journal of Neural Engineering 15(1), 016015.CrossRefGoogle ScholarPubMed
Su, B-Y, Wang, J, Liu, S-Q, Sheng, M, Jiang, J and Xiang, K (2019) A cnn-based method for intent recognition using inertial measurement units and intelligent lower limb prosthesis. IEEE Transactions on Neural Systems and Rehabilitation Engineering 27(5), 10321042.CrossRefGoogle ScholarPubMed
Sup, F, Bohara, A and Goldfarb, M (2008) Design and control of a powered transfemoral prosthesis. The International Journal of Robotics Research 27(2), 263273.CrossRefGoogle ScholarPubMed
Teh, Y and Hargrove, LJ (2021) Using latent representations of muscle activation patterns to mitigate myoelectric interface noise. In 2021 10th International IEEE/EMBS Conference on Neural Engineering (NER). IEEE, pp. 11481151.CrossRefGoogle Scholar
Thatte, N and Geyer, H (2015) Toward balance recovery with leg prostheses using neuromuscular model control. IEEE Transactions on Biomedical Engineering 63(5), 904913.CrossRefGoogle ScholarPubMed
Van der Maaten, L and Hinton, G (2008) Visualizing data using t-SNE. Journal of Machine Learning Research 9, 25792605.Google Scholar
Villarreal, DJ and Gregg, RD (2016) Unified phase variables of relative degree two for human locomotion. In 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). Orlando, FL: IEEE, pp. 62626267.CrossRefGoogle Scholar
Villarreal, DJ, Poonawala, HA and Gregg, RD (2016) A robust parameterization of human gait patterns across phase-shifting perturbations. IEEE Transactions on Neural Systems and Rehabilitation Engineering 25(3), 265278.CrossRefGoogle ScholarPubMed
Wen, Y, Si, J, Brandt, A, Gao, X and Huang, HH (2019) Online reinforcement learning control for the personalization of a robotic knee prosthesis. IEEE Transactions on Cybernetics 50(6), 23462356.CrossRefGoogle ScholarPubMed
Xu, D, Feng, Y, Mai, J and Wang, Q (2018) Real-time on-board recognition of continuous locomotion modes for amputees with robotic transtibial prostheses. IEEE Transactions on Neural Systems and Rehabilitation Engineering 26(10), 20152025.CrossRefGoogle ScholarPubMed
Young, AJ, Simon, AM and Hargrove, LJ (2013) A training method for locomotion mode prediction using powered lower limb prostheses. IEEE Transactions on Neural Systems and Rehabilitation Engineering 22(3), 671677.Google ScholarPubMed
Yu, Y, Chen, C, Sheng, X and Zhu, X (2020) Multi-dof continuous estimation for wrist torques using stacked autoencoder. Biomedical Signal Processing and Control 57, 101733.CrossRefGoogle Scholar
Ziegler-Graham, K, MacKenzie, EJ, Ephraim, PL, Travison, TG and Brookmeyer, R (2008) Estimating the prevalence of limb loss in the United States: 2005 to 2050. Archives of Physical Medicine and Rehabilitation 89(3), 422429.Google ScholarPubMed
Figure 0

Table 1. Configuration of mechanical sensors

Figure 1

Figure 1. Architecture of the proposed DNN. The model takes a history of 18 mechanical sensor datapoints (see Table 1) as the inputs to obtain the single-time step impedance parameters. The latent network (a) extracts latent features (b). The latent features and history of three sensor data (weight, thigh angle, and shank angle) are used as the inputs of the main network (c) to extract time sequence features (d) and output impedance parameters. (e) The network modules represent the function of layers. The number in the brackets represents the number of units in the layers (e.g., Dense (N) represents the Dense layer with N units, and Load cell (Hist, N) represents N load cell sensor data points of the Hist-time step).

Figure 2

Table 2. The upper and lower limits of the impedance parameters

Figure 3

Figure 2. System configuration. The Pyboard was connected with the OSL and the Android smartphone, respectively, by CAN and USB. The Pyboard acted as a bidirectional translation module between the smartphone and the OSL.

Figure 4

Table 3. Subject demographics

Figure 5

Table 4. Collected datasets for training, offline testing, and online testing

Figure 6

Figure 3. Comparison of impedance parameters generated from the state machine (blue lines) and DNN (red lines) in offline tests. In general, the DNN made impedance parameters similar to the state machine for the user who was used to train the DNN model (i.e., TF2) and the new users (i.e., TF5 and TF6). The median $ {R}^2 $ across all impedance parameters for all ambulation modes were 0.77, 0.46, and 0.79 for TF2, TF5, and TF6, respectively; the median $ {R}^2 $ for all users was 0.72. All plots show 75th and 25th percentiles in lighter bands.

Figure 7

Figure 4. RMSD of all five ambulation modes in offline tests. The users who were not used for the training (i.e., TF5 and TF6) showed a similar error level as the user who was used for training (i.e., TF2). Bar plots show the 25th, 50th (median), and 75th percentiles. Asterisk indicates $ p $ < .05.

Figure 8

Figure 5. Gait trajectory and corresponding impedance parameters across five ambulation modes during online testing. The blue, red, and yellow lines represent TF2, TF3, and TF5, respectively. All plots show 75th and 25th percentiles in lighter bands.

Figure 9

Table 5. Successful motions during online testing with respect to total trials

Figure 10

Figure 6. Visualization of latent features using t-SNE from the online test dataset. (a) Feature distributions for all ambulation modes. Features are clearly separated except for LGW and RA because their characteristics are similar. (b)–(f) present feature distributions of individual ambulation modes. The feature distributions have long and curved shapes because the features represent a continuous gait trajectory.

Figure 11

Figure 7. Median gait trajectory (upper figures) and corresponding time sequence feature distribution (lower figures) during online testing. The blue, yellow, and red lines in the upper figures represent the normalized weight, ankle angle, and knee angle, respectively. Each color in the lower figures represents the activation level of each time sequence feature. The time sequence features were normalized with respect to the maximum and minimum. The features were sensitively changed according to the gait phase.

Figure 12

Figure 8. Feature correlation with respect to ambulation modes and users. High correlation (minimum of 0.46 and median of 0.82) indicates that the time sequence features are sensitive to the gait phase, rather than ambulation mode. SD for TF2 was excluded because TF2 was unable to complete the mode.

Figure 13

Figure 9. Gait comparison of offline (blue lines) and online data (red lines). All plots show 75th and 25th percentiles in lighter bands.

Figure 14

Figure 10. Comparison of gait duration between offline data (blue lines) and online data (red lines) for five ambulation modes. In general, the DNN results in longer gait duration than the state machine. Bar plots show the 25th, 50th (median), and 75th percentiles. Asterisk indicates $ p $ < .05.

Supplementary material: PDF

Kim et al. supplementary material

Kim et al. supplementary material

Download Kim et al. supplementary material(PDF)
PDF 168.3 KB