Human activity recognition (HAR) is a rapidly expanding field with a variety of applications from biometric authentication to developing home-based rehabilitation for people suffering from traumatic brain injuries. While HAR is traditionally performed using accelerometry data, a team of students led by researchers in the BIG IDEAS Lab will explore HAR with physiological data from wrist wearables. Using deep learning methods, students will extract features from wearable sensor data to classify human activity. The student team will develop a reproducible machine learning model that will be integrated into the Big Ideas Lab Digital Biomarker Discovery Pipeline (DBDP), which is a source of code for researchers and clinicians developing digital biomarkers from wearable sensors and mobile health technologies.
This project is in partnership with the Rhodes Information Iniative Data+ undergraduate summer research program. For more information on this project : https://bigdata.duke.edu/projects/human-activity-recognition-using-physiological-data-wearables For more information on Data+: https://bigdata.duke.edu/data
Traditional Human Activity Recognition (HAR) utilizes accelerometry (movement) data to classify activities. This summer, Team #4 examined using physiological sensors to improve HAR accuracy and generalizability. The team developed ML models that are going to be available open source in the Digital Biomarker Discovery Pipeline (DBDP) to enable other researchers and clinicians to make useful insights in the field of HAR.
In sum, the goal of the Human Activity Recognition Team is to create a predictive model that:
- Takes in multimodal data from mechanical sensors (such as accelerometers) and physiological sensors (such as electrodermal sensors and pulse oximeters).
- Classifies human activity (Rest, Deep Breathing, Walking, Typing) at high accuracy and precision, while being generalizable and adaptable to other HAR datasets.
Many HAR models only use one or two kinds of mechanical sensor data as inputs to infer behaviors (Dernbach, 2012; Kwapisz, 2011; Zeng, 2014). Our model is the first to incorporate several kinds of physiological data as well as one kind of mechanical sensor data. The model uses Blood Volume Pulse (physiological), Electrodermal Activity (physiological), Skin Temperature (physiological), and 3-axis accelerometry (mechanical).
With our novel approach to using both mechanical and physiological data for activity recognition, we could possibly provide more detailed insight into user behavior and habits. Thus, our aim to make our model flexible enough to work well with other HAR datasets could help in providing better care for chronic diseases that often require major lifestyle changes for patients and monitoring vulnerable patients over long periods of time such as elderly patients or patients in intensive care. Because our model is multimodal, including a range of different types of physiological and mechanical data can aid in the effort of making care more personalized and/or effective. Lastly, using elements of multi-attribute classification, which is the inclusion contextual data such as age and gender to further tailor HAR models to each user, could improve the model’s accuracy for new datasets that include demographic data (Lara, 2013).
Our data was collected from Bent, et. al 2020: "Investigating sources of inaccuracy in wearable optical heart rate sensors".
Deliverable | Status |
---|---|
Literature Review on Human Activity Recognition | Completed |
Data Cleaning | Completed |
Exploratory Data Analysis | Completed |
Random Forest Model | Completed |
Deep Learning Models | Completed |
Documentation | In Progress |
- Literature Review
- Data Cleaning
- Exploratory Data Analysis
- Time Series Preprocessing
- Deep Learning Models
- Random Forest Model
- Python - Primary language for building end-to-end data to model pipeline
- Pytorch - Python library used for tensor computation and deep neural networks
- TensorFlow - Python library for neural network and deep learning
- Dataiku - Automated platform powers both self-service analytics and the operationalization of our baseline machine learning model
- JupyterLab - Our team's primary interface to develop the project in Python
- Tableau - Visualization tool for outliers, summary statistics, etc.
- Brinnae Bent (Project Manager)
- Kush Gulati: LinkedIn, Github
- Annie Hirsch: LinkedIn, Github
- Noah Lanier: Github
- Nathan Warren: LinkedIn, Github
This project was developed under the Big Ideas Lab - see their webpage for more details. We aspire to make our work open source, as a part of an overarching Digital Biomarker Discovery Pipeline.
Author | Model Name | Model Description | Github Link | Paper Link |
---|---|---|---|---|
Ordóñez, 2016 | DeepConvLSTM | DeepConvLTSM combines CNN with LTSM recurrent layers. Has a dense layer structure. This model uses a deep convolutional neural network to extract features and a recurrent neural network to learn time dependencies. | GitHub (from authors) GitHub (Pytorch) | Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition |
Singh, 2020 | DeepConvLSTM w/ self-attention | Similar to the Ordonez model, but also includes an attention layer in addition to the CNN, RNN, LTSM and dense layer structure. | GitHub | Deep ConvLSTM with self-attention for human activity decoding using wearables |
Ma, 2019 | AttnSense | AttentionSense model uses a convolutional layer for each sensor, followed by an attention layer for each sensor which applies an attention weight for each sensor. Stacked GRUs extract important temporal features. Softmax to find probability of each classification. | GitHub | AttnSense: Multi-level Attention Mechanism For Multimodal Human Activity Recognition |
Yao, 2017 | DeepSense | DeepSense uses CNN → RNN with GRUs. CNN extracts local sensor features, while the RNN extracts temporal features. This model is the state-of-the-art model in Heterogeneous dataset, which used a CNN network to extract features of each sensor and combined them by another merge convolutional layer, then it used a LSTM network to learn time dependencies. | GitHub | DeepSense: A Unified Deep Learning Framework for Time-Series Mobile Sensing Data Processing |
- A. Reiss and D. Stricker. Introducing a New Benchmarked Dataset for Activity Monitoring. The 16th IEEE International Symposium on Wearable Computers (ISWC), 2012.
- Bent, B., Goldstein, B.A., Kibbe, W.A. et al. Investigating sources of inaccuracy in wearable optical heart rate sensors. npj Digit. Med. 3, 18 (2020). https://doi.org/10.1038/s41746-020-0226-6
- Ma, H., Li, W., Zhang, X., Gao, S., & Lu, S. (2019). AttnSense: Multi-level Attention Mechanism For Multimodal Human Activity Recognition. Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence. doi: 10.24963/ijcai.2019/431
- Ordóñez, F., & Roggen, D. (2016). Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition. Sensors, 16(1), 115. doi: 10.3390/s16010115
- Singh, S. P., Lay-Ekuakille, A., Gangwar, D., Sharma, M. K., & Gupta, S. (2020). Deep ConvLSTM with self-attention for human activity decoding using wearables. ArXiv, abs/2005.00698.
- Yao, S., Hu, S., Zhao, Y., Zhang, A., & Abdelzaher, T. (2017). DeepSense. Proceedings of the 26th International Conference on World Wide Web. doi: 10.1145/3038912.3052577