TuriCreate: Human Activity Classifier Model Deployment and result on unseen test dataset #3482

sevimcengiz · 2023-11-30T09:56:19Z

Hi,

I've gathered approximately 5 GB of time series data from my Apple Watch, categorizing activities into two classes with proper labeling (such as walking or not).
The model was trained successfully, and the attached photo shows the best model's results.
However, upon testing with an unseen dataset, I find the performance unsatisfactory.

Despite achieving high accuracy, F1, precision, and recall values during training, deploying the model for real-time activity classification on the Apple Watch yields disappointing results.
I only see log_loss couldn't see valid/train loss graph.

While I understand my dataset might not be considered real Big Data, I believe that 5 GB of data should suffice for distinguishing between two activities, such as walking or not.

I think metrics like accuracy, recall, precision, and F1 aren't adequate to take this model in practical deployment. How do I handle this challenge to build a trustful model?

How can I ensure the reliability of my model in such a scenario?

Ps: Tucreate: Training, validation, and testing pipeline was done considering human activity documents.
Subsequently, the model deployment encountered unsatisfactory results on the watch app.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TuriCreate: Human Activity Classifier Model Deployment and result on unseen test dataset #3482

TuriCreate: Human Activity Classifier Model Deployment and result on unseen test dataset #3482

sevimcengiz commented Nov 30, 2023 •

edited

Loading

TuriCreate: Human Activity Classifier Model Deployment and result on unseen test dataset #3482

TuriCreate: Human Activity Classifier Model Deployment and result on unseen test dataset #3482

Comments

sevimcengiz commented Nov 30, 2023 • edited Loading

sevimcengiz commented Nov 30, 2023 •

edited

Loading