Skip to content

hobein/ailia-models

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ailia-models

The collection of pre-trained, state-of-the-art AI models.

About ailia SDK

ailia SDK is a cross-platform high speed inference SDK. The ailia SDK provides a consistent C++ API on Windows, Mac, Linux, iOS, Android, Jetson and Raspberry Pi. It supports Unity, Python and JNI for efficient AI implementation. The ailia SDK makes great use of the GPU via Vulkan and Metal to serve accelerated computing.

How to use

ailia MODELS tutorial

Supported models

Action recognition

Model Reference Exported From Supported Ailia Version
mars MARS: Motion-Augmented RGB Stream for Action Recognition Pytorch 1.2.4 and later
st-gcn ST-GCN Pytorch 1.2.5 and later
ax_action_recognition Realtime-Action-Recognition Pytorch 1.2.7 and later
va-cnn View Adaptive Neural Networks (VA) for Skeleton-based Human Action Recognition Pytorch 1.2.7 and later

Anomaly detection

Model Reference Exported From Supported Ailia Version
padim PaDiM-Anomaly-Detection-Localization-master Pytorch 1.2.6 and later

Audio processing

Model Reference Exported From Supported Ailia Version
crnn_audio_classification crnn-audio-classification Pytorch 1.2.5 and later
deepspeech2 deepspeech.pytorch Pytorch 1.2.2 and later
pytorch-dc-tts Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention Pytorch 1.2.6 and later
unet_source_separation source_separation Pytorch 1.2.6 and later
transformer-cnn-emotion-recognition Combining Spatial and Temporal Feature Representions of Speech Emotion by Parallelizing CNNs and Transformer-Encoders Pytorch 1.2.5 and later

Crowd counting

Model Reference Exported From Supported Ailia Version
crowdcount-cascaded-mtl CNN-based Cascaded Multi-task Learning of
High-level Prior and Density Estimation for Crowd Counting
(Single Image Crowd Counting)
Pytorch 1.2.1 and later
c-3-framework Crowd Counting Code Framework(C^3-Framework) Pytorch 1.2.5 and later

Deep fashion

Model Reference Exported From Supported Ailia Version
clothing-detection Clothing-Detection Pytorch 1.2.1 and later
mmfashion MMFashion Pytorch 1.2.5 and later
mmfashion_tryon MMFashion virtula try-on Pytorch 1.2.8 and later
fashionai-key-points-detection A Pytorch Implementation of Cascaded Pyramid Network for FashionAI Key Points Detection Pytorch 1.2.5 and later

Depth estimation

Model Reference Exported From Supported Ailia Version
monodepth2 Monocular depth estimation from a single image Pytorch 1.2.2 and later
midas Towards Robust Monocular Depth Estimation:
Mixing Datasets for Zero-shot Cross-dataset Transfer
Pytorch 1.2.4 and later
fcrn-depthprediction Deeper Depth Prediction with Fully Convolutional Residual Networks TensorFlow 1.2.6 and later
fast-depth ICRA 2019 "FastDepth: Fast Monocular Depth Estimation on Embedded Systems" Pytorch 1.2.5 and later

Face detection

Model Reference Exported From Supported Ailia Version
yolov1-face YOLO-Face-detection Darknet 1.1.0 and later
yolov3-face Face detection using keras-yolov3 Keras 1.2.1 and later
blazeface BlazeFace-PyTorch Pytorch 1.2.1 and later
face-mask-detection Face detection using keras-yolov3 Keras 1.2.1 and later
dbface DBFace : real-time, single-stage detector for face detection,
with faster speed and higher accuracy
Pytorch 1.2.2 and later
retinaface RetinaFace: Single-stage Dense Face Localisation in the Wild. Pytorch 1.2.5 and later

Face identification

Model Reference Exported From Supported Ailia Version
vggface2 VGGFace2 Dataset for Face Recognition Caffe 1.1.0 and later
arcface pytorch implement of arcface Pytorch 1.2.1 and later
insightface InsightFace: 2D and 3D Face Analysis Project Pytorch 1.2.5 and later

Face recognition

Model Reference Exported From Supported Ailia Version
face_classification Real-time face detection and emotion/gender classification Keras 1.1.0 and later
facial_feature kaggle-facial-keypoints Pytorch 1.2.0 and later
face_alignment 2D and 3D Face alignment library build using pytorch Pytorch 1.2.1 and later
prnet Joint 3D Face Reconstruction and Dense Alignment
with Position Map Regression Network
TensorFlow 1.2.2 and later
gazeml A deep learning framework based on Tensorflow
for the training of high performance gaze estimation
TensorFlow 1.2.0 and later
facemesh facemesh.pytorch Pytorch 1.2.2 and later
mediapipe_iris irislandmarks.pytorch Pytorch 1.2.2 and later
hopenet deep-head-pose Pytorch 1.2.2 and later
ax_gaze_estimation ax Gaze Estimation Pytorch 1.2.2 and later

Frame Interpolation

Model Reference Exported From Supported Ailia Version
flavr FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation Pytorch 1.2.7 and later

Generative adversarial networks

Model Reference Exported From Supported Ailia Version
pytorch-gan Code repo for the Pytorch GAN Zoo project (used to train this model) Pytorch 1.2.4 and later
council-GAN Council-GAN Pytorch 1.2.4 and later

Hand detection

Model Reference Exported From Supported Ailia Version
yolov3-hand Hand detection branch of Face detection using keras-yolov3 Keras 1.2.1 and later
hand_detection_pytorch hand-detection.PyTorch Pytorch 1.2.2 and later
blazepalm MediaPipePyTorch Pytorch 1.2.5 and later

Hand recognition

Model Reference Exported From Supported Ailia Version
blazehand MediaPipePyTorch Pytorch 1.2.5 and later
hand3d ColorHandPose3D network TensorFlow 1.2.5 and later
minimal-hand Minimal Hand TensorFlow 1.2.8 and later

Image captioning

Model Reference Exported From Supported Ailia Version
illustration2vec Illustration2Vec Caffe 1.2.2 and later
image_captioning_pytorch Image Captioning pytorch Pytorch 1.2.5 and later

Image classification

Model Reference Exported From Supported Ailia Version
vgg16 Very Deep Convolutional Networks for Large-Scale Image Recognition Keras 1.1.0 and later
googlenet Going Deeper with Convolutions Pytorch 1.2.0 and later
resnet50 Deep Residual Learning for Image Recognition Chainer 1.2.0 and later
inceptionv3 Rethinking the Inception Architecture for Computer Vision Pytorch 1.2.0 and later
inceptionv4 Keras Inception-V4 Keras 1.2.5 and later
mobilenetv2 PyTorch Implemention of MobileNet V2 Pytorch 1.2.0 and later
mobilenetv3 PyTorch Implemention of MobileNet V3 Pytorch 1.2.1 and later
partialconv Partial Convolution Layer for Padding and Image Inpainting Pytorch 1.2.0 and later
efficientnet A PyTorch implementation of EfficientNet Pytorch 1.2.3 and later
vit Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale) Pytorch 1.2.7 and later
efficientnetv2 EfficientNetV2 Pytorch 1.2.4 and later

Image manipulation

Model Reference Exported From Supported Ailia Version
noise2noise Learning Image Restoration without Clean Data Pytorch 1.2.0 and later
dewarpnet DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks Pytorch 1.2.1 and later
illnet Document Rectification and Illumination Correction using a Patch-based CNN Pytorch 1.2.2 and later
colorization Colorful Image Colorization Pytorch 1.2.2 and later
u2net_portrait U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection Pytorch 1.2.2 and later
style2paints Style2Paints TensorFlow 1.2.6 and later
deep_white_balance Deep White-Balance Editing, CVPR 2020 (Oral) PyTorch 1.2.6 and later
inpainting-with-partial-conv pytorch-inpainting-with-partial-conv PyTorch 1.2.6 and later
inpainting_gmcnn Image Inpainting via Generative Multi-column Convolutional Neural Networks TensorFlow 1.2.6 and later
deblur_gan DeblurGAN Pytorch 1.2.6 and later
3d-photo-inpainting 3D Photography using Context-aware Layered Depth Inpainting Pytorch 1.2.7 and later

Image segmentation

Model Reference Exported From Supported Ailia Version
deeplabv3 Xception65 for backbone network of DeepLab v3+ Chainer 1.2.0 and later
hrnet_segmentation High-resolution networks (HRNets) for Semantic Segmentation Pytorch 1.2.1 and later
hair_segmentation hair segmentation in mobile device Keras 1.2.1 and later
pspnet-hair-segmentation pytorch-hair-segmentation Pytorch 1.2.2 and later
U-2-Net U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection Pytorch 1.2.2 and later
deep-image-matting Deep Image Matting Keras 1.2.3 and later
human_part_segmentation Self Correction for Human Parsing Pytorch 1.2.4 and later
semantic-segmentation-mobilenet-v3 Semantic segmentation with MobileNetV3 TensorFlow 1.2.5 and later
pytorch-unet Pytorch-Unet Pytorch 1.2.5 and later
pytorch-enet PyTorch-ENet Pytorch 1.2.8 and later
yet-another-anime-segmenter Yet-Another-Anime-Segmenter Pytorch 1.2.6 and later
swiftnet SwiftNet Pytorch 1.2.6 and later
codes-for-lane-detection Codes-for-Lane-Detection Pytorch 1.2.6 and later
dense_prediction_transformers dense_prediction_transformers Pytorch 1.2.7 and later
u2net-portrait-matting U^2-Net - Portrait matting Pytorch 1.2.7 and later
u2net-human-seg U^2-Net - human segmentation Pytorch 1.2.4 and later
indexnet Indices Matter: Learning to Index for Deep Image Matting Pytorch 1.2.7 and later
modnet MODNet: Trimap-Free Portrait Matting in Real Time Pytorch 1.2.7 and later

Line Segment Detection

Model Reference Exported From Supported Ailia Version
mlsd M-LSD: Towards Light-weight and Real-time Line Segment Detection TensorFlow 1.2.8 and later

Natural language processing

Model Reference Exported From Supported Ailia Version
bert pytorch-pretrained-bert Pytorch 1.2.2 and later
bert_maskedlm huggingface/transformers Pytorch 1.2.5 and later
bert_ner huggingface/transformers Pytorch 1.2.5 and later
bert_question_answering huggingface/transformers Pytorch 1.2.5 and later
bert_sentiment_analysis huggingface/transformers Pytorch 1.2.5 and later
bert_zero_shot_classification huggingface/transformers Pytorch 1.2.5 and later
bert_tweets_sentiment huggingface/transformers Pytorch 1.2.5 and later

Object detection

Model Reference Exported From Supported Ailia Version
yolov1-tiny YOLO: Real-Time Object Detection Darknet 1.1.0 and later
yolov2 YOLO: Real-Time Object Detection Pytorch 1.2.0 and later
yolov3 YOLO: Real-Time Object Detection ONNX Runtime 1.2.1 and later
yolov3-tiny YOLO: Real-Time Object Detection ONNX Runtime 1.2.1 and later
yolov4 Pytorch-YOLOv4 Pytorch 1.2.4 and later
yolov4-tiny Pytorch-YOLOv4 Pytorch 1.2.5 and later
yolov5 yolov5 Pytorch 1.2.5 and later
mobilenet_ssd MobileNetV1, MobileNetV2, VGG based SSD/SSD-lite implementation in Pytorch Pytorch 1.2.1 and later
maskrcnn Mask R-CNN: real-time neural network for object instance segmentation Pytorch 1.2.3 and later
m2det M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network Pytorch 1.2.3 and later
centernet CenterNet : Objects as Points Pytorch 1.2.1 and later
pedestrian_detection Pedestrian-Detection-on-YOLOv3_Research-and-APP Keras 1.2.1 and later
efficientdet EfficientDet: Scalable and Efficient Object Detection, in PyTorch Pytorch 1.2.6 and later
3d_bbox 3D Bounding Box Estimation Using Deep Learning and Geometry Pytorch 1.2.6 and later
nanodet NanoDet Pytorch 1.2.6 and later
yolor yolor Pytorch 1.2.5 and later
3d-object-detection.pytorch 3d-object-detection.pytorch Pytorch 1.2.8 and later
mediapipe_objectron MediaPipe Objectron TensorFlow Lite 1.2.5 and later

Object tracking

Model Reference Exported From Supported Ailia Version
deepsort Deep Sort with PyTorch Pytorch 1.2.3 and later
person_reid_baseline_pytorch UTS-Person-reID-Practical Pytorch 1.2.6 and later
roneld RONELD-Lane-Detection Pytorch 1.2.6 and later

Point segmentation

Model Reference Exported From Supported Ailia Version
pointnet_pytorch PointNet.pytorch Pytorch 1.2.6 and later

Pose estimation

Model Reference Exported From Supported Ailia Version
openpose Code repo for realtime multi-person pose estimation in CVPR'17 (Oral) Caffe 1.2.1 and later
lightweight-human-pose-estimation Fast and accurate human pose estimation in PyTorch.
Contains implementation of
"Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose" paper.
Pytorch 1.2.1 and later
lightweight-human-pose-estimation-3d Real-time 3D multi-person pose estimation demo in PyTorch.
OpenVINO backend can be used for fast inference on CPU.
Pytorch 1.2.1 and later
3d-pose-baseline A simple baseline for 3d human pose estimation in tensorflow.
Presented at ICCV 17.
TensorFlow 1.2.3 and later
pose_resnet Simple Baselines for Human Pose Estimation and Tracking Pytorch 1.2.1 and later
blazepose MediaPipePyTorch Pytorch 1.2.5 and later
blazepose-fullbody MediaPipe TensorFlow Lite 1.2.5 and later
3dmppe_posenet PoseNet of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image" Pytorch 1.2.6 and later
efficientpose Code repo for EfficientPose TensorFlow 1.2.6 and later
pose-hg-3d Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach Pytorch 1.2.6 and later
gast A Graph Attention Spatio-temporal Convolutional Networks for 3D Human Pose Estimation in Video (GAST-Net) Pytorch 1.2.7 and later
animalpose MMPose - 2D animal pose estimation Pytorch 1.2.7 and later
movenet Code repo for movenet TensorFlow 1.2.8 and later

Rotation prediction

Model Reference Exported From Supported Ailia Version
rotnet CNNs for predicting the rotation angle of an image to correct its orientation Keras 1.2.1 and later

Style transfer

Model Reference Exported From Supported Ailia Version
adain Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization Pytorch 1.2.1 and later
psgan PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer Pytorch 1.2.7 and later
beauty_gan BeautyGAN Pytorch 1.2.7 and later

Super resolution

Model Reference Exported From Supported Ailia Version
srresnet Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network Pytorch 1.2.0 and later
edsr Enhanced Deep Residual Networks for Single Image Super-Resolution Pytorch 1.2.6 and later
han Single Image Super-Resolution via a Holistic Attention Network Pytorch 1.2.6 and later

Text detection

Model Reference Exported From Supported Ailia Version
craft_pytorch CRAFT: Character-Region Awareness For Text detection Pytorch 1.2.2 and later
pixel_link Pixel-Link TensorFlow 1.2.6 and later
east EAST: An Efficient and Accurate Scene Text Detector TensorFlow 1.2.6 and later

Text recognition

Model Reference Exported From Supported Ailia Version
etl Japanese Character Classification Keras 1.1.0 and later
deep-text-recognition-benchmark deep-text-recognition-benchmark Pytorch 1.2.6 and later
crnn.pytorch Convolutional Recurrent Neural Network Pytorch 1.2.6 and later
paddleocr PaddleOCR : Awesome multilingual OCR toolkits based on PaddlePaddle Pytorch 1.2.6 and later

Commercial model

Model Reference Exported From Supported Ailia Version
acculus-pose Acculus, Inc. Caffe 1.2.3 and later

Other languages

unity version

c++ version

About

Pretrained models for ailia SDK

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 99.5%
  • Shell 0.5%