ailia-models

The collection of pre-trained, state-of-the-art AI models.

About ailia SDK

ailia SDK is a cross-platform high speed inference SDK. The ailia SDK provides a consistent C++ API on Windows, Mac, Linux, iOS, Android, Jetson and Raspberry Pi. It supports Unity, Python and JNI for efficient AI implementation. The ailia SDK makes great use of the GPU via Vulkan and Metal to serve accelerated computing.

How to use

ailia MODELS tutorial

Supported models

Action recognition

Model	Reference	Exported From	Supported Ailia Version
mars	MARS: Motion-Augmented RGB Stream for Action Recognition	Pytorch	1.2.4 and later
st-gcn	ST-GCN	Pytorch	1.2.5 and later
ax_action_recognition	Realtime-Action-Recognition	Pytorch	1.2.7 and later
va-cnn	View Adaptive Neural Networks (VA) for Skeleton-based Human Action Recognition	Pytorch	1.2.7 and later

Anomaly detection

	Model	Reference	Exported From	Supported Ailia Version
	padim	PaDiM-Anomaly-Detection-Localization-master	Pytorch	1.2.6 and later

Audio processing

Model	Reference	Exported From	Supported Ailia Version
crnn_audio_classification	crnn-audio-classification	Pytorch	1.2.5 and later
deepspeech2	deepspeech.pytorch	Pytorch	1.2.2 and later
pytorch-dc-tts	Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention	Pytorch	1.2.6 and later
unet_source_separation	source_separation	Pytorch	1.2.6 and later
transformer-cnn-emotion-recognition	Combining Spatial and Temporal Feature Representions of Speech Emotion by Parallelizing CNNs and Transformer-Encoders	Pytorch	1.2.5 and later

Crowd counting

	Model	Reference	Exported From	Supported Ailia Version
	crowdcount-cascaded-mtl	CNN-based Cascaded Multi-task Learning of High-level Prior and Density Estimation for Crowd Counting (Single Image Crowd Counting)	Pytorch	1.2.1 and later
	c-3-framework	Crowd Counting Code Framework(C^3-Framework)	Pytorch	1.2.5 and later

Deep fashion

Model	Reference	Exported From	Supported Ailia Version
clothing-detection	Clothing-Detection	Pytorch	1.2.1 and later
mmfashion	MMFashion	Pytorch	1.2.5 and later
mmfashion_tryon	MMFashion virtula try-on	Pytorch	1.2.8 and later
fashionai-key-points-detection	A Pytorch Implementation of Cascaded Pyramid Network for FashionAI Key Points Detection	Pytorch	1.2.5 and later

Depth estimation

Model	Reference	Exported From	Supported Ailia Version
monodepth2	Monocular depth estimation from a single image	Pytorch	1.2.2 and later
midas	Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer	Pytorch	1.2.4 and later
fcrn-depthprediction	Deeper Depth Prediction with Fully Convolutional Residual Networks	TensorFlow	1.2.6 and later
fast-depth	ICRA 2019 "FastDepth: Fast Monocular Depth Estimation on Embedded Systems"	Pytorch	1.2.5 and later

Face detection

Model	Reference	Exported From	Supported Ailia Version
yolov1-face	YOLO-Face-detection	Darknet	1.1.0 and later
yolov3-face	Face detection using keras-yolov3	Keras	1.2.1 and later
blazeface	BlazeFace-PyTorch	Pytorch	1.2.1 and later
face-mask-detection	Face detection using keras-yolov3	Keras	1.2.1 and later
dbface	DBFace : real-time, single-stage detector for face detection, with faster speed and higher accuracy	Pytorch	1.2.2 and later
retinaface	RetinaFace: Single-stage Dense Face Localisation in the Wild.	Pytorch	1.2.5 and later

Face identification

Model	Reference	Exported From	Supported Ailia Version
vggface2	VGGFace2 Dataset for Face Recognition	Caffe	1.1.0 and later
arcface	pytorch implement of arcface	Pytorch	1.2.1 and later
insightface	InsightFace: 2D and 3D Face Analysis Project	Pytorch	1.2.5 and later

Face recognition

Model	Reference	Exported From	Supported Ailia Version
face_classification	Real-time face detection and emotion/gender classification	Keras	1.1.0 and later
facial_feature	kaggle-facial-keypoints	Pytorch	1.2.0 and later
face_alignment	2D and 3D Face alignment library build using pytorch	Pytorch	1.2.1 and later
prnet	Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network	TensorFlow	1.2.2 and later
gazeml	A deep learning framework based on Tensorflow for the training of high performance gaze estimation	TensorFlow	1.2.0 and later
facemesh	facemesh.pytorch	Pytorch	1.2.2 and later
mediapipe_iris	irislandmarks.pytorch	Pytorch	1.2.2 and later
hopenet	deep-head-pose	Pytorch	1.2.2 and later
ax_gaze_estimation	ax Gaze Estimation	Pytorch	1.2.2 and later

Frame Interpolation

	Model	Reference	Exported From	Supported Ailia Version
	flavr	FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation	Pytorch	1.2.7 and later

Generative adversarial networks

	Model	Reference	Exported From	Supported Ailia Version
	pytorch-gan	Code repo for the Pytorch GAN Zoo project (used to train this model)	Pytorch	1.2.4 and later
	council-GAN	Council-GAN	Pytorch	1.2.4 and later

Hand detection

Model	Reference	Exported From	Supported Ailia Version
yolov3-hand	Hand detection branch of Face detection using keras-yolov3	Keras	1.2.1 and later
hand_detection_pytorch	hand-detection.PyTorch	Pytorch	1.2.2 and later
blazepalm	MediaPipePyTorch	Pytorch	1.2.5 and later

Hand recognition

Model	Reference	Exported From	Supported Ailia Version
blazehand	MediaPipePyTorch	Pytorch	1.2.5 and later
hand3d	ColorHandPose3D network	TensorFlow	1.2.5 and later
minimal-hand	Minimal Hand	TensorFlow	1.2.8 and later

Image captioning

	Model	Reference	Exported From	Supported Ailia Version
	illustration2vec	Illustration2Vec	Caffe	1.2.2 and later
	image_captioning_pytorch	Image Captioning pytorch	Pytorch	1.2.5 and later

Image classification

Model	Reference	Exported From	Supported Ailia Version
vgg16	Very Deep Convolutional Networks for Large-Scale Image Recognition	Keras	1.1.0 and later
googlenet	Going Deeper with Convolutions	Pytorch	1.2.0 and later
resnet50	Deep Residual Learning for Image Recognition	Chainer	1.2.0 and later
inceptionv3	Rethinking the Inception Architecture for Computer Vision	Pytorch	1.2.0 and later
inceptionv4	Keras Inception-V4	Keras	1.2.5 and later
mobilenetv2	PyTorch Implemention of MobileNet V2	Pytorch	1.2.0 and later
mobilenetv3	PyTorch Implemention of MobileNet V3	Pytorch	1.2.1 and later
partialconv	Partial Convolution Layer for Padding and Image Inpainting	Pytorch	1.2.0 and later
efficientnet	A PyTorch implementation of EfficientNet	Pytorch	1.2.3 and later
vit	Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)	Pytorch	1.2.7 and later
efficientnetv2	EfficientNetV2	Pytorch	1.2.4 and later

Image manipulation

Model	Reference	Exported From	Supported Ailia Version
noise2noise	Learning Image Restoration without Clean Data	Pytorch	1.2.0 and later
dewarpnet	DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks	Pytorch	1.2.1 and later
illnet	Document Rectification and Illumination Correction using a Patch-based CNN	Pytorch	1.2.2 and later
colorization	Colorful Image Colorization	Pytorch	1.2.2 and later
u2net_portrait	U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection	Pytorch	1.2.2 and later
style2paints	Style2Paints	TensorFlow	1.2.6 and later
deep_white_balance	Deep White-Balance Editing, CVPR 2020 (Oral)	PyTorch	1.2.6 and later
inpainting-with-partial-conv	pytorch-inpainting-with-partial-conv	PyTorch	1.2.6 and later
inpainting_gmcnn	Image Inpainting via Generative Multi-column Convolutional Neural Networks	TensorFlow	1.2.6 and later
deblur_gan	DeblurGAN	Pytorch	1.2.6 and later
3d-photo-inpainting	3D Photography using Context-aware Layered Depth Inpainting	Pytorch	1.2.7 and later

Image segmentation

Model	Reference	Exported From	Supported Ailia Version
deeplabv3	Xception65 for backbone network of DeepLab v3+	Chainer	1.2.0 and later
hrnet_segmentation	High-resolution networks (HRNets) for Semantic Segmentation	Pytorch	1.2.1 and later
hair_segmentation	hair segmentation in mobile device	Keras	1.2.1 and later
pspnet-hair-segmentation	pytorch-hair-segmentation	Pytorch	1.2.2 and later
U-2-Net	U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection	Pytorch	1.2.2 and later
deep-image-matting	Deep Image Matting	Keras	1.2.3 and later
human_part_segmentation	Self Correction for Human Parsing	Pytorch	1.2.4 and later
semantic-segmentation-mobilenet-v3	Semantic segmentation with MobileNetV3	TensorFlow	1.2.5 and later
pytorch-unet	Pytorch-Unet	Pytorch	1.2.5 and later
pytorch-enet	PyTorch-ENet	Pytorch	1.2.8 and later
yet-another-anime-segmenter	Yet-Another-Anime-Segmenter	Pytorch	1.2.6 and later
swiftnet	SwiftNet	Pytorch	1.2.6 and later
codes-for-lane-detection	Codes-for-Lane-Detection	Pytorch	1.2.6 and later
dense_prediction_transformers	dense_prediction_transformers	Pytorch	1.2.7 and later
u2net-portrait-matting	U^2-Net - Portrait matting	Pytorch	1.2.7 and later
u2net-human-seg	U^2-Net - human segmentation	Pytorch	1.2.4 and later
indexnet	Indices Matter: Learning to Index for Deep Image Matting	Pytorch	1.2.7 and later
modnet	MODNet: Trimap-Free Portrait Matting in Real Time	Pytorch	1.2.7 and later

Line Segment Detection

	Model	Reference	Exported From	Supported Ailia Version
	mlsd	M-LSD: Towards Light-weight and Real-time Line Segment Detection	TensorFlow	1.2.8 and later

Natural language processing

Model	Reference	Exported From	Supported Ailia Version
bert	pytorch-pretrained-bert	Pytorch	1.2.2 and later
bert_maskedlm	huggingface/transformers	Pytorch	1.2.5 and later
bert_ner	huggingface/transformers	Pytorch	1.2.5 and later
bert_question_answering	huggingface/transformers	Pytorch	1.2.5 and later
bert_sentiment_analysis	huggingface/transformers	Pytorch	1.2.5 and later
bert_zero_shot_classification	huggingface/transformers	Pytorch	1.2.5 and later
bert_tweets_sentiment	huggingface/transformers	Pytorch	1.2.5 and later

Object detection

Model	Reference	Exported From	Supported Ailia Version
yolov1-tiny	YOLO: Real-Time Object Detection	Darknet	1.1.0 and later
yolov2	YOLO: Real-Time Object Detection	Pytorch	1.2.0 and later
yolov3	YOLO: Real-Time Object Detection	ONNX Runtime	1.2.1 and later
yolov3-tiny	YOLO: Real-Time Object Detection	ONNX Runtime	1.2.1 and later
yolov4	Pytorch-YOLOv4	Pytorch	1.2.4 and later
yolov4-tiny	Pytorch-YOLOv4	Pytorch	1.2.5 and later
yolov5	yolov5	Pytorch	1.2.5 and later
mobilenet_ssd	MobileNetV1, MobileNetV2, VGG based SSD/SSD-lite implementation in Pytorch	Pytorch	1.2.1 and later
maskrcnn	Mask R-CNN: real-time neural network for object instance segmentation	Pytorch	1.2.3 and later
m2det	M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network	Pytorch	1.2.3 and later
centernet	CenterNet : Objects as Points	Pytorch	1.2.1 and later
pedestrian_detection	Pedestrian-Detection-on-YOLOv3_Research-and-APP	Keras	1.2.1 and later
efficientdet	EfficientDet: Scalable and Efficient Object Detection, in PyTorch	Pytorch	1.2.6 and later
3d_bbox	3D Bounding Box Estimation Using Deep Learning and Geometry	Pytorch	1.2.6 and later
nanodet	NanoDet	Pytorch	1.2.6 and later
yolor	yolor	Pytorch	1.2.5 and later
3d-object-detection.pytorch	3d-object-detection.pytorch	Pytorch	1.2.8 and later
mediapipe_objectron	MediaPipe Objectron	TensorFlow Lite	1.2.5 and later

Object tracking

Model	Reference	Exported From	Supported Ailia Version
deepsort	Deep Sort with PyTorch	Pytorch	1.2.3 and later
person_reid_baseline_pytorch	UTS-Person-reID-Practical	Pytorch	1.2.6 and later
roneld	RONELD-Lane-Detection	Pytorch	1.2.6 and later

Point segmentation

	Model	Reference	Exported From	Supported Ailia Version
	pointnet_pytorch	PointNet.pytorch	Pytorch	1.2.6 and later

Pose estimation

Model	Reference	Exported From	Supported Ailia Version
openpose	Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)	Caffe	1.2.1 and later
lightweight-human-pose-estimation	Fast and accurate human pose estimation in PyTorch. Contains implementation of "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose" paper.	Pytorch	1.2.1 and later
lightweight-human-pose-estimation-3d	Real-time 3D multi-person pose estimation demo in PyTorch. OpenVINO backend can be used for fast inference on CPU.	Pytorch	1.2.1 and later
3d-pose-baseline	A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.	TensorFlow	1.2.3 and later
pose_resnet	Simple Baselines for Human Pose Estimation and Tracking	Pytorch	1.2.1 and later
blazepose	MediaPipePyTorch	Pytorch	1.2.5 and later
blazepose-fullbody	MediaPipe	TensorFlow Lite	1.2.5 and later
3dmppe_posenet	PoseNet of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image"	Pytorch	1.2.6 and later
efficientpose	Code repo for EfficientPose	TensorFlow	1.2.6 and later
pose-hg-3d	Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach	Pytorch	1.2.6 and later
gast	A Graph Attention Spatio-temporal Convolutional Networks for 3D Human Pose Estimation in Video (GAST-Net)	Pytorch	1.2.7 and later
animalpose	MMPose - 2D animal pose estimation	Pytorch	1.2.7 and later
movenet	Code repo for movenet	TensorFlow	1.2.8 and later

Rotation prediction

	Model	Reference	Exported From	Supported Ailia Version
	rotnet	CNNs for predicting the rotation angle of an image to correct its orientation	Keras	1.2.1 and later

Style transfer

Model	Reference	Exported From	Supported Ailia Version
adain	Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization	Pytorch	1.2.1 and later
psgan	PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer	Pytorch	1.2.7 and later
beauty_gan	BeautyGAN	Pytorch	1.2.7 and later

Super resolution

Model	Reference	Exported From	Supported Ailia Version
srresnet	Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network	Pytorch	1.2.0 and later
edsr	Enhanced Deep Residual Networks for Single Image Super-Resolution	Pytorch	1.2.6 and later
han	Single Image Super-Resolution via a Holistic Attention Network	Pytorch	1.2.6 and later

Text detection

Model	Reference	Exported From	Supported Ailia Version
craft_pytorch	CRAFT: Character-Region Awareness For Text detection	Pytorch	1.2.2 and later
pixel_link	Pixel-Link	TensorFlow	1.2.6 and later
east	EAST: An Efficient and Accurate Scene Text Detector	TensorFlow	1.2.6 and later

Text recognition

Model	Reference	Exported From	Supported Ailia Version
etl	Japanese Character Classification	Keras	1.1.0 and later
deep-text-recognition-benchmark	deep-text-recognition-benchmark	Pytorch	1.2.6 and later
crnn.pytorch	Convolutional Recurrent Neural Network	Pytorch	1.2.6 and later
paddleocr	PaddleOCR : Awesome multilingual OCR toolkits based on PaddlePaddle	Pytorch	1.2.6 and later

Commercial model

Model	Reference	Exported From	Supported Ailia Version
acculus-pose	Acculus, Inc.	Caffe	1.2.3 and later

Other languages

unity version

c++ version

Name		Name	Last commit message	Last commit date
Latest commit History 2,163 Commits
.vscode		.vscode
action_recognition		action_recognition
anomaly_detection/padim		anomaly_detection/padim
audio_processing		audio_processing
commercial_model/acculus-pose		commercial_model/acculus-pose
crowd_counting		crowd_counting
deep_fashion		deep_fashion
demo/dms		demo/dms
depth_estimation		depth_estimation
face_detection		face_detection
face_identification		face_identification
face_recognition		face_recognition
frame_interpolation/flavr		frame_interpolation/flavr
generative_adversarial_networks		generative_adversarial_networks
hand_detection		hand_detection
hand_recognition		hand_recognition
image_captioning		image_captioning
image_classification		image_classification
image_manipulation		image_manipulation
image_segmentation		image_segmentation
line_segment_detection/mlsd		line_segment_detection/mlsd
neural_language_processing		neural_language_processing
object_detection		object_detection
object_tracking		object_tracking
point_segmentation/pointnet_pytorch		point_segmentation/pointnet_pytorch
pose_estimation		pose_estimation
rotation_prediction/rotnet		rotation_prediction/rotnet
scripts		scripts
style_transfer		style_transfer
super_resolution		super_resolution
text_detection		text_detection
text_recognition		text_recognition
util		util
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
TUTORIAL.md		TUTORIAL.md
launcher.py		launcher.py
requirements.txt		requirements.txt

hobein/ailia-models

Folders and files

Latest commit

History

Repository files navigation