https://github.com/labcontext/text-to-image-with-SNGAN-and-WGAN (=== almost same git ===) https://github.com/wooramkang/TEXT-TO-IMAGE-GANs-V.RESEARCH- (=== almost same git ===)
papers i have red for image caption, image description
A. design neural networks and object detection
- Going deeper with convolutions
- Rethinking the Inception Architecture for Computer Vision
- Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
- Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
- Deep Residual Learning for Image Recognition
- Residual Networks are Exponential Ensembles of Relatively Shallow Networks
- Speed/accuracy trade-offs for modern convolutional object detectors
- Dropout: A simple way to prevent neural networks from overfitting
B. Viusal semantic embedding
- Deep Visual-Semantic Alignments for Generating Image Descriptions
- Order-embeddings of images and language
- Unifying visual-semantic embeddings with multimodal neural language models
- Multimodal convolutional neural networks for matching image and sentence
C. Image caption
- Show and Tell: a neural image caption generator
- Show, Adapt and Tell: Adversarial training of cross-domain image captioner
- Show, Attend and Tell: Neural image caption generation with visual attention
D. Reinforcement learning
- Deep Reinforcement learning-based image captioning with embedding reward
E. image segmentation
- learning deconvolution network for semantic segmentation
F. unsupervised learning, Deep Generative Model
- Building high-level features using large scale unsupervised learning
- Auto-encoding variational bayes
- Generative adversarial nets
- Unsupervised representation learning with deep convolutional generative adversarial networks
- DRAW: A recurrent neural network for image generation.
- Pixel recurrent neural networks
- Conditional image generation with PixelCNN decoders
G. image attention machanism
- Show, Attend and Tell: Neural image caption generation with visual attention
- Generating images from captions with attention
- Self-Attention Generative Adversarial Networks
- Bottom-up and top-down attention for image captioning and visual question answering
- Generative Image Inpainting with Contextual Attention
- Watch What You Just Said: Image Captioning with Text-Conditional Attention
- Aligning where to see and what to tell: image caption with region-based attention and scene factorization
- Self-Attention Generative Adversarial Networks
H. generative adversarial nets
- generative-adversarial-nets
- Unsupervised representation learning with deep convolutional generative adversarial networks
- Least Squares Generative Adversarial Networks
- Semi-Supervised Learning with Generative Adversarial Networks
- Conditional Generative Adversarial Nets
- Conditional Image Synthesis with Auxiliary Classifier GANs
- Unpaired Image-to-Image Translation
- StackGAN, Text to Photo-realistic Image Synthesis
- Wang Stacked Conditional Generative CVPR_2018_paper
- Mueller GANerated Hands for_CVPR_2018_paper
- chang PairedCycleGAN Asymmetric Style CVPR_2018_paper
- Finding Tiny Faces in the Wild with Generative Adversarial Network
- Wei Person Transfer GAN CVPR_2018_paper