Self attention for pooling linear classifier #28

@sebastianruder

This PR will introduce a `BiAttentionPoolingClassifier` as in [Attention is all you need](https://arxiv.org/abs/1706.03762) following the discussion with @sebastianruder in Teams. I ran out of memory on my 1060 while testing the attention module, but was able to at least verify that it is functionally correct. Some changes might be required to ensure that the tensor passed to `self.layers` is of the right shape (but I'm not quite sure as of now). I'll shift all the stuff to Collab for testing and see if it's any help.

Commits on Feb 14, 2019

Fixed self attention, local tests passed

aayux committed Feb 14, 2019

Configuration menu

View commit details

Copy full SHA for 03c4a2c

Browse repository at this point

Copy the full SHA

03c4a2c View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Self attention for pooling linear classifier #28

Self attention for pooling linear classifier #28

Commits on Jan 6, 2019

Commits on Feb 14, 2019

Self attention for pooling linear classifier #28

Are you sure you want to change the base?

Self attention for pooling linear classifier #28

Commits on Jan 6, 2019

Commits on Feb 14, 2019