Adding algorithm TD Learning with N-Tuple Networks for 2048 #1107

Jazeem · 2023-08-09T12:29:53Z

An algorithm that can win at 2048 without using tree search or hand crafted state evaluator functions.

Currently the code is somewhat coupled with 2048. It is possible to decouple it, if that is valuable for open_spiel. Will have to find a way to move n_tuples into game.

Have modified the game 2048 to consider a move that doesn't change the board as invalid

Implementation is similar to https://github.com/moporgic/TDL2048-Demo

lanctot · 2023-08-09T20:15:19Z

Cool! We had quite a big refactor committed to the master branch yesterday. Could you pull changes from master and commit the merge (or rebase)?

We basically just moved all the games into subdirectories.

I cannot run the tests while there are conflicts.

Jazeem · 2023-08-10T05:37:38Z

Sure. Rebased now 👍🏻

lanctot

Can you lint your code by following Step 9 in the Adding a New Game on this page? https://github.com/deepmind/open_spiel/blob/master/docs/developer_guide.md

It greatly reduced the load on our end to have the code formatted as required by Google python standards.

E.g. indentation should be two spaces, not four. Etc.

open_spiel/python/examples/2048_td_n_tuple_network.py

…etwork.py

Jazeem · 2023-09-01T06:46:35Z

@lanctot

Used pylint and formatted the code
Introduced a class NTupleNetwork and added a brief description about what N-Tuples are and a citation to a IEEE paper that this implementation is based on
The output of the code is very straightforward similar to breakthrough_dqn.py in that for every 1000 runs; the code prints what's the average score reached, max score and largest tile that was unlocked. Running the code you can see the average scores increasing indicating that learning is happening. Once the 2048 tile is unlocked, the agent is able to win the game

lanctot · 2023-09-05T12:53:45Z

Hi @Jazeem,

Can you respond to the conversations above (or resolve them now if they are resolved)

Jazeem · 2023-09-07T09:39:36Z

@lanctot

Have resolved them

Jazeem force-pushed the ntuple branch from e7bbde5 to 4b54abd Compare August 10, 2023 04:36

lanctot requested changes Aug 28, 2023

View reviewed changes

Jazeem added 11 commits August 31, 2023 16:11

Move considered not a legal action if it does not change the board

7e946c5

Added TD Learning algorithm with N-Tuple Networks for 2048

626915e

Modified tests for 2048

a25e35b

Minor changes

0275a4b

Minor changes

b2e799b

Moved 2048_td_n_tuple_network.py from algorithms to examples

c478705

Variable renames

e8991bf

Fixed line lengths going above 80 chars

ca94472

Fixed code formatting issues

1fdd278

class NTupleNetwork introduced

ae94452

2048_td_n_tuple_network.py renamed to twenty_forty_eight_td_n_tuple_n…

68f6c73

…etwork.py

Jazeem force-pushed the ntuple branch from e3416be to 68f6c73 Compare August 31, 2023 11:13

New line added

91f43a6

lanctot approved these changes Sep 7, 2023

View reviewed changes

lanctot added imported This PR has been imported and awaiting internal review. Please avoid any more local changes, thanks! merged internally The code is now submitted to our internal repo and will be merged in the next github sync. labels Sep 7, 2023

lanctot merged commit 2847cef into google-deepmind:master Sep 12, 2023
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding algorithm TD Learning with N-Tuple Networks for 2048 #1107

Adding algorithm TD Learning with N-Tuple Networks for 2048 #1107

Jazeem commented Aug 9, 2023 •

edited

Loading

lanctot commented Aug 9, 2023

Jazeem commented Aug 10, 2023

lanctot left a comment

Jazeem commented Sep 1, 2023

lanctot commented Sep 5, 2023

Jazeem commented Sep 7, 2023

Adding algorithm TD Learning with N-Tuple Networks for 2048 #1107

Adding algorithm TD Learning with N-Tuple Networks for 2048 #1107

Conversation

Jazeem commented Aug 9, 2023 • edited Loading

lanctot commented Aug 9, 2023

Jazeem commented Aug 10, 2023

lanctot left a comment

Choose a reason for hiding this comment

Jazeem commented Sep 1, 2023

lanctot commented Sep 5, 2023

Jazeem commented Sep 7, 2023

Jazeem commented Aug 9, 2023 •

edited

Loading