NUS ME5406 Deep Learning for Robotics Project 1
Before run this project, please make sure that you have installed these packages in your environment:
- torch (only used to help plot figures of results)
- numpy
- matplotlib
- os
After git clone or download this project, please run main.py
file.
There are three reinforcement learning algorithms (Monte Carlo Control, SARSA and Q-learning), two size of maps (
For example:
python main.py --task SARSA --map_size 4 --epsilon 0.1 --gamma 0.9 --time 10000
python main.py --task Monte_Carlo --map_size 4 --epsilon 0.1 --gamma 0.9 --time 10000
python main.py --task Q-learning --map_size 4 --epsilon 0.1 --gamma 0.9 --time 50000
The options for task
are Monte_Carlo
, SARSA
and Q-learning
, for map_size
is 4
and 10
.
epsilon
, gamma
and time
don't have fixed options, but if you want to try other parameters, please make sure that the parameters you choose are reasonable.
After running the code, it will generate three figures showing the evaluation results of average step, numbers of successful and failed episodes bar and average reward.
For example:
Run the code:
python main.py --task SARSA --map_size 4 --epsilon 0.1 --gamma 0.9 --time 10000
It will generate three evalution figures:
Average step:
Number bar:
Average reward: