Windy Gridworld

This repository contains my solution to the windy gridworld problem from Sutton & Barto. There is a gridworld with a crosswind running upward through the middle of the grid. The actions are the standard four—up, down, right, and left. The goal is the reach the end position from the start.

The strength of wind is given below each column. If you are at a position with wind 2, and you choose to move right, you will move 2 upwards and 1 right. Wind was randomly initialized and 1000 simulations were ran for each of the algorithms with different wind speeds each time.

Algorithms used

On-policy SARSA and Q-learning were used. To compare the two, a plot of the episode length over number of episodes was done. The lower the episode length, the faster the agent reached the goal. I also experimented with a few features of C++20.

To Build (dependency fetched automatically, no installation required)

cmake -B build
cmake --build build
./build/out

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
SARSA Q-learn.png		SARSA Q-learn.png
episode graph.png		episode graph.png
main.cpp		main.cpp
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Windy Gridworld

Algorithms used

To Build (dependency fetched automatically, no installation required)

About

Uh oh!

Releases

Packages

Languages

warg-void/Windy-Gridworld

Folders and files

Latest commit

History

Repository files navigation

Windy Gridworld

Algorithms used

To Build (dependency fetched automatically, no installation required)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages