Projeto Final - 2015.2

Aprendizado por Reforço para Jogar Tetris

O objetivo desse projeto é criar um jogador inteligente de Tetris.

Para isso é utilizado o conceito de aprendizado por reforço, onde a máquina aprende algo sem nenhuma assisteência externa, ou seja, dada apenas um valor para a derrota e outro para a vitória, que no caso é chegar a 200 movimentos sem perder, o algoritmo aprende a jogar por si só, escolhendo novas jogadas tanto para explorar novas oportunidades quanto para aproveitar oportunidades já identificadas como positivas. Ao final desse processo de treinamento, temos uma polītica que consegue escolher um tabuleiro dentre uma lista de opções, e portanto sabe como jogar Tetris!

Reinforcement Learning to play Tetris

The goal of this project is to create an intelligent Twtris player.

To make this possible, the reinforcement learning technique is applied. In this technique the machine learns about the game without any outside assistance, given only an arbitrary value for losing the game and another one for winning it, the algorithm learns to play by itself. It chooses new plays based on an explore/exploit technique where it tries to find the best possible plays by itself. By the end of this training, we come up with a policy which is able to choose a specific Tetris board from a list of avaiable boards and thus knows how to play Tetris!

Fontes utilizadas / Sources:

https://webdocs.cs.ualberta.ca/~sutton/book/the-book.html

http://www.cs.cmu.edu/afs/cs/project/ACRL/www/TetrisReports/Breelyn_Eric_Don_Project.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
policies		policies
tetris		tetris
.gitignore		.gitignore
README.md		README.md
inputs.txt		inputs.txt
inputs2.txt		inputs2.txt
main.go		main.go
projeto_final		projeto_final
resultados.csv		resultados.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

policies

policies

tetris

tetris

.gitignore

.gitignore

README.md

README.md

inputs.txt

inputs.txt

inputs2.txt

inputs2.txt

main.go

main.go

projeto_final

projeto_final

resultados.csv

resultados.csv

Repository files navigation

Projeto Final - 2015.2

Aprendizado por Reforço para Jogar Tetris

Reinforcement Learning to play Tetris

About

Releases

Packages

Languages

hbejgel/projeto_final

Folders and files

Latest commit

History

Repository files navigation

Projeto Final - 2015.2

Aprendizado por Reforço para Jogar Tetris

Reinforcement Learning to play Tetris

About

Resources

Stars

Watchers

Forks

Languages