Hi Related to RL Project #1

gaoyuankidult · 2014-01-19T16:02:39Z

Hi Everyone

On the webpage it is mentioned that 26.1 [2] is the deadline of choosing topic. I hope we can start project much earlier before that.

We are planning to complete the project according to a paper.
The name of this papar is

Playing Atari with Deep Reinforcement Learning [1]

Mainly this application considers a algorithm called Deep Q-Networks (DQN), which is really just a fancy name that combines the a variation of Q learning with convolution neural network (CNN).

Hope we all can go through the paper first and I think we should at least know

Atari[3]
Q learning

for Q learning, you can read wiki and then the section Q-Learning Using Matlab of this article [4]
Basically, the project goes as a small team work. this github repo ( [email protected]:gaoyuankidult/DRL-AI.git ) will be used (please join the repo). questions and discussions can be posted on the github thread.

### Discussion of Next Meeting

Understand CNN (nice work done by yaolu [5])
Environment of Project
Licenses
Language and Architecture of Project

if you have other topic, please inform all members.

The proposed meeting time is 9. pm Tuesday in ida. It is about 2 and half hours long(can be shortened, if goals achieved )

[1] http://arxiv.org/pdf/1312.5602v1.pdf
[2] http://www.cs.helsinki.fi/en/courses/58314105/2014/k/s/1
[3] http://yavar.naddaf.name/ale/
[4] http://pcframe.net/bbs/zboard.php?id=scrap&page=1&sn1=&divpage=1&sn=off&ss=on&sc=on&select_arrange=headnum&desc=asc&no=4
[5] https://github.com/yaolubrain/cnn_linear_max

yaolubrain · 2014-01-19T16:44:43Z

Nice job. Yuan! The following is the best tutorial on CNN for object recognition. Please read it.
http://www.cs.toronto.edu/~ranzato/publications/ranzato_cvpr13.pdf

gaoyuankidult · 2014-01-20T12:14:00Z

Hi every one. it seems everyone has time around 9pm tomorrow. We will meet at 9 pm in one member's home. preferably my home. The place can be changed if things are not going as we planned.

Br

yaolubrain · 2014-01-20T12:19:20Z

no problem!

On Mon, Jan 20, 2014 at 2:14 PM, gaoyuankidult [email protected]:

Hi every one. it seems everyone has time around 9pm tomorrow. We will meet
at 9 pm in one member's home. preferably my home. The place can be changed
if things are not going as we planned.

Br

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/1#issuecomment-32754674
.

Yao Lu
Department of Computer Science,
University of Helsinki, Finland

gaoyuankidult · 2014-01-22T16:28:24Z

Hi Yao

Under the pressure of the course. Nick thinks he should work on an another task about reinforcement learning. so as consequence, we two will build this project. I will take the responsibility of Nick. we meet next week for the project.

gaoyuankidult · 2014-02-17T09:03:04Z

Video Output Format:

A 2D array of 7-bit pixels, 160 pixels wide by 210 pixels
high.

first the 128 bytes of RAM (taking values in 0–255), then the 33,600 screen pixels
(taking value in 0–127). The screen is provided in row-order, i.e. beginning with the 160 pixels
that compose the first row.

gaoyuankidult · 2014-03-05T14:41:51Z

Hello everyone

What is the situation now ?

In this project, we use a relative loose cooperative style.
But we also need to catch up with our goal.

Our final goal is to present the result with detailed comparison.
for doing that we need to go through following steps:

Working prototype for DQN and HyperNEAT
Furthered enhanced version (comparable with eachother)

As you may know already, the current situation of my for this project is that I am able to provide a full functioning environment which can output suitable picture stream and instantaneous reward.

What is your situation now regarding to the project.

Cheers

yaolubrain · 2014-03-05T15:23:09Z

I think I can finish the CNN in C++ in at most two weeks. But I will try to
get it done in a week.

On Wed, Mar 5, 2014 at 4:41 PM, gaoyuankidult [email protected]:

Hello everyone

What is the situation now ?

In this project, we use a relative loose cooperative style.
But we also need to catch up with our goal.

Our final goal is to present the result with detailed comparison.
for doing that we need to go through following steps:

Working prototype for DQN and HyperNEAT

Furthered enhanced version (comparable with eachother)

As you may know already, the current situation of my for this project is
that I am able to provide a full functioning environment which can output
suitable picture stream and instantaneous reward.

What is your situation now regarding to the project.

Cheers

Reply to this email directly or view it on GitHubhttps://github.com//issues/1#issuecomment-36748164
.

Yao Lu
Department of Computer Science,
University of Helsinki, Finland

jhb86253817 · 2014-03-05T15:36:46Z

Now I am working on NEAT, if things goes well, it can be done by this
weekend. Then, I will continue on HyperNEAT, which is based on NEAT, this
may require another week.

Haibo Jin

2014-03-05 17:23 GMT+02:00 yaolubrain [email protected]:

I think I can finish the CNN in C++ in at most two weeks. But I will try to
get it done in a week.

On Wed, Mar 5, 2014 at 4:41 PM, gaoyuankidult <[email protected]

wrote:

Hello everyone

What is the situation now ?

In this project, we use a relative loose cooperative style.
But we also need to catch up with our goal.

Our final goal is to present the result with detailed comparison.
for doing that we need to go through following steps:

Working prototype for DQN and HyperNEAT

Furthered enhanced version (comparable with eachother)

As you may know already, the current situation of my for this project is
that I am able to provide a full functioning environment which can output
suitable picture stream and instantaneous reward.

What is your situation now regarding to the project.

Cheers

Reply to this email directly or view it on GitHub<
https://github.com/gaoyuankidult/DRL-AI/issues/1#issuecomment-36748164>

.

Yao Lu
Department of Computer Science,
University of Helsinki, Finland

Reply to this email directly or view it on GitHubhttps://github.com//issues/1#issuecomment-36752905
.

gaoyuankidult · 2014-03-05T16:23:48Z

Great ! Guys ! You are all excellent cooperator !

gaoyuankidult · 2014-03-05T18:09:30Z

Let us continue doing this project and make an impressive result !

gaoyuankidult closed this as completed Feb 4, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hi Related to RL Project #1

Hi Related to RL Project #1

gaoyuankidult commented Jan 19, 2014

yaolubrain commented Jan 19, 2014

gaoyuankidult commented Jan 20, 2014

yaolubrain commented Jan 20, 2014

gaoyuankidult commented Jan 22, 2014

gaoyuankidult commented Feb 17, 2014

gaoyuankidult commented Mar 5, 2014

yaolubrain commented Mar 5, 2014

jhb86253817 commented Mar 5, 2014

gaoyuankidult commented Mar 5, 2014

gaoyuankidult commented Mar 5, 2014

Hi Related to RL Project #1

Hi Related to RL Project #1

Comments

gaoyuankidult commented Jan 19, 2014

### Discussion of Next Meeting

yaolubrain commented Jan 19, 2014

gaoyuankidult commented Jan 20, 2014

yaolubrain commented Jan 20, 2014

gaoyuankidult commented Jan 22, 2014

gaoyuankidult commented Feb 17, 2014

gaoyuankidult commented Mar 5, 2014

yaolubrain commented Mar 5, 2014

jhb86253817 commented Mar 5, 2014

gaoyuankidult commented Mar 5, 2014

gaoyuankidult commented Mar 5, 2014