Skip to content

Assignments and Labs for Behavioral & Cognitive Robotics Course in Innopolis University for Spring Semester 2020

License

Notifications You must be signed in to change notification settings

Zikt/BC_IURobotics_Spring20

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BC_IURobotics_Spring20

Assignments and Labs for Behavioral & Cognitive Robotics Course in Innopolis University for Spring Semester 2020

The guide, instructions for this course and other materials can be found in the repository here. Course Instructor - Prof. Stefano Nolfi.

All the software required is available ready to be used through the docker container docker container prepared by Vladislav Kurenkov that can be pulled, built and run. Check the repository above for further instructions. Learn more about Gym

Lecture notes here


Exercise 1

Getting used to working with Gym. Testing and learning how to use gym environments.

Exercise 2

Implement Evolutionary Hill-Climber with simple Neural Network and train it.

Ex 2a: The task is to implement a neural network controller for a Gym problem (the CartPole-v0). I initialized the network with random parameters, and evaluated the neuro-agent for 10 evaluation episodes each lasting 200 steps and printed the toal sum of rewards. Since the cartPole-v0 problem returns a reward of 1 for each step in which the pole is balanced, the fitness(total reward) corresponds to the total number of steps in which the agent manages to keep the pole balanced. As expected, the agent did not balance the pole for many steps by using a policy with random parameters.

Next I implemented the steady state evolutionary strategy (Pagliuca, Milano and Nolfi, 2018) as described by the given pseudo-code. Note that When the population includes only 2 individuals, as in the described example, the algorithm is equivalent to a stochastic hill-climber that operates on a single candidate solution by (i) adding random variation to the parameters, and (ii) by retaining or discarding the variations depending on whether the addition of variations produce an increase or a decrease of the fitness, respectively.

Ex 2b:

Exercise 3

About

Assignments and Labs for Behavioral & Cognitive Robotics Course in Innopolis University for Spring Semester 2020

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages