Programmer. Hiker. Cook. Always looking for interesting problems to solve.
A few thoughts about the evolution. In a population of 50 neural networks, how do I select which one will decide on next motor action?
So basically, the environment will be reset for each individual, and this individual will be the only one to decide on action in this trial.
Question: If I reset the environment for each trial, will it not affect the learning?