Show simple item record

dc.creatorKevin Robb
dc.date.accessioned2020-05-26T01:54:02Z
dc.date.available2020-05-26T01:54:02Z
dc.date.issued2020
dc.identifier.urihttps://hdl.handle.net/11244.46/1547
dc.description.abstractReinforcement learning depends on agents being learning individuals, and when agents rely on their instincts rather than gathering data and acting accordingly, the population tends to be less successful than a true RL population. ÒRiskinessÓ is the elementary metric for determining how willing to rely on learning an individual or a population is. With a high learning parameter, as we denote riskiness in this paper, agents find the safest option and seldom deviate from it, essentially using learning to become a non-learning individual. With a low learning rate, agents ignore recency entirely and seek out the highest reward, regardless of the risk. We attempt in this paper to evolve this Òrisk neutralityÓ in a population by adding a safe exploration nurturing period during which agents are free to explore without consequence. We discovered the environmental conditions necessary for our hypotheses to be mostly satisfied and found that nurturing enables agents to distinguish between two different risky options to evolve risk neutrality. Too long of a nurturing period causes the evolution to waver before settling on a path with essentially random results, while a short nurturing period causes a successful evolution of risk neutrality. The non-nurturing case evolves risk aversion by default as we expected from a reinforcement learning system, because agents are unable to distinguish between the good risk and bad risk, so they decide to avoid risks altogether.
dc.format.mediumText
dc.languageeng
dc.relation.ispartofseriesNo
dc.subjectUniversity Libraries Undergraduate Research Award
dc.titleOn Reinforcement Learning, Nurturing, and the Evolution of Risk Neutral
dc.typeArticle
dc.description.peerreviewNo


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record