{"id":1604,"date":"2020-07-16T19:57:51","date_gmt":"2020-07-16T19:57:51","guid":{"rendered":"http:\/\/techclot.com\/index.php\/2020\/07\/16\/deepminds-ai-has-learnt-how-to-become-highly-aggressive\/"},"modified":"2020-07-16T19:57:51","modified_gmt":"2020-07-16T19:57:51","slug":"deepminds-ai-has-learnt-how-to-become-highly-aggressive","status":"publish","type":"post","link":"https:\/\/techclot.com\/index.php\/2020\/07\/16\/deepminds-ai-has-learnt-how-to-become-highly-aggressive\/","title":{"rendered":"DeepMind&#8217;s AI has Learnt how to Become Highly Aggressive"},"content":{"rendered":"<p>Artificial intelligence changes the way it behaves based on the environment it is in, much like humans do, according to the <a target=\"_blank\" href=\"https:\/\/deepmind.com\/blog\/understanding-agent-cooperation\/\">latest research from DeepMind<\/a>.<\/p>\n<p>Computer scientists have studied how their AI behaves in social situations by using principles from game theory and social sciences. <strong>During the work, they found it is possible for AI to act in an &#8220;aggressive manner&#8221; when it feels it is going to lose out, but agents will work as a team when there is more to be gained.<\/strong><\/p>\n<p>For the research, the AI was tested on two games: a fruit gathering game and a Wolfpack hunting game. These are both basic, 2D games that used AI characters (known as agents) similar to those used in DeepMind&#8217;s original work with Atari.<\/p>\n<p> <ins data-ad-slot=\"5939622089\" data-ad-client=\"ca-pub-6519338693755664\" class=\"adsbygoogle\" data-ad-format=\"fluid\" data-ad-layout=\"in-article\"><\/ins> <\/p>\n<p>Within DeepMind&#8217;s work, the gathering game saw the systems trained using deep reinforcement learning to collect apples (represented by green pixels). When a player, or in this case an AI, collected an apple, it was rewarded with a &#8216;1&#8217; and the apple disappeared from the game&#8217;s map.<\/p>\n<p>To beat competitors in the game it is possible to direct a &#8216;beam&#8217; at an opposition player. When they are hit twice, the player is removed from the game for a set period. Naturally, the way to beat an opposing player is to knock them out of the game and collect all the apples.<\/p>\n<p>Two agents, Red and Blue, collect apples (green) and occasionally tag the other agent (yellow lines).<\/p>\n<figure >\n<blockquote data-animation-role=\"quote\" data-animation-override><p>     <span>&#8220;<\/span>Intuitively, a defecting policy in this game is one that is aggressive \u2013 i.e., involving frequent attempts to tag rival players to remove them from the game.<span>&#8221;<\/span>   <\/p><\/blockquote>\n<\/figure>\n<p>After 40 million in-game steps, they found the agents learnt &#8220;highly aggressive&#8221; policies when there were few resources (apples) with the possibility of a costly action (not getting a reward).<\/p>\n<figure >\n<blockquote data-animation-role=\"quote\" data-animation-override><p>     <span>&#8220;<\/span>Less aggressive policies emerge from learning in relatively abundant environments with less possibility for costly action. The greed motivation reflects the temptation to take out a rival and collect all the apples oneself.<span>&#8221;<\/span>   <\/p><\/blockquote>\n<\/figure>\n<p>In the second, Wolfpack game, two in-game characters acting as wolves chased a third character, the prey, around. If both wolves were near the prey when it was captured, they both received a reward.<\/p>\n<figure >\n<blockquote data-animation-role=\"quote\" data-animation-override><p>     <span>&#8220;<\/span>The idea is that the prey is dangerous, a lone wolf can overcome it, but is at risk of losing the carcass to scavengers.<span>&#8221;<\/span>   <\/p><\/blockquote>\n<\/figure>\n<p>Two wolves working together could protect the prey from scavengers and get a higher reward.<\/p>\n<p>(Red) wolves chase the blue dot while avoiding (grey) obstacles.<\/p>\n<p>   \t<a href=\"http:\/\/www.wired.co.uk\/article\/artificial-intelligence-social-impact-deepmind\" class=\"sqs-block-button-element--small sqs-block-button-element\" target=\"_blank\">Read More<\/a><br \/>\n<a rel=\"nofollow\" href=\"https:\/\/www.artificial-intelligence.blog\/news\/deepminds-ai-has-learnt-how-to-become-highly-aggressive\">2019 Artificial Intelligence News &#8211; AI News<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Artificial intelligence changes the way it behaves based on the environment it is in, much&#8230;<\/p>\n","protected":false},"author":1,"featured_media":2336,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[3],"tags":[4355,230,719,1387,3251],"class_list":["post-1604","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-artificial-intelligence","tag-aggressive","tag-become","tag-deepminds","tag-highly","tag-learnt"],"jetpack_featured_media_url":"https:\/\/i0.wp.com\/techclot.com\/wp-content\/uploads\/2020\/08\/4G1jb9.jpg?fit=500%2C325&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p3orZX-pS","jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/techclot.com\/index.php\/wp-json\/wp\/v2\/posts\/1604","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/techclot.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/techclot.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/techclot.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/techclot.com\/index.php\/wp-json\/wp\/v2\/comments?post=1604"}],"version-history":[{"count":0,"href":"https:\/\/techclot.com\/index.php\/wp-json\/wp\/v2\/posts\/1604\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/techclot.com\/index.php\/wp-json\/wp\/v2\/media\/2336"}],"wp:attachment":[{"href":"https:\/\/techclot.com\/index.php\/wp-json\/wp\/v2\/media?parent=1604"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/techclot.com\/index.php\/wp-json\/wp\/v2\/categories?post=1604"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/techclot.com\/index.php\/wp-json\/wp\/v2\/tags?post=1604"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}