تاریک روشن

سی‌وید

سـرگـرمی
کـودکـان
ورزشــی
عــلـم و فـنـاوری
خــودرو و وســایـل نـقـلـیه
مـوسـیقـی
اخــبـار
بـازی و سـرگـرمی
حـیـوانـات و طـبـیعت
مــذهـبـی

تاریک روشن

صفحه اصلی
کمک به خیریه محک

سی‌وید

سـرگـرمی
کـودکـان
ورزشــی
عــلـم و فـنـاوری
خــودرو و وســایـل نـقـلـیه
مـوسـیقـی
اخــبـار
بـازی و سـرگـرمی
حـیـوانـات و طـبـیعت
مــذهـبـی

تاریک روشن

صفحه اصلی
کمک به خیریه محک

Dota 2 Reinforcement Learning

BeyondGodlike Bot منتشر شده در تاریخ 1396/05/30

2.3 هزار بار بازدید - 7 سال پیش - The challenge of creating a

The challenge of creating a bot for Dota 2 is the vast amount of information available at every frame, and the continuous set of actions possible.

This video shows one model I am testing where I discretize the actions into 8 directions + 1 hold, and I use only as input/state data the (x,y) offset of the creeps relative to the hero. The objective is to block the creeps as much as possible. Every 5 episodes, I use a hardcoded bot to "bootstrap" the training in an effort to get the model to learn faster.

I trained this model using actor-critic where I have a value and policy network. Both networks takes as input the state data. The value network outputs a single value which estimates the score of a certain state, and the policy network outputs 9 values from which estimates the probability of each action for a certain state.

Link to Thread: http://dev.dota2.com/showthread.php?t...
Link to Code: https://github.com/BeyondGodlikeBot/C...

#people_blogs
#dota_2
#bot
#ai
#reinforcement_learning

7 سال پیش در تاریخ 1396/05/30 منتشر شده است.

2,302 بـار بازدید شده

... بیشتر

7:30

What is MineRL?

11:31

OpenAI Five Beats World Champion DOTA2 Team 2-0! 🤖

8:25

Reinforcement Learning from scratch

33:53

Training AI to Play Pokemon with Reinforcement Learning

پخش زنده

EWC 2024 DOTA 2 - Team Spirit vs LGD Gaming - (BO3) | PlayOffs Day 2

22:03

Teaching Robots to Walk w/ Reinforcement Learning

27:50

I Trained an AI for 2 Years on Trackmania. It's Breaking Records.

19:14

2022 - Non-Euclidean Doom: what happens to a game when pi is not 3.14159…

11:05

AI Learns to Park - Deep Reinforcement Learning

22:10

Lego Racers Can't Be Made Today

11:14

The Man Who Solved the World’s Most Famous Math Problem

2:16

Dota 2 Creep Block Machine Learning (RL)

11:10

I taught an A.I. to speedrun Minecraft. It made history.

20:41

Training an unbeatable AI in Trackmania

25:43

THE MOST POWERFUL WEAPON IN FALLOUT 4 - Fallout 4 Is A Perfectly Balanced Game With No Exploits!

29:30

This Is Why You Can’t Go To Antarctica

13:05

Your Grammar Is Basic Compared to Black English

12:32

Reverse Engineering Age Of Empires

13:11

ML Was Hard Until I Learned These 5 Secrets!

1:38:34

Python + PyTorch + Pygame Reinforcement Learning – Train an AI to Play Snake

اشــتـراک گـذاری

دانــلـود

این امکان در حال حاضر وجود ندارد.

بـیـشــتر

شناسه ویدئو : UVE0rxcffYo