新足迹 › 论坛 › 兴趣爱好区 › 音乐影视 › AlphaGo on YouTube

精华好帖回顾
· 何种消费可以退税？（有更新） (2005-3-2) rdcwayx	· 轻音乐系列专辑 - 三大轻音乐团及其他著名演奏家等（更新完毕） (2008-10-14) zmzhu
· 东北人做面食之一：花卷篇 (2008-7-16) Wobat	· 撞鬼了（18岁以下读者，请自觉绕道） (2016-10-22) yasuko

查看: 1171|回复: 0

[原创影评／剧评] AlphaGo on YouTube [复制链接]

一条大鱼

铜靴族

发表于 2020-4-23 23:40 |显示全部楼层

此文章由一条大鱼原创或转贴，不代表本站立场和观点，版权归 oursteps.com.au 和作者一条大鱼所有！转贴必须注明作者、出处和本声明，并保持内容完整

本帖最后由一条大鱼于 2020-4-23 22:42 编辑

The documentary about AlphaGo on YouTube is an exciting masterpiece. To me, the three layers structure of AlphaGo’s algorithm means more than a computer programming.

The algorithm has three layers as described at the around 47:20 of the clip:
1. Policy network is trained on high level games to imitate those Go players.
2. Value network evaluates board position to tell what the probability of winning is in this particular position.
3. Tree search tries to figure out what would happen in the future.

This algorithm is so similar to the thinking plan implemented by a great player.
1. Using policy network he read all the books in his hometown’s library to imitate the existing great players, when he was only a child.
2. Using valuation network, he know where to fish and his business partner knows it reversely.
3. Using tree search to figure out what probably wouldn’t change in the future.

The difference is that he figured the whole plan out 50 years ago. Luckily, he is still famous to most of us.

走路的人多了，路才越来越宽；而不是路越来越宽，走路的人才多了。
...pursuit; ...love；...desire.

返回列表

		自动登录	找回密码
密码			注册

精华好帖回顾

[原创影评／剧评] AlphaGo on YouTube [复制链接]

发表回复

浏览过的版块