This AI Can Solve 604 Tasks [Paper Analysis of Gato by DeepMind]

Valerio Velardo - The Sound of AI منتشر شده در تاریخ 1401/02/26

6.3 هزار بار بازدید - 2 سال پیش - DeepMind published a revolutionary paper

DeepMind published a revolutionary paper 🔥 They introduced Gato, a generalist AI agent that can carry out more than 600 tasks with a single transformer neural architecture. The tasks are varied, from playing Atari games to providing captions to images.

This paper demonstrates that:

📌 Generalist agents can perform reasonably well on many tasks / embodiments / modalities
📌 Generalist agents have the potential to learn new tasks with few data points
📌 By scaling up the parameter size, we can build a general-purpose agent

This work shocked me. I’ve always tackled AI from the perspective of Narrow Intelligence: build a specialised model that does well on a single - quite constrained - task.

👉 Gato paves the way for Artificial General Intelligence (AGI). In so doing, it opens new ethical dilemmas that should at least spark discussions in the AI community.

Since I’ve finished reading this paper, I can’t stop asking a question: is it ethical to push this research line given the grave dangers which may come with quasi-AGI agents?

Would you like to learn more? Check my last video, where I provide a breakdown of the paper, and analyse its ethical implications.

A Generalist Agent by DeepMind:
https://www.deepmind.com/publications...

===============================

Interested in hiring me as a consultant/freelancer?
https://valeriovelardo.com/

Join The Sound Of AI Slack community:
https://valeriovelardo.com/the-sound-...

Connect with Valerio on Linkedin:
LinkedIn: valeriovelardo

Follow Valerio on Facebook:
Facebook: TheSoundOfAI

Follow Valerio on Twitter:
Twitter: musikalkemist

===============================

Content:

0:00 Intro
1:11 General vs Narrow intellicence
3:06 Research hypotheses
4:25 Idea to approach AGI
8:23 Benefits of single network for many tasks
10:04 Datasets used
11:59 Data preparation
18:24 Model architecture
20:31 Training
22:48 Loss function
27:25 Recognising a task
30:50 Inference
33:37 How does the model perform?
39:00 Scale analysis
40:18 Can the model tackle unseen tasks?
44:14 Key discoveries
47:40 Ethical implications

2 سال پیش در تاریخ 1401/02/26 منتشر شده است.

6,399 بـار بازدید شده

... بیشتر