Explaining the Segment Anything Model - Network architecture, Dataset, Training

Neural Breakdown with AVB
Neural Breakdown with AVB
17.8 هزار بار بازدید - پارسال - In this video, I dive
In this video, I dive deep into the technical details and architecture behind the Segment Anything Model, also known as SAM. SAM is the world's first foundation model on image segmentation and is an amazing tool that can segment any image provided to it at multiple nested levels of granularity at interactive latency.

#deeplearning #computervision #machinelearning

To support the channel and access the Word documents/slides used in this video,  consider JOINING the channel on Youtube or Patreon. Members get access to scripts, slides, animations, and illustrations for most of the videos on my channel!
Join and support the channel - https://www.seevid.ir/c/avb_fj/join
Patreon - Patreon: NeuralBreakdownwithAVB

Project page: https://segment-anything.com/
Give the paper a read: https://arxiv.org/pdf/2304.02643.pdf

0:00 - Intro
1:29 - Architecture
4:50 - Interactive Training
6:30 - Dataset
7:27 - Model Architecture
12:30 - Outro

Other papers cited:
Focal Loss for Dense Object Detection: https://arxiv.org/pdf/1708.02002.pdf
CLIP: https://arxiv.org/pdf/2103.00020.pdf
Masked Autoencoders Are Scalable Vision Learners: https://arxiv.org/pdf/2111.06377.pdf

Songs:
Sunny Days - Anno Domini Beats
Wellington Coffee Shop - Dyalla
No 3 Morning Folk Song - Esther Abrami
پارسال در تاریخ 1402/02/11 منتشر شده است.
17,803 بـار بازدید شده
... بیشتر