Computer Vision Meetup: Multi-Modal Visual Question Answering using UForm models and Milvus - Part 1

Voxel51
Voxel51
100 بار بازدید - 2 ماه پیش - UForm is a multimodal AI
UForm is a multimodal AI library that will help you understand and search visual and textual content across various languages. UForm not only supports RAG chat use-cases, but is also capable of Visual Question Answering (VQA). Compact custom pre-trained transformer models can run anywhere from your server farm down to your laptop. I’ll be giving a demo of RAG and VQA using Milvus vector database.

Speaker: Christy Bergman is a passionate Developer Advocate at Zilliz. She previously worked in distributed computing at Anyscale and as a Specialist AI/ML Solutions Architect at AWS.

Not a Meetup member? Sign up to attend the next event:

https://voxel51.com/computer-vision-a...

Recorded on May 30, 2024 at the AI, Machine Learning and Data Science Meetup.

#computervision #machinelearning #datascience #ai #artificialintelligence
2 ماه پیش در تاریخ 1403/03/14 منتشر شده است.
100 بـار بازدید شده
... بیشتر