Description: Further DetailsTitle: Multimodal Foundation ModelsCondition: NewFormat: PaperbackSubtitle: From Specialists to General-Purpose AssistantsEAN: 9781638283362ISBN: 9781638283362Publisher: now publishers IncRelease Date: 05/06/2024Description: This monograph presents a comprehensive survey of the taxonomy and evolution of multimodal foundation models that demonstrate vision and vision-language capabilities, focusing on the transition from specialist models to general-purpose assistants.The focus encompasses five core topics, categorized into two classes; (i) a survey of well-established research areas: multimodal foundation models pre-trained for specific purposes, including two topics – methods of learning vision backbones for visual understanding and text-to-image generation; (ii) recent advances in exploratory, open research areas: multimodal foundation models that aim to play the role of general-purpose assistants, including three topics – unified vision models inspired by large language models (LLMs), end-to-end training of multimodal LLMs, and chaining multimodal tools with LLMs.The target audience of the monograph is researchers, graduate students, and professionals in computer vision and vision-language multimodal communities who are eager to learn the basics and recent advances in multimodal foundation models.Language: EnglishCountry/Region of Manufacture: USItem Height: 234mmItem Length: 156mmItem Weight: 330gAuthor: Chunyuan Li, Zhe Gan, Zhengyuan Yang, Jianwei Yang, Linjie Li, Lijuan Wang, Jianfeng GaoGenre: Computing & InternetBook Series: Foundations and Trends® in Computer Graphics and VisionRelease Year: 2024 Missing Information?Please contact us if any details are missing and where possible we will add the information to our listing.
Price: 160.52 USD
Location: GU14 0GT
End Time: 2024-10-11T19:29:22.000Z
Shipping Cost: 0 USD
Product Images
Item Specifics
Return shipping will be paid by: Buyer
All returns accepted: Returns Accepted
Item must be returned within: 30 Days
Refund will be given as: Money back or replacement (buyer's choice)
Return policy details:
Book Title: Multimodal Foundation Models
Title: Multimodal Foundation Models
Subtitle: From Specialists to General-Purpose Assistants
EAN: 9781638283362
ISBN: 9781638283362
Release Date: 05/06/2024
Release Year: 2024
Country/Region of Manufacture: US
Item Height: 234mm
Genre: Computing & Internet
Language: English
Publication Name: Multimodal Foundation Models : from Specialists to General-Purpose Assistants
Publisher: Now Publishers
Subject: Computer Vision & Pattern Recognition
Publication Year: 2024
Item Weight: 11.6 Oz
Type: Textbook
Subject Area: Computers
Item Length: 9.2 in
Author: Zhengyuan Yang, Linjie Li, Zhe Gan, Chunyuan Li, Jianwei Yang
Series: Foundations and Trends in Computer Graphics and Vision Ser.
Item Width: 6.1 in
Format: Trade Paperback