福模

免费开源AI模型下载_本地AI工具资源平台

多模态AIMultimodal AI

ALIGN多模态AI模型 - 大规模图像文本对齐

ALIGN Multimodal AI Model - Large-Scale Image-Text Alignment

ALIGN多模态AI模型,利用大规模图像文本对进行对比学习。在多个视觉语言任务中取得了优异成果,支持图像检索和文本生成。

ALIGN multimodal AI model, utilizing large-scale image-text pairs for contrastive learning. Achieves excellent results in multiple vision-language tasks, supporting image retrieval and text generation.

ALIGN多模态图像文本对比学习ALIGNMultimodalImage-TextContrastive Learning

文件大小

5.6 GB

Upload Size

5.6 GB

上传日期

2024-12-15

Upload Date

2024-12-15

下载次数

12,700

Downloads

12,700

评分

4.7/5.0

Rating

4.7/5.0

下载资源 Download Resources

下载资源表示您同意我们的使用条款和隐私政策

By downloading this resource, you agree to our Terms of Service and Privacy Policy

相关资源推荐

MUSE多模态AI生成模型 - 高质量文本到图像合成MUSE Multimodal AI Generation Model - High-Quality Text-to-Image Synthesis

MUSE多模态AI生成模型,基于Transformer的高质量文本到图像生成系统。结合了扩散模型和Transformer的优势,生成高质量图像。

MUSE multimodal AI generation model, a high-quality text-to-image generation system based on Transformer. Combines the advantages of diffusion models and Transformers to generate high-quality images.

MUSE多模态文本到图像MUSEMultimodalText-to-Image
18.7 GB2025-02-03
多模态 AI 模型资源 - 图像文本联合理解模型Multimodal AI Model Resources - Joint Image-Text Understanding Model

多模态AI模型资源,实现图像与文本的联合理解。支持图像描述、视觉问答、图文检索等任务,为跨模态AI应用提供强大支持。

Multimodal AI model resources that enable joint understanding of images and text. Supports tasks such as image captioning, visual question answering, and image-text retrieval, providing strong support for cross-modal AI applications.

多模态图像理解文本理解MultimodalImage UnderstandingText Understanding
15.4 GB2024-01-05
Flamingo视觉语言模型 - 少样本视觉语言理解Flamingo Vision-Language Model - Few-Shot Visual Language Understanding

Flamingo视觉语言模型,实现少样本视觉语言理解。结合图像和文本信息,支持问答、描述生成等多模态任务,具有优秀的泛化能力。

Flamingo vision-language model, achieving few-shot visual language understanding. Combines image and text information, supporting multimodal tasks such as question answering and description generation, with excellent generalization capabilities.

视觉语言多模态FlamingoVision-LanguageMultimodalFlamingo
72.6 GB2025-03-11