M-CLIP/M-BERT-Distil-40有戏ai
M-BERT Distil 40百度ai智能云
Github Model Card即梦al
Usage
To use this model along with the original CLIP vision encoder you need to download the code and additional linear weights from the Multilingual-CLIP Github.al一键脱装入口
Once this is done, you can load and use the model with the following code即梦al
from src import multilingual_clip
model = multilingual_clip.load_model('M-BERT-Distil-40')
embeddings = model(['Älgen är skogens konung!', 'Wie leben Eisbären in der Antarktis?', 'Вы знали, что все белые медведи левши?'])
print(embeddings.shape)
# Yields: torch.Size([3, 640])
About
A distilbert人工智能ai哪个好-base-multilingual tuned to match the embedding space for 40 languages, to the embedding space of the CLIP text encoder which accompanies the Res50x4 vision encoder.
A full list of the 100 languages used during pre-training can be found here, and a list of the 40 languages used during fine-tuning can be found in SupportedLanguages.md.
Training data pairs was generated by sampling 40k sentences for each language from the combined descriptions of GCC + MSCOCO + VizWiz, and translating them into the corresponding language.
All translation was done using the AWS translate service, the quality of these translations have currently not been analyzed, but one can assume the quality varies between the 40 languages.
Evaluation
These results can be viewed at Github.
A non-rigorous qualitative evaluation shows that for the languages French, German, Spanish, Russian, Swedish and Greek it seemingly yields respectable results for most instances. The exception being that Greeks are apparently unable to recognize happy persons.
When testing on Kannada, a language which was included during pre-training but not fine-tuning, it performed close to random
数据统计
数据评估
本站菠萝导航提供的M-CLIP/M-BERT-Distil-40都来源于网络,不保证外部链接的准确性和完整性,同时,对于该外部链接的指向,不由菠萝导航实际控制,在2023年5月9日 下午7:12收录时,该网页上的内容,都属于合规合法,后期网页的内容如出现违规,可以直接联系网站管理员进行删除,菠萝导航不承担任何责任。grok中文版下载

