rinna/Japanese–clip-vit-b-16

rinna-icon

This is a Japanese CLIP (Contrastive Language-Image Pre-Training) model trained by rinna Co., Ltd..

Please see japanese-clip for the other available models.

How to use the model

Install package

$ pip install git+https://github.com/rinnakk/japanese-clip.git

Run

import io
import requests
from PIL import Image
import torch
import japanese_clip as ja_clip
device = "cuda" if torch.cuda.is_available() else "cpu"
model, preprocess = ja_clip.load("rinna/japanese-clip-vit-b-16", cache_dir="/tmp/japanese_clip", device=device)
tokenizer = ja_clip.load_tokenizer()
img = Image.open(io.BytesIO(requests.get('https://images.pexels.com/photos/2253275/pexels-photo-2253275.jpeg?auto=compress&cs=tinysrgb&dpr=3&h=750&w=1260').content))
image = preprocess(img).unsqueeze(0).to(device)
encodings = ja_clip.tokenize(
    texts=["犬", "猫", "象"],
    max_seq_len=77,
    device=device,
    tokenizer=tokenizer, # this is optional. if you don't pass, load tokenizer each time
)
with torch.no_grad():
    image_features = model.get_image_features(image)
    text_features = model.get_text_features(**encodings)
    text_probs = (100.0 * image_features @ text_features.T).softmax(dim=-1)
print("Label probs:", text_probs)  # prints: [[1.0, 0.0, 0.0]]

Model architecture

The model was trained a ViT-B/16 Transformer architecture as an image encoder and uses a 12-layer BERT as a text encoder. The image encoder was initialized from the AugReg vit-base-patch16-224 model.

Training

The model was trained on CC12M translated the captions to Japanese.

License

The Apache 2.0 license

rinna/japanese-clip-vit-b-16浏览人数已经达到526，如你需要查询该站的相关权重信息，可以点击"5118数据""爱站数据""Chinaz数据"进入；以目前的网站数据参考，建议大家请以爱站数据为准，更多网站价值评估因素如：rinna/japanese-clip-vit-b-16的访问速度、搜索引擎收录以及索引量、用户体验等；当然要评估一个站的价值，最主要还是需要根据您自身的需求以及需要，一些确切的数据则需要找rinna/japanese-clip-vit-b-16的站长进行洽谈提供。如该站的IP、PV、跳出率等！

特别声明

本站Ai导航提供的rinna/japanese-clip-vit-b-16都来源于网络，不保证外部链接的准确性和完整性，同时，对于该外部链接的指向，不由Ai导航实际控制，在2023年5月9日下午7:16收录时，该网页上的内容，都属于合规合法，后期网页的内容如出现违规，可以直接联系网站管理员进行删除，Ai导航不承担任何责任。

Ai导航致力于优质、实用的网络站点资源收集与分享！本文地址https://www.ainavpro.com/sites/3043.html转载请注明

google/vit-base-patch32-224-in21k

Vision Transformer (base-si...

IDEA-CCNL/Taiyi-CLIP-Roberta-large-326M-Chinese

Taiyi-CLIP-Roberta-large-32...

facebook/bart-base

BART (base-sized model) B...

IDEA-CCNL/Erlangshen-SimCSE-110M-Chinese

Erlangshen-SimCSE-110M-Chin...

microsoft/wavlm-large

WavLM-Large Microsoft's W...

sonoisa/sentence-bert-base-ja-mean-tokens-v2

This is a Japanese sentence...

暂无评论

暂无评论...

rinna/japanese-clip-vit-b-16

rinna/Japanese–clip-vit-b-16

How to use the model

Model architecture

Training

License

数据统计

数据评估

相关导航

暂无评论

热门标签

随机网址

rinna/japanese-clip-vit-b-16

rinna/Japanese–clip-vit-b-16

How to use the model

Model architecture

Training

License

数据统计

数据评估

相关导航

暂无评论

热门标签

随机网址

广告位