Japanese InstructBLIP Alpha Model Details Japanese InstructBLIP Alpha is a vision-language instruction-following model that enables to generate Japanese descriptions for input images and optionally input texts such as questions. Usage First install additional dependencies in requirements.txt: pip install sentencepiece einops import torch from transformers import LlamaTokenizer, AutoModelForVision2