Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sbintuitions
/
sarashina2.2-vision-3b
like
13
Follow
SB Intuitions
243
Image-to-Text
Transformers
Safetensors
Japanese
English
sarashina2_vision
text-generation
multimodal
vision-language
custom_code
arxiv:
5 papers
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
sarashina2.2-vision-3b
7.61 GB
1 contributor
History:
3 commits
toshi-456
Upload README.md
147b6cc
verified
18 days ago
.gitattributes
Safe
1.57 kB
Upload 16 files
20 days ago
LICENSE
Safe
1.07 kB
Upload 16 files
20 days ago
README.md
Safe
6.95 kB
Upload README.md
18 days ago
chat_template.json
Safe
1.12 kB
Upload 16 files
20 days ago
config.json
Safe
1.53 kB
Upload 16 files
20 days ago
configuration_sarashina2_vision.py
Safe
2.92 kB
Upload 16 files
20 days ago
generation_config.json
Safe
133 Bytes
Upload 16 files
20 days ago
model.safetensors
7.6 GB
xet
Upload 16 files
20 days ago
modeling_sarashina2_vision.py
Safe
11.7 kB
Upload 16 files
20 days ago
preprocessor_config.json
Safe
646 Bytes
Upload 16 files
20 days ago
processing_sarashina2_vision.py
Safe
24 kB
Upload 16 files
20 days ago
processor_config.json
Safe
152 Bytes
Upload 16 files
20 days ago
sample.jpg
819 kB
xet
Upload 16 files
20 days ago
special_tokens_map.json
Safe
968 Bytes
Upload 16 files
20 days ago
tokenizer.json
Safe
6.72 MB
Upload 16 files
20 days ago
tokenizer.model
Safe
1.83 MB
xet
Upload 16 files
20 days ago
tokenizer_config.json
Safe
5.05 kB
Upload 16 files
20 days ago