Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
google
/
pix2struct-screen2words-base
like
24
Follow
Google
40.8k
Visual Question Answering
Transformers
PyTorch
5 languages
pix2struct
image-to-text
arxiv:
2210.03347
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
4
Deploy
Use this model
main
pix2struct-screen2words-base
1.13 GB
2 contributors
History:
9 commits
ybelkada
Update README.md
1751472
over 2 years ago
.gitattributes
1.48 kB
initial commit
almost 3 years ago
README.md
4.5 kB
Update README.md
over 2 years ago
config.json
4.9 kB
Update config.json
over 2 years ago
preprocessor_config.json
249 Bytes
Upload processor
almost 3 years ago
pytorch_model.bin
1.13 GB
xet
Upload Pix2StructForConditionalGeneration
almost 3 years ago
special_tokens_map.json
2.2 kB
Upload processor
almost 3 years ago
spiece.model
851 kB
xet
Upload processor
almost 3 years ago
tokenizer.json
3.27 MB
Upload processor
almost 3 years ago
tokenizer_config.json
2.58 kB
Upload processor
almost 3 years ago