Why do you need an API key to tokenize the text? Isn't it supposed to be a cheap step that everything else in the model relies on?
I'd love it if that API (which I do not believe Anthropic charge anything for) worked without an API key.
Interesting findings. Might need a way to downsample images on upload to keep costs down.
Yeah that should work - it looks like the same pixel dimension image at smaller sizes has about the same token cost for 4.6 and 4.7, so the image cost increase only kicks in if you use larger images that 4.6 would have presumably resized before inspecting.