Google announces new PaliGemma 2 vision-language models

Google announces new PaliGemma 2 vision-language models (Image Source: Google)

Google's PaliGemma 2 models are available in multiple sizes and resolutions, and they can understand text, images, and videos. Google is also touting the ability to create detailed, contextually relevant captions.

Rohith Bhaskar, Published 12/06/2024 🇫🇷 🇪🇸 ...

Google has announced the follow-up to the visual-language model PaliGemma launched in May 2024. PaliGemma 2 is available in multiple sizes ranging from 3 billion parameters to 28 billion and various resolution sizes up to 896px.

The company says the model displays "leading performance on chemical formula recognition, music score recognition, spatial reasoning, and chest X-ray report generation."

It also has long captioning capabilities with "detailed, contextually relevant captions for images, going beyond simple object identification to describe actions, emotions, and the overall narrative of the scene."

The new models will be offered as a "drop-in replacement" in multiple sizes without "major code modifications." The pre-trained models are available on Hugging Face and Kaggle and are free for anyone to download and try out. It also supports multiple frameworks including Hugging Face Transformers, Keras, PyTorch, JAX, and Gemma.cpp.

Google says PaliGemma 2's "flexibility makes fine-tuning for specific tasks and datasets straightforward, empowering you to tailor its capabilities to your precise needs."

Source(s)

Google

Google Docs now uses AI to create formatted documents 12/09/2024

Google is suing the US CSFB for government overreach (Image Source: Photo by Matthew Kwong on Unsplash)

Google is suing the US CSFB for government overreach 12/09/2024

Google DeepMind's Genie 2 is a real-time 3D world generator (Image Source: Google DeepMind)

Google DeepMind's Genie 2 is a real-time 3D world generator 12/06/2024

Google's GenCast AI can make fast and accurate weather predictions (Image Source: Google)

Google's GenCast AI can make fast and accurate weather predictions 12/05/2024

Google unveils a new video generation model for Vertex AI (Image Source: Google)

Google unveils a new video generation model for Vertex AI 12/05/2024

Loading Comments

Comment on this article

Tesla vs Waymo self-driving test ha...

OnePlus tipped to release a new com...

Rohith Bhaskar - Tech Writer - 327 articles published on Notebookcheck since 2024

I might look like a normal human being, but I’m secretly powered by tech news and bad puns. With a soft spot for all things digital, I dive into the world of innovation and bring back stories that make sense to techies and newbies alike.

contact me via: LinkedIn

Please share our article, every link counts!