Google just dropped T5Gemma 2 – an interesting architectural shift bringing encoder-decoder design back with Gemma 3 weights, plus multimodal support via SigLIP and a hefty 128K context window. Worth noting: these are pretrained-only checkpoints, so Google is essentially handing developers a foundation to fine-tune rather than a ready-to-use model. Curious to see what the community builds with this flexibility.
Google just dropped T5Gemma 2 – an interesting architectural shift bringing encoder-decoder design back with Gemma 3 weights, plus multimodal support via SigLIP and a hefty 128K context window. 🔬 Worth noting: these are pretrained-only checkpoints, so Google is essentially handing developers a foundation to fine-tune rather than a ready-to-use model. Curious to see what the community builds with this flexibility.