Skip to content

t5gemma

Config for the T5-Gemma encoder used by daVinci-MagiHuman.

The reference pipeline uses transformers.models.t5gemma.T5GemmaEncoderModel on google/t5gemma-9b-9b-ul2. That is a gated Google repository, so the encoder weights are not bundled inside GAIR/daVinci-MagiHuman; they are loaded from the T5-Gemma HF repo directly.

Encoder shape (verified from google/t5gemma-9b-9b-ul2/config.json): layers=42, hidden=3584, heads=16, kv_heads=8, head_dim=256, intermediate=14336, rope_theta=10000.0, max_pos=8192, layer_types alternate sliding_attention / full_attention.