t5gemma
¶
Config for the T5-Gemma encoder used by daVinci-MagiHuman.
The reference pipeline uses transformers.models.t5gemma.T5GemmaEncoderModel
on google/t5gemma-9b-9b-ul2. That is a gated Google repository, so the
encoder weights are not bundled inside GAIR/daVinci-MagiHuman; they are
loaded from the T5-Gemma HF repo directly.
Encoder shape (verified from google/t5gemma-9b-9b-ul2/config.json): layers=42, hidden=3584, heads=16, kv_heads=8, head_dim=256, intermediate=14336, rope_theta=10000.0, max_pos=8192, layer_types alternate sliding_attention / full_attention.