AlbertForMaskedLM,4
AllenaiLongformerBase,4
BartForCausalLM,4
BertForMaskedLM,16
BigBird,32
BlenderbotForCausalLM,32
DebertaV2ForMaskedLM,16
DistilBertForMaskedLM,128
DistillGPT2,16
ElectraForCausalLM,8
GoogleFnet,16
GPT2ForSequenceClassification,4
LayoutLMForMaskedLM,16
M2M100ForConditionalGeneration,16
MBartForCausalLM,4
MegatronBertForCausalLM,4
MobileBertForMaskedLM,64
MT5ForConditionalGeneration,16
OPTForCausalLM,2
PegasusForCausalLM,32
PLBartForCausalLM,8
RobertaForCausalLM,16
T5ForConditionalGeneration,4
T5Small,1
TrOCRForCausalLM,32
XGLMForCausalLM,8
XLNetLMHeadModel,8
YituTechConvBert,16
