Porting v2 models to flash attention (#15) e55e319 verified bwang0911 Markus28 commited on Mar 23, 2024
feat: choose flash attention heuristically if not set explicitly 2e2b8d0 Markus28 commited on Mar 6, 2024
feat: added get_input_embeddings method to BertForPreTraining bb281f0 Markus28 commited on Feb 22, 2024
feat: added from_config, also pass additional kwargs from config to model 4164fd6 Markus28 commited on Feb 22, 2024