Commit Graph

30 Commits

Author SHA1 Message Date
sunyt32
50174a3078 fix fairseq example 2023-09-29 03:50:24 +00:00
sunyt32
fd8234c2ac rollback variant name 2023-09-28 16:44:51 +00:00
sunyt32
5c89ffbeea modify rms norm and value dim in retention 2023-09-28 14:24:37 +00:00
Shuming Ma
258eda3308
Update vocab links 2023-08-11 16:46:37 +08:00
Shuming Ma
5356b252c4 Update MT config 2023-07-31 09:17:03 -07:00
Li Dong
bf65397b26 RetNet 2023-07-24 14:30:13 +08:00
shumingma
891f84f302 Fix MoE sample size 2023-03-08 01:19:36 -08:00
shumingma
0a07df1e5b Update Bert MoE 2023-03-07 21:21:48 -08:00
shumingma
c397ebb013 Fix Bert MoE 2023-03-07 21:11:05 -08:00
shumingma
670113e446 Update MoE criterions 2023-03-07 20:53:41 -08:00
shumingma
8d8b80a731 Merge branch 'main' of https://github.com/microsoft/torchscale into main 2023-03-05 19:24:31 -08:00
shumingma
a788e67ef2 Fix Bert dense 2023-03-05 19:24:14 -08:00
Shaohan Huang
5b0be94ab8
add --pad-to-max-length in bert+moe example 2023-03-05 19:39:04 +08:00
Shaohan Huang
95aea9c1b4
set numpy version 2023-03-05 19:36:07 +08:00
buaahsh
bc140c65bb fx bert moe 2023-03-05 07:43:58 +00:00
Shaohan Huang
cbdbc1dfc8
Update README.md 2023-03-04 07:37:01 +08:00
shumingma
20c1e6c611 Bert MoE 2023-03-02 02:54:19 -08:00
shumingma
9f105b591d Support Pytorch LayerNorm 2023-01-16 20:17:28 -08:00
shumingma
1a5d2c26fe Batch size first 2023-01-05 01:19:51 -08:00
shumingma
9d968a24ed Update XPos 2023-01-03 22:54:24 -08:00
shumingma
7e12b582e4 Support latest fairseq 2022-12-15 03:44:15 -08:00
shumingma
2518ea030c Fix example fsdp 2022-12-08 04:20:27 -08:00
shumingma
be167b3dda Add an example for vocab 2022-12-01 20:40:09 -08:00
Kashif Rasul
e8be99f8f1 fix typo 2022-11-29 10:48:56 +01:00
Kashif Rasul
c69aba2a73 fix call to activation_fn 2022-11-29 00:11:38 +01:00
shumingma
7eca1a531c Code reformatting 2022-11-26 09:01:02 -08:00
shumingma
994e4665a2 flake8 lint checks 2022-11-26 08:10:15 -08:00
Shaohan Huang
bdf759f116 decoder_embed_dim -> args.decoder_embed_dim 2022-11-24 14:30:39 +08:00
shumingma
65fe50f466 update copyright 2022-11-23 08:36:55 -08:00
shumingma
ede048831f torchscale released 2022-11-23 08:21:58 -08:00