sunyt32
|
50174a3078
|
fix fairseq example
|
2023-09-29 03:50:24 +00:00 |
|
sunyt32
|
fd8234c2ac
|
rollback variant name
|
2023-09-28 16:44:51 +00:00 |
|
sunyt32
|
5c89ffbeea
|
modify rms norm and value dim in retention
|
2023-09-28 14:24:37 +00:00 |
|
Shuming Ma
|
258eda3308
|
Update vocab links
|
2023-08-11 16:46:37 +08:00 |
|
Shuming Ma
|
5356b252c4
|
Update MT config
|
2023-07-31 09:17:03 -07:00 |
|
Li Dong
|
bf65397b26
|
RetNet
|
2023-07-24 14:30:13 +08:00 |
|
shumingma
|
891f84f302
|
Fix MoE sample size
|
2023-03-08 01:19:36 -08:00 |
|
shumingma
|
0a07df1e5b
|
Update Bert MoE
|
2023-03-07 21:21:48 -08:00 |
|
shumingma
|
c397ebb013
|
Fix Bert MoE
|
2023-03-07 21:11:05 -08:00 |
|
shumingma
|
670113e446
|
Update MoE criterions
|
2023-03-07 20:53:41 -08:00 |
|
shumingma
|
8d8b80a731
|
Merge branch 'main' of https://github.com/microsoft/torchscale into main
|
2023-03-05 19:24:31 -08:00 |
|
shumingma
|
a788e67ef2
|
Fix Bert dense
|
2023-03-05 19:24:14 -08:00 |
|
Shaohan Huang
|
5b0be94ab8
|
add --pad-to-max-length in bert+moe example
|
2023-03-05 19:39:04 +08:00 |
|
Shaohan Huang
|
95aea9c1b4
|
set numpy version
|
2023-03-05 19:36:07 +08:00 |
|
buaahsh
|
bc140c65bb
|
fx bert moe
|
2023-03-05 07:43:58 +00:00 |
|
Shaohan Huang
|
cbdbc1dfc8
|
Update README.md
|
2023-03-04 07:37:01 +08:00 |
|
shumingma
|
20c1e6c611
|
Bert MoE
|
2023-03-02 02:54:19 -08:00 |
|
shumingma
|
9f105b591d
|
Support Pytorch LayerNorm
|
2023-01-16 20:17:28 -08:00 |
|
shumingma
|
1a5d2c26fe
|
Batch size first
|
2023-01-05 01:19:51 -08:00 |
|
shumingma
|
9d968a24ed
|
Update XPos
|
2023-01-03 22:54:24 -08:00 |
|
shumingma
|
7e12b582e4
|
Support latest fairseq
|
2022-12-15 03:44:15 -08:00 |
|
shumingma
|
2518ea030c
|
Fix example fsdp
|
2022-12-08 04:20:27 -08:00 |
|
shumingma
|
be167b3dda
|
Add an example for vocab
|
2022-12-01 20:40:09 -08:00 |
|
Kashif Rasul
|
e8be99f8f1
|
fix typo
|
2022-11-29 10:48:56 +01:00 |
|
Kashif Rasul
|
c69aba2a73
|
fix call to activation_fn
|
2022-11-29 00:11:38 +01:00 |
|
shumingma
|
7eca1a531c
|
Code reformatting
|
2022-11-26 09:01:02 -08:00 |
|
shumingma
|
994e4665a2
|
flake8 lint checks
|
2022-11-26 08:10:15 -08:00 |
|
Shaohan Huang
|
bdf759f116
|
decoder_embed_dim -> args.decoder_embed_dim
|
2022-11-24 14:30:39 +08:00 |
|
shumingma
|
65fe50f466
|
update copyright
|
2022-11-23 08:36:55 -08:00 |
|
shumingma
|
ede048831f
|
torchscale released
|
2022-11-23 08:21:58 -08:00 |
|