Phil Wang
|
19b9ef34b9
|
whoops
|
2023-03-10 08:59:49 -08:00 |
|
Phil Wang
|
c99b44f774
|
do the epsilon beta2 switcharoo within the cuda code, and not within the python class (so that the state dict still makes sense)
|
2023-03-10 08:57:59 -08:00 |
|
Phil Wang
|
c83888aa1a
|
use epsilon as beta2 for lion, complete most of the logic in kernel.cu for all functions
|
2023-03-09 11:54:54 -08:00 |
|
Phil Wang
|
cb4c3c8c66
|
do a bunch of typical bookkeeping before getting to main lion logic
|
2023-03-09 10:10:19 -08:00 |
|
Phil Wang
|
d43ea9722c
|
make sure interface is correct
|
2023-03-09 09:45:33 -08:00 |
|
Phil Wang
|
7247cb4554
|
initial commit, slowly work from interface into the kernel
|
2023-03-09 08:08:46 -08:00 |
|
Tom Aarsen
|
62c0bd2278
|
Fix several typos in logging and comments
Via codespell
|
2022-11-01 09:53:47 +01:00 |
|
Tom Aarsen
|
7727fa4c8c
|
Remove f-prefix from strings that don't use formatting
|
2022-10-27 13:36:39 +02:00 |
|
Tom Aarsen
|
4a05df34c2
|
Fix critical bug in PytorchLARS().step: Undefined variable
|
2022-10-27 13:19:09 +02:00 |
|
Tom Aarsen
|
7a3c9af05d
|
Sort imports
Via isort
|
2022-10-27 13:15:21 +02:00 |
|
Tom Aarsen
|
0b078403ee
|
Simplify statements into equivalent, modern variants
via pyupgrade --py37-plus. The changes e.g. are subclassing from object, calling super() with super(ThisClass, self), or old-style syntax formatting.
|
2022-10-27 13:14:13 +02:00 |
|
Tim Dettmers
|
7e0fb655e1
|
Some initial code. Needs to be tested.
|
2022-08-23 13:59:34 -07:00 |
|
Titus von Koeller
|
ea7c14f8ef
|
reran black with linelength 80 for greater readability
|
2022-08-01 09:32:47 -07:00 |
|
Titus von Koeller
|
bfa0e33294
|
ran black and isort for coherent code formatting
|
2022-08-01 03:31:48 -07:00 |
|
Max Ryabinin
|
e4cf33f2a3
|
Fix imports
|
2022-07-01 17:25:44 +03:00 |
|
Max Ryabinin
|
8258b4364a
|
Add a CPU-only build option
|
2022-07-01 17:16:10 +03:00 |
|
Max Ryabinin
|
33efe4a09f
|
Remove unused imports, fix NotImplementedError
|
2022-06-30 18:14:20 +03:00 |
|
Tim Dettmers
|
20e1677dfd
|
Added module override, bnb.nn.Embedding #13 #15 #19
|
2021-11-29 09:32:13 -08:00 |
|
Tim Dettmers
|
3cff6795fb
|
Merge branch 'main' of github.com:facebookresearch/bitsandbytes into 0.26.0
|
2021-11-29 08:24:17 -08:00 |
|
Tim Dettmers
|
108cf9fc1f
|
Fixed unsafe use of eval. #8
|
2021-11-29 08:21:05 -08:00 |
|
Tim Dettmers
|
2f8083bd8b
|
Added AdamW. #10 #13
|
2021-11-28 21:18:11 -08:00 |
|
Tim Dettmers
|
8b3c0f355c
|
Added adagrad with tests (no clipping).
|
2021-11-10 15:10:02 -08:00 |
|
Konstantin Ditschuneit
|
56f5274848
|
Add missing imports to adam
|
2021-10-31 16:38:38 +01:00 |
|
Tim Dettmers
|
0c5fa5a637
|
Fixed syntax and import error.
|
2021-10-21 16:04:18 -07:00 |
|
Tim Dettmers
|
1ec0d54529
|
Added analysis Adam.
|
2021-10-21 15:53:20 -07:00 |
|
Tim Dettmers
|
eaf35ab949
|
Copied over Analysis Adam.
|
2021-10-21 10:20:41 -07:00 |
|
Tim Dettmers
|
a6eae2e7f2
|
Added skip_zeros; tests are passing.
|
2021-10-20 19:15:47 -07:00 |
|
Tim Dettmers
|
bb34fd50a1
|
Initial plumbing for skip_zeros.
|
2021-10-20 18:37:44 -07:00 |
|
Tim Dettmers
|
7923c4a066
|
Changed from testpypi to pypi. Release 0.0.24
|
2021-10-07 08:39:38 -07:00 |
|
Tim Dettmers
|
7439924891
|
Initial commit
|
2021-10-05 19:16:20 -07:00 |
|