Titus von Koeller
|
57fa64628f
|
minor refactor to more concise syntax
|
2022-08-01 03:22:12 -07:00 |
|
Tim Dettmers
|
4a6ea7e24b
|
Added adjusted build file.
|
2022-07-31 20:59:34 -07:00 |
|
Tim Dettmers
|
28d1e7dc01
|
Initial build script changes (untested on PyPi).
|
2022-07-31 19:41:56 -07:00 |
|
Tim Dettmers
|
dd50382b32
|
Full evaluate_cuda setup with integration test.
|
2022-07-31 17:47:44 -07:00 |
|
Titus von Koeller
|
5d90b38c4d
|
adding CLI tool for CUDA install debugging - intermediate commit
|
2022-07-27 21:16:04 -07:00 |
|
Tim Dettmers
|
bd515328d7
|
Fixed deployment script to check for LD_LIBRARY_PATH.
|
2022-07-27 05:57:50 -07:00 |
|
Tim Dettmers
|
389f66ca5a
|
Fixed direct extraction masking.
|
2022-07-27 01:46:35 -07:00 |
|
Tim Dettmers
|
a409213656
|
Fixed make default to compile with cublaslt.
|
2022-07-26 19:38:17 -07:00 |
|
Tim Dettmers
|
5737f2b027
|
Merge branch 'patch_merge' into extract_outliers
|
2022-07-26 19:38:01 -07:00 |
|
Tim Dettmers
|
47a73d94c3
|
Matmullt with direct outlier extraction for 8-bit inference.
|
2022-07-26 19:15:35 -07:00 |
|
Tim Dettmers
|
32fa459ed7
|
Added col_ampere outlier extraction kernel.
|
2022-07-26 18:15:51 -07:00 |
|
Tim Dettmers
|
bcab99ec87
|
Working outlier extraction for Turing.
|
2022-07-26 17:39:30 -07:00 |
|
Tim Dettmers
|
cbb901ac51
|
Boilerplate and test for extract_outliers.
|
2022-07-26 12:12:38 -07:00 |
|
Tim Dettmers
|
dc8c9efdb3
|
Changed setup.py; deployed on test pypi.
|
2022-07-26 10:32:22 -07:00 |
|
Tim Dettmers
|
953b7285dd
|
Fixed cpuonly build.
|
2022-07-26 09:12:16 -07:00 |
|
Tim Dettmers
|
f2dd703251
|
Added matmul build and flags.
|
2022-07-25 22:34:14 -07:00 |
|
Tim Dettmers
|
9268dc9d88
|
Some progress on build script; added multi-cuda install script.
|
2022-07-25 19:30:37 -07:00 |
|
Tim Dettmers
|
1e88edd8c0
|
Removed rowscale (segfaults on ampere).
|
2022-07-25 17:27:57 -07:00 |
|
Tim Dettmers
|
8b1fd32e3e
|
Fixed makefile; fixed Ampere igemmlt_8 bug.
|
2022-07-25 14:02:14 -07:00 |
|
Tim Dettmers
|
7d2ecd30c0
|
Fixed rowcol synchronization bug.
|
2022-07-22 15:21:37 -07:00 |
|
Tim Dettmers
|
c771b3a75a
|
Most tests passing.
|
2022-07-22 14:41:05 -07:00 |
|
Tim Dettmers
|
4cd7ea62b2
|
Merge pull request #3 from TimDettmers/cpuonly
Add a CPU-only build option
|
2022-07-18 09:51:37 -07:00 |
|
Max Ryabinin
|
fd750cd237
|
Update README.md
|
2022-07-01 17:46:29 +03:00 |
|
Max Ryabinin
|
025824d29b
|
Reduce diff
|
2022-07-01 17:42:58 +03:00 |
|
Max Ryabinin
|
575aa698fa
|
Reduce diff
|
2022-07-01 17:41:48 +03:00 |
|
Max Ryabinin
|
4d1d5b569f
|
Reduce diff
|
2022-07-01 17:40:02 +03:00 |
|
Max Ryabinin
|
31ce1b3708
|
Reduce diff
|
2022-07-01 17:36:30 +03:00 |
|
Max Ryabinin
|
e4cf33f2a3
|
Fix imports
|
2022-07-01 17:25:44 +03:00 |
|
Max Ryabinin
|
8258b4364a
|
Add a CPU-only build option
|
2022-07-01 17:16:10 +03:00 |
|
Tim Dettmers
|
3418cd390e
|
Merge pull request #2 from TimDettmers/fix_imports
Remove unused imports, fix NotImplementedError
|
2022-06-30 08:21:24 -07:00 |
|
Max Ryabinin
|
33efe4a09f
|
Remove unused imports, fix NotImplementedError
|
2022-06-30 18:14:20 +03:00 |
|
Tim Dettmers
|
4e60e7dc62
|
Fixed makefile compute capabilities.
|
2021-11-29 09:54:19 -08:00 |
|
Tim Dettmers
|
85287b9eda
|
Bumped version for release.
|
2021-11-29 09:34:29 -08:00 |
|
Tim Dettmers
|
20e1677dfd
|
Added module override, bnb.nn.Embedding #13 #15 #19
|
2021-11-29 09:32:13 -08:00 |
|
Tim Dettmers
|
3cff6795fb
|
Merge branch 'main' of github.com:facebookresearch/bitsandbytes into 0.26.0
|
2021-11-29 08:24:17 -08:00 |
|
Tim Dettmers
|
262350c10f
|
Merge pull request #14 from SirRob1997/main
[FIX] passing of sparse in StableEmbedding
|
2021-11-29 08:22:16 -08:00 |
|
Tim Dettmers
|
108cf9fc1f
|
Fixed unsafe use of eval. #8
|
2021-11-29 08:21:05 -08:00 |
|
Tim Dettmers
|
b3fe8a6d0f
|
Upgraded to -std=c++14; printing gpp version. #12
|
2021-11-28 21:31:03 -08:00 |
|
Tim Dettmers
|
2f8083bd8b
|
Added AdamW. #10 #13
|
2021-11-28 21:18:11 -08:00 |
|
Robin Schmidt
|
67a1283501
|
[FIX] passing of sparse in StableEmbedding
|
2021-11-15 17:27:02 +01:00 |
|
Tim Dettmers
|
037022e878
|
Merge pull request #9 from ditschuk/fix_adam_imports
Add missing imports to adam
|
2021-11-15 07:58:44 -08:00 |
|
Tim Dettmers
|
ca2078a697
|
Updated changelog.
|
2021-11-10 15:12:39 -08:00 |
|
Tim Dettmers
|
8b3c0f355c
|
Added adagrad with tests (no clipping).
|
2021-11-10 15:10:02 -08:00 |
|
Konstantin Ditschuneit
|
56f5274848
|
Add missing imports to adam
|
2021-10-31 16:38:38 +01:00 |
|
Tim Dettmers
|
22b2877c7f
|
Changed versioning scheme to <major>.<minor>.<patch>.
|
2021-10-21 22:43:08 -07:00 |
|
Tim Dettmers
|
c1ed5d39b9
|
Fixed compilation flag for CUDA 11.0.
|
2021-10-21 22:30:55 -07:00 |
|
Tim Dettmers
|
d8515f316d
|
Bumped version number for release.
|
2021-10-21 22:08:37 -07:00 |
|
Tim Dettmers
|
3dc1688acb
|
Updated docs and changelog.
|
2021-10-21 17:26:18 -07:00 |
|
Tim Dettmers
|
0fb378b4ee
|
Added compilation from source instructions; easier compilation.
|
2021-10-21 17:22:43 -07:00 |
|
Tim Dettmers
|
d2f1672690
|
Added changes for analysis Adam.
|
2021-10-21 16:07:24 -07:00 |
|