Commit Graph

254 Commits

Author SHA1 Message Date
Tim Dettmers
a409213656 Fixed make default to compile with cublaslt. 2022-07-26 19:38:17 -07:00
Tim Dettmers
5737f2b027 Merge branch 'patch_merge' into extract_outliers 2022-07-26 19:38:01 -07:00
Tim Dettmers
47a73d94c3 Matmullt with direct outlier extraction for 8-bit inference. 2022-07-26 19:15:35 -07:00
Tim Dettmers
32fa459ed7 Added col_ampere outlier extraction kernel. 2022-07-26 18:15:51 -07:00
Tim Dettmers
bcab99ec87 Working outlier extraction for Turing. 2022-07-26 17:39:30 -07:00
Tim Dettmers
cbb901ac51 Boilerplate and test for extract_outliers. 2022-07-26 12:12:38 -07:00
Tim Dettmers
dc8c9efdb3 Changed setup.py; deployed on test pypi. 2022-07-26 10:32:22 -07:00
Tim Dettmers
953b7285dd Fixed cpuonly build. 2022-07-26 09:12:16 -07:00
Tim Dettmers
f2dd703251 Added matmul build and flags. 2022-07-25 22:34:14 -07:00
Tim Dettmers
9268dc9d88 Some progress on build script; added multi-cuda install script. 2022-07-25 19:30:37 -07:00
Tim Dettmers
1e88edd8c0 Removed rowscale (segfaults on ampere). 2022-07-25 17:27:57 -07:00
Tim Dettmers
8b1fd32e3e Fixed makefile; fixed Ampere igemmlt_8 bug. 2022-07-25 14:02:14 -07:00
Tim Dettmers
7d2ecd30c0 Fixed rowcol synchronization bug. 2022-07-22 15:21:37 -07:00
Tim Dettmers
c771b3a75a Most tests passing. 2022-07-22 14:41:05 -07:00
Tim Dettmers
4cd7ea62b2
Merge pull request #3 from TimDettmers/cpuonly
Add a CPU-only build option
2022-07-18 09:51:37 -07:00
Max Ryabinin
fd750cd237 Update README.md 2022-07-01 17:46:29 +03:00
Max Ryabinin
025824d29b Reduce diff 2022-07-01 17:42:58 +03:00
Max Ryabinin
575aa698fa Reduce diff 2022-07-01 17:41:48 +03:00
Max Ryabinin
4d1d5b569f Reduce diff 2022-07-01 17:40:02 +03:00
Max Ryabinin
31ce1b3708 Reduce diff 2022-07-01 17:36:30 +03:00
Max Ryabinin
e4cf33f2a3 Fix imports 2022-07-01 17:25:44 +03:00
Max Ryabinin
8258b4364a Add a CPU-only build option 2022-07-01 17:16:10 +03:00
Tim Dettmers
3418cd390e
Merge pull request #2 from TimDettmers/fix_imports
Remove unused imports, fix NotImplementedError
2022-06-30 08:21:24 -07:00
Max Ryabinin
33efe4a09f Remove unused imports, fix NotImplementedError 2022-06-30 18:14:20 +03:00
Tim Dettmers
4e60e7dc62 Fixed makefile compute capabilities. 2021-11-29 09:54:19 -08:00
Tim Dettmers
85287b9eda Bumped version for release. 2021-11-29 09:34:29 -08:00
Tim Dettmers
20e1677dfd Added module override, bnb.nn.Embedding #13 #15 #19 2021-11-29 09:32:13 -08:00
Tim Dettmers
3cff6795fb Merge branch 'main' of github.com:facebookresearch/bitsandbytes into 0.26.0 2021-11-29 08:24:17 -08:00
Tim Dettmers
262350c10f
Merge pull request #14 from SirRob1997/main
[FIX] passing of sparse in StableEmbedding
2021-11-29 08:22:16 -08:00
Tim Dettmers
108cf9fc1f Fixed unsafe use of eval. #8 2021-11-29 08:21:05 -08:00
Tim Dettmers
b3fe8a6d0f Upgraded to -std=c++14; printing gpp version. #12 2021-11-28 21:31:03 -08:00
Tim Dettmers
2f8083bd8b Added AdamW. #10 #13 2021-11-28 21:18:11 -08:00
Robin Schmidt
67a1283501 [FIX] passing of sparse in StableEmbedding 2021-11-15 17:27:02 +01:00
Tim Dettmers
037022e878
Merge pull request #9 from ditschuk/fix_adam_imports
Add missing imports to adam
2021-11-15 07:58:44 -08:00
Tim Dettmers
ca2078a697 Updated changelog. 2021-11-10 15:12:39 -08:00
Tim Dettmers
8b3c0f355c Added adagrad with tests (no clipping). 2021-11-10 15:10:02 -08:00
Konstantin Ditschuneit
56f5274848 Add missing imports to adam 2021-10-31 16:38:38 +01:00
Tim Dettmers
22b2877c7f Changed versioning scheme to <major>.<minor>.<patch>. 2021-10-21 22:43:08 -07:00
Tim Dettmers
c1ed5d39b9 Fixed compilation flag for CUDA 11.0. 2021-10-21 22:30:55 -07:00
Tim Dettmers
d8515f316d Bumped version number for release. 2021-10-21 22:08:37 -07:00
Tim Dettmers
3dc1688acb Updated docs and changelog. 2021-10-21 17:26:18 -07:00
Tim Dettmers
0fb378b4ee Added compilation from source instructions; easier compilation. 2021-10-21 17:22:43 -07:00
Tim Dettmers
d2f1672690 Added changes for analysis Adam. 2021-10-21 16:07:24 -07:00
Tim Dettmers
0c5fa5a637 Fixed syntax and import error. 2021-10-21 16:04:18 -07:00
Tim Dettmers
1ec0d54529 Added analysis Adam. 2021-10-21 15:53:20 -07:00
Tim Dettmers
eaf35ab949 Copied over Analysis Adam. 2021-10-21 10:20:41 -07:00
Tim Dettmers
d06c5776e4 Updated changelog. 2021-10-20 19:26:43 -07:00
Tim Dettmers
a6eae2e7f2 Added skip_zeros; tests are passing. 2021-10-20 19:15:47 -07:00
Tim Dettmers
bb34fd50a1 Initial plumbing for skip_zeros. 2021-10-20 18:37:44 -07:00
Tim Dettmers
8400b58cbb Added Kepler and fixed V100+CUDA101 support. #4 #5 2021-10-17 21:21:39 -07:00