Commit Graph

415 Commits

Author SHA1 Message Date
Titus von Koeller
3fd06fb620 refactored subshell execution code for greater readability and moved it to utils 2022-08-01 09:30:29 -07:00
Titus von Koeller
54efd874a8 flake8 found some stuff that needs fixing before the release 2022-08-01 03:32:34 -07:00
Titus von Koeller
bfa0e33294 ran black and isort for coherent code formatting 2022-08-01 03:31:48 -07:00
Titus von Koeller
597a8521b2 fix typo 2022-08-01 03:22:44 -07:00
Titus von Koeller
57fa64628f minor refactor to more concise syntax 2022-08-01 03:22:12 -07:00
Tim Dettmers
4a6ea7e24b Added adjusted build file. 2022-07-31 20:59:34 -07:00
Tim Dettmers
28d1e7dc01 Initial build script changes (untested on PyPi). 2022-07-31 19:41:56 -07:00
Tim Dettmers
dd50382b32 Full evaluate_cuda setup with integration test. 2022-07-31 17:47:44 -07:00
Titus von Koeller
5d90b38c4d adding CLI tool for CUDA install debugging - intermediate commit 2022-07-27 21:16:04 -07:00
Tim Dettmers
bd515328d7 Fixed deployment script to check for LD_LIBRARY_PATH. 2022-07-27 05:57:50 -07:00
Tim Dettmers
389f66ca5a Fixed direct extraction masking. 2022-07-27 01:46:35 -07:00
Tim Dettmers
a409213656 Fixed make default to compile with cublaslt. 2022-07-26 19:38:17 -07:00
Tim Dettmers
5737f2b027 Merge branch 'patch_merge' into extract_outliers 2022-07-26 19:38:01 -07:00
Tim Dettmers
47a73d94c3 Matmullt with direct outlier extraction for 8-bit inference. 2022-07-26 19:15:35 -07:00
Tim Dettmers
32fa459ed7 Added col_ampere outlier extraction kernel. 2022-07-26 18:15:51 -07:00
Tim Dettmers
bcab99ec87 Working outlier extraction for Turing. 2022-07-26 17:39:30 -07:00
Tim Dettmers
cbb901ac51 Boilerplate and test for extract_outliers. 2022-07-26 12:12:38 -07:00
Tim Dettmers
dc8c9efdb3 Changed setup.py; deployed on test pypi. 2022-07-26 10:32:22 -07:00
Tim Dettmers
953b7285dd Fixed cpuonly build. 2022-07-26 09:12:16 -07:00
Tim Dettmers
f2dd703251 Added matmul build and flags. 2022-07-25 22:34:14 -07:00
Tim Dettmers
9268dc9d88 Some progress on build script; added multi-cuda install script. 2022-07-25 19:30:37 -07:00
Tim Dettmers
1e88edd8c0 Removed rowscale (segfaults on ampere). 2022-07-25 17:27:57 -07:00
Tim Dettmers
8b1fd32e3e Fixed makefile; fixed Ampere igemmlt_8 bug. 2022-07-25 14:02:14 -07:00
Tim Dettmers
7d2ecd30c0 Fixed rowcol synchronization bug. 2022-07-22 15:21:37 -07:00
Tim Dettmers
c771b3a75a Most tests passing. 2022-07-22 14:41:05 -07:00
Tim Dettmers
4cd7ea62b2
Merge pull request #3 from TimDettmers/cpuonly
Add a CPU-only build option
2022-07-18 09:51:37 -07:00
Max Ryabinin
fd750cd237 Update README.md 2022-07-01 17:46:29 +03:00
Max Ryabinin
025824d29b Reduce diff 2022-07-01 17:42:58 +03:00
Max Ryabinin
575aa698fa Reduce diff 2022-07-01 17:41:48 +03:00
Max Ryabinin
4d1d5b569f Reduce diff 2022-07-01 17:40:02 +03:00
Max Ryabinin
31ce1b3708 Reduce diff 2022-07-01 17:36:30 +03:00
Max Ryabinin
e4cf33f2a3 Fix imports 2022-07-01 17:25:44 +03:00
Max Ryabinin
8258b4364a Add a CPU-only build option 2022-07-01 17:16:10 +03:00
Tim Dettmers
3418cd390e
Merge pull request #2 from TimDettmers/fix_imports
Remove unused imports, fix NotImplementedError
2022-06-30 08:21:24 -07:00
Max Ryabinin
33efe4a09f Remove unused imports, fix NotImplementedError 2022-06-30 18:14:20 +03:00
Tim Dettmers
4e60e7dc62 Fixed makefile compute capabilities. 2021-11-29 09:54:19 -08:00
Tim Dettmers
85287b9eda Bumped version for release. 2021-11-29 09:34:29 -08:00
Tim Dettmers
20e1677dfd Added module override, bnb.nn.Embedding #13 #15 #19 2021-11-29 09:32:13 -08:00
Tim Dettmers
3cff6795fb Merge branch 'main' of github.com:facebookresearch/bitsandbytes into 0.26.0 2021-11-29 08:24:17 -08:00
Tim Dettmers
262350c10f
Merge pull request #14 from SirRob1997/main
[FIX] passing of sparse in StableEmbedding
2021-11-29 08:22:16 -08:00
Tim Dettmers
108cf9fc1f Fixed unsafe use of eval. #8 2021-11-29 08:21:05 -08:00
Tim Dettmers
b3fe8a6d0f Upgraded to -std=c++14; printing gpp version. #12 2021-11-28 21:31:03 -08:00
Tim Dettmers
2f8083bd8b Added AdamW. #10 #13 2021-11-28 21:18:11 -08:00
Robin Schmidt
67a1283501 [FIX] passing of sparse in StableEmbedding 2021-11-15 17:27:02 +01:00
Tim Dettmers
037022e878
Merge pull request #9 from ditschuk/fix_adam_imports
Add missing imports to adam
2021-11-15 07:58:44 -08:00
Tim Dettmers
ca2078a697 Updated changelog. 2021-11-10 15:12:39 -08:00
Tim Dettmers
8b3c0f355c Added adagrad with tests (no clipping). 2021-11-10 15:10:02 -08:00
Konstantin Ditschuneit
56f5274848 Add missing imports to adam 2021-10-31 16:38:38 +01:00
Tim Dettmers
22b2877c7f Changed versioning scheme to <major>.<minor>.<patch>. 2021-10-21 22:43:08 -07:00
Tim Dettmers
c1ed5d39b9 Fixed compilation flag for CUDA 11.0. 2021-10-21 22:30:55 -07:00