Commit Graph

86 Commits

Author SHA1 Message Date
justheuristic
56a074f6dc un-fuse bias 2022-09-17 23:46:37 +03:00
justheuristic
d9ca0ed905 un-fuse bias 2022-09-17 23:44:28 +03:00
justheuristic
eac9aca460 cast bias too 2022-09-17 23:38:09 +03:00
justheuristic
a9fe0ff98c recast to fp16 2022-09-17 23:34:22 +03:00
justheuristic
fc4a135ed1 clearer assertions 2022-09-17 23:24:26 +03:00
justheuristic
cc4858c2fd some kind of warning or something when this is first executed to make people aware that a cast happens and the operation quantization is performed in fp16. 2022-09-17 20:46:04 +03:00
justheuristic
3634fc738b
Merge branch 'TimDettmers:main' into memory-efficient-backward 2022-09-17 18:42:22 +03:00
Tim Dettmers
8ccc0b0ee1 Merge branch 'main' of github.com:TimDettmers/bitsandbytes into main 2022-09-11 11:58:40 -07:00
Tim Dettmers
19a7adca7a Fixed 2^31 max size issue for cpu blockwise quant. 2022-09-11 11:55:09 -07:00
dbaranchuk
e2a75769f2 bug fix 2022-09-11 21:41:46 +03:00
dbaranchuk
4dd475ced4 refactoring 2022-09-11 06:28:17 +03:00
dbaranchuk
d358999e9e refactoring 2022-09-11 06:26:15 +03:00
dbaranchuk
ee325f0215 clarified an exception message 2022-09-11 06:18:44 +03:00
dbaranchuk
42b5fc9acc add memory effcient backward option 2022-09-11 05:51:29 +03:00
Dmitry Baranchuk
843ad0631c
Merge pull request #1 from TimDettmers/main
Update main branch
2022-09-10 19:33:21 -07:00
Tim Dettmers
2e630b55f5 Version bump + bnb.utils import fix. 2022-09-08 13:16:16 -07:00
Tim Dettmers
aca55881b9
Merge branch 'main' into remove_unused_code 2022-09-05 16:29:25 -07:00
dbaranchuk
8d34d36f15 req_gradA for casted & more efficient and accurate fp16 backward 2022-08-29 00:56:08 +03:00
dbaranchuk
b3fee1ed6a add dtype <-> fp16 cast 2022-08-26 04:11:40 +03:00
dbaranchuk
4d6174bc63 memory efficient fp16 backward 2022-08-25 19:09:23 +03:00
Max Ryabinin
92a3363096 Replace print_stderr with warnings.warn 2022-08-24 18:45:17 +03:00
Max Ryabinin
9fc0ab415c Remove unused code 2022-08-24 18:43:18 +03:00
Tim Dettmers
ee5b947e63 Fixed issue where Pascal was not displaying proper error. 2022-08-23 16:00:26 -07:00
dbaranchuk
ef2936a90d delete CxB from state 2022-08-24 01:33:04 +03:00
dbaranchuk
876387dc0c minor fixes 2022-08-24 01:12:48 +03:00
Tim Dettmers
7e0fb655e1 Some initial code. Needs to be tested. 2022-08-23 13:59:34 -07:00
dbaranchuk
656de8ed11 minor fixes 2022-08-23 23:53:43 +03:00
dbaranchuk
1753aa0418 refactoring 2022-08-23 23:51:00 +03:00
dbaranchuk
8ae9bb23ad add memory efficient backward 2022-08-23 23:39:54 +03:00
Tim Dettmers
9d60b3c527 Fixed bug in Linear8bitLt, when the bias is None. 2022-08-17 03:45:57 -07:00
Tim Dettmers
a6664de072 Enhanced error handling in CUDA SETUP failures. 2022-08-16 19:03:19 -07:00
Tim Dettmers
de354f7ded Added fused bias to matmullt. 2022-08-16 12:00:54 -07:00
Tim Dettmers
111b876449 Merge branch 'cuda-bin-switch-and-cli' of github.com:TimDettmers/bitsandbytes into cuda-bin-switch-and-cli 2022-08-16 10:57:10 -07:00
Tim Dettmers
1ed2fa2f21 Removed storage() from get_ptr; added boilerplate for bias dequant_mm. 2022-08-16 10:56:17 -07:00
Tim Dettmers
1ced47c504 Added CUDA version warning and fixed cuda_install for 9.2/10.2. 2022-08-09 20:02:47 -07:00
Tim Dettmers
f9cbe2fe99 Fixed prod Python < 3.7 compatibility in function.py. 2022-08-08 09:13:22 -07:00
Tim Dettmers
62441815bc Removed prod for Python <= 3.7 compatibility. 2022-08-08 05:20:36 -07:00
Tim Dettmers
26efb154c8 Fixed bug where python -m bitsandbytes was failing. 2022-08-07 09:49:36 -07:00
Tim Dettmers
c472bd56f0 Added the case that all env variables are empty (CUDA docker). 2022-08-05 08:57:52 -07:00
Tim Dettmers
e35337f05e Now determining cuda version via libcudart.so call. 2022-08-05 07:13:24 -07:00
Tim Dettmers
8f84674d67 Fixed bugs in cuda setup. 2022-08-04 09:16:00 -07:00
Tim Dettmers
758c7175a2 Merge branch 'debug' into cuda-bin-switch-and-cli 2022-08-04 08:03:00 -07:00
Tim Dettmers
ab72a1294f Added pre/post device call for extract outliers. 2022-08-04 07:47:22 -07:00
Tim Dettmers
cc5b323876 Merge branch 'extract_outliers' into debug 2022-08-04 07:40:48 -07:00
Tim Dettmers
6101a8fb9f Added pre and post device call to transform. 2022-08-04 07:28:12 -07:00
Tim Dettmers
320eacb4c2 Removed print statement. 2022-08-03 14:17:54 -07:00
Tim Dettmers
451fd9506e Added fixes for the case that matmullt dim A is zero, e.g. [0, 768]. 2022-08-03 11:54:01 -07:00
Tim Dettmers
2f01865a2f Added CUDA block assert and is_on_gpu check. 2022-08-03 09:05:37 -07:00
Titus von Koeller
96bc209baf tentative refactoring of the compute capabilities code 2022-08-02 21:27:36 -07:00
Titus von Koeller
59a615b386 factored cuda_setup.main out into smaller modules and functions 2022-08-02 21:26:50 -07:00