Tim Dettmers
|
5d9bc7a301
|
Merge pull request #19 from mbrukman/patch-1
Add syntax highlighting for BibTeX entries
|
2022-09-05 16:11:38 -07:00 |
|
Tim Dettmers
|
4e4668ab09
|
Merge pull request #13 from chessgecko/patch-1
fix param name
|
2022-09-05 16:10:47 -07:00 |
|
dbaranchuk
|
8d34d36f15
|
req_gradA for casted & more efficient and accurate fp16 backward
|
2022-08-29 00:56:08 +03:00 |
|
dbaranchuk
|
b3fee1ed6a
|
add dtype <-> fp16 cast
|
2022-08-26 04:11:40 +03:00 |
|
dbaranchuk
|
4d6174bc63
|
memory efficient fp16 backward
|
2022-08-25 19:09:23 +03:00 |
|
Max Ryabinin
|
92a3363096
|
Replace print_stderr with warnings.warn
|
2022-08-24 18:45:17 +03:00 |
|
Max Ryabinin
|
9fc0ab415c
|
Remove unused code
|
2022-08-24 18:43:18 +03:00 |
|
Tim Dettmers
|
8e7053a988
|
0.32.2: cpuonly fix and pascal error message.
|
2022-08-23 16:08:26 -07:00 |
|
Tim Dettmers
|
ee5b947e63
|
Fixed issue where Pascal was not displaying proper error.
|
2022-08-23 16:00:26 -07:00 |
|
dbaranchuk
|
ef2936a90d
|
delete CxB from state
|
2022-08-24 01:33:04 +03:00 |
|
dbaranchuk
|
876387dc0c
|
minor fixes
|
2022-08-24 01:12:48 +03:00 |
|
Tim Dettmers
|
7e0fb655e1
|
Some initial code. Needs to be tested.
|
2022-08-23 13:59:34 -07:00 |
|
dbaranchuk
|
656de8ed11
|
minor fixes
|
2022-08-23 23:53:43 +03:00 |
|
dbaranchuk
|
1753aa0418
|
refactoring
|
2022-08-23 23:51:00 +03:00 |
|
dbaranchuk
|
8ae9bb23ad
|
add memory efficient backward
|
2022-08-23 23:39:54 +03:00 |
|
Misha Brukman
|
616e23491d
|
Add syntax highlighting for BibTeX entries
Also minor punctuation fixes.
|
2022-08-20 12:04:31 -04:00 |
|
David Silin
|
e4e13db812
|
fix param name
|
2022-08-17 14:39:10 -05:00 |
|
Tim Dettmers
|
9d60b3c527
|
Fixed bug in Linear8bitLt, when the bias is None.
|
2022-08-17 03:45:57 -07:00 |
|
Tim Dettmers
|
b00cc9137f
|
Added rudimentary LLM.int8() documentation.
|
2022-08-16 19:33:28 -07:00 |
|
Tim Dettmers
|
a6664de072
|
Enhanced error handling in CUDA SETUP failures.
|
2022-08-16 19:03:19 -07:00 |
|
Tim Dettmers
|
de354f7ded
|
Added fused bias to matmullt.
|
2022-08-16 12:00:54 -07:00 |
|
Tim Dettmers
|
dede343033
|
Added fused bias in dequant_mm.
|
2022-08-16 11:12:09 -07:00 |
|
Tim Dettmers
|
111b876449
|
Merge branch 'cuda-bin-switch-and-cli' of github.com:TimDettmers/bitsandbytes into cuda-bin-switch-and-cli
|
2022-08-16 10:57:10 -07:00 |
|
Tim Dettmers
|
1ed2fa2f21
|
Removed storage() from get_ptr; added boilerplate for bias dequant_mm.
|
2022-08-16 10:56:17 -07:00 |
|
Tim Dettmers
|
1ced47c504
|
Added CUDA version warning and fixed cuda_install for 9.2/10.2.
|
2022-08-09 20:02:47 -07:00 |
|
Tim Dettmers
|
f9cbe2fe99
|
Fixed prod Python < 3.7 compatibility in function.py.
|
2022-08-08 09:13:22 -07:00 |
|
Tim Dettmers
|
62441815bc
|
Removed prod for Python <= 3.7 compatibility.
|
2022-08-08 05:20:36 -07:00 |
|
Tim Dettmers
|
26efb154c8
|
Fixed bug where python -m bitsandbytes was failing.
|
2022-08-07 09:49:36 -07:00 |
|
Tim Dettmers
|
a4532c59f7
|
Removed faulty asserts.
|
2022-08-06 09:31:05 -07:00 |
|
Tim Dettmers
|
c472bd56f0
|
Added the case that all env variables are empty (CUDA docker).
|
2022-08-05 08:57:52 -07:00 |
|
Tim Dettmers
|
6ad8796cfc
|
Bumping version for TestPyPi release.
|
2022-08-05 07:17:36 -07:00 |
|
Tim Dettmers
|
e35337f05e
|
Now determining cuda version via libcudart.so call.
|
2022-08-05 07:13:24 -07:00 |
|
Tim Dettmers
|
8f84674d67
|
Fixed bugs in cuda setup.
|
2022-08-04 09:16:00 -07:00 |
|
Tim Dettmers
|
758c7175a2
|
Merge branch 'debug' into cuda-bin-switch-and-cli
|
2022-08-04 08:03:00 -07:00 |
|
Tim Dettmers
|
ab72a1294f
|
Added pre/post device call for extract outliers.
|
2022-08-04 07:47:22 -07:00 |
|
Tim Dettmers
|
cc5b323876
|
Merge branch 'extract_outliers' into debug
|
2022-08-04 07:40:48 -07:00 |
|
Tim Dettmers
|
6101a8fb9f
|
Added pre and post device call to transform.
|
2022-08-04 07:28:12 -07:00 |
|
Tim Dettmers
|
320eacb4c2
|
Removed print statement.
|
2022-08-03 14:17:54 -07:00 |
|
Tim Dettmers
|
451fd9506e
|
Added fixes for the case that matmullt dim A is zero, e.g. [0, 768].
|
2022-08-03 11:54:01 -07:00 |
|
Tim Dettmers
|
2f01865a2f
|
Added CUDA block assert and is_on_gpu check.
|
2022-08-03 09:05:37 -07:00 |
|
Titus von Koeller
|
96bc209baf
|
tentative refactoring of the compute capabilities code
|
2022-08-02 21:27:36 -07:00 |
|
Titus von Koeller
|
59a615b386
|
factored cuda_setup.main out into smaller modules and functions
|
2022-08-02 21:26:50 -07:00 |
|
Titus von Koeller
|
3809236428
|
move cuda_setup code into subpackage
|
2022-08-02 07:42:27 -07:00 |
|
Tim Dettmers
|
e120c4a550
|
Fixed syntax error; bumped revision for beta release.
|
2022-08-01 20:05:03 -07:00 |
|
Tim Dettmers
|
3479d02a76
|
Added some more docs and comments.
|
2022-08-01 19:43:09 -07:00 |
|
Tim Dettmers
|
8bf3e9faab
|
Added full env variable search; CONDA_PREFIX priority.
|
2022-08-01 19:22:41 -07:00 |
|
Titus von Koeller
|
c4fe6c69a3
|
deleted function that was moved but accidentally not removed in commit
|
2022-08-01 09:40:41 -07:00 |
|
Titus von Koeller
|
ea7c14f8ef
|
reran black with linelength 80 for greater readability
|
2022-08-01 09:32:47 -07:00 |
|
Titus von Koeller
|
3fd06fb620
|
refactored subshell execution code for greater readability and moved it to utils
|
2022-08-01 09:30:29 -07:00 |
|
Titus von Koeller
|
54efd874a8
|
flake8 found some stuff that needs fixing before the release
|
2022-08-01 03:32:34 -07:00 |
|