Commit Graph

484 Commits

Author SHA1 Message Date
blackhc
ed17aa9a31 Don't mark it as failure though. 2022-12-29 23:50:48 +00:00
blackhc
7b39a5511d Fix issue #97 2022-12-29 23:47:21 +00:00
Tim Dettmers
c059bd2848 Added additional blocksizes: {64, 128, 256}. 2022-11-20 14:18:15 -08:00
Tim Dettmers
eb028e6ebc Fixed k-bit quantization maps. 2022-11-19 07:24:03 -08:00
Tom Aarsen
b104ce3b62
Merge branch 'main' into cleanup 2022-11-17 15:22:29 +01:00
Tim Dettmers
08fa2e7b01 Fixed bug in cpu quant; faster GPU dequant. 2022-11-07 18:06:18 -08:00
Tim Dettmers
62a333ac40 Added pre/post calls do quantize_blockwise. 2022-11-06 17:17:51 -08:00
Tim Dettmers
e0e697b150 Fixed blockwise test and logic. 2022-11-06 16:36:31 -08:00
Tim Dettmers
6bc2b992be Added blocksizes 2048, 1024, and 512 to blockwise quant. 2022-11-06 16:27:48 -08:00
Tim Dettmers
2f2063bac2 Added k<256 quantile estimate. 2022-11-06 13:05:25 -08:00
Tim Dettmers
98cbc4bc4f Added k-bit fp8 map. 2022-11-06 11:59:37 -08:00
Tim Dettmers
caf1832526 Added k-bit linear quantization. 2022-11-06 11:47:54 -08:00
Victor Nova
62d39a237c
add device and dtype parameters to StableEmbedding 2022-11-04 14:12:46 -07:00
Tim Dettmers
1efb87d89d Added FP8 quantization map. 2022-11-03 19:49:50 -07:00
Tom Aarsen
62c0bd2278 Fix several typos in logging and comments
Via codespell
2022-11-01 09:53:47 +01:00
Tom Aarsen
d504050ff7 Call isort over cuda_setup/main.py 2022-11-01 09:46:03 +01:00
Tom Aarsen
30f28b94a0
Merge branch 'main' into cleanup 2022-11-01 09:43:49 +01:00
Tim Dettmers
8d87c0b852 Fixed CUDA setup bugs, including #81. 2022-10-31 18:04:49 -07:00
adpkadspokasdk
8724c990c7 allow hiding of the welcome message 2022-10-27 16:04:49 -06:00
Tom Aarsen
2a91e15113 Remove outdated linter log 2022-10-27 20:50:49 +02:00
Tim Dettmers
4844aef4ff Fixing bad error when GPU was not detected for #73. 2022-10-27 08:54:30 -07:00
Tim Dettmers
96ab2af1ef Bump version. 2022-10-27 07:09:08 -07:00
Tim Dettmers
29e239e4d1
Merge pull request #72 from tomaarsen/hotfix/uncalled_func
Prevent `AttributeError: 'function' object has no attribute 'add_log_entry'` on logging
2022-10-27 07:06:54 -07:00
Tom Aarsen
c6dad28a08 Remove extraneous get_ptr calls 2022-10-27 13:53:16 +02:00
Tom Aarsen
7727fa4c8c Remove f-prefix from strings that don't use formatting 2022-10-27 13:36:39 +02:00
Tom Aarsen
54bd6ed1d6 Remove unused imports 2022-10-27 13:32:01 +02:00
Tom Aarsen
ef70f2adcd Fix bad indentation 2022-10-27 13:27:18 +02:00
Tom Aarsen
697bd02c60 Resolve dangerous default value [] as argument 2022-10-27 13:25:51 +02:00
Tom Aarsen
b5cf706341 Removing unnecessary else's 2022-10-27 13:25:07 +02:00
Tom Aarsen
4a05df34c2 Fix critical bug in PytorchLARS().step: Undefined variable 2022-10-27 13:19:09 +02:00
Tom Aarsen
f6978ae2a2 Fix critical bug in histogram_scatter_add_2d: Undefined variable 2022-10-27 13:16:53 +02:00
Tom Aarsen
7a3c9af05d Sort imports
Via isort
2022-10-27 13:15:21 +02:00
Tom Aarsen
0b078403ee Simplify statements into equivalent, modern variants
via pyupgrade --py37-plus. The changes e.g. are subclassing from object, calling super() with super(ThisClass, self), or old-style syntax formatting.
2022-10-27 13:14:13 +02:00
Tom Aarsen
1eec77d34c Remove trailing whitespace & ensure newline at EOF 2022-10-27 13:11:29 +02:00
Tom Aarsen
31f6689504 Remove references to unused cli 2022-10-27 13:10:32 +02:00
Tom Aarsen
4faf6cb7e9 Replace seemingly incorrect use of CUDA_RUNTIME_LIB 2022-10-26 09:43:57 +02:00
Tom Aarsen
c584482f1f Resolve cases of CUDASetup.get_instance not being called when used 2022-10-26 09:37:16 +02:00
Tim Dettmers
a371be302d Added CUDA SETUP instruction generator. 2022-10-25 08:01:19 -07:00
Tim Dettmers
62e1649357 Bumped version. Fixes for diverse issues relating CUDA SETUP. 2022-10-24 14:47:56 -07:00
Tim Dettmers
df86625a93 Isolated CUDASetup logging; all tests green. 2022-10-24 11:54:25 -07:00
Tim Dettmers
b844e104b7 Updated docs (#32) and changelog. 2022-10-09 19:31:43 -07:00
Tim Dettmers
62b6a9399d Added CUDA 11.8 install and deployment. 2022-10-09 19:02:28 -07:00
Tim Dettmers
ed2e3b9db4
Merge pull request #36 from tomaarsen/hotfix/os_error_name_too_long
Fixes `OSError: File name too long` when environment variable is too long
2022-10-09 16:47:11 -07:00
Tim Dettmers
76699b4a8d
Merge pull request #37 from tomaarsen/hotfix/colab_just_cpu
Perform check using implicit list length
2022-10-09 16:43:58 -07:00
Tim Dettmers
7740c6e9c9 Fixed url in setup.py (#38), updated changelog. 2022-09-19 21:13:40 -07:00
Tim Dettmers
439f2b0c10
Merge pull request #33 from dbaranchuk/memory-efficient-backward
Memory efficient backward
2022-09-19 21:09:25 -07:00
justheuristic
76ce9aa6da try fp32 2022-09-20 06:51:25 +03:00
Tim Dettmers
292a478716 set threshold 2022-09-20 06:42:05 +03:00
justheuristic
a07825ac31 review 2022-09-20 06:40:36 +03:00
Tim Dettmers
9b7d307b8c review 2022-09-20 06:36:32 +03:00