Tim Dettmers
|
211ad594df
|
Added error+instructions for unsupported CUDA 10.0 version #82
|
2023-01-03 06:07:35 -08:00 |
|
Tim Dettmers
|
9d353ca786
|
Merge pull request #87 from lostmsu/main
Add `device` and `dtype` parameters to `StableEmbedding`
|
2023-01-02 13:22:45 +01:00 |
|
Tim Dettmers
|
7a6563b6c8
|
Default to CPU library on CUDA error+small refactor.
|
2023-01-02 03:47:09 -08:00 |
|
Tim Dettmers
|
d9112dc55b
|
Merge pull request #110 from BlackHC/cublaslt_version
Improve cc version detection for cublaslt
|
2023-01-02 12:35:53 +01:00 |
|
Tim Dettmers
|
336e24696c
|
CUDASetup only executed once + fixed circular import.
|
2023-01-02 03:31:43 -08:00 |
|
Tim Dettmers
|
be5cecb88f
|
Merge branch 'main' into main
|
2023-01-02 11:23:17 +01:00 |
|
Tim Dettmers
|
c91f592ad7
|
Merge branch 'main' into cleanup
|
2023-01-02 11:19:16 +01:00 |
|
blackhc
|
ed17aa9a31
|
Don't mark it as failure though.
|
2022-12-29 23:50:48 +00:00 |
|
blackhc
|
7b39a5511d
|
Fix issue #97
|
2022-12-29 23:47:21 +00:00 |
|
Tim Dettmers
|
c059bd2848
|
Added additional blocksizes: {64, 128, 256}.
|
2022-11-20 14:18:15 -08:00 |
|
Tim Dettmers
|
eb028e6ebc
|
Fixed k-bit quantization maps.
|
2022-11-19 07:24:03 -08:00 |
|
Tom Aarsen
|
b104ce3b62
|
Merge branch 'main' into cleanup
|
2022-11-17 15:22:29 +01:00 |
|
Tim Dettmers
|
08fa2e7b01
|
Fixed bug in cpu quant; faster GPU dequant.
|
2022-11-07 18:06:18 -08:00 |
|
Tim Dettmers
|
62a333ac40
|
Added pre/post calls do quantize_blockwise.
|
2022-11-06 17:17:51 -08:00 |
|
Tim Dettmers
|
e0e697b150
|
Fixed blockwise test and logic.
|
2022-11-06 16:36:31 -08:00 |
|
Tim Dettmers
|
6bc2b992be
|
Added blocksizes 2048, 1024, and 512 to blockwise quant.
|
2022-11-06 16:27:48 -08:00 |
|
Tim Dettmers
|
2f2063bac2
|
Added k<256 quantile estimate.
|
2022-11-06 13:05:25 -08:00 |
|
Tim Dettmers
|
98cbc4bc4f
|
Added k-bit fp8 map.
|
2022-11-06 11:59:37 -08:00 |
|
Tim Dettmers
|
caf1832526
|
Added k-bit linear quantization.
|
2022-11-06 11:47:54 -08:00 |
|
Victor Nova
|
62d39a237c
|
add device and dtype parameters to StableEmbedding
|
2022-11-04 14:12:46 -07:00 |
|
Tim Dettmers
|
1efb87d89d
|
Added FP8 quantization map.
|
2022-11-03 19:49:50 -07:00 |
|
Tom Aarsen
|
62c0bd2278
|
Fix several typos in logging and comments
Via codespell
|
2022-11-01 09:53:47 +01:00 |
|
Tom Aarsen
|
d504050ff7
|
Call isort over cuda_setup/main.py
|
2022-11-01 09:46:03 +01:00 |
|
Tom Aarsen
|
30f28b94a0
|
Merge branch 'main' into cleanup
|
2022-11-01 09:43:49 +01:00 |
|
Tim Dettmers
|
8d87c0b852
|
Fixed CUDA setup bugs, including #81.
|
2022-10-31 18:04:49 -07:00 |
|
adpkadspokasdk
|
8724c990c7
|
allow hiding of the welcome message
|
2022-10-27 16:04:49 -06:00 |
|
Tim Dettmers
|
4844aef4ff
|
Fixing bad error when GPU was not detected for #73.
|
2022-10-27 08:54:30 -07:00 |
|
Tom Aarsen
|
c6dad28a08
|
Remove extraneous get_ptr calls
|
2022-10-27 13:53:16 +02:00 |
|
Tom Aarsen
|
7727fa4c8c
|
Remove f-prefix from strings that don't use formatting
|
2022-10-27 13:36:39 +02:00 |
|
Tom Aarsen
|
54bd6ed1d6
|
Remove unused imports
|
2022-10-27 13:32:01 +02:00 |
|
Tom Aarsen
|
ef70f2adcd
|
Fix bad indentation
|
2022-10-27 13:27:18 +02:00 |
|
Tom Aarsen
|
697bd02c60
|
Resolve dangerous default value [] as argument
|
2022-10-27 13:25:51 +02:00 |
|
Tom Aarsen
|
b5cf706341
|
Removing unnecessary else's
|
2022-10-27 13:25:07 +02:00 |
|
Tom Aarsen
|
4a05df34c2
|
Fix critical bug in PytorchLARS().step: Undefined variable
|
2022-10-27 13:19:09 +02:00 |
|
Tom Aarsen
|
f6978ae2a2
|
Fix critical bug in histogram_scatter_add_2d: Undefined variable
|
2022-10-27 13:16:53 +02:00 |
|
Tom Aarsen
|
7a3c9af05d
|
Sort imports
Via isort
|
2022-10-27 13:15:21 +02:00 |
|
Tom Aarsen
|
0b078403ee
|
Simplify statements into equivalent, modern variants
via pyupgrade --py37-plus. The changes e.g. are subclassing from object, calling super() with super(ThisClass, self), or old-style syntax formatting.
|
2022-10-27 13:14:13 +02:00 |
|
Tom Aarsen
|
1eec77d34c
|
Remove trailing whitespace & ensure newline at EOF
|
2022-10-27 13:11:29 +02:00 |
|
Tom Aarsen
|
31f6689504
|
Remove references to unused cli
|
2022-10-27 13:10:32 +02:00 |
|
Tom Aarsen
|
4faf6cb7e9
|
Replace seemingly incorrect use of CUDA_RUNTIME_LIB
|
2022-10-26 09:43:57 +02:00 |
|
Tom Aarsen
|
c584482f1f
|
Resolve cases of CUDASetup.get_instance not being called when used
|
2022-10-26 09:37:16 +02:00 |
|
Tim Dettmers
|
a371be302d
|
Added CUDA SETUP instruction generator.
|
2022-10-25 08:01:19 -07:00 |
|
Tim Dettmers
|
df86625a93
|
Isolated CUDASetup logging; all tests green.
|
2022-10-24 11:54:25 -07:00 |
|
Tim Dettmers
|
ed2e3b9db4
|
Merge pull request #36 from tomaarsen/hotfix/os_error_name_too_long
Fixes `OSError: File name too long` when environment variable is too long
|
2022-10-09 16:47:11 -07:00 |
|
Tim Dettmers
|
76699b4a8d
|
Merge pull request #37 from tomaarsen/hotfix/colab_just_cpu
Perform check using implicit list length
|
2022-10-09 16:43:58 -07:00 |
|
Tim Dettmers
|
9b7d307b8c
|
review
|
2022-09-20 06:36:32 +03:00 |
|
justheuristic
|
5d65817101
|
debug
|
2022-09-18 01:09:24 +03:00 |
|
justheuristic
|
4da2227fcb
|
debug
|
2022-09-18 01:03:21 +03:00 |
|
justheuristic
|
4b4a9effd1
|
debugprint
|
2022-09-18 01:02:13 +03:00 |
|
justheuristic
|
7906dc4c9a
|
debugpritn
|
2022-09-18 00:57:26 +03:00 |
|