broncotc
|
1b52f4243f
|
fixed, works on gfx1030, do save RAM
|
2022-11-24 05:15:08 +00:00 |
|
broncotc
|
2dcf38289d
|
should be hippified, and all cuda checkes cleaned up, makefile not updated yet
|
2022-11-23 17:52:19 -08:00 |
|
Tim Dettmers
|
c059bd2848
|
Added additional blocksizes: {64, 128, 256}.
|
2022-11-20 14:18:15 -08:00 |
|
Tim Dettmers
|
eb028e6ebc
|
Fixed k-bit quantization maps.
|
2022-11-19 07:24:03 -08:00 |
|
Tim Dettmers
|
08fa2e7b01
|
Fixed bug in cpu quant; faster GPU dequant.
|
2022-11-07 18:06:18 -08:00 |
|
Tim Dettmers
|
62a333ac40
|
Added pre/post calls do quantize_blockwise.
|
2022-11-06 17:17:51 -08:00 |
|
Tim Dettmers
|
e0e697b150
|
Fixed blockwise test and logic.
|
2022-11-06 16:36:31 -08:00 |
|
Tim Dettmers
|
6bc2b992be
|
Added blocksizes 2048, 1024, and 512 to blockwise quant.
|
2022-11-06 16:27:48 -08:00 |
|
Tim Dettmers
|
2f2063bac2
|
Added k<256 quantile estimate.
|
2022-11-06 13:05:25 -08:00 |
|
Tim Dettmers
|
98cbc4bc4f
|
Added k-bit fp8 map.
|
2022-11-06 11:59:37 -08:00 |
|
Tim Dettmers
|
caf1832526
|
Added k-bit linear quantization.
|
2022-11-06 11:47:54 -08:00 |
|
Tim Dettmers
|
1efb87d89d
|
Added FP8 quantization map.
|
2022-11-03 19:49:50 -07:00 |
|
Tim Dettmers
|
8d87c0b852
|
Fixed CUDA setup bugs, including #81.
|
2022-10-31 18:04:49 -07:00 |
|
Tim Dettmers
|
4844aef4ff
|
Fixing bad error when GPU was not detected for #73.
|
2022-10-27 08:54:30 -07:00 |
|
Tom Aarsen
|
4faf6cb7e9
|
Replace seemingly incorrect use of CUDA_RUNTIME_LIB
|
2022-10-26 09:43:57 +02:00 |
|
Tom Aarsen
|
c584482f1f
|
Resolve cases of CUDASetup.get_instance not being called when used
|
2022-10-26 09:37:16 +02:00 |
|
Tim Dettmers
|
a371be302d
|
Added CUDA SETUP instruction generator.
|
2022-10-25 08:01:19 -07:00 |
|
Tim Dettmers
|
df86625a93
|
Isolated CUDASetup logging; all tests green.
|
2022-10-24 11:54:25 -07:00 |
|
Tim Dettmers
|
ed2e3b9db4
|
Merge pull request #36 from tomaarsen/hotfix/os_error_name_too_long
Fixes `OSError: File name too long` when environment variable is too long
|
2022-10-09 16:47:11 -07:00 |
|
Tim Dettmers
|
76699b4a8d
|
Merge pull request #37 from tomaarsen/hotfix/colab_just_cpu
Perform check using implicit list length
|
2022-10-09 16:43:58 -07:00 |
|
Tim Dettmers
|
9b7d307b8c
|
review
|
2022-09-20 06:36:32 +03:00 |
|
justheuristic
|
5d65817101
|
debug
|
2022-09-18 01:09:24 +03:00 |
|
justheuristic
|
4da2227fcb
|
debug
|
2022-09-18 01:03:21 +03:00 |
|
justheuristic
|
4b4a9effd1
|
debugprint
|
2022-09-18 01:02:13 +03:00 |
|
justheuristic
|
7906dc4c9a
|
debugpritn
|
2022-09-18 00:57:26 +03:00 |
|
justheuristic
|
591f60395a
|
add memory efficient backward
|
2022-09-18 00:52:53 +03:00 |
|
justheuristic
|
579b8c782f
|
reduce diff
|
2022-09-18 00:47:58 +03:00 |
|
justheuristic
|
76ece2c126
|
rollback
|
2022-09-18 00:43:56 +03:00 |
|
justheuristic
|
18f142e268
|
addmm_
|
2022-09-18 00:43:02 +03:00 |
|
justheuristic
|
ab9dee062d
|
cast edge case
|
2022-09-18 00:36:46 +03:00 |
|
justheuristic
|
cbfdf0b5ef
|
cast edge case
|
2022-09-18 00:35:42 +03:00 |
|
justheuristic
|
e35e2c665a
|
cast properly
|
2022-09-18 00:35:03 +03:00 |
|
justheuristic
|
577275bd8c
|
cast properly
|
2022-09-18 00:30:57 +03:00 |
|
justheuristic
|
45dc1983e9
|
cast properly
|
2022-09-18 00:28:03 +03:00 |
|
justheuristic
|
702cc72018
|
debug asset
|
2022-09-18 00:26:46 +03:00 |
|
justheuristic
|
a214824f93
|
matmul -1- addmm
|
2022-09-18 00:24:59 +03:00 |
|
justheuristic
|
14048a3c16
|
safer cast
|
2022-09-18 00:24:20 +03:00 |
|
justheuristic
|
5b169f18e4
|
change typecast behavior
|
2022-09-18 00:21:15 +03:00 |
|
justheuristic
|
1da4880262
|
change typecast behavior
|
2022-09-18 00:19:22 +03:00 |
|
justheuristic
|
1145589f84
|
change typecast behavior
|
2022-09-18 00:15:57 +03:00 |
|
justheuristic
|
d6e25b5f5e
|
change typecast behavior
|
2022-09-18 00:15:18 +03:00 |
|
justheuristic
|
e2b523d071
|
change typecast behavior
|
2022-09-18 00:07:05 +03:00 |
|
justheuristic
|
85bf5294a6
|
debug assert
|
2022-09-18 00:01:25 +03:00 |
|
justheuristic
|
210b9ed9ce
|
debug assert
|
2022-09-18 00:00:45 +03:00 |
|
justheuristic
|
647c976a74
|
change order
|
2022-09-17 23:59:36 +03:00 |
|
justheuristic
|
0de1a4494b
|
change order
|
2022-09-17 23:53:49 +03:00 |
|
justheuristic
|
e9b87112ee
|
un-fuse bias
|
2022-09-17 23:51:28 +03:00 |
|
justheuristic
|
56a074f6dc
|
un-fuse bias
|
2022-09-17 23:46:37 +03:00 |
|
justheuristic
|
d9ca0ed905
|
un-fuse bias
|
2022-09-17 23:44:28 +03:00 |
|
justheuristic
|
eac9aca460
|
cast bias too
|
2022-09-17 23:38:09 +03:00 |
|