Tim Dettmers
|
0f9d30207f
|
Added nested quantization for blockwise quantization.
|
2023-04-19 11:48:47 -07:00 |
|
Tim Dettmers
|
7dc198feb7
|
Added 32-bit optimizer for bfloat16 gradients.
|
2023-04-17 18:01:49 -07:00 |
|
Tim Dettmers
|
b8ea2b416d
|
Fixed bias conversion in Linear4bit
|
2023-04-12 12:28:35 -07:00 |
|
Tim Dettmers
|
e9fa03b717
|
Some fixed for loading PEFT modules with Params4bit.
|
2023-04-07 09:59:21 -07:00 |
|
Tim Dettmers
|
1ccb7bdec6
|
Fixed ParamsIn4 init; fixed PyTorch 2.0 test failure.
|
2023-04-03 18:47:00 -07:00 |
|
Tim Dettmers
|
4ea489d3bf
|
Refactor FP4 into 4Bit and integrate NF4 data type.
|
2023-04-03 11:00:12 -07:00 |
|
Tim Dettmers
|
64cc05920d
|
First draft of NF4.
|
2023-04-02 16:10:35 -07:00 |
|
Tim Dettmers
|
4ad999d144
|
Added quantization tree generation.
|
2023-04-02 14:42:45 -07:00 |
|
Tim Dettmers
|
0d332a641f
|
Added normal with extra value.
|
2023-04-02 14:09:08 -07:00 |
|
Tim Dettmers
|
2dd5d69056
|
Generalized FP4 data type.
|
2023-04-02 12:42:01 -07:00 |
|
Tim Dettmers
|
51a21df728
|
Added 8-bit compression to quantization statistics.
|
2023-04-01 16:10:18 -07:00 |
|
Tim Dettmers
|
c4cfe4fbdd
|
Added bf16 Adam.
|
2023-04-01 10:33:03 -07:00 |
|
Tim Dettmers
|
8645d1f71c
|
Added normal quant.
|
2023-03-29 18:41:37 -07:00 |
|
Tim Dettmers
|
69810521d3
|
Some small changes.
|
2023-03-27 09:12:57 -07:00 |
|
Artidoro Pagnoni
|
6c31a5fe99
|
t5 model fix
|
2023-02-27 14:23:21 -08:00 |
|
Tim Dettmers
|
9851a10b46
|
Added cast to fp4 layer for speed.
|
2023-02-24 10:17:57 -08:00 |
|
Tim Dettmers
|
c93a90d075
|
Fixed FP4 import and data type conversion in backward.
|
2023-02-14 13:31:39 -08:00 |
|
Tim Dettmers
|
7f0773aede
|
Added backprop test for Linear8bitLt and LinearFP4.
|
2023-02-05 06:49:54 -08:00 |
|
Tim Dettmers
|
c0c352b379
|
Added bias test for LinearFP4 and basic test.
|
2023-02-05 06:29:52 -08:00 |
|
Tim Dettmers
|
c361f84239
|
Fixed matmul_fp4 transpose.
|
2023-02-05 06:16:56 -08:00 |
|
Tim Dettmers
|
cfe4705e32
|
Added matmul_fp4 to the benchmark.
|
2023-02-04 22:00:04 -08:00 |
|
Tim Dettmers
|
13c0a4dc5d
|
Backward matmul_fp4 passes.
|
2023-02-04 21:35:43 -08:00 |
|
Tim Dettmers
|
160a83580d
|
Forward matmul_fp4 tests pass.
|
2023-02-04 21:11:21 -08:00 |
|
Tim Dettmers
|
3ac5840c03
|
Added fp4 quant/dequant and dequant optimizations.
|
2023-02-04 14:52:04 -08:00 |
|
Tim Dettmers
|
0f5c394870
|
Added version 0.37.0.
|
2023-02-01 20:27:01 -08:00 |
|
Tim Dettmers
|
de53588934
|
Added Int8 matmul support for all GPUs. Full backward support.
|
2023-02-01 20:09:31 -08:00 |
|
Tim Dettmers
|
92ab6a8d5f
|
Merge pull request #119 from stas00/patch-1
improve install instructions
|
2023-02-01 19:21:36 -08:00 |
|
Stas Bekman
|
c5372a8567
|
improve install instructions
|
2023-01-05 13:34:51 -08:00 |
|
Tim Dettmers
|
1341fb44ad
|
Fixed issue where the CUDA SETUP was not printed.
|
2023-01-04 03:50:53 -08:00 |
|
Tim Dettmers
|
3901ebf7ae
|
Added CUDA 12.0 support; removed CC 3.0 support.
|
2023-01-04 02:28:33 -08:00 |
|
Tim Dettmers
|
b3de19218e
|
Added error message for unexpected CUDA exception.
|
2023-01-03 06:57:07 -08:00 |
|
Tim Dettmers
|
81990491ff
|
Merge pull request #113 from Borzik/fix-warnings
Import missing warn function
|
2023-01-03 15:46:58 +01:00 |
|
Tim Dettmers
|
9180b4cc11
|
Added additional error message for cudart error #85
|
2023-01-03 06:44:11 -08:00 |
|
Tim Dettmers
|
dfb049f8e4
|
Added Python >= 3.8 requirement.
|
2023-01-03 06:20:06 -08:00 |
|
Tim Dettmers
|
211ad594df
|
Added error+instructions for unsupported CUDA 10.0 version #82
|
2023-01-03 06:07:35 -08:00 |
|
Felix Borzik
|
f3800bab75
|
import warn function
|
2023-01-03 13:23:34 +00:00 |
|
Tim Dettmers
|
9d353ca786
|
Merge pull request #87 from lostmsu/main
Add `device` and `dtype` parameters to `StableEmbedding`
|
2023-01-02 13:22:45 +01:00 |
|
Tim Dettmers
|
7a6563b6c8
|
Default to CPU library on CUDA error+small refactor.
|
2023-01-02 03:47:09 -08:00 |
|
Tim Dettmers
|
d9112dc55b
|
Merge pull request #110 from BlackHC/cublaslt_version
Improve cc version detection for cublaslt
|
2023-01-02 12:35:53 +01:00 |
|
Tim Dettmers
|
336e24696c
|
CUDASetup only executed once + fixed circular import.
|
2023-01-02 03:31:43 -08:00 |
|
Tim Dettmers
|
df9a9b0c4c
|
Merge pull request #77 from Cyberes/main
Allow hiding of the welcome message
|
2023-01-02 11:28:17 +01:00 |
|
Tim Dettmers
|
be5cecb88f
|
Merge branch 'main' into main
|
2023-01-02 11:23:17 +01:00 |
|
Tim Dettmers
|
f0ec93d016
|
Merge pull request #76 from tomaarsen/cleanup
Cleanup involving a handful of failures, some optimization and a lot of code quality improvements
|
2023-01-02 11:19:28 +01:00 |
|
Tim Dettmers
|
c91f592ad7
|
Merge branch 'main' into cleanup
|
2023-01-02 11:19:16 +01:00 |
|
blackhc
|
ed17aa9a31
|
Don't mark it as failure though.
|
2022-12-29 23:50:48 +00:00 |
|
blackhc
|
7b39a5511d
|
Fix issue #97
|
2022-12-29 23:47:21 +00:00 |
|
Tim Dettmers
|
c059bd2848
|
Added additional blocksizes: {64, 128, 256}.
|
2022-11-20 14:18:15 -08:00 |
|
Tim Dettmers
|
eb028e6ebc
|
Fixed k-bit quantization maps.
|
2022-11-19 07:24:03 -08:00 |
|
Tom Aarsen
|
b104ce3b62
|
Merge branch 'main' into cleanup
|
2022-11-17 15:22:29 +01:00 |
|
Tim Dettmers
|
08fa2e7b01
|
Fixed bug in cpu quant; faster GPU dequant.
|
2022-11-07 18:06:18 -08:00 |
|