justheuristic
|
d9ca0ed905
|
un-fuse bias
|
2022-09-17 23:44:28 +03:00 |
|
justheuristic
|
7facedda38
|
copypaste tolerances
|
2022-09-17 23:41:40 +03:00 |
|
justheuristic
|
e29c5f5c41
|
clearer assertions
|
2022-09-17 23:22:04 +03:00 |
|
justheuristic
|
9379df85d2
|
check dtypes first
|
2022-09-17 23:13:23 +03:00 |
|
justheuristic
|
140cdbe876
|
check dtypes first
|
2022-09-17 23:12:58 +03:00 |
|
justheuristic
|
a9c7953e0a
|
cast to half before double_quant
|
2022-09-17 23:10:21 +03:00 |
|
justheuristic
|
469d5a631d
|
test_bf16
|
2022-09-17 23:06:57 +03:00 |
|
Tim Dettmers
|
7e0fb655e1
|
Some initial code. Needs to be tested.
|
2022-08-23 13:59:34 -07:00 |
|
Tim Dettmers
|
de354f7ded
|
Added fused bias to matmullt.
|
2022-08-16 12:00:54 -07:00 |
|
Tim Dettmers
|
8f84674d67
|
Fixed bugs in cuda setup.
|
2022-08-04 09:16:00 -07:00 |
|
Tim Dettmers
|
758c7175a2
|
Merge branch 'debug' into cuda-bin-switch-and-cli
|
2022-08-04 08:03:00 -07:00 |
|
Tim Dettmers
|
451fd9506e
|
Added fixes for the case that matmullt dim A is zero, e.g. [0, 768].
|
2022-08-03 11:54:01 -07:00 |
|
Titus von Koeller
|
ea7c14f8ef
|
reran black with linelength 80 for greater readability
|
2022-08-01 09:32:47 -07:00 |
|
Titus von Koeller
|
bfa0e33294
|
ran black and isort for coherent code formatting
|
2022-08-01 03:31:48 -07:00 |
|
Tim Dettmers
|
c771b3a75a
|
Most tests passing.
|
2022-07-22 14:41:05 -07:00 |
|