Commit Graph

70 Commits

Author SHA1 Message Date
Tim Dettmers
4ea489d3bf Refactor FP4 into 4Bit and integrate NF4 data type. 2023-04-03 11:00:12 -07:00
Tim Dettmers
9851a10b46 Added cast to fp4 layer for speed. 2023-02-24 10:17:57 -08:00
Tim Dettmers
c93a90d075 Fixed FP4 import and data type conversion in backward. 2023-02-14 13:31:39 -08:00
Tim Dettmers
c361f84239 Fixed matmul_fp4 transpose. 2023-02-05 06:16:56 -08:00
Tim Dettmers
cfe4705e32 Added matmul_fp4 to the benchmark. 2023-02-04 22:00:04 -08:00
Tim Dettmers
13c0a4dc5d Backward matmul_fp4 passes. 2023-02-04 21:35:43 -08:00
Tim Dettmers
160a83580d Forward matmul_fp4 tests pass. 2023-02-04 21:11:21 -08:00
Tim Dettmers
de53588934 Added Int8 matmul support for all GPUs. Full backward support. 2023-02-01 20:09:31 -08:00
Tom Aarsen
697bd02c60 Resolve dangerous default value [] as argument 2022-10-27 13:25:51 +02:00
Tom Aarsen
7a3c9af05d Sort imports
Via isort
2022-10-27 13:15:21 +02:00
Tom Aarsen
0b078403ee Simplify statements into equivalent, modern variants
via pyupgrade --py37-plus. The changes e.g. are subclassing from object, calling super() with super(ThisClass, self), or old-style syntax formatting.
2022-10-27 13:14:13 +02:00
Tom Aarsen
1eec77d34c Remove trailing whitespace & ensure newline at EOF 2022-10-27 13:11:29 +02:00
Tim Dettmers
9b7d307b8c review 2022-09-20 06:36:32 +03:00
justheuristic
5d65817101 debug 2022-09-18 01:09:24 +03:00
justheuristic
4da2227fcb debug 2022-09-18 01:03:21 +03:00
justheuristic
4b4a9effd1 debugprint 2022-09-18 01:02:13 +03:00
justheuristic
7906dc4c9a debugpritn 2022-09-18 00:57:26 +03:00
justheuristic
591f60395a add memory efficient backward 2022-09-18 00:52:53 +03:00
justheuristic
579b8c782f reduce diff 2022-09-18 00:47:58 +03:00
justheuristic
76ece2c126 rollback 2022-09-18 00:43:56 +03:00
justheuristic
18f142e268 addmm_ 2022-09-18 00:43:02 +03:00
justheuristic
ab9dee062d cast edge case 2022-09-18 00:36:46 +03:00
justheuristic
cbfdf0b5ef cast edge case 2022-09-18 00:35:42 +03:00
justheuristic
e35e2c665a cast properly 2022-09-18 00:35:03 +03:00
justheuristic
577275bd8c cast properly 2022-09-18 00:30:57 +03:00
justheuristic
45dc1983e9 cast properly 2022-09-18 00:28:03 +03:00
justheuristic
702cc72018 debug asset 2022-09-18 00:26:46 +03:00
justheuristic
a214824f93 matmul -1- addmm 2022-09-18 00:24:59 +03:00
justheuristic
14048a3c16 safer cast 2022-09-18 00:24:20 +03:00
justheuristic
5b169f18e4 change typecast behavior 2022-09-18 00:21:15 +03:00
justheuristic
1da4880262 change typecast behavior 2022-09-18 00:19:22 +03:00
justheuristic
1145589f84 change typecast behavior 2022-09-18 00:15:57 +03:00
justheuristic
d6e25b5f5e change typecast behavior 2022-09-18 00:15:18 +03:00
justheuristic
e2b523d071 change typecast behavior 2022-09-18 00:07:05 +03:00
justheuristic
85bf5294a6 debug assert 2022-09-18 00:01:25 +03:00
justheuristic
210b9ed9ce debug assert 2022-09-18 00:00:45 +03:00
justheuristic
647c976a74 change order 2022-09-17 23:59:36 +03:00
justheuristic
0de1a4494b change order 2022-09-17 23:53:49 +03:00
justheuristic
e9b87112ee un-fuse bias 2022-09-17 23:51:28 +03:00
justheuristic
56a074f6dc un-fuse bias 2022-09-17 23:46:37 +03:00
justheuristic
d9ca0ed905 un-fuse bias 2022-09-17 23:44:28 +03:00
justheuristic
eac9aca460 cast bias too 2022-09-17 23:38:09 +03:00
justheuristic
a9fe0ff98c recast to fp16 2022-09-17 23:34:22 +03:00
justheuristic
fc4a135ed1 clearer assertions 2022-09-17 23:24:26 +03:00
justheuristic
cc4858c2fd some kind of warning or something when this is first executed to make people aware that a cast happens and the operation quantization is performed in fp16. 2022-09-17 20:46:04 +03:00
dbaranchuk
e2a75769f2 bug fix 2022-09-11 21:41:46 +03:00
dbaranchuk
4dd475ced4 refactoring 2022-09-11 06:28:17 +03:00
dbaranchuk
d358999e9e refactoring 2022-09-11 06:26:15 +03:00
dbaranchuk
ee325f0215 clarified an exception message 2022-09-11 06:18:44 +03:00
dbaranchuk
42b5fc9acc add memory effcient backward option 2022-09-11 05:51:29 +03:00