Commit Graph

45 Commits

Author SHA1 Message Date
Tim Dettmers
7140c01405 Merge branch 'main' into fp8_merge 2023-04-12 11:44:39 -07:00
Tim Dettmers
dd562c24f1 Refactored simulated fp8 modules into research.nn. 2023-04-12 11:24:44 -07:00
Tim Dettmers
ec1ea63711 Refactored triton into its own folder. Refactored fp8 matmuls. 2023-04-12 09:39:39 -07:00
Mitchell Wortsman
d677a71607 typo 2023-04-08 19:36:17 +00:00
Mitchell Wortsman
da524d97c9 mem efficient" 2023-04-08 19:34:18 +00:00
Mitchell Wortsman
7f87ba83ee cleaning and refactor 2023-04-01 18:46:04 +00:00
Tim Dettmers
a13a522c4c Added first triton test. 2023-03-31 11:20:54 -07:00
Mitchell Wortsman
5f3d9ada8d triton-v1 2023-03-29 06:47:08 +00:00
Mitchell Wortsman
51f8bb7133 pre-triton update 2023-03-24 05:44:42 +00:00
Max Ryabinin
24609b66af Reduce diff 2023-02-25 06:24:58 +01:00
Max Ryabinin
d15822a54b Refactor _tile_indices into a cached property, fix device bug 2023-02-25 06:23:07 +01:00
Max Ryabinin
cc608c04c2 Revert the layout if weights were reordered 2023-02-25 06:02:06 +01:00
Max Ryabinin
cd4d904a4c Raise an error when loading a quantized checkpoint before quantization 2023-02-25 06:01:34 +01:00
Mitchell Wortsman
75377d125e new experiments 2023-02-24 00:10:15 +00:00
Mitchell Wortsman
3fbf60ad83 sim now worse than real 2023-02-23 08:27:15 +00:00
Max Ryabinin
58b09ee1b1 [WIP] Implement proper serialization of Linear8bitLt 2023-02-21 12:04:47 +01:00
Mitchell Wortsman
7b764d3569 adding half() cast 2023-02-21 03:53:44 +00:00
Tim Dettmers
2dfa3ce16d Fixed LinearFP8 and added tests. 2023-02-13 17:48:52 -08:00
Tim Dettmers
fa255cbc56 Added missing import. 2023-02-13 17:29:39 -08:00
Tim Dettmers
ca3236587a Added forward/backward tests; removed bias. 2023-02-13 17:20:52 -08:00
Tim Dettmers
6bdb6c351e Added fp8 simulation layer. 2023-02-13 16:53:07 -08:00
Tim Dettmers
de53588934 Added Int8 matmul support for all GPUs. Full backward support. 2023-02-01 20:09:31 -08:00
Tim Dettmers
c9f505064e Added outlier detector and fake quantization layer. 2023-01-28 17:05:22 -08:00
Tim Dettmers
9d353ca786
Merge pull request #87 from lostmsu/main
Add `device` and `dtype` parameters to `StableEmbedding`
2023-01-02 13:22:45 +01:00
Victor Nova
62d39a237c
add device and dtype parameters to StableEmbedding 2022-11-04 14:12:46 -07:00
Tom Aarsen
54bd6ed1d6 Remove unused imports 2022-10-27 13:32:01 +02:00
Tom Aarsen
0b078403ee Simplify statements into equivalent, modern variants
via pyupgrade --py37-plus. The changes e.g. are subclassing from object, calling super() with super(ThisClass, self), or old-style syntax formatting.
2022-10-27 13:14:13 +02:00
Tom Aarsen
1eec77d34c Remove trailing whitespace & ensure newline at EOF 2022-10-27 13:11:29 +02:00
Tim Dettmers
df86625a93 Isolated CUDASetup logging; all tests green. 2022-10-24 11:54:25 -07:00
justheuristic
5d65817101 debug 2022-09-18 01:09:24 +03:00
dbaranchuk
4dd475ced4 refactoring 2022-09-11 06:28:17 +03:00
dbaranchuk
42b5fc9acc add memory effcient backward option 2022-09-11 05:51:29 +03:00
dbaranchuk
4d6174bc63 memory efficient fp16 backward 2022-08-25 19:09:23 +03:00
dbaranchuk
ef2936a90d delete CxB from state 2022-08-24 01:33:04 +03:00
dbaranchuk
656de8ed11 minor fixes 2022-08-23 23:53:43 +03:00
dbaranchuk
8ae9bb23ad add memory efficient backward 2022-08-23 23:39:54 +03:00
Tim Dettmers
9d60b3c527 Fixed bug in Linear8bitLt, when the bias is None. 2022-08-17 03:45:57 -07:00
Tim Dettmers
de354f7ded Added fused bias to matmullt. 2022-08-16 12:00:54 -07:00
Titus von Koeller
ea7c14f8ef reran black with linelength 80 for greater readability 2022-08-01 09:32:47 -07:00
Titus von Koeller
bfa0e33294 ran black and isort for coherent code formatting 2022-08-01 03:31:48 -07:00
Tim Dettmers
c771b3a75a Most tests passing. 2022-07-22 14:41:05 -07:00
Max Ryabinin
33efe4a09f Remove unused imports, fix NotImplementedError 2022-06-30 18:14:20 +03:00
Tim Dettmers
20e1677dfd Added module override, bnb.nn.Embedding #13 #15 #19 2021-11-29 09:32:13 -08:00
Robin Schmidt
67a1283501 [FIX] passing of sparse in StableEmbedding 2021-11-15 17:27:02 +01:00
Tim Dettmers
7439924891 Initial commit 2021-10-05 19:16:20 -07:00