Tim Dettmers
|
7140c01405
|
Merge branch 'main' into fp8_merge
|
2023-04-12 11:44:39 -07:00 |
|
Tim Dettmers
|
dd562c24f1
|
Refactored simulated fp8 modules into research.nn.
|
2023-04-12 11:24:44 -07:00 |
|
Tim Dettmers
|
ec1ea63711
|
Refactored triton into its own folder. Refactored fp8 matmuls.
|
2023-04-12 09:39:39 -07:00 |
|
Mitchell Wortsman
|
d677a71607
|
typo
|
2023-04-08 19:36:17 +00:00 |
|
Mitchell Wortsman
|
da524d97c9
|
mem efficient"
|
2023-04-08 19:34:18 +00:00 |
|
Mitchell Wortsman
|
7f87ba83ee
|
cleaning and refactor
|
2023-04-01 18:46:04 +00:00 |
|
Tim Dettmers
|
a13a522c4c
|
Added first triton test.
|
2023-03-31 11:20:54 -07:00 |
|
Mitchell Wortsman
|
5f3d9ada8d
|
triton-v1
|
2023-03-29 06:47:08 +00:00 |
|
Mitchell Wortsman
|
51f8bb7133
|
pre-triton update
|
2023-03-24 05:44:42 +00:00 |
|
Max Ryabinin
|
24609b66af
|
Reduce diff
|
2023-02-25 06:24:58 +01:00 |
|
Max Ryabinin
|
d15822a54b
|
Refactor _tile_indices into a cached property, fix device bug
|
2023-02-25 06:23:07 +01:00 |
|
Max Ryabinin
|
cc608c04c2
|
Revert the layout if weights were reordered
|
2023-02-25 06:02:06 +01:00 |
|
Max Ryabinin
|
cd4d904a4c
|
Raise an error when loading a quantized checkpoint before quantization
|
2023-02-25 06:01:34 +01:00 |
|
Mitchell Wortsman
|
75377d125e
|
new experiments
|
2023-02-24 00:10:15 +00:00 |
|
Mitchell Wortsman
|
3fbf60ad83
|
sim now worse than real
|
2023-02-23 08:27:15 +00:00 |
|
Max Ryabinin
|
58b09ee1b1
|
[WIP] Implement proper serialization of Linear8bitLt
|
2023-02-21 12:04:47 +01:00 |
|
Mitchell Wortsman
|
7b764d3569
|
adding half() cast
|
2023-02-21 03:53:44 +00:00 |
|
Tim Dettmers
|
2dfa3ce16d
|
Fixed LinearFP8 and added tests.
|
2023-02-13 17:48:52 -08:00 |
|
Tim Dettmers
|
fa255cbc56
|
Added missing import.
|
2023-02-13 17:29:39 -08:00 |
|
Tim Dettmers
|
ca3236587a
|
Added forward/backward tests; removed bias.
|
2023-02-13 17:20:52 -08:00 |
|
Tim Dettmers
|
6bdb6c351e
|
Added fp8 simulation layer.
|
2023-02-13 16:53:07 -08:00 |
|
Tim Dettmers
|
de53588934
|
Added Int8 matmul support for all GPUs. Full backward support.
|
2023-02-01 20:09:31 -08:00 |
|
Tim Dettmers
|
c9f505064e
|
Added outlier detector and fake quantization layer.
|
2023-01-28 17:05:22 -08:00 |
|
Tim Dettmers
|
9d353ca786
|
Merge pull request #87 from lostmsu/main
Add `device` and `dtype` parameters to `StableEmbedding`
|
2023-01-02 13:22:45 +01:00 |
|
Victor Nova
|
62d39a237c
|
add device and dtype parameters to StableEmbedding
|
2022-11-04 14:12:46 -07:00 |
|
Tom Aarsen
|
54bd6ed1d6
|
Remove unused imports
|
2022-10-27 13:32:01 +02:00 |
|
Tom Aarsen
|
0b078403ee
|
Simplify statements into equivalent, modern variants
via pyupgrade --py37-plus. The changes e.g. are subclassing from object, calling super() with super(ThisClass, self), or old-style syntax formatting.
|
2022-10-27 13:14:13 +02:00 |
|
Tom Aarsen
|
1eec77d34c
|
Remove trailing whitespace & ensure newline at EOF
|
2022-10-27 13:11:29 +02:00 |
|
Tim Dettmers
|
df86625a93
|
Isolated CUDASetup logging; all tests green.
|
2022-10-24 11:54:25 -07:00 |
|
justheuristic
|
5d65817101
|
debug
|
2022-09-18 01:09:24 +03:00 |
|
dbaranchuk
|
4dd475ced4
|
refactoring
|
2022-09-11 06:28:17 +03:00 |
|
dbaranchuk
|
42b5fc9acc
|
add memory effcient backward option
|
2022-09-11 05:51:29 +03:00 |
|
dbaranchuk
|
4d6174bc63
|
memory efficient fp16 backward
|
2022-08-25 19:09:23 +03:00 |
|
dbaranchuk
|
ef2936a90d
|
delete CxB from state
|
2022-08-24 01:33:04 +03:00 |
|
dbaranchuk
|
656de8ed11
|
minor fixes
|
2022-08-23 23:53:43 +03:00 |
|
dbaranchuk
|
8ae9bb23ad
|
add memory efficient backward
|
2022-08-23 23:39:54 +03:00 |
|
Tim Dettmers
|
9d60b3c527
|
Fixed bug in Linear8bitLt, when the bias is None.
|
2022-08-17 03:45:57 -07:00 |
|
Tim Dettmers
|
de354f7ded
|
Added fused bias to matmullt.
|
2022-08-16 12:00:54 -07:00 |
|
Titus von Koeller
|
ea7c14f8ef
|
reran black with linelength 80 for greater readability
|
2022-08-01 09:32:47 -07:00 |
|
Titus von Koeller
|
bfa0e33294
|
ran black and isort for coherent code formatting
|
2022-08-01 03:31:48 -07:00 |
|
Tim Dettmers
|
c771b3a75a
|
Most tests passing.
|
2022-07-22 14:41:05 -07:00 |
|
Max Ryabinin
|
33efe4a09f
|
Remove unused imports, fix NotImplementedError
|
2022-06-30 18:14:20 +03:00 |
|
Tim Dettmers
|
20e1677dfd
|
Added module override, bnb.nn.Embedding #13 #15 #19
|
2021-11-29 09:32:13 -08:00 |
|
Robin Schmidt
|
67a1283501
|
[FIX] passing of sparse in StableEmbedding
|
2021-11-15 17:27:02 +01:00 |
|
Tim Dettmers
|
7439924891
|
Initial commit
|
2021-10-05 19:16:20 -07:00 |
|