bitsandbytes-rocm/bitsandbytes
2023-04-12 11:44:39 -07:00
..
autograd Merge branch 'main' into fp8_merge 2023-04-12 11:44:39 -07:00
cuda_setup Fixing #300. 2023-04-11 16:14:29 -07:00
nn Merge branch 'main' into fp8_merge 2023-04-12 11:44:39 -07:00
optim Fixed bug where beta2 was not passed into Lion 32-bit. 2023-04-11 09:16:01 -07:00
research Refactored simulated fp8 modules into research.nn. 2023-04-12 11:24:44 -07:00
triton Added missing triton and fp8 files. 2023-04-12 10:06:18 -07:00
__init__.py Refactored triton into its own folder. Refactored fp8 matmuls. 2023-04-12 09:39:39 -07:00
__main__.py Added more detailed cuda setup debug and debugging instructions. 2023-04-11 13:47:10 -07:00
cextension.py Added better error message for debugging on CUDA not detected failures. 2023-04-12 07:56:52 -07:00
functional.py Merge branch 'main' into fp8_merge 2023-04-12 11:44:39 -07:00
utils.py Added outlier detector and fake quantization layer. 2023-01-28 17:05:22 -08:00