Issues
- 0
bug: Fails running dynamic shapes
#338 opened by michaelfeil - 0
Confuse about block_delta
#337 opened by zhanglei1172 - 4
bug: optimize_model() fails on HF's GPT2 with "RuntimeError: CUDA error: operation not permitted when stream is capturing"
#336 opened by CorentinJ - 4
bug: Llama model optimization failing
#317 opened by AndrewMead10 - 0
- 0
- 0
- 0
bug: Bart speedup only 1.6x
#327 opened by sinking-point - 1
bug: Llama reproduce error with kernl
#321 opened by yychen016 - 3
bug: How to save the optimized model to file?
#313 opened by aaronchan90 - 0
bug: does kernl support pipeline parallel?
#323 opened by ninisy - 6
bug: Could not get kernl running on CodeT5
#283 opened by TheSeamau5 - 0
proposal: Write GEMM (matrice mulitplication) triton optimization animation
#318 opened by pommedeterresautee - 0
- 2
- 1
docs: automatic code reference generation
#280 opened by jonathlela - 2
feature: using TorchBench to test the coverage
#305 opened by xuzhao9 - 1
- 1
- 0
[FRONT] Linking kernl and the blog
#281 opened by white-gorilla - 0
- 5
- 0
[FRONT] Remove/comment empty sections or put something more engaging, less crude.
#259 opened by white-gorilla - 5
- 0
bug: tests failing at nvidia-driver-530
#304 opened by christallire - 4
Accelerate warmup IRL?
#242 opened by JaheimLee - 1
feature: non verbose CI
#302 opened by pommedeterresautee - 7
Installation problem
#293 opened by p-christ - 2
feature: run tests on CI
#289 opened by pommedeterresautee - 2
- 6
- 0
feature: introduce int8 quant kernel
#288 opened by pommedeterresautee - 2
- 1
bug: ERROR: Package 'kernl' requires a different Python: 3.8.10 not in '==3.9.*'
#282 opened by silvacarl2 - 2
version is still 0.1.0
#273 opened by JaheimLee - 0
- 0
feature: reduce memory overhead in CG
#267 opened by pommedeterresautee - 0
- 0
- 0
- 1
bug: memory leak
#256 opened by pommedeterresautee - 1
[M4M Insiders] Upgrade version.
#254 opened by white-gorilla - 1
- 1
[M4MInsiders] Fix automatic docker image update
#248 opened by white-gorilla - 0
[FRONT] The contribution guide is broken.
#249 opened by white-gorilla - 0
Add conventions to triton kernel writing
#229 opened by gaetansnl - 1
bug: RuntimeError("GPU compute capability 8.0 (Ampere) or higher is required to use Kernl")
#246 opened by Oxi84 - 0
bug: memory leak in Pytest / CUDA Graph
#244 opened by pommedeterresautee - 1
- 0
Linter and formatter configuration incompatibility
#230 opened by gaetansnl