huggingface/candle

Minimalist ML framework for Rust

RustApache-2.0

Issues

Qwen2: can not run with the latest Qwen2 models
#2256 opened 2 days ago by zensh
3
Since cudarc 0.11.4 error with PTX -- CUDA_ERROR_UNSUPPORTED_PTX_VERSION
#2237 opened 2 days ago by CoffeeVampir3
3
How to Implement New Operators Using CUDA Host Functions Along with Thrust and CUB Libraries
#2258 opened 2 days ago by chenwanqq
0
Latest commit on cudarc seems to have broken running the examples codes
#2175 opened a month ago by hololite
26
Linear layer with same weights, biases, and inputs gives different output than Pytorch
#2250 opened 4 days ago by EricLBuehler
3
ONNX: MaxPool with pads != 0
#2255 opened 3 days ago by limymy
0
unsupported op_type STFT for op
#2254 opened 3 days ago by mzdk100
0
Unsupported op_type Pad for op
#2196 opened 3 days ago by mzdk100
1
Improve extracting values from `gguf_file::Value`
#2245 opened 6 days ago by polarathene
1
Dynamic linking feature breaks pyo3 wrappers
#2252 opened 3 days ago by qooba
0
nvcc fatal : Cannot find compiler 'cl.exe' in PATH
#2241 opened 7 days ago by kdletters
1
[question] difference with tvm-unity / mlc-llm
#2249 opened 4 days ago by louis030195
1
How to load LoRA adapter along with the GGUF model?
#2226 opened 12 days ago by niranjanakella
6
Automatically upcasting GGUF values
#2243 opened 6 days ago by EricLBuehler
2
Improving the versatility of Tensor::slice_assign
#2242 opened 6 days ago by EricLBuehler
0
Tensor::to_scalar very high latency
#2239 opened 7 days ago by RoggeOhta
5
Meta voice WASM example?
#2232 opened 9 days ago by overheat
1
CUBLAS_STATUS_NOT_SUPPORTED for Conv2d
#2218 opened 9 days ago by EricLBuehler
2
Misleading `Tensor::matmul` documentation
#2228 opened 11 days ago by kckeiks
0
Unsupported cuda toolkit version: `12050`
#2210 opened 17 days ago by Gadersd
3
Unable to convert t5 model to GGUF
#2215 opened 12 days ago by niranjanakella
3
SeparableConv2d implementation
#2219 opened 14 days ago by PacoDu
1
Implement `torch.bucketize`
#2185 opened 15 days ago by EricLBuehler
2
Quantization issue - Mixtral 8x22b
#2201 opened 19 days ago by edesalve
2
Error: cannot seed the CPU rng with set_seed
#2216 opened 16 days ago by siddthartha
0
Unsupported cuda toolkit version: `12040`
#2169 opened 17 days ago by kdletters
2
~2x slower than `Transformer` on cpu with `Bert` model
#2204 opened 19 days ago by CrazyboyQCD
2
failed to build cudarc -- unsupported cuda toolkit version: `11040`
#2192 opened 20 days ago by siddthartha
0
Using MKL Documentation goes to 404
#2198 opened 21 days ago by CoffeeVampir3
0
How to slice a tensor?
#2197 opened 21 days ago by Gadersd
1
Problem loading metadata of gguf file
#2152 opened a month ago by cnlancehu
6
Incorrect EOS token(s) in meta-llama/Meta-Llama-3-8B-Instruct example
#2164 opened 25 days ago by socathie
6
Example with model via `include_bytes!`?
#2186 opened a month ago by boustrophedon
0
Metal error "no metal implementation for rms-norm" for Llama3 variant
#2184 opened a month ago by n8mellis
2
Whisper microphone example outputs gibberish
#2182 opened a month ago by krzysztofwos
0
`sort_last_dim` fails on cuda
#2181 opened a month ago by lucasavila00
0
VarBuilder::from_bytes?
#2177 opened a month ago by boustrophedon
6
Upgrade cudarc dependency to v0.11.1
#2173 opened a month ago by sidharthrajaram
0
Transparent Huge Pages Support
#2149 opened a month ago by michaeleisel
2
How to do a Axum's sse function for Candle?
#2167 opened a month ago by sunnyregion
2
Why is the answer of my Gemma example not as expected? Did I miss something?
#2170 opened a month ago by coolbeevip
4
How to run LLama-3 or Phi with more then 4096 prompt tokens?
#2171 opened a month ago by baleksey
0
No backward pass for `RmsNorm` if tensor is contiguous
#2168 opened a month ago by agerasev
0
`broadcast_as` error when processing multiple tokens at once in quantized example
#2153 opened a month ago by EricLBuehler
9
Error: Metal error Error while loading function: "Function 'cast_bf16_f16' does not exist" with llama3
#2163 opened a month ago by yIllusionSky
2
Model to architecture mapping
#2161 opened a month ago by BDUG
2
Quantized Phi-3 example fails "cannot find llama.attention.head_count in metadata"
#2154 opened a month ago by MoonKraken
4
Top-p halves the generation speed in the Llama example
#2147 opened a month ago by Ayuei
3
Tensor Filtering
#2148 opened a month ago by michaeleisel
0
Low time effiency when run cnn on mnist-traning only with CPU
#2144 opened a month ago by Viewer-HX
3