Pinned issues
Issues
- 2
Sub reddit is down
#451 opened by Gamithon - 0
Question: GPU support
#456 opened by toastxc - 0
- 0
Out of context answers
#453 opened by DravidVaishnav - 1
Include bytes of model in program?
#452 opened by Lamby777 - 2
Currently in dev any inference is broken
#450 opened by gadLinux - 1
- 1
- 0
- 1
Behavior when missing quantization version
#447 opened by Reichenbachian - 1
EOS is not read from gguf format
#446 opened by Alisa-lisa - 0
Support Separate Loading of Vocabulary or Tensors
#445 opened by skirodev - 5
Support for Mistral-7b
#434 opened by ranaya-formant - 5
Why is the feed_prompt process so slow?
#439 opened by zackshen - 3
Video Tutorials
#399 opened by okpatil4u - 0
Reduce dependencies
#437 opened by philpax - 6
Supporting Llama-2 70B param
#402 opened by AmineDiro - 5
Api Server Example
#414 opened by e253 - 1
How to disable ggml logging?
#433 opened by mrwilby - 1
Default String for ConfiguredSamplers
#420 opened by JuliaMerz - 2
Clarify MSRV policy
#431 opened by chris-ha458 - 1
How do I use Huggingface tokenization to use a model on Huggingace in MODEL_PATH instead of my local machine?
#427 opened by alfellati - 1
Medusa Speculative Decoding
#423 opened by someone13574 - 2
SIGTRAP triggered on MacOS
#422 opened by jafioti - 0
- 6
Metal Prompt Feeding
#403 opened by jafioti - 3
WizardCoder llama assert failure
#417 opened by jacohend - 2
AMD ROCm support with HIPBLAS
#415 opened by xangelix - 5
GPT-2 load errors
#397 opened by pabl-o-ce - 0
LLaMA-2 GGML formats fail to generate any new token
#413 opened by AnubhabB - 1
System prompts
#411 opened by kpcyrd - 8
Issues using with whisper-rs
#408 opened by jafioti - 1
Proper Rewind+Refeed when stop token is detected.
#407 opened by JuliaMerz - 4
Cannot Compile llm-base on main
#405 opened by carllippert - 2
Installation of CLI on macOS
#388 opened by twardoch - 3
stack overflow after new merge
#391 opened by fluffydolphin - 2
Add classifier-free guidance
#377 opened by philpax - 0
- 4
Build failure on Windows
#396 opened by NuSkooler - 6
- 0
- 1
Reddit community Wiki page is disabled
#392 opened by viktor-ferenczi - 1
Implement SuperHOT/interpolated RoPE support
#378 opened by philpax - 1
[compile fail] Undefined symbols for architecture x86_64: "_ggml_graph_plan", referenced from: ggml::GraphExecutionPlan:: new ::h19443c1d77cb997e in libggml-a3125eb72f014293.rlib(ggml-a3125eb72f014293.ggml.9c60abbf9368402d-cgu.1.rcgu.o)
#387 opened by twardoch - 4
Runtime GPU backend selection
#386 opened by philpax - 2
Invalid magic number while loading LoRAs
#384 opened by clarkmcc - 1
- 6
I can't compile llm with nix
#374 opened by GENDRAUD - 1
- 0
Shared `ExecutionParameters` for all operations
#372 opened by philpax