lyogavin/Anima

33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU

Jupyter NotebookApache-2.0

Issues

Running on Mac get traceback error
#123 opened a month ago by gr3enarr0w
3
attn impl to sdpa...
#107 opened 4 months ago by saa1028
4
Generation takes forever
#111 opened 4 months ago by Kira-Pgr
3
Error with Llama3: ValueError: Trying to set a tensor of shape torch.Size([1024, 8192]) in "weight" (which has shape torch.Size([8192, 8192])), this look incorrect.
#131 opened a month ago by Cangshanqingshi
0
mac m2 run air llm garage-bAInd/Platypus2-7B get error Input must be a file-like object opened in binary mode, or string
#116 opened 3 months ago by wuxiongwei
6
error in apple mac m3
#134 opened a month ago by mustangs0786
2
Insuficient disk space
#136 opened a month ago by ulisesbussi
2
segmentation fault python3 airllm2.py
#129 opened a month ago by taozhiyuai
3
safetensors_rust.SafetensorError: Error while deserializing header: MetadataIncompleteBuffer
#137 opened a month ago by chuangzhidan
0
CPU ram offload
#135 opened a month ago by NicolasMejiaPetit
0
Discord Invite Expired in the readme
#90 opened 5 months ago by birdup000
1
Does airllm support quantized gguf/gptq/awq models ?
#133 opened a month ago by robik72
0
COMPILED_WITH_CUDA error requires libcuda.so
#132 opened a month ago by nickums
0
AirLLM: Support for DirectML
#108 opened 4 months ago by vegax87
1
跑不通chatglm3，请大佬指教。
#130 opened a month ago by ZiQiangXie
2
to run llama3-70b,but fail to import. why?
#128 opened a month ago by taozhiyuai
0
Any CoreML implementation plans?
#127 opened a month ago by Proryanator
0
Mac 'str' object has no attribute 'sequences
#126 opened a month ago by gr3enarr0w
0
"src" directory name is conflicted
#125 opened a month ago by Rambo55555
0
how to delete the original download model after it has been downloaded
#124 opened a month ago by ruiguo-bio
0
通过Ollama下载了的模型，如何在airllm中直接使用呢
#122 opened a month ago by w1005444804
1
Is it possible to use AirLLM with a quantized input model?
#117 opened 3 months ago by Verdagon
3
请求支持llama3
#121 opened a month ago by CrazyBoyM
2
The following error is encountered when running the sample code
#120 opened 2 months ago by Nuclear6
0
compression parameter on mac.dosent work.
#119 opened 2 months ago by dnvs
0
Macbook "Torch not compiled with CUDA enabled" Error
#104 opened 4 months ago by LanLanBoom
2
Support for OPT Architecture
#118 opened 2 months ago by varunlmxd
0
For me this model is extremely underperforming
#105 opened 4 months ago by SadafShafi
1
似乎只能产生很少的字符
#115 opened 3 months ago by andeyeluguo
2
Add UI like AUTOMATIC1111 for stable-diffusion-webui
#114 opened 4 months ago by janmartin
0
Which 70B model does macOS support?
#112 opened 4 months ago by ruifengma
0
ValueError: LlamaForCausalLM does not support an attention implementation through torch.nn.functional.scaled_dot_product_attention yet.
#101 opened 5 months ago by sleeper1023
1
Optimize for consumer GPU, eg 11GB or 16GB
#109 opened 4 months ago by profintegra
0
AMD gpu support
#106 opened 4 months ago by hanq-moreh
0
用airllm运行Yi-34B-chat模型，分层之后报这个错误
#103 opened 5 months ago by peiyanyang
1
Will the airllm framework be adapted for the streaming output functionality of different models in the future?
#102 opened 5 months ago by wangqn1
0
关于对话模型是否能使用airllm
#99 opened 5 months ago by wzz981
1
AirLLMLlamaMlx fails to load model with mlx==0.0.7
#100 opened 5 months ago by jakule
0
how to infer on multiple gpus?
#98 opened 5 months ago by yuxx0218
1
Finetune 70B on 24GB 4090?
#96 opened 5 months ago by Naozumi520
1
ValueError: max() arg is an empty sequence(Apple M2 Max, macOS 14.2.1)
#91 opened 5 months ago by tvsj
6
safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooLarge
#93 opened 5 months ago by fudp
1
microsoft-phi2:max() arg is an empty sequence
#95 opened 5 months ago by zazaji
1
ImportError: cannot import name AutoMode
#94 opened 5 months ago by zazaji
1
Would adding Parallelism speed up AirLLM?
#89 opened 5 months ago by birdup000
0
Mac quantization
#88 opened 5 months ago by ageorgios
0
Mac Airllm Inference tigerbot-70b-chat-v2
#87 opened 5 months ago by ageorgios
0
configure the chunk split size
#86 opened 5 months ago by ageorgios
0
Mixtral models seem to run forever
#84 opened 5 months ago by Josh-XT
1
Does Airllm support sqlcoder-34b which was fine-tuned on codellama?
#85 opened 5 months ago by mw-hv
1