pytorch/torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

PythonBSD-3-Clause

Issues

What is the future plan of this torchchat project?
#1114 opened 9 days ago by yanbing-j
1
Cannot build ET Runner
#1107 opened 10 days ago by byjlw
0
Support for quantized llm for smaller memory devices
#1024 opened 10 days ago by jhetuts
2
Can't run ET Install
#1106 opened 10 days ago by byjlw
1
should have the ability to output debug artifacts when exporting to a .pte file
#1101 opened 11 days ago by byjlw
0
Cannot install ET
#1103 opened 11 days ago by byjlw
1
HelloGitHub Badge
#1105 opened 11 days ago by 521xueweihan
0
Vague error when trying to use the browser if API isn't running
#1095 opened 15 days ago by byjlw
0
MPS: chat command is much slower than generate on Mac
#999 opened a month ago by iseeyuan
7
[Distributed Inference] running under .inference_mode() = assert Dtensor NotImplementedError: Operator aten.matmul.default does not have a sharding strategy registered.
#1087 opened 16 days ago by lessw2020
1
[Distributed Inference] moving stage.submod to non-fp32 (bf16, fp16) results in dtensor assert "self.mask_buffer.data is not None"
#1086 opened 16 days ago by lessw2020
4
download for 3.1 is broken
#1069 opened 18 days ago by byjlw
2
Slow eval performance for .pte models
#1066 opened 18 days ago by vmpuri
0
Streamlit Browser Issue
#1033 opened 19 days ago by nobelchowdary
6
Slimming down torchchat: Replace replace_attention_with_custom_sdpa_attention() with ET's implementation
#1058 opened 22 days ago by Jack-Khuu
0
support load model locally
#1040 opened a month ago by irasin
3
Open AI API Maturity
#973 opened a month ago by Jack-Khuu
4
Eval script fails on CPU on model generated by ExecuTorch
#1022 opened a month ago by agunapal
3
ImportError: cannot import name 'int4_weight_only' from 'torchao.quantization.quant_api'
#1017 opened a month ago by sadimanna
7
torchchat generate requires network connection, even if models are cached
#1014 opened a month ago by malfet
1
[Feature request] Langchain Support - Chat Model
#1009 opened a month ago by raymon-io
1
How to deploy a new model by torchchat?
#1038 opened a month ago by liu8060
4
Dataclass Type Enforcement
#1025 opened a month ago by vmpuri
0
Make quantization a first class feature
#1032 opened a month ago by byjlw
0
Improve support for and documentation of custom models
#1041 opened a month ago by Jack-Khuu
0
Can't install requirements when using Python-3.12
#963 opened 2 months ago by malfet
4
OpenAI API - response data structure for the models endpoint is wrong
#1019 opened a month ago by byjlw
2
Include final softmax in forward() to run on GPU when compiling w/ aoti or other accelerator on et
#1012 opened a month ago by sunshinesfbay
2
Failed to build wheel for executorch. Failed to stat buck-out/v2
#990 opened a month ago by WaelShaikh
4
AOTI/DSO model does not run in Linux
#996 opened a month ago by lhl
3
[Raspbian] streamlit GUI interface does not work / no documentation how to install
#1001 opened a month ago by sunshinesfbay
4
`scripts/build_native.sh et` errors out
#985 opened a month ago by sunshinesfbay
7
torchchat有没有推理加速的功能，目前不支持qwen1.5,qwen2系列吗？
#1010 opened a month ago by yawzhe
1
Crashes with internal assert while parsing options
#976 opened a month ago by malfet
7
No attribute 'prompt' and 'num_samples'
#1003 opened a month ago by YeonwooSung
2
No attribute Prompt
#1002 opened a month ago by bookandlover
3
Memory usage is wrong (reporting 0) for non-CUDA commands
#984 opened a month ago by byjlw
4
Weird model behaviour on Server/Browser: Looks like it's not using the template
#989 opened a month ago by akhilreddy0703
2
Android app crash with stories110m model
#977 opened a month ago by iceychris
2
Leverage the HF cache for models
#992 opened a month ago by byjlw
1
Could we request support for a smallish (~4-5B param) modern vision LLM? LLava-1.6 or Nanollava?
#988 opened a month ago by kinchahoy
1
Android App Should Crash gracefully when the tokenizer in the aar doesn't match the model
#978 opened a month ago by Jack-Khuu
1
Add an "Intro to torchchat" diagram to the README
#982 opened a month ago by Jack-Khuu
0
Self-documenting repo: Too many files and folders in root directory
#979 opened a month ago by soumith
0
Update CLI arg builders to check for only args that the subcommand uses: Export/Generate
#932 opened 2 months ago by Jack-Khuu
0
Running `torchchat export` with just the model name does not error out
#969 opened 2 months ago by malfet
1
Github code search doesnt work with folders called `build`
#903 opened 2 months ago by msaroufim
1
test
#894 opened 2 months ago by Jack-Khuu
0
Improve the scope of Model Evaluation to AOTI and ET
#938 opened 2 months ago by Jack-Khuu
0
--device cpu ignored when using --quantize (M1 Pro)
#899 opened 2 months ago by manuelcandales
3