quic/ai-hub-models
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
PythonBSD-3-Clause
Issues
- 1
- 1
[BUG] IOT BYOM Issue: <insert issue here>
#128 opened by mbenencase - 2
[Feature Request] Add the use of https://pypi.org/project/torch-npu/ to enable the NPU of x1e-78-100
#127 opened by BrickDesignerNL - 1
[Feature Request] New feature request
#125 opened by 1826133674 - 1
[Question] What's the groupsize of w4a16 + w8a16
#112 opened by xiguadong - 4
[BUG] Fail to run llama v2 7B quantized on Galaxy S24 Ultra using GENIE C API
#102 opened by taeyeonlee - 1
[BUG] Error exporting OpenAI-Clip
#110 opened by rachillesf - 1
- 2
[BUG] Ollama not using GPU and not using NPU
#126 opened by BrickDesignerNL - 7
[BUG] Windows install Whisper not working
#123 opened by BrickDesignerNL - 2
qai-hub SDV2_1 demo running question
#120 opened by 1826133674 - 8
[BUG] Wrong Device/CPU detection & Using deprecated function & Key needed for demo & Should run local, but uploads model & Very slow Qualcomm cloud
#122 opened by BrickDesignerNL - 1
[MODEL REQUEST] Whisper-large-v3
#115 opened by BrickDesignerNL - 1
[MODEL REQUEST] Bark - text to singing & sound effects - from suno.ai / suno.com
#116 opened by BrickDesignerNL - 1
- 1
- 0
- 0
[MODEL FORMAT REQUEST] Support safetensors
#119 opened by BrickDesignerNL - 1
[BUG] File bug report : After logging in again, user is presented with login screen.
#111 opened by quic-rneti - 6
[BUG] Repo incompatible with Python 3.12.7 (ARM64) & 3.11.8 (ARM64). Lower versions of Python for Windows aren't available for ARM64. - Snapdragon X Elite has no native working version.
#113 opened by BrickDesignerNL - 1
[BUG] YoLo (and other models) DEMO not working - dataset folder is locally available but reference is not correct
#114 opened by BrickDesignerNL - 13
- 2
[Feature Request] Whisper Small.En Quantized
#81 opened by Carl-2008 - 1
qti.tvm.error.OpNotImplemented: The following operators are not supported in frontend TFLite: 'GELU'
#104 opened by chenxiao521 - 3
- 0
- 2
[BUG] File bug report I'm having a problem installing the gen_ondevice_llama operation on my phone deploying llama to run on the final phone
#107 opened by LLIKKE - 2
[Feature Request] New feature request
#106 opened by holylong - 3
In QCS8550 development board, the inference of llama-v2-7b-chat using NPU failed."
#105 opened by holylong - 7
[BUG] IOT BYOM Issue: Compiling PyTorch model to a QNN Context Binary on AiHub fails
#95 opened by Midi12 - 1
- 1
Running Model Locally with Custom Prompts
#96 opened by xuandy05 - 3
- 1
What is the limitation of Hexagon V75 that the Llama v2 7B Quantized model should be split into 8 Bin files ?
#100 opened by taeyeonlee - 6
No Python on Windows ARM available for < 3.11
#89 opened by nsteblay - 4
- 7
- 4
- 4
[BUG] The generated text is strange from 8 QNN Context Bins which are generated in AI Hub.
#87 opened by taeyeonlee - 3
[BUG] fails to create context from binary for the 4 QNN Context Bin files (llama_v2_7b_chat_quantized_TokenGenerator_x_Quantized.bin)
#85 opened by taeyeonlee - 1
Does not support XR2 GEN2 chip?
#92 opened by brisyramshere - 2
- 2
Has the AiHub's Whisper model been retrained?
#90 opened by zzy981019 - 1
[BUG] Mobile BYOM Issue: <insert issue here>
#91 opened by weekendcheng - 2
- 1
- 1
Operate with NPU
#83 opened by a4073631 - 1
Genie C API : Sample android app
#79 opened by taeyeonlee - 1
error occured when run llama_v2_7b_chat_quantized_PromptProcessor_3_Quantized
#84 opened by yolanda1224git - 0
Error uploading to QDC: status code=500
#78 opened by yolanda1224git