nazneenn

IntelBangalore, India

Pinned Repositories

ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, GraphRAG, DeepSpeed, Axolotl, etc
Language:Python6.8k 253 2.7k1.3k
neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Language:Python2.3k 33 209258
ai-documents
00
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Language:Python00
intel-extension-for-tensorflow
Intel® Extension for TensorFlow*
Language:C++0 0 00
neural-compressor
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.
Language:Python0 0 00
oneapi-hackathon
Language:Jupyter Notebook1 1 01
ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python00
Unnati-FakeNews-Detection
Language:Jupyter Notebook1 1 02
Unnati_workshop
Language:Jupyter Notebook3 1 01

nazneenn's Repositories

nazneenn/Unnati_workshop
Language:Jupyter Notebook3 1 01
nazneenn/oneapi-hackathon
Language:Jupyter Notebook1 1 01
nazneenn/Unnati-FakeNews-Detection
Language:Jupyter Notebook1 1 02
nazneenn/ai-documents
00
nazneenn/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Language:Python00
nazneenn/intel-extension-for-tensorflow
Intel® Extension for TensorFlow*
Language:C++0 0 00
nazneenn/neural-compressor
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.
Language:Python0 0 00
nazneenn/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python00

nazneenn

Pinned Repositories

ipex-llm

neural-compressor

ai-documents

diffusers

intel-extension-for-tensorflow

neural-compressor

oneapi-hackathon

ray

Unnati-FakeNews-Detection

Unnati_workshop

nazneenn's Repositories

nazneenn/Unnati_workshop

nazneenn/oneapi-hackathon

nazneenn/Unnati-FakeNews-Detection

nazneenn/ai-documents

nazneenn/diffusers

nazneenn/intel-extension-for-tensorflow

nazneenn/neural-compressor

nazneenn/ray