vuiseng9

vuiseng9 |at| gmail |dot| com

@Intel

Pinned Repositories

nncf
Neural Network Compression Framework for enhanced OpenVINO™ inference
Language:Python971 32 348246
jm-hm-ubuntu
Set up JM (AVC) and HM (HEVC) reference codes in Ubuntu
Language:C2 1 12
nervana-distiller
Quick start and examples of Intel Nervana System Distiller
Language:Shell1 1 00
nn_pruning
Prune a model while finetuning or training.
Language:Jupyter Notebook1 0 00
nncf
PyTorch*-based Neural Network Compression Framework for enhanced OpenVINO™ inference
Language:Python0 0 02
openvino-ubuntu
Set up and run OpenVINO in Docker Ubuntu Environment on Intel CPU with Integrated Graphics
Language:Shell8 1 02

vuiseng9's Repositories

vuiseng9/nncf
PyTorch*-based Neural Network Compression Framework for enhanced OpenVINO™ inference
Language:Python0 0 02
vuiseng9/bench-softmax
Language:Python1 0
vuiseng9/cats
Language:Python0 0
vuiseng9/data-parallel-CPP
Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben Ashbaugh, James Brodman, Michael Kinsner, John Pennycook, Xinmin Tian (Apress, 2020).
vuiseng9/dejavu-lm
Language:Python0 0
vuiseng9/EAGLE
[ICML'24] EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
Language:Python0 0
vuiseng9/ipex
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
Language:Python0 0
vuiseng9/ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max). A PyTorch LLM library that seamlessly integrates with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, ModelScope, etc
Language:Python0 0
vuiseng9/llm-awq
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python0 0
vuiseng9/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python
vuiseng9/mlperf-inference
Reference implementations of MLPerf™ inference benchmarks
Language:Python0 0
vuiseng9/mlperf-v3.0-intel
This repository contains the results and code for the MLPerf™ Inference v3.0 benchmark.
0 0
vuiseng9/mlperf-v3.1-intel
This repository contains the results and code for the MLPerf™ Inference v3.1 benchmark.
0 0
vuiseng9/mm_amx
matmul using AMX instructions
vuiseng9/oneAPI-samples
Samples for Intel® oneAPI Toolkits
vuiseng9/openvino.genai
Language:Python0 0
vuiseng9/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
Language:Python0 0
vuiseng9/optimum-intel
Accelerate inference of 🤗 Transformers with Intel optimization tools
Language:Python0 01
vuiseng9/ov-llm-tld
Language:Python0 0
vuiseng9/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C0 0
vuiseng9/sd-perf
quick script to profile stable diffusion performance
Language:Python2 01
vuiseng9/SparseFinetuning
Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry
Language:Python0 0
vuiseng9/Spec-Bench
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
Language:Python0 0
vuiseng9/speculative-sampling
Simple implementation of Speculative Sampling in NumPy for GPT-2.
Language:Python0 0
vuiseng9/SqueezeLLM
SqueezeLLM: Dense-and-Sparse Quantization
Language:Python0 0
vuiseng9/torch-custom-linear
custom implementation of linear
Language:Python2 0
vuiseng9/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Language:Python0 01
vuiseng9/trl
Train transformer language models with reinforcement learning.
Language:Python0 0
vuiseng9/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0
vuiseng9/wanda
A simple and effective LLM pruning approach.
Language:Python0 0

vuiseng9

Pinned Repositories

nncf

jm-hm-ubuntu

nervana-distiller

nn_pruning

nncf

openvino-ubuntu

vuiseng9's Repositories

vuiseng9/nncf

vuiseng9/bench-softmax

vuiseng9/cats

vuiseng9/data-parallel-CPP

vuiseng9/dejavu-lm

vuiseng9/EAGLE

vuiseng9/ipex

vuiseng9/ipex-llm

vuiseng9/llm-awq

vuiseng9/lm-evaluation-harness

vuiseng9/mlperf-inference

vuiseng9/mlperf-v3.0-intel

vuiseng9/mlperf-v3.1-intel

vuiseng9/mm_amx

vuiseng9/oneAPI-samples

vuiseng9/openvino.genai

vuiseng9/optimum

vuiseng9/optimum-intel

vuiseng9/ov-llm-tld

vuiseng9/PowerInfer

vuiseng9/sd-perf

vuiseng9/SparseFinetuning

vuiseng9/Spec-Bench

vuiseng9/speculative-sampling

vuiseng9/SqueezeLLM

vuiseng9/torch-custom-linear

vuiseng9/transformers

vuiseng9/trl

vuiseng9/vllm

vuiseng9/wanda