Intel® Gaudi® AI Accelerator

Pinned Repositories

DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python12 2 015
Gaudi-tutorials
Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://developer.habana.ai/
Language:Jupyter Notebook57 7 337
Habana_Custom_Kernel
Provides the examples to write and build Habana custom kernels using the HabanaTools
Language:C++18 5 422
hccl_demo
Language:C++18 70 210
Megatron-DeepSpeed
Intel Gaudi's Megatron DeepSpeed Large Language Models for training
Language:Python13 1 08
Model-References
Reference models for Intel(R) Gaudi(R) AI Accelerator
Language:Jupyter Notebook156 37 3482
Setup_and_Install
Setup and Installation Instructions for Habana binaries, docker image creation
Language:Python24 11 813
SynapseAI_Core
SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi
Language:C38 72 24
tpc_llvm
TPC-CLANG compiler that compiles a TPC C programming language which is used in HabanaLabs Deep-Learning Accelerators
26 76 25
vllm-fork
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python44 7 5462

Intel® Gaudi® AI Accelerator 's Repositories

HabanaAI/Model-References
Reference models for Intel(R) Gaudi(R) AI Accelerator
Language:Jupyter Notebook156 37 3482
HabanaAI/Gaudi-tutorials
Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://developer.habana.ai/
Language:Jupyter Notebook57 7 337
HabanaAI/vllm-fork
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python44 7 5462
HabanaAI/tpc_llvm
TPC-CLANG compiler that compiles a TPC C programming language which is used in HabanaLabs Deep-Learning Accelerators
26 76 25
HabanaAI/Setup_and_Install
Setup and Installation Instructions for Habana binaries, docker image creation
Language:Python24 11 813
HabanaAI/Habana_Custom_Kernel
Provides the examples to write and build Habana custom kernels using the HabanaTools
Language:C++18 5 422
HabanaAI/hccl_demo
Language:C++18 70 210
HabanaAI/Megatron-DeepSpeed
Intel Gaudi's Megatron DeepSpeed Large Language Models for training
Language:Python13 1 08
HabanaAI/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python12 2 015
HabanaAI/Gaudi-solutions
Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases
Language:Jupyter Notebook10 9 06
HabanaAI/HCL
Language:C++9 69 13
HabanaAI/Gaudi2-Workshop
Language:Jupyter Notebook8 9 03
HabanaAI/deepspeed_old
Language:Python6 6 03
HabanaAI/hl-thunk-open
Thunk library for HabanaLabs kernel driver
Language:C5 62 08
HabanaAI/Intel_Gaudi3_Software
Intel® Gaudi® Software is an implementation of the runtime and graph compiler for Gaudi3
Language:C++5 66 03
HabanaAI/Megatron-LM
Ongoing research training transformer models at scale
Language:Python5
HabanaAI/habanalabs-k8s-device-plugin
HABANA device plugin for Kubernetes
Language:Go4 2 03
HabanaAI/Fairseq
Language:Python3 3 01
HabanaAI/optimum-habana-fork
Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
Language:Python3 3 017
HabanaAI/TOWL
Language:HTML3 71 02
HabanaAI/vllm-hpu-extension
Language:Python3 71 013
HabanaAI/hccl_ofi_wrapper
Language:C++2 73 01
HabanaAI/slurm
Slurm: A Highly Scalable Workload Manager
Language:C2 2 03
HabanaAI/drivers.accel.habanalabs.kernel
Language:C1 2 00
HabanaAI/papers
Academic papers by Habana research team
1 9 02
HabanaAI/pytorch-fork
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python1
HabanaAI/rdma-core
RDMA core userspace libraries and daemons
Language:C1 0 0
HabanaAI/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Language:Python0 1 00
HabanaAI/drivers.gpu.linux-nic.kernel
NIC drivers (Ethernet, IBverbs and common) for the NIC IP that is inside Intel's data-center GPU
Language:C0 74 02
HabanaAI/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Language:Python2 0